-
Episode 41: Meta macht AGI und KIs beeinflussen unsere Kultur
In dieser Folge reden Florian und Ich über demnächst erscheinene Modelle, Metas Pläne, und neue Hardware in VR. Mehr Informationen auf dem Discord Serverhttps://discord.gg/3YzyeGJHthoder auf https://mkannen.tech — read more
-
Episode 40: CES, Wahlen, und KI in der Forschung
In dieser Episode reden Nico und Ich über die CES, humanoide Roboter und neue Forschungsergbnisse durch KI. Mehr Informationen auf dem Discord Serverhttps://discord.gg/3YzyeGJHthoder auf https://mkannen.tech — read more
-
Episode 36: Gerüchte, Mixtral, und Fusion
In dieser Episode reden Florian und Ich über GPT-4.5 Gerüchte, alternative Computer, und Forschritte in Medizin und Fusion. Frohe Weihnachten! Mehr Informationen auf dem Discord Serverhttps://discord.gg/3YzyeGJHthoder auf https://mkannen.tech — read more
-
Episode 35: NeurIPS, Mixtral und Phi-2
In dieser Folge reden Nico und Ich über die ganzen Neuigkeiten die im Zuge der NeurIPS raus kamen, darunter neue Modelle und Paper. Mehr Informationen auf dem Discord Serverhttps://discord.gg/3YzyeGJHthoder auf https://mkannen.tech — read more
-
Gemini is here
Google Deepmind just released their new Gemini models. They come in 3 sizes. Nano will be used on devices like the Pixel phones, and Pro will be used in their products such as Bard, and Ultra is going to be released at the beginning of next year. The models are multimodal and can input, audio, — read more
-
Google found a way to improve math skills in LLMs
LLMs are powerful tools, but they often struggle with tasks that require logical and algorithmic reasoning, such as arithmetic. A team of researchers from Google has developed a new technique to teach LLMs how to perform arithmetic operations by using in-context learning and algorithmic prompting. Algorithmic prompting means that the model is given detailed explanations — read more
-
Microsoft published the next Version of Kosmos
Researchers at Microsoft have unveiled Kosmos-2 the successor of Kosmos-1, a Multimodal Large Language Model (MLLM) that integrates the capability of perceiving object descriptions and grounding text in the visual world. By representing refer expressions as links in Markdown format, Kosmos-2 achieves the vital task of grounding text to visual elements, enabling multimodal grounding, referring — read more
-
RoboCat handles every Robot
Deepmind published a new blog post where they present their newest AI which is based on their previous work Gato. RoboCat is a self-improving AI agent for robotics that learns to perform a variety of tasks across different arms and then self-generates new training data to improve its technique. It is the first agent to — read more
-
Synthetic Human Embryos
After earlier experiments on mice, it is now possible to create human embryos out of stem cells. This allows us to make human life without sperm or eggs. Since the experiments are limited by ethical concerns they stopped the growth of the embryo at an early stage. This research could lead to a better understanding — read more
-
New OpenAI Update
OpenAI announced a set of changes to their model APIs. The biggest announcement is the addition of function calls for both GPT-3.5 and 4. This allows developers to enable plugins and other external tools for the models. They also released new versions of GPT-3.5 and 4 that are better at following directions and a Version — read more
-
DeepMind Makes Everything Faster
After DeepMind developed AlphaTensor last year and found a new algorithm for matrix multiplication, they did it again. This time they developed AlphaDev which found a new algorithm for sorting. This sounds not as exciting as a new language model, but sorting algorithms run billions of times every hour. Optimizing central algorithms like sorting and — read more
-
Copilots for everyone
Microsoft Build is currently underway, with Microsoft showcasing a range of new and upcoming products, including various Copilots such as Copilot for Bing, GitHub, and Edge. In their pipeline, they also have plans to launch a Copilot specifically designed for Windows. These Copilots are all built using Microsoft’s new Azure AI Studio Platform, which is — read more
-
Intel Presents New Hardware
Intel just announced a new supercomputer named Aurora. It is expected to offer more than 2 exaflops of peak double-precision compute performance and is based on their new GPU series which outperforms even the new H100 cards from NVIDIA. They are going to use Aurora to train their own LLMs up to a trillion parameters. — read more
-
US Senate Holds an AI Hearing
Today the US Senate held an AI testimony to discuss the risks and chances of AI and possible ways to regulate the sector nationally and globally. Witnesses testifying include Sam Altman, CEO of OpenAI; Gary Marcus, professor emeritus at New York University, and Christina Montgomery, vice president and chief privacy and trust officer at IBM. — read more
-
Google IO Summary
Google IO happened yesterday and the keynote focused heavily on AI. Some of the things that I found most important are: PaLM 2 is their new LLM. It comes in different sizes from small enough for pixel phones, to big enough to beat ChatGPT-3.5. It is used in Bard and many of their productivity tools. — read more
-
Claude comes with 100K context
Anthropic, the OpenAI competitor just announced a new version of their LLM Claude. This new Version has a context length of 100K tokens, which corresponds to around 75K words. It is not clear from the announcement how they implemented that and how the full context gets fed into the attention layers. OpenAI is planning to — read more