-
Predictions for 2026
Another year has passed, and it is time for my yearly predictions. As you can see from the title, I decided not to review the last year and go over my previous predictions. I always felt a bit weird reviewing myself, and I leave it to you to judge my last year’s predictions. There is — read more
-
Episode 71: GPT-5.2, Claude Opus 4.5 und Deepseek
In dieser Folge reden Florian und Ich über viele neue Modelle wie Deepseek 3.2, Mistral und andere. Mehr Informationen auf dem Discord Server https://discord.gg/3YzyeGJHth oder auf https://mkannen.tech — read more
-
Episode 69: Datencenter im Weltraum und die KI Blase
In dieser Episode reden Florian und Ich über Datencenter im Weltraum, die Debatte um die KI Blase, und neue open-source Modelle, die die Lücke schließen. Mehr Informationen auf dem Discord Server https://discord.gg/3YzyeGJHth oder auf https://mkannen.tech — read more
-
Episode 66: Grok 4, OpenAI und Meta’s SuperAI Team
In dieser Episode reden Florian und Ich über die aktuelle Kontroverse um Grok, OpenAIs aktuelle Probleme, und wie Alignment von Modellen schief gehen kann. Mehr Informationen auf dem Discord Server https://discord.gg/3YzyeGJHth oder auf https://mkannen.tech — read more
-
Episode 63: o3 und die Ferne Zukunft
In dieser Folge reden Florian und Ich über die neuen Reasoning Modelle von OpenAI und Google. Außerdem reden wir über einige mögliche Zukunftstechnologien. Mehr Informationen auf dem Discord Server https://discord.gg/3YzyeGJHth oder auf https://mkannen.tech — read more
-
Episode 56: OpenAI o1 Review
In dieser Episode reden Florian und Ich über das neue Model o1 und was es besonders macht. Außerdem reden wir über den Hardwaremarkt, Alpha Proteo und die US Politik. Mehr Informationen auf dem Discord Server https://discord.gg/3YzyeGJHth oder auf https://mkannen.tech — read more
-
Episode 35: NeurIPS, Mixtral und Phi-2
In dieser Folge reden Nico und Ich über die ganzen Neuigkeiten die im Zuge der NeurIPS raus kamen, darunter neue Modelle und Paper. Mehr Informationen auf dem Discord Serverhttps://discord.gg/3YzyeGJHthoder auf https://mkannen.tech — read more
-
Episode 30: OpenAI DevDay KeyNote News
In dieser Episode reden Florian und Ich über die Annoucements von OpenAIs Keynote; Unter anderem GPT-4 Turbo. Außerdem reden wir über Apple, GitHub und die Folgen von Automatisierung. Mehr Informationen auf dem Discord Serverhttps://discord.gg/3YzyeGJHthoder auf https://mkannen.tech — read more
-
New OpenAI Update
OpenAI announced a set of changes to their model APIs. The biggest announcement is the addition of function calls for both GPT-3.5 and 4. This allows developers to enable plugins and other external tools for the models. They also released new versions of GPT-3.5 and 4 that are better at following directions and a Version — read more
-
AI helps with AI Understanding
One of the main problems of LLMs is that they are black boxes and how they produce an output is not understandable for humans. Understanding what different neurons are representing and how they influence the model is important to make sure they are reliable and do not contain dangerous trends. OpenAI applied GPT-4 to find — read more
-
OpenAI Open-Sources a New Text-to-3D model
Shap-E can generate 3D assets from text or images. Unlike their earlier model Point-E, this one can directly generate the parameters of implicit functions that can be rendered as both textured meshes and neural radiance fields. It is also faster to run and open-source! Read the paper here. Just like video generation, the quality is — read more
-
New Image generation approach
OpenAI developed a new approach to image generation called consistency models. Current models, like Dalle-2 or stable diffusion, iteratively diffuse the result. This new approach goes straight to the final result which makes the process way faster and cheaper. While not as good as some diffusion models yet, they will likely improve and become an — read more
-
The New Wave of GPT Agents
Since GPT-3.5 and GPT-4 APIs are available many companies and start-ups have implemented them into their products. Now developers have started to do it the other way around. They build systems around GPT-4 to enable it to search, use APIs, execute code, and interact with itself. Examples are HuggingGPT or AutoGPT. They are based on — read more
-
Open Letter to pause bigger AI models
A group of researchers and notable people released an open letter in which they call for a 6 month stop from developing models that are more advanced than GPT-4. Some of the notable names are researchers from competing companies like Deepmind, Google, and Stability AI like Victoria Krakovna, Noam Shazeer, and Emad Mostaque. But also — read more
-
Listen to OpenAI
Many people saw the new episode of the Lex Friedman Podcast with Sam Altman, where he talks about some social and political implications of GPT-4. But fewer people saw the podcast with Ilya Sutskever, the Chief Scientist at OpenAI, which is way more technical and in my opinion even more exciting and enjoyable. I really — read more
-
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Microsoft researchers have conducted an investigation on an early version of OpenAI’s GPT-4, and they have found that it exhibits more general intelligence than previous AI models. The model can solve novel and difficult tasks spanning mathematics, coding, vision, medicine, law, psychology, and more, without needing any special prompting. Furthermore, in all of these tasks, — read more
-
ChatGPT’s biggest update jet
OpenAI announced that they will introduce plugins to ChatGPT. Two of them developed by OpenAi themself allow the model to search the web for information and run generated python code. Other third-party plugins like Wolfram allow the model to use other APIs to perform certain tasks. the future capabilities of a model enhanced this way — read more
-
From GPT-4 to Proto-AGI
Deutsche Version Artificial General Intelligence (AGI) is the ultimate goal of many AI researchers and enthusiasts. It refers to the ability of a machine to perform any intellectual task that a human can do, such as reasoning, learning, creativity, and generalization. However, we are still far from achieving AGI with our current AI systems. One — read more
-
GPTs are GPTs: How Large Language Models Could Transform the U.S. Labor Market
A new study by OpenAI and the University of Pennsylvania investigates the potential impact of Generative Pre-trained Transformer (GPT) models on the U.S. labor market. The paper, titled “GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models,” assesses occupations based on their correspondence with GPT capabilities, using both — read more
-
GPT-4 is here
OpenAI presented its new GPT model today. GPT-4 has a context window of 32K tokens and outperforms humans and previous models like GPT-3.5 in almost all language tasks. It is also multimodal and supports images as inputs. Read more here or watch the presentation here. OpenAI just released GPT-4, a game-changer in AI language models. — read more