AI Blog: September 2025

9.30.2025

Claude 4.5 Sonnet: Best Coding Model In The World! Powerful + Agentic! (Fully Tested)

Claude Sonnet 4.5 is here, and it’s absolutely groundbreaking. Touted as the best coding model ever, it excels at building complex agents, using computers, and shows major gains in reasoning and math.

9.29.2025

Google's Gemini 2.5 Flash Lite is now the fastest
proprietary model

Google announced updates to Gemini 2.5 Flash and 2.5 Flash Lite, its LLMs designed for speed and efficiency, and for the application programming interface (API) to Gemini Live, its AI voice generation model for enterprise functions like customer support calls.

9.26.2025

Liquid AI offers new small, task-specific Liquid Nano models

Liquid AI, a startup pursuing alternatives to the popular "transformer"-based AI models that have come to define the generative AI era, is announcing not one, not two, but a whole family of six different types of AI models called Liquid Nanos that it says are better suited to the "reality of most AI deployments" in enterprises and organizations than the larger foundation models from rivals like OpenAI, Google, and Anthropic.

9.25.2025

Apple’s memory-saving AI breakthrough could save
enterprises millions

Apple researchers have developed a breakthrough framework that dramatically reduces the memory requirements for AI systems engaged in long conversational interactions, a development that could significantly lower costs for enterprise deployments of chatbots and virtual assistants.

9.24.2025

Why Codex Is the Beginning of New Engineering

The way we write code just changed forever. OpenAI’s latest updates to Codex aren’t just a small step forward—they’re a whole new chapter in engineering. In this video, I walk you through what’s really happening with Codex across the cloud, desktop (CLI), GitHub integrations, and even VS Code extensions.

You’ll see me experiment live: from adding playful sparkles to a homepage logo, to swapping out fonts across an app, to spinning up pull requests and letting Codex review them. Along the way, I share why this shift is about more than faster coding—it’s about scaling yourself from one developer into ten.

By the end, you’ll see the full lifecycle: firing off multiple requests, reviewing results, merging PRs, and understanding how this ties into Git workflows. This is the first real glimpse of what “new engineering” looks like—and it’s already here, for just $20 a month.

9.23.2025

China's Alibaba challenges U.S. tech giants with open source Qwen3-Omni AI model accepting text, audio, image and video

Chinese search giant Alibaba's Qwen team of AI researchers has debuted what may be its most impressive model yet: Qwen3-Omni, an open source large language model (LLM) that the company bills as the first "natively end-to-end omni-modal AI unifying text, image, audio & video in one model.

Nvidia plans to invest up to $100B in OpenAI

Nvidia announced Monday it plans to invest up to $100 billion in OpenAI as part of a deal to build out massive data centers for training and running AI models. The companies say they signed a letter of intent to deploy 10 gigawatts — enough to power millions of homes — worth of Nvidia systems to power OpenAI’s next generation of AI infrastructure.

9.22.2025

Mistral's updated Magistral Small 1.2 reasoning model can analyze images and fit on a Macbook

The company said on X that its new Magistral Small 1.2 and Magistral Medium 1.2 LLMs are "minor" updates to its Magistral 1.1 series.

But the updates may actually be more significant than this well-funded European AI darling is letting on: both models are equipped with a vision encoder, allowing them to analyze imagery submitted by users. And they both offer performance improvements on key benchmarks, as well as enhanced usability features.

9.19.2025

Replit's NEW AI Agents Are INSANE

ReplitAgent3 brings a new automation and agent builder. We show how to use Replit, give a full Replit review, test n8n AI agent features, and explore n8n automation directly inside Replit’s Agents and Automations beta.

9.18.2025

Meet Alibaba's open source Tongyi DeepResearch Agent

This week, another AI agent research team at Alibaba — the Tongyi Lab, not to be confused with the Qwen Team that releases foundation models under the same parent company — unveiled a powerful, new open source agent specifically for conducting "deep research" across the web and compiling through, accurate reports and other materials for individuals and organizations.

9.17.2025

You need to learn MCP RIGHT NOW!!

You need to learn MCP Right Now!! The Model Context Protocol (MCP) changes everything. It gives LLMs like Claude and ChatGPT the ability to use real tools through APIs instead of clunky GUIs. I’ll break down how MCP works, why it’s so powerful, and how you can run it using Docker containers. I’ll even show you how to build your own custom MCP servers so you can connect AI to literally ANYTHING: Obsidian, Brave, Kali Linux, and more.

9.16.2025

OpenAI upgrades Codex with a new version of GPT-5

OpenAI announced Monday that it’s releasing a new version of GPT-5 to its AI coding agent, Codex. The company says its new model, called GPT-5-Codex, spends its “thinking” time more dynamically than previous models and could spend anywhere from a few seconds to seven hours on a coding task. As a result, it performs better on agentic coding benchmarks.

9.15.2025

Agentic swarm coding is the new enterprise moat

In just a few months, the ability of AI to perform complex software engineering has accelerated at a dizzying, non-linear pace. The improvement has been documented in several ways. Driven by several major advances in technology and practice, it amounts to a fundamental change in how software is created, one that is already making last year’s paradigms obsolete.

The era of "vibe coding" – the conversational, often exploratory practice of prompting an AI for code, coined by Andrej Karpathy – has given way to a more disciplined concept: agentic swarm coding.

9.12.2025

K2 Think arrives from UAE as 'world’s fastest open-source
AI model'

Not to be confused with Chinese AI lab Moonshot's recently released, powerful, open source model Kimi K2, another new open source large language model (LLM) called "K2 Think" debuted today, and it's already making waves among AI power users and observers for its claims of being the "world’s fastest open-source AI model" and the "most advanced open-source AI reasoning system ever created."

9.11.2025

CodeBuff: ClaudeCode KILLER! New AI Coding Agent is Quite Powerful, FREE, & Opensource!

Codebuff is an AI coding assistant that edits your codebase through natural language instructions. Instead of using one model for everything, it coordinates specialized agents that work together to understand your project and make precise changes.

9.10.2025

Qwen3-Max arrives in preview with 1 trillion parameters, blazing fast response speed, and API availability

After a busy summer in which the AI lab released a whole fleet of new open source AI models with support for English and Chinese — models that matched or outperformed top U.S. lab offerings from Google, OpenAI, and Anthropic — it has now unveiled its largest large language model (LLM) to date, Qwen3-Max-Preview (Instruct) with more than 1 trillion parameters.

9.09.2025

This new framework lets LLM agents learn from experience, no fine-tuning required

A new learning paradigm developed by University College London (UCL) and Huawei Noah’s Ark Lab enables large language model (LLM) agents to dynamically adapt to their environment without fine-tuning the underlying language model. The method allows agents to continuously improve their performance by using a structured memory system that updates itself as the agent gathers experience.

9.08.2025

Spec Kit: Github's NEW tool That FINALLY Fixes AI Coding

Spec-driven development is here to change the way we code. In this video, we break down GitHub’s new open-source toolkit, Spec Kit, and show how it transforms messy vibe coding into structured, reliable workflows. From the core concepts to a live demo project, you’ll see why Spec Kit might be the tool that finally makes AI coding click.

9.05.2025

Google's mobile-ready EmbeddingGemma ranks highest in embedding leaderboard among small parameter models

Google’s open-source Gemma is already a small model designed to run on devices like smartphones. However, Google continues to expand the Gemma family of models and optimize these for local usage on phones and laptops.

9.04.2025

LangChain 1.0 alpha consolidates agent design, reducing adoption risk for enterprises

The emerging field of AI orchestration just got an additional boost of confidence with the alpha release of LangChain’s two flagship platforms, LangGraph and LangChain version 1.0.

Graduating to a 1.0 signals that both LangGraph and LangChain have reached stability, thereby lessening any enterprise risks associated with adopting the two platforms.

9.03.2025

Unified Codex CLI + IDE Extension | MCP Setup Guide
& First Look

In this setup guide, I'll show you how to set up MCP servers for both Codex CLI and the new IDE extension. This powerful combination lets you seamlessly move between your IDE, command line, and then send it off to the cloud.

9.02.2025

Nous Research drops Hermes 4 AI models that outperform ChatGPT without content restrictions

Nous Research, a secretive artificial intelligence startup that has emerged as a leading voice in the open-source AI movement, quietly released Hermes 4 on Monday, a family of large language models that the company claims can match the performance of leading proprietary systems while offering unprecedented user control and minimal content restrictions.