6.03.2026

Perplexity AI unveils hybrid local-cloud inference system
at Computex 2026

Perplexity AI unveiled what it calls the first hybrid local-server inference orchestrator at Computex 2026 on Monday night, demonstrating software that autonomously decides — in real time and mid-task — which AI workloads stay on a user's device and which get routed to frontier models in the cloud.

6.02.2026

MiniMax M3 IS INSANE! BEST Opensource AI Model!

In this video, I fully test MiniMax M3, the new open-weight frontier model from MiniMax that combines coding, agentic reasoning, multimodal understanding, and long-context capabilities into one model. M3 supports up to a 1 million token context window, is natively multimodal from day one, and delivers some seriously impressive benchmark results across SWE-Bench Pro, BrowseComp, SVG-Bench, KernelBench Hard, OSWorld Verified, and more.

What makes this release even more insane is the pricing. MiniMax M3 is not only competing with models like Opus 4.7 and GPT-5.5, but in several benchmarks it actually beats them while being dramatically cheaper. MiniMax is also offering huge token plans, aggressive API pricing, and open-weight access, making this one of the most accessible frontier-level models available right now.



6.01.2026

Running Local AI on AMD

In this video, we look at running local AI work jobs for LLMs, images and video models, but running it on an AMD GPUs and processors.