Ollama: MLX Gets a Major Upgrade and Web Search Goes Live
Daniel Hiltgen led a massive MLX infrastructure overhaul with header vendoring that simplifies the build process, while Parth Sareen introduced experimental web search capabilities. The team also updated ROCm to version 7.2 for Linux and fixed several build issues, showing strong momentum across multiple fronts.
Duration: PT4M3S
Episode overview
This episode is a short developer briefing from Ollama.
It explains recent repository work in plain language.
- Show: Ollama
- Published: 2026-03-10T10:05:52Z
- Audio duration: PT4M3S
Transcript excerpt
This excerpt keeps the crawler page concise. Listen to the episode or use the RSS feed for the full update.
Hey there, code friends! Welcome back to another episode of the Ollama podcast. I'm your host, and wow, do we have an exciting day to dive into! March 10th brought us some absolutely fantastic changes that are going to make developers' lives so much easier.
Let's jump right into the star of today's show - Daniel Hiltgen just dropped what I'm calling a game-changing MLX improvement. This pull request is massive, folks - we're talking over 5,000 lines added across 142 files. But here's why this matters for you: remember how you used to need to run cmake before building…
Daniel implemented what's called "header vendoring" for MLX. Think of it like this - instead of your build process having to hunt around for the right header files every single time, they're now packaged right there with the code. It's like going from having to assemble your toolbox every morning to just grabbing it…
But wait, there's more! Parth Sareen brought us something really cool - experimental web search and web fetch routes. This is one of those features that makes you go "why didn't we have this before?" Now you can easily access web search and fetch APIs through Ollama Cloud without needing your own API…
Speaki…
Nearby episodes from Ollama
- Launch Command Gets a Major Polish
- Spring Cleaning and Performance Gains
- Thinking Streams and Local Tool Power-ups
- Stability First - Error Handling and Performance Fixes
- Simplifying the Sampling Story
- Cloud Models Get Smarter & Build Performance Boost
- Cloud Integrations Get Some Love
- Smarter Constraints and Qwen3.5 Boost