Ollama: Tool Calling Gets Smarter

Four significant pull requests merged today focusing on tool calling improvements and system reliability. Jeffrey Morgan and Parth Sareen led major enhancements to Qwen3 and GLM parsers for better tool calling behavior, while Eva H fixed a critical Windows startup crash and Patrick Devine added peak memory usage visibility for MLX models.

Duration: PT3M47S

Episode overview

This episode is a short developer briefing from Ollama.

It explains recent repository work in plain language.

  • Show: Ollama
  • Published: 2026-02-27T11:02:49Z
  • Audio duration: PT3M47S

Transcript excerpt

This excerpt keeps the crawler page concise. Listen to the episode or use the RSS feed for the full update.

Hey there, wonderful developers! Welcome back to another episode of the Ollama podcast. I'm so excited to share what's been happening in the codebase because today feels like one of those days where everything just clicks into place.

So grab your favorite beverage and let's dive into what the team accomplished yesterday. We had four fantastic pull requests merge, and honestly, there's a beautiful theme running through all of this work around making Ollama smarter and more reliable.

Let's start with the star of the show - Jeffrey Morgan's work on fixing Qwen3 tool calling in thinking mode. Now, if you've ever worked with language models that can use tools, you know how tricky the parsing can get, especially when the model is in that "thinking" state where it's reasoning through a problem.…

Building on that theme, Parth Sareen jumped in with some excellent work on stable tool call indexing for both GLM47 and Qwen3 parsers. This might sound like a small detail, but consistent indexing is absolutely crucial when you're dealing with multiple tool calls. Think about it - if your model wants to use three…

Now, switching gears to reliability, Eva H came through with a fix that Windows users…

A…

Nearby episodes from Ollama

  1. Cloud Integration Drama and AI Model Expansion
  2. Smarter Sampling and Crash Prevention
  3. Building Bridges for Better Model Compatibility
  4. MLX Runner Gets Rock Solid
  5. Cleaner Shutdowns and Faster Startups
  6. Qwen 3.5 Architecture Lands with Safety Upgrades
  7. Memory Management Revolution
  8. Nemotron Architecture Lands with Unified Cache Vision