New Model Support and Memory Management Wins

Today brought some exciting developments with GLM-4.7-Flash model support landing alongside crucial memory management fixes for image generation models. Jeffrey Morgan led the charge with major architecture additions, while the team also tackled integration improvements and API consistency issues that'll make life easier for downstream developers.

Duration: PT4M7S

Episode overview

This episode is a short developer briefing from Ollama.

It explains recent repository work in plain language.

  • Show: Ollama
  • Published: 2026-01-20T11:03:12Z
  • Audio duration: PT4M7S

Transcript excerpt

This excerpt keeps the crawler page concise. Listen to the episode or use the RSS feed for the full update.

Hey there, fellow developers! Welcome back to another episode of the Ollama podcast. I'm your host, and wow, do we have some fascinating changes to dive into today from January 20th. Grab your favorite beverage because we're about to explore some really cool model architecture work and some clever performance…

Let's start with the big story of the day - Jeffrey Morgan just merged support for a brand new model architecture called GLM-4.7-Flash. This is huge! We're talking about over 2,500 lines of code changes across 17 files to add the `Glm4MoeLiteForCausalLM` architecture. Now, if you're wondering what makes this…

The implementation includes new converters, model definitions, and parsers - it's like watching a well-orchestrated symphony of code organization. And here's a fun detail: Jeffrey mentioned they should probably rename the renderer and parser to "glm47" to keep things consistent. These kinds of naming considerations…

Now, speaking of smart optimizations, Jeffrey also tackled a really clever performance issue around image generation models. Picture this scenario - you're trying to delete a model with `ollama rm`, but the system accidentally loads the entire model into…

But…

Da…

Nearby episodes from Ollama

  1. Memory Magic and Command Makeover
  2. Making Ollama Play Nice with Everyone
  3. The Great Cleanup - Manifests Get Their Own Home
  4. New Model Architecture and Image Generation Fixes
  5. FLUX.2 Image Generation Arrives
  6. Image Generation Goes Native and Parser Cleanup Magic
  7. Dynamic Loading and Experimental Models Take Center Stage
  8. Release Day Rescue Mission