ollama: Resolve context window size via API #39941

bennetbo · 2025-10-10T12:44:14Z

Previously we were guessing the context window size here:

Line 22 in 8c3f09e

fn get_max_tokens(name: &str) -> u64 {

This is inaccurate and must be updated manually. This PR ensures that we extract the context window size from the request in the same way that the Ollama CLI does when running ollama show <model-name> (Relevant code is here)

The format looks like this:

{
  "model_info": {
    "general.architecture": "llama",
    "llama.context_length": 132000
  }
}

Once this PR is merged we could technically remove the old code

zed/crates/ollama/src/ollama.rs

Line 22 in 8c3f09e

fn get_max_tokens(name: &str) -> u64 {

I decided to keep it for now, as it is unclear if the necessary fields are available via the API on older Ollama versions.

Release Notes:

Fixed an issue where Ollama models would use the wrong context window size

bennetbo added 2 commits October 10, 2025 14:21

ollama: Read context window size from API

fb8d5de

Improve deserialization

8c3f09e

cla-bot bot added the cla-signed The user has signed the Contributor License Agreement label Oct 10, 2025

bennetbo enabled auto-merge (squash) October 10, 2025 12:47

bennetbo merged commit 3d5ddcc into main Oct 10, 2025
25 checks passed

bennetbo deleted the ollama-context-length-via-api branch October 10, 2025 12:59

tidely mentioned this pull request Nov 10, 2025

AI: Ollama uses an unnecessary large context window which fails GPU memory allocations #42340

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ollama: Resolve context window size via API #39941

ollama: Resolve context window size via API #39941

Uh oh!

bennetbo commented Oct 10, 2025 •

edited

Loading

Uh oh!

Labels

2 participants

Uh oh!

ollama: Resolve context window size via API #39941

ollama: Resolve context window size via API #39941

Uh oh!

Conversation

bennetbo commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Labels

2 participants

bennetbo commented Oct 10, 2025 •

edited

Loading