Fix gpt-oss tool calling: pass tool args and tool responses as json by matteoserva · Pull Request #19620 · ggml-org/llama.cpp

matteoserva · 2026-02-14T08:06:27Z

gpt-oss template expects json objects for tool arguments and tool responses.
llama.cpp passes strings, causing them to be escaped twice.

fixes #19520
supersedes #19553

aldehir

Need a better way to handle tool content. This will break clients that return plain strings, which is perfectly valid.

Also, have you tested this against the Unsloth template? They patched this particular issue, and it appears they handle both cases but needs to be verified.

Additionally, please adjust the code style to match the rest of the file.

aldehir · 2026-02-15T08:51:58Z

+                std::string response = msg.value("content",std::string());
+                adjusted_message["content"] = json::parse(response);


There is no guarantee clients return back valid JSON as the tool response. Perhaps fallback to string when parsing fails.

matteoserva · 2026-02-18T08:46:07Z

You were right. The updated model fixed everything. Thanks very much

bfroemel · 2026-02-18T09:43:41Z

You were right. The updated model fixed everything. Thanks very much

Unfortunately, the template fix wasn't picked up by everyone, e.g., most importantly upstream (openai), and even https://huggingface.co/ggml-org/gpt-oss-120b-GGUF hasn't. So IMO the proposed code fix (with suggested changes by @aldehir ) could make sense and shouldn't interfere with the unsloth template fix.

btw: I couldn't really understand why some reported on reddit bad gpt-oss performance and always attributed it to some kind of openai bashing and/or bots. Now I see that issues like that are indistinguishable from bad model performance for average users. It would be great if we had some automated way to verify whether the model sees valid prompts; maybe in some future we have a small set of end2end testcases per model where API requests are verifiable against given prompts.

pwilkin · 2026-02-18T11:54:35Z

@bfroemel that's exactly how the autoparser project started :) I wanted to have llama.cpp as a reliable local agentic coding backend and to my frustration I realized that like 5 theoretically supported models that I tested in a row broke down within the first few tool call requests. Then I started digging and I realized that a lot of what is commonly attribute to "bad model quality" is actually bad parser behavior.

aldehir · 2026-02-18T15:40:34Z

I agree, this should be fixed for all templates. I'll try to bring this PR to the finish line when I find time, unless @matteoserva wants to. An easy workaround is to use the Unsloth template for now.

matteoserva · 2026-02-18T18:06:51Z

I agree, this should be fixed for all templates. I'll try to bring this PR to the finish line when I find time, unless @matteoserva wants to. An easy workaround is to use the Unsloth template for now.

Yes, please do. :)

Fix gpt-oss tool calling: pass tool args and tool responses as json

4018b9c

matteoserva requested a review from pwilkin as a code owner February 14, 2026 08:06

aldehir requested changes Feb 15, 2026

View reviewed changes

matteoserva closed this Feb 18, 2026

matteoserva mentioned this pull request Feb 18, 2026

Eval bug: gpt-oss tool call template escapes arguments and responses twice #19520

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gpt-oss tool calling: pass tool args and tool responses as json#19620

Fix gpt-oss tool calling: pass tool args and tool responses as json#19620
matteoserva wants to merge 1 commit intoggml-org:masterfrom
matteoserva:fix-gptoss-tools

matteoserva commented Feb 14, 2026

Uh oh!

aldehir left a comment

Uh oh!

aldehir Feb 15, 2026

Uh oh!

matteoserva commented Feb 18, 2026

Uh oh!

bfroemel commented Feb 18, 2026

Uh oh!

pwilkin commented Feb 18, 2026

Uh oh!

aldehir commented Feb 18, 2026

Uh oh!

matteoserva commented Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		std::string response = msg.value("content",std::string());
		adjusted_message["content"] = json::parse(response);

Conversation

matteoserva commented Feb 14, 2026

Uh oh!

aldehir left a comment

Choose a reason for hiding this comment

Uh oh!

aldehir Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

matteoserva commented Feb 18, 2026

Uh oh!

bfroemel commented Feb 18, 2026

Uh oh!

pwilkin commented Feb 18, 2026

Uh oh!

aldehir commented Feb 18, 2026

Uh oh!

matteoserva commented Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants