Draft: parse tool_calls arguments by la1ty · Pull Request #4921 · xorbitsai/inference

la1ty · 2026-05-18T05:32:27Z

Add parsing for JSON-encoded arguments in tool_calls. Tested with open-zread and Qwen3.5 models.

Close #4914 .

Note: This is a draft PR and it works on my computer. I hope the subsequent modifications can be made by the official maintainers.

TODO

I made a quick investigation in llm_family.json and it seemed that there are four kinds of handling logics in chat templates:

accept	template	models
both	`tool_call.arguments \| tojson`	`DianJin-R1-32B`, `HuatuoGPT-o1-70B`, `InternVL3-78B`, `Ovis2-34B`, `QwQ-32B`, `XiYanSQL-QwenCoder-32B`, `Fin-R1`, `Qwen1.5-Chat`, `Qwen2-Instruct`, `Qwen2.5-Coder-32B-Instruct`, `Qwen2.5-Instruct`, `GPT-OSS`, `KAT-V1-40B`, `DeepSeek-V3.1`
both	`{%- set arguments = tool['function']['arguments'] %}{%- if arguments is not string %}...`	`DeepSeek-R1-0528`, `Qwen3`, `Qwen3-Instruct`, `Qwen3-Thinking`, `Baichuan-M2-32B`, `Qwen3-VL-Instruct`, `Qwen3-VL-Thinking`, `Qwen3-Next-Instruct`, `Qwen3-Next-Thinking`, `Qwen3-Omni-Instruct`, `Qwen3-Omni-Thinking`, `DeepSeek-V3.2`, `Kimi-K2.5`
`arguments` as string	`'\n' + tool['function']['arguments'] + '\n'`	`DeepSeek-Prover-V2-7B`, `DeepSeek-V2.5`, `DeepSeek-V3-0324`, `DeepSeek-R1`, `DeepSeek-R1-0528-Qwen3-8B`, `DeepSeek-R1-Distill-Llama-8B`, `DeepSeek-R1-Distill-Qwen-32B`, `Skywork-OR1-7B`, `QwenLong-L1-32B`
`arguments` as dict	`{%- for arg_name, arg_val in tool_call.arguments \| items %}`	`Llama-3.1-70B-Instruct`, `Qwen3-Coder`, `GLM-4.5`, `GLM-4.5V`, `Seed-OSS-36B-Instruct`, `MiniMax-M2`, `GLM-4.6`, `GLM-4.7`, `GLM-4.7-Flash`, `MiniMax-M2.5`, `GLM-5`, `Qwen3.5`, `Qwen3.6`, `MiniMax-M2.7`, `GLM-5.1`

So models that treat arguments as string will be unable to handling function callings if this PR is merged but the chat template remains the same.

Add parsing for JSON-encoded arguments in tool_calls.

gemini-code-assist

Code Review

This pull request modifies _transform_messages in xinference/model/llm/utils.py to automatically parse JSON-encoded tool call arguments into dictionaries, facilitating easier iteration within Jinja2 templates. Reviewers pointed out that this global conversion is a breaking change for templates expecting string-based arguments and suggested implementing a fromjson filter as a safer, opt-in alternative. Additionally, feedback suggests adding more robust type checking to ensure that the function objects and parsed results are indeed dictionaries before processing them.

gemini-code-assist · 2026-05-18T05:33:36Z

+            # Parse JSON-encoded arguments in tool_calls to dicts,
+            # so Jinja2 templates can iterate them with |items.
+            if new_message.get("tool_calls"):
+                tool_calls = []
+                for tc in new_message["tool_calls"]:
+                    tc = dict(tc)
+                    func = tc.get("function")
+                    if func and isinstance(func.get("arguments"), str):
+                        func = dict(func)
+                        try:
+                            func["arguments"] = json.loads(func["arguments"])
+                        except (json.JSONDecodeError, TypeError):
+                            pass
+                        tc["function"] = func
+                    tool_calls.append(tc)
+                new_message["tool_calls"] = tool_calls


This change introduces a breaking change for models whose chat templates expect tool_calls arguments to be a JSON string. As noted in the PR description, templates using string concatenation or assuming a string type will fail. Instead of a global conversion in _transform_messages, a safer approach would be to add a fromjson filter to the Jinja2 environment in _compile_jinja_template. This allows templates that need structured data to opt-in without breaking others. If this global change is desired, it should likely be made conditional based on the model family or template requirements.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

qinxuye · 2026-05-19T03:21:05Z

https://github.com/xorbitsai/inference/actions/runs/26015899223/job/76465909355?pr=4921

Could you fix the error?

la1ty · 2026-05-19T07:22:53Z

https://github.com/xorbitsai/inference/actions/runs/26015899223/job/76465909355?pr=4921

Could you fix the error?

Okay, it is fixed.

Draft: parsing tool_calls arguments

43cd9bf

Add parsing for JSON-encoded arguments in tool_calls.

XprobeBot added this to the v2.x milestone May 18, 2026

gemini-code-assist Bot reviewed May 18, 2026

View reviewed changes

la1ty and others added 2 commits May 18, 2026 13:35

Apply suggestion from @gemini-code-assist[bot]

871b4a4

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

fix linting

b0a77f3

la1ty added 2 commits May 19, 2026 12:15

Merge branch 'xorbitsai:main' into draft-parse-arguments

5743c10

fix test

9cc4e0f

la1ty changed the title ~~Draft: parsing tool_calls arguments~~ Draft: parse tool_calls arguments May 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: parse tool_calls arguments#4921

Draft: parse tool_calls arguments#4921
la1ty wants to merge 5 commits into
xorbitsai:mainfrom
la1ty:draft-parse-arguments

la1ty commented May 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 18, 2026

Uh oh!

Uh oh!

qinxuye commented May 19, 2026

Uh oh!

la1ty commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

la1ty commented May 18, 2026

TODO

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

qinxuye commented May 19, 2026

Uh oh!

la1ty commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants