text-generation-webui/extensions/openai
oobabooga 8d43123f73 API: Fix function calling for Qwen, Mistral, GPT-OSS, and other models
The tool call response parser only handled JSON-based formats, causing
tool_calls to always be empty for models that use non-JSON formats.

Add parsers for three additional tool call formats:
- Qwen3.5: <tool_call><function=name><parameter=key>value</parameter>
- Mistral/Devstral: functionName{"arg": "value"}
- GPT-OSS: <|channel|>commentary to=functions.name<|message|>{...}

Also fix multi-turn tool conversations crashing with Jinja2
UndefinedError on tool_call_id by preserving tool_calls and
tool_call_id metadata through the chat history conversion.
2026-03-06 00:55:33 -03:00
..
cache_embedding_model.py Make /v1/embeddings functional, add request/response types 2023-11-10 07:34:27 -08:00
completions.py API: Fix function calling for Qwen, Mistral, GPT-OSS, and other models 2026-03-06 00:55:33 -03:00
embeddings.py Openai embedding fix to support jina-embeddings-v2 (#4642) 2023-11-18 20:24:29 -03:00
errors.py extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 11:28:12 -03:00
images.py Image: Several fixes 2025-12-05 05:58:57 -08:00
logits.py New llama.cpp loader (#6846) 2025-04-18 09:59:37 -03:00
models.py Properly fix the /v1/models endpoint 2025-06-19 10:25:55 -07:00
moderations.py Lint 2023-11-16 18:03:06 -08:00
script.py API: Add parallel request support for llama.cpp and ExLlamaV3 2026-03-05 16:49:58 -08:00
tokens.py Add types to the encode/decode/token-count endpoints 2023-11-07 19:32:14 -08:00
typing.py Add adaptive-p sampler and n-gram speculative decoding support 2026-03-04 09:41:29 -08:00
utils.py API: Fix function calling for Qwen, Mistral, GPT-OSS, and other models 2026-03-06 00:55:33 -03:00