text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2026-03-10 23:53:49 +01:00

History

oobabooga 8d43123f73 API: Fix function calling for Qwen, Mistral, GPT-OSS, and other models The tool call response parser only handled JSON-based formats, causing tool_calls to always be empty for models that use non-JSON formats. Add parsers for three additional tool call formats: - Qwen3.5: <tool_call><function=name><parameter=key>value</parameter> - Mistral/Devstral: functionName{"arg": "value"} - GPT-OSS: <\|channel\|>commentary to=functions.name<\|message\|>{...} Also fix multi-turn tool conversations crashing with Jinja2 UndefinedError on tool_call_id by preserving tool_calls and tool_call_id metadata through the chat history conversion.		2026-03-06 00:55:33 -03:00
..
cache_embedding_model.py	Make /v1/embeddings functional, add request/response types	2023-11-10 07:34:27 -08:00
completions.py	API: Fix function calling for Qwen, Mistral, GPT-OSS, and other models	2026-03-06 00:55:33 -03:00
embeddings.py	Openai embedding fix to support jina-embeddings-v2 (#4642 )	2023-11-18 20:24:29 -03:00
errors.py	extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122 )	2023-07-24 11:28:12 -03:00
images.py	Image: Several fixes	2025-12-05 05:58:57 -08:00
logits.py	New llama.cpp loader (#6846 )	2025-04-18 09:59:37 -03:00
models.py	Properly fix the /v1/models endpoint	2025-06-19 10:25:55 -07:00
moderations.py	Lint	2023-11-16 18:03:06 -08:00
script.py	API: Add parallel request support for llama.cpp and ExLlamaV3	2026-03-05 16:49:58 -08:00
tokens.py	Add types to the encode/decode/token-count endpoints	2023-11-07 19:32:14 -08:00
typing.py	Add adaptive-p sampler and n-gram speculative decoding support	2026-03-04 09:41:29 -08:00
utils.py	API: Fix function calling for Qwen, Mistral, GPT-OSS, and other models	2026-03-06 00:55:33 -03:00