Commit graph

21 commits

Author SHA1 Message Date
oobabooga
e18f32cba7 Remove hardcoded trust_remote_code=True in embedding loader 2026-04-06 17:47:50 -07:00
oobabooga
193424cc93 API: Fix IPv6 address formatting 2026-04-06 10:07:52 -07:00
oobabooga
c26ffdd24c API: add instruction_template support to the model load endpoint 2026-04-06 07:02:53 -07:00
oobabooga
4d6230a944 Follow-up to d78fc46114 2026-04-06 06:48:48 -07:00
oobabooga
d78fc46114 Fix "address already in use" on server restart (Linux/macOS) 2026-04-05 16:42:27 -07:00
oobabooga
c63a79ee48 Image generation: Embed generation metadata in API image responses 2026-04-04 23:15:14 -07:00
oobabooga
2eef90a323 API: Remove deprecated "settings" parameter from model load endpoint 2026-04-04 11:00:14 -07:00
oobabooga
9183dc444e API: Fix loader args leaking between sequential model loads 2026-04-04 10:48:53 -07:00
oobabooga
7aab2fdf9a API: Improve cache clearing in logprobs 2026-04-02 17:50:42 -07:00
oobabooga
091037ec20 Fix top_logprobs_ids missing for llama.cpp loader 2026-04-02 16:13:45 -03:00
oobabooga
ea1f8c71f2 API: Optimize prompt logprobs and refactor ExLlamav3 forward pass 2026-04-02 14:31:11 -03:00
oobabooga
c10c6e87ae API: Add token ids to logprobs output 2026-04-02 07:17:27 -07:00
oobabooga
4073164be0 Fix ExLlamav3 OOM on prompt logprobs and qwen3_5_moe HF compat 2026-04-01 19:44:55 -07:00
oobabooga
71c1a52afe API: Implement echo + logprobs for /v1/completions endpoint 2026-03-31 07:43:11 -07:00
oobabooga
e154140021 Rename "truncation length" to "context length" in logs 2026-03-25 07:21:02 -07:00
oobabooga
02f18a1d65 API: Add thinking block signature field, fix error codes, clean up logging 2026-03-23 07:06:38 -07:00
oobabooga
2c4f364339 Update API docs to mention Anthropic support 2026-03-21 18:38:11 -07:00
oobabooga
0216893475 API: Add Anthropic-compatible /v1/messages endpoint 2026-03-20 20:38:55 -07:00
oobabooga
7c79143a14 API: Fix _start_cloudflared raising after first attempt instead of exhausting retries 2026-03-20 15:03:49 -03:00
oobabooga
1a910574c3 API: Fix debug_msg truthy check for OPENEDAI_DEBUG=0 2026-03-20 14:57:01 -03:00
oobabooga
bf6fbc019d API: Move OpenAI-compatible API from extensions/openai to modules/api 2026-03-20 14:46:00 -03:00