mamei16
abb7cc02e9
Re-introduce inline LaTeX rendering with more robust exception handling ( #7402 )
2026-03-04 17:44:19 -03:00
mamei16
68109bc5da
Improve process_markdown_content ( #7403 )
2026-03-04 17:26:13 -03:00
weiguang li
952e2c404a
Bump sentence-transformers from 2.2.2 to 3.3.1 in superbooga ( #7406 )
2026-03-04 17:08:08 -03:00
oobabooga
cdf0e392e6
llama.cpp: Reorganize speculative decoding UI and use recommended ngram-mod defaults
2026-03-04 12:05:08 -08:00
oobabooga
eb90daf098
ExLlamaV2: Don't expose unused seed parameter
2026-03-04 11:14:50 -08:00
oobabooga
0ffb75de7c
Update Transformers to 5.3.0
2026-03-04 11:12:54 -08:00
oobabooga
d8af0505a8
ExLlamav3_HF: Optimize prefill and fix CFG cache initialization
2026-03-04 11:09:58 -08:00
oobabooga
9b916f02cd
ExLlamaV3: Attach AdaptiveP, fix speculative decoding parameter, add seed
2026-03-04 10:51:15 -08:00
oobabooga
5d93f4e800
Fix requires_grad warning in logits API
2026-03-04 10:43:23 -08:00
oobabooga
64eb77e782
Fix the logits API endpoint with transformers
2026-03-04 10:41:47 -08:00
oobabooga
22141679e3
Update the custom gradio wheels
2026-03-04 10:01:31 -08:00
oobabooga
65de4c30c8
Add adaptive-p sampler and n-gram speculative decoding support
2026-03-04 09:41:29 -08:00
oobabooga
f010aa1612
Replace PyPDF2 with pymupdf for PDF text extraction
...
pymupdf produces cleaner text (e.g. no concatenated words in headers),
handles encrypted and malformed PDFs that PyPDF2 failed on, and
supports non-Latin scripts.
2026-03-04 06:43:37 -08:00
oobabooga
f4d787ab8d
Delegate GPU layer allocation to llama.cpp's --fit
2026-03-04 06:37:50 -08:00
oobabooga
8a3d866401
Fix temperature_last having no effect in llama.cpp server sampler order
2026-03-04 06:10:51 -08:00
oobabooga
11dc6fdfce
Update the custom gradio wheels
2026-03-04 06:04:33 -08:00
oobabooga
7d42b6900e
Update the custom gradio wheels
2026-03-04 05:47:59 -08:00
oobabooga
8cbb7661a8
Remove no longer needed dark theme localstorage code
2026-03-03 18:51:24 -08:00
oobabooga
866c48e55b
Simplify dark theme handling using gradio fork's new dark_theme parameter
2026-03-03 18:41:47 -08:00
oobabooga
b3fd0d16e0
Use a new gr.Headless component for efficient chat streaming
2026-03-03 18:12:03 -08:00
oobabooga
d584ede72e
Avoid a circular import
2026-03-03 17:59:47 -08:00
oobabooga
c0bff831e3
Update custom gradio wheels
2026-03-03 17:21:18 -08:00
oobabooga
2260e530c9
Remove gradio monkey-patches (moved to gradio fork)
2026-03-03 17:17:36 -08:00
oobabooga
e9f22813e4
Replace gradio with my gradio 4.37.2 fork
2026-03-03 16:51:27 -08:00
dependabot[bot]
3519890c8e
Bump flask-cloudflared from 0.0.14 to 0.0.15 in /requirements/full ( #7380 )
2026-03-03 21:41:51 -03:00
dependabot[bot]
9c604628a0
Bump flask-cloudflared from 0.0.14 to 0.0.15 in /requirements/portable ( #7382 )
2026-03-03 21:41:46 -03:00
oobabooga
fbd2acfa19
Remove triton-windows from non-CUDA requirements
2026-03-03 16:16:55 -08:00
oobabooga
5fd79b23d1
Add CUDA 13.1 portable builds
2026-03-03 15:36:41 -08:00
oobabooga
b8fcc8ea32
Update llama.cpp, remove noavx2 builds, add ROCm Windows portable builds
2026-03-03 15:27:19 -08:00
Pádraic Slattery
d7dd533b99
chore: Update outdated GitHub Actions versions ( #7384 )
2026-03-03 17:54:12 -03:00
oobabooga
9576c5a5f4
Update Miniforge from 25.3.0 to 26.1.0
2026-03-03 12:33:20 -08:00
oobabooga
9814d3d0ae
Patch gradio 4.x for huggingface-hub 1.x compatibility
2026-03-03 12:20:37 -08:00
oobabooga
38d0eeefc0
Update dependencies: torch 2.9.1, transformers 5.2, exllamav3 0.0.22, accelerate 1.12, huggingface-hub 1.5
2026-03-03 12:01:02 -08:00
oobabooga
ddd74324fe
Update PyTorch to 2.9.1 and ROCm to 6.4
2026-03-03 11:38:52 -08:00
oobabooga
efc72d5c32
Update Python from 3.11 to 3.13
2026-03-03 11:03:26 -08:00
oobabooga
aecbc5a8ac
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
2026-01-28 08:30:28 -08:00
oobabooga
c54e8a2b3d
Try to spawn llama.cpp on port 5001 instead of random port
2026-01-28 08:23:55 -08:00
oobabooga
dc2bbf1861
Refactor thinking block detection and add Solar Open support
2026-01-28 08:21:34 -08:00
dependabot[bot]
cae1fef42d
Bump triton-windows in /requirements/full ( #7368 )
2026-01-14 21:30:59 -03:00
q5sys (JT)
7493fe7841
feat: Add a dropdown to save/load user personas ( #7367 )
2026-01-14 20:35:08 -03:00
jakubartur
21b979c02a
Fix code block copy button on HTTP (Clipboard API fallback) ( #7358 )
2026-01-14 19:34:21 -03:00
oobabooga
a731861127
Update README
2026-01-13 15:38:32 -08:00
oobabooga
d79cdc614c
Update llama.cpp
2026-01-08 11:24:15 -08:00
oobabooga
332fd40653
Update llama.cpp
2026-01-07 19:06:23 -08:00
dependabot[bot]
50a35b483c
Update bitsandbytes requirement in /requirements/full ( #7353 )
2026-01-06 15:27:23 -03:00
dependabot[bot]
45fbec0320
Update torchao requirement in /requirements/full ( #7356 )
2026-01-06 15:27:10 -03:00
oobabooga
b0968ed8b4
Update flash-linear-attention
2026-01-06 10:26:43 -08:00
oobabooga
36747cf99c
Lint
2026-01-06 10:24:34 -08:00
oobabooga
2fcbadec67
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
2026-01-06 10:24:07 -08:00
oobabooga
bb3b7bc197
Update llama.cpp
2026-01-06 10:23:58 -08:00