oobabooga
|
6b1b2e2373
|
Update README
|
2025-08-17 22:19:20 -07:00 |
|
oobabooga
|
8a14aa62ff
|
Update README
|
2025-08-17 22:06:59 -07:00 |
|
oobabooga
|
8cdb911a6e
|
Update README
|
2025-08-17 22:06:12 -07:00 |
|
oobabooga
|
6bf31479d9
|
Update README
|
2025-08-17 22:00:21 -07:00 |
|
oobabooga
|
320f7339cd
|
Update README
|
2025-08-17 21:56:35 -07:00 |
|
oobabooga
|
3dec47eaf8
|
Small one-click installer changes
|
2025-08-17 21:43:46 -07:00 |
|
oobabooga
|
35707c2dd8
|
Update README
|
2025-08-17 21:39:57 -07:00 |
|
oobabooga
|
58797a9eb5
|
Minor change after 9651b5c873
|
2025-08-17 14:18:23 -07:00 |
|
oobabooga
|
64eba9576c
|
mtmd: Fix a bug when "include past attachments" is unchecked
|
2025-08-17 14:08:40 -07:00 |
|
oobabooga
|
3a91ca2dd1
|
Update flash attention
|
2025-08-17 13:57:23 -07:00 |
|
oobabooga
|
9651b5c873
|
Make CUDA 12.8 the default CUDA option, remove the CUDA 12.4 option
Exllamav3 doesn't compile with torch 2.6 anymore, and torch 2.7
requires newer CUDA.
|
2025-08-17 13:26:09 -07:00 |
|
oobabooga
|
a633793a00
|
Bump exllamav3 to 0.0.6
|
2025-08-17 13:19:42 -07:00 |
|
oobabooga
|
dbabe67e77
|
ExLlamaV3: Enable the --enable-tp option, add a --tp-backend option
|
2025-08-17 13:19:11 -07:00 |
|
oobabooga
|
d771ca4a13
|
Fix web search (attempt)
|
2025-08-14 12:05:14 -07:00 |
|
oobabooga
|
73a8a737b2
|
docs: Improve the multimodal examples slightly
|
2025-08-13 18:23:18 -07:00 |
|
altoiddealer
|
57f6e9af5a
|
Set multimodal status during Model Loading (#7199)
|
2025-08-13 16:47:27 -03:00 |
|
oobabooga
|
725a8bcf60
|
Small docs change
|
2025-08-13 06:49:28 -07:00 |
|
oobabooga
|
331eab81f7
|
mtmd: Explain base64 inputs in the API docs
|
2025-08-13 06:46:10 -07:00 |
|
oobabooga
|
bd05fb899e
|
Update README
|
2025-08-12 14:19:18 -07:00 |
|
oobabooga
|
41b95e9ec3
|
Lint
|
2025-08-12 13:37:37 -07:00 |
|
oobabooga
|
2f979ce294
|
docs: Add a multimodal tutorial
|
2025-08-12 13:33:49 -07:00 |
|
oobabooga
|
7301452b41
|
UI: Minor info message change
|
2025-08-12 13:23:24 -07:00 |
|
oobabooga
|
8d7b88106a
|
Revert "mtmd: Fail early if images are provided but the model doesn't support them (llama.cpp)"
This reverts commit d8fcc71616.
|
2025-08-12 13:20:16 -07:00 |
|
oobabooga
|
2f6a629393
|
UI: Minor improvement after 0e88a621fd
|
2025-08-12 08:51:01 -07:00 |
|
oobabooga
|
2238302b49
|
ExLlamaV3: Add speculative decoding
|
2025-08-12 08:50:45 -07:00 |
|
oobabooga
|
0882970a94
|
Update llama.cpp
|
2025-08-12 07:00:24 -07:00 |
|
oobabooga
|
d8fcc71616
|
mtmd: Fail early if images are provided but the model doesn't support them (llama.cpp)
|
2025-08-11 18:02:33 -07:00 |
|
oobabooga
|
e6447cd24a
|
mtmd: Update the llama-server request
|
2025-08-11 17:42:35 -07:00 |
|
oobabooga
|
c47e6deda2
|
Update README
|
2025-08-11 16:20:20 -07:00 |
|
oobabooga
|
0e3def449a
|
llama.cpp: --swa-full to llama-server when streaming-llm is checked
|
2025-08-11 15:17:25 -07:00 |
|
oobabooga
|
0e88a621fd
|
UI: Better organize the right sidebar
|
2025-08-11 15:16:03 -07:00 |
|
oobabooga
|
1e3c4e8bdb
|
Update llama.cpp
|
2025-08-11 14:40:59 -07:00 |
|
oobabooga
|
765af1ba17
|
API: Improve a validation
|
2025-08-11 12:39:48 -07:00 |
|
oobabooga
|
a78ca6ffcd
|
Remove a comment
|
2025-08-11 12:33:38 -07:00 |
|
oobabooga
|
dfd9c60d80
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2025-08-11 12:33:27 -07:00 |
|
oobabooga
|
999471256c
|
Lint
|
2025-08-11 12:32:17 -07:00 |
|
Mykeehu
|
1ba1211ca0
|
Fix edit window and buttons in Messenger theme (#7100)
|
2025-08-11 16:13:56 -03:00 |
|
oobabooga
|
b10d525bf7
|
UI: Update a tooltip
|
2025-08-11 12:05:22 -07:00 |
|
oobabooga
|
b62c8845f3
|
mtmd: Fix /chat/completions for llama.cpp
|
2025-08-11 12:01:59 -07:00 |
|
oobabooga
|
38c0b4a1ad
|
Default ctx-size to 8192 when not found in the metadata
|
2025-08-11 07:39:53 -07:00 |
|
oobabooga
|
52d1cbbbe9
|
Fix an import
|
2025-08-11 07:38:39 -07:00 |
|
oobabooga
|
1cb800d392
|
Docs: small change
|
2025-08-11 07:37:10 -07:00 |
|
oobabooga
|
4809ddfeb8
|
Exllamav3: small sampler fixes
|
2025-08-11 07:35:22 -07:00 |
|
oobabooga
|
4d8dbbab64
|
API: Fix sampler_priority usage for ExLlamaV3
|
2025-08-11 07:26:11 -07:00 |
|
oobabooga
|
c5340533c0
|
mtmd: Add another API example
|
2025-08-10 20:39:04 -07:00 |
|
oobabooga
|
9ec310d858
|
UI: Fix the color of italic text
|
2025-08-10 07:54:21 -07:00 |
|
oobabooga
|
cc964ee579
|
mtmd: Increase the size of the UI image preview
|
2025-08-10 07:44:38 -07:00 |
|
oobabooga
|
6fbf162d71
|
Default max_tokens to 512 in the API instead of 16
|
2025-08-10 07:21:55 -07:00 |
|
oobabooga
|
1fb5807859
|
mtmd: Fix API text completion when no images are sent
|
2025-08-10 06:54:44 -07:00 |
|
oobabooga
|
0ea62d88f6
|
mtmd: Fix "continue" when an image is present
|
2025-08-09 21:47:02 -07:00 |
|