Commit graph

4944 commits

Author SHA1 Message Date
oobabooga 73442a2b6d UI: Better handle the chat input position with CSS
This also solves scrolling issues with the main chat content
when the height of the textarea increases.
2025-08-27 05:43:13 -07:00
oobabooga 8042f76399 Make portable installs functional with Python 3.13 2025-08-27 05:37:01 -07:00
oobabooga ccc8a2229d Revert "UI: Preserve chat scroll position on textarea resize"
This reverts commit 750adf793d.
2025-08-26 13:59:54 -07:00
oobabooga 750adf793d UI: Preserve chat scroll position on textarea resize 2025-08-26 12:19:23 -07:00
oobabooga 02ca96fa44 Multiple fixes 2025-08-25 22:17:22 -07:00
oobabooga 6a7166fffa Add support for the Seed-OSS template 2025-08-25 19:46:48 -07:00
oobabooga 8fcb4b3102 Make bot_prefix extensions functional again 2025-08-25 19:10:46 -07:00
oobabooga 8f660aefe3 Fix chat-instruct replies leaking the bot name sometimes 2025-08-25 18:50:16 -07:00
oobabooga a531328f7e Fix the GPT-OSS stopping string 2025-08-25 18:41:58 -07:00
oobabooga 6c165d2e55 Fix the chat template 2025-08-25 18:28:43 -07:00
oobabooga b657be7381 Obtain stopping strings in chat mode 2025-08-25 18:22:08 -07:00
oobabooga ded6c41cf8 Fix impersonate for chat-instruct 2025-08-25 18:16:17 -07:00
oobabooga c1aa4590ea Code simplifications, fix impersonate 2025-08-25 18:05:40 -07:00
oobabooga b330ec3517 Simplifications 2025-08-25 17:54:15 -07:00
oobabooga 3ad5970374 Make the llama.cpp --verbose output less verbose 2025-08-25 17:43:21 -07:00
oobabooga adeca8a658 Remove changes to the jinja2 templates 2025-08-25 17:36:01 -07:00
oobabooga aad0104c1b Remove a function 2025-08-25 17:33:13 -07:00
oobabooga f919cdf881 chat.py code simplifications 2025-08-25 17:20:51 -07:00
oobabooga d08800c359 chat.py improvements 2025-08-25 17:03:37 -07:00
oobabooga 3bc48014a5 chat.py code simplifications 2025-08-25 16:48:21 -07:00
oobabooga 1f77427088 Update llama.cpp 2025-08-24 19:56:22 -07:00
oobabooga 2478294c06 UI: Preload the instruct and chat fonts 2025-08-24 12:37:41 -07:00
oobabooga 8be798e15f llama.cpp: Fix stderr deadlock while loading some multimodal models 2025-08-24 12:20:05 -07:00
oobabooga 7fe8da8944 Minor simplification after f247c2ae62 2025-08-22 14:42:56 -07:00
oobabooga f247c2ae62 Make --model work with absolute paths, eg --model /tmp/gemma-3-270m-it-IQ4_NL.gguf 2025-08-22 11:47:33 -07:00
oobabooga fd41f2fafc Update llama.cpp 2025-08-22 11:18:56 -07:00
oobabooga 9e7b326e34 Lint 2025-08-19 06:50:40 -07:00
oobabooga 1972479610 Add the TP option to exllamav3_HF 2025-08-19 06:48:22 -07:00
oobabooga e0f5905a97 Code formatting 2025-08-19 06:34:05 -07:00
oobabooga 5b06284a8a UI: Keep ExLlamav3_HF selected if already selected for EXL3 models 2025-08-19 06:23:21 -07:00
oobabooga cbba58bef9 UI: Fix code blocks having an extra empty line 2025-08-18 15:50:09 -07:00
oobabooga 8805a50d24 Update llama.cpp 2025-08-18 15:31:01 -07:00
oobabooga 7d23a55901 Fix model unloading when switching loaders (closes #7203) 2025-08-18 09:05:47 -07:00
oobabooga 08594e5263 Installer: Slight improvement 2025-08-18 05:59:46 -07:00
oobabooga 15f99b1b71 Installer: Fix a requirement file 2025-08-18 05:51:46 -07:00
oobabooga 6b1b2e2373 Update README 2025-08-17 22:19:20 -07:00
oobabooga 8a14aa62ff Update README 2025-08-17 22:06:59 -07:00
oobabooga 8cdb911a6e Update README 2025-08-17 22:06:12 -07:00
oobabooga 6bf31479d9 Update README 2025-08-17 22:00:21 -07:00
oobabooga 320f7339cd Update README 2025-08-17 21:56:35 -07:00
oobabooga 3dec47eaf8 Small one-click installer changes 2025-08-17 21:43:46 -07:00
oobabooga 35707c2dd8 Update README 2025-08-17 21:39:57 -07:00
oobabooga 58797a9eb5 Minor change after 9651b5c873 2025-08-17 14:18:23 -07:00
oobabooga 64eba9576c mtmd: Fix a bug when "include past attachments" is unchecked 2025-08-17 14:08:40 -07:00
oobabooga 3a91ca2dd1 Update flash attention 2025-08-17 13:57:23 -07:00
oobabooga 9651b5c873 Make CUDA 12.8 the default CUDA option, remove the CUDA 12.4 option
Exllamav3 doesn't compile with torch 2.6 anymore, and torch 2.7
requires newer CUDA.
2025-08-17 13:26:09 -07:00
oobabooga a633793a00 Bump exllamav3 to 0.0.6 2025-08-17 13:19:42 -07:00
oobabooga dbabe67e77 ExLlamaV3: Enable the --enable-tp option, add a --tp-backend option 2025-08-17 13:19:11 -07:00
oobabooga d771ca4a13 Fix web search (attempt) 2025-08-14 12:05:14 -07:00
oobabooga 73a8a737b2 docs: Improve the multimodal examples slightly 2025-08-13 18:23:18 -07:00