text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2026-02-09 17:25:00 +01:00

Author	SHA1	Message	Date
oobabooga	74eedf6050	Remove the CFG slider	2025-11-27 16:28:40 -08:00
oobabooga	9e33c6bfb7	Add missing files	2025-11-27 15:56:58 -08:00
oobabooga	666816a773	Small fixes	2025-11-27 15:48:53 -08:00
oobabooga	21f992e7f7	Organize the UI	2025-11-27 15:42:11 -08:00
oobabooga	148a5d1e44	Keep things more modular	2025-11-27 15:32:01 -08:00
oobabooga	0adda7a5c5	Lint	2025-11-27 14:39:21 -08:00
oobabooga	aa074409cb	Better events for the dimensions	2025-11-27 14:38:50 -08:00
oobabooga	be799ba8eb	Lint	2025-11-27 14:25:49 -08:00
oobabooga	a873692234	Image generation now functional	2025-11-27 14:24:35 -08:00
oobabooga	2f11b3040d	Add functions	2025-11-27 13:53:46 -08:00
oobabooga	aa63c612de	Progress on model loading	2025-11-27 13:46:54 -08:00
oobabooga	164c6fcdbf	Add the UI structure	2025-11-27 13:44:07 -08:00
oobabooga	4ad2ad468e	Add basic structure	2025-11-27 10:10:11 -08:00
GodEmperor785	400bb0694b	Add slider for --ubatch-size for llama.cpp loader, change defaults for better MoE performance (#7316 )	2025-11-21 16:56:02 -03:00
oobabooga	8f0048663d	More modular HTML generator	2025-11-21 07:09:16 -08:00
oobabooga	0d4eff284c	Add a --cpu-moe model for llama.cpp	2025-11-19 05:23:43 -08:00
Trenten Miller	6871484398	fix: Rename 'evaluation_strategy' to 'eval_strategy' in training	2025-10-28 16:48:04 -03:00
oobabooga	a156ebbf76	Lint	2025-10-15 13:15:01 -07:00
oobabooga	c871d9cdbd	Revert "Same as `7f06aec3a1` but for exllamav3_hf" This reverts commit `deb37b821b`.	2025-10-15 13:05:41 -07:00
oobabooga	b5a6904c4a	Make --trust-remote-code immutable from the UI/API	2025-10-14 20:47:01 -07:00
mamei16	308e726e11	log error when llama-server request exceeds context size (#7263 )	2025-10-12 23:00:11 -03:00
oobabooga	655c3e86e3	Fix "continue" missing an initial space in chat-instruct/chat modes	2025-10-11 17:00:25 -07:00
oobabooga	c7dd920dc8	Fix metadata leaking into branched chats	2025-10-11 14:12:05 -07:00
oobabooga	78ff21d512	Organize the --help message	2025-10-10 15:21:08 -07:00
oobabooga	0d03813e98	Update modules/chat.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-10-09 21:01:13 -03:00
oobabooga	deb37b821b	Same as `7f06aec3a1` but for exllamav3_hf	2025-10-09 13:02:38 -07:00
oobabooga	7f06aec3a1	exllamav3: Implement the logits function for /v1/internal/logits	2025-10-09 11:24:25 -07:00
oobabooga	218dc01b51	Add fallbacks after `93aa7b3ed3`	2025-10-09 10:59:34 -07:00
oobabooga	282aa19189	Safer profile picture uploading	2025-10-09 09:26:35 -07:00
oobabooga	93aa7b3ed3	Better handle multigpu setups with transformers + bitsandbytes	2025-10-09 08:49:44 -07:00
Remowylliams	38a7fd685d	chat.py fixes Instruct mode History	2025-10-05 11:34:47 -03:00
oobabooga	1e863a7113	Fix exllamav3 ignoring the stop button	2025-09-19 16:12:50 -07:00
stevenxdavis	dd6d2223a5	Changing transformers_loader.py to Match User Expectations for --bf16 and Flash Attention 2 (#7217 )	2025-09-17 16:39:04 -03:00
oobabooga	9e9ab39892	Make exllamav3_hf and exllamav2_hf functional again	2025-09-17 12:29:22 -07:00
oobabooga	f3829b268a	llama.cpp: Always pass --flash-attn on	2025-09-02 12:12:17 -07:00
oobabooga	c6ea67bbdb	Lint	2025-09-02 10:22:03 -07:00
oobabooga	00ed878b05	Slightly more robust model loading	2025-09-02 10:16:26 -07:00
oobabooga	387e249dec	Change an info message	2025-08-31 16:27:10 -07:00
oobabooga	8028d88541	Lint	2025-08-30 21:29:20 -07:00
oobabooga	13876a1ee8	llama.cpp: Remove the --flash-attn flag (it's always on now)	2025-08-30 20:28:26 -07:00
oobabooga	3a3e247f3c	Even better way to handle continue for thinking blocks	2025-08-30 12:36:35 -07:00
oobabooga	cf1aad2a68	Fix "continue" for Byte-OSS for partial thinking blocks	2025-08-30 12:16:45 -07:00
oobabooga	96136ea760	Fix LaTeX rendering for equations with asterisks	2025-08-30 10:13:32 -07:00
oobabooga	a3eb67e466	Fix the UI failing to launch if the Notebook prompt is too long	2025-08-30 08:42:26 -07:00
oobabooga	a2b37adb26	UI: Preload the correct fonts for chat mode	2025-08-29 09:25:44 -07:00
oobabooga	cb8780a4ce	Safer check for is_multimodal when loading models Avoids unrelated multimodal error when a model fails to load due to lack of memory.	2025-08-28 11:13:19 -07:00
oobabooga	cfc83745ec	UI: Improve right sidebar borders in light mode	2025-08-28 08:34:48 -07:00
oobabooga	ba6041251d	UI: Minor change	2025-08-28 06:20:00 -07:00
oobabooga	a92758a144	llama.cpp: Fix obtaining the maximum sequence length for GPT-OSS	2025-08-27 16:15:40 -07:00
oobabooga	030ba7bfeb	UI: Mention that Seed-OSS uses enable_thinking	2025-08-27 07:44:35 -07:00

1 2 3 4 5 ...

1961 commits