UI: Set max_updates_second to 12 by default

When the tokens/second at at ~50 and the model is a thinking model, the markdown rendering for the streaming message becomes a CPU bottleneck.
2026-02-22 23:54:33 +01:00 · 2025-04-30 14:53:15 -07:00 · 2025-04-30 14:53:15 -07:00 · b46ca01340
parent a4bf339724
commit b46ca01340
1 changed files with 1 additions and 1 deletions
--- a/modules/shared.py
+++ b/modules/shared.py
@ -47,7 +47,7 @@ settings = {
    'max_new_tokens_max': 4096,
    'prompt_lookup_num_tokens': 0,
    'max_tokens_second': 0,
-    'max_updates_second': 0,
+    'max_updates_second': 12,
    'auto_max_new_tokens': True,
    'ban_eos_token': False,
    'add_bos_token': True,