llama.cpp: Disable jinja by default (we use Python jinja, not cpp jinja)

This was causing template compilation issues with qwen models.
2026-04-06 15:13:38 +00:00 · 2026-04-02 21:56:06 -07:00 · 2026-04-02 21:56:06 -07:00 · a1cb5b5dc0
commit a1cb5b5dc0
parent 42dfcdfc5b
1 changed files with 1 additions and 0 deletions
--- a/modules/llama_cpp_server.py
+++ b/modules/llama_cpp_server.py
@ -418,6 +418,7 @@ class LlamaServer:
            "--ubatch-size", str(shared.args.ubatch_size),
            "--port", str(self.port),
            "--no-webui",
+            "--no-jinja",
            "--flash-attn", "on",
        ]