Full documentation update to match current codebase

This commit is contained in:
oobabooga 2026-03-05 12:46:21 -03:00
parent 1c2548fd89
commit 1ffe540c97
10 changed files with 388 additions and 326 deletions

View file

@ -1,20 +1,19 @@
## What Works
| Loader | Loading 1 LoRA | Loading 2 or more LoRAs | Training LoRAs | Multimodal extension | Perplexity evaluation |
|----------------|----------------|-------------------------|----------------|----------------------|-----------------------|
| Transformers | ✅ | ✅\*\* | ✅\* | ✅ | ✅ |
| llama.cpp | ❌ | ❌ | ❌ | ❌ | use llamacpp_HF |
| llamacpp_HF | ❌ | ❌ | ❌ | ❌ | ✅ |
| ExLlamav2_HF | ✅ | ✅ | ❌ | ❌ | ✅ |
| ExLlamav2 | ✅ | ✅ | ❌ | ❌ | use ExLlamav2_HF |
| AutoGPTQ | ✅ | ❌ | ❌ | ✅ | ✅ |
| AutoAWQ | ? | ❌ | ? | ? | ✅ |
| HQQ | ? | ? | ? | ? | ✅ |
| Loader | Loading LoRAs | Training LoRAs | Multimodal | Perplexity evaluation |
|----------------|---------------|----------------|------------|-----------------------|
| llama.cpp | ❌ | ❌ | ✅\* | ❌ |
| Transformers | ✅ | ✅ | ✅\*\* | ✅ |
| ExLlamav3_HF | ❌ | ❌ | ❌ | ✅ |
| ExLlamav3 | ❌ | ❌ | ✅ | ❌ |
| ExLlamav2_HF | ✅ | ❌ | ❌ | ✅ |
| ExLlamav2 | ✅ | ❌ | ❌ | ❌ |
| TensorRT-LLM | ❌ | ❌ | ❌ | ❌ |
❌ = not implemented
❌ = not supported
✅ = implemented
✅ = supported
\* Training LoRAs with GPTQ models also works with the Transformers loader. Make sure to check "auto-devices" and "disable_exllama" before loading the model.
\* Via the `mmproj` parameter (multimodal projector file).
\*\* Multi-LoRA in PEFT is tricky and the current implementation does not work reliably in all cases.
\*\* Via the `send_pictures` extension.