Commit graph

434 commits

Author SHA1 Message Date
oobabooga dcfd09b61e
Update README.md 2023-05-11 09:49:57 -03:00
oobabooga 7a49ceab29
Update README.md 2023-05-11 09:42:39 -03:00
oobabooga 57dc44a995
Update README.md 2023-05-10 12:48:25 -03:00
oobabooga 181b102521
Update README.md 2023-05-10 12:09:47 -03:00
Carl Kenner 814f754451
Support for MPT, INCITE, WizardLM, StableLM, Galactica, Vicuna, Guanaco, and Baize instruction following (#1596) 2023-05-09 20:37:31 -03:00
Wojtab e9e75a9ec7
Generalize multimodality (llava/minigpt4 7b and 13b now supported) (#1741) 2023-05-09 20:18:02 -03:00
oobabooga 00e333d790 Add MOSS support 2023-05-04 23:20:34 -03:00
oobabooga b6ff138084 Add --checkpoint argument for GPTQ 2023-05-04 15:17:20 -03:00
Ahmed Said fbcd32988e
added no_mmap & mlock parameters to llama.cpp and removed llamacpp_model_alternative (#1649)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-05-02 18:25:28 -03:00
oobabooga f39c99fa14 Load more than one LoRA with --lora, fix a bug 2023-04-25 22:58:48 -03:00
oobabooga b6af2e56a2 Add --character flag, add character to settings.json 2023-04-24 13:19:42 -03:00
eiery 78d1977ebf
add n_batch support for llama.cpp (#1115) 2023-04-24 03:46:18 -03:00
Andy Salerno 654933c634
New universal API with streaming/blocking endpoints (#990)
Previous title: Add api_streaming extension and update api-example-stream to use it

* Merge with latest main

* Add parameter capturing encoder_repetition_penalty

* Change some defaults, minor fixes

* Add --api, --public-api flags

* remove unneeded/broken comment from blocking API startup. The comment is already correctly emitted in try_start_cloudflared by calling the lambda we pass in.

* Update on_start message for blocking_api, it should say 'non-streaming' and not 'streaming'

* Update the API examples

* Change a comment

* Update README

* Remove the gradio API

* Remove unused import

* Minor change

* Remove unused import

---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-23 15:52:43 -03:00
oobabooga 7438f4f6ba Change GPTQ triton default settings 2023-04-22 12:27:30 -03:00
oobabooga fe02281477
Update README.md 2023-04-22 03:05:00 -03:00
oobabooga 038fa3eb39
Update README.md 2023-04-22 02:46:07 -03:00
oobabooga 505c2c73e8
Update README.md 2023-04-22 00:11:27 -03:00
oobabooga f8da9a0424
Update README.md 2023-04-18 20:25:08 -03:00
oobabooga c3f6e65554
Update README.md 2023-04-18 20:23:31 -03:00
oobabooga eb15193327
Update README.md 2023-04-18 13:07:08 -03:00
oobabooga 7fbfc489e2
Update README.md 2023-04-18 12:56:37 -03:00
oobabooga f559f9595b
Update README.md 2023-04-18 12:54:09 -03:00
loeken 89e22d4d6a
added windows/docker docs (#1027) 2023-04-18 12:47:43 -03:00
oobabooga 8275989f03
Add new 1-click installers for Linux and MacOS 2023-04-18 02:40:36 -03:00
oobabooga 301c687c64
Update README.md 2023-04-17 11:25:26 -03:00
oobabooga 89bc540557 Update README 2023-04-17 10:55:35 -03:00
practicaldreamer 3961f49524
Add note about --no-fused_mlp ignoring --gpu-memory (#1301) 2023-04-17 10:46:37 -03:00
sgsdxzy b57ffc2ec9
Update to support GPTQ triton commit c90adef (#1229) 2023-04-17 01:11:18 -03:00
oobabooga 3e5cdd005f
Update README.md 2023-04-16 23:28:59 -03:00
oobabooga 39099663a0
Add 4-bit LoRA support (#1200) 2023-04-16 23:26:52 -03:00
oobabooga 705121161b
Update README.md 2023-04-16 20:03:03 -03:00
oobabooga 50c55a51fc
Update README.md 2023-04-16 19:22:31 -03:00
Forkoz c6fe1ced01
Add ChatGLM support (#1256)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-16 19:15:03 -03:00
oobabooga c96529a1b3
Update README.md 2023-04-16 17:00:03 -03:00
oobabooga 004f275efe
Update README.md 2023-04-14 23:36:56 -03:00
oobabooga 83964ed354
Update README.md 2023-04-14 23:33:54 -03:00
oobabooga c41037db68
Update README.md 2023-04-14 23:32:39 -03:00
v0xie 9d66957207
Add --listen-host launch option (#1122) 2023-04-13 21:35:08 -03:00
oobabooga 403be8a27f
Update README.md 2023-04-13 21:23:35 -03:00
Light 97e67d136b Update README.md 2023-04-13 21:00:58 +08:00
Light 15d5a043f2 Merge remote-tracking branch 'origin/main' into triton 2023-04-13 19:38:51 +08:00
oobabooga 7dfbe54f42 Add --model-menu option 2023-04-12 21:24:26 -03:00
MarlinMr 47daf891fe
Link to developer.nvidia.com (#1104) 2023-04-12 15:56:42 -03:00
Light f3591ccfa1 Keep minimal change. 2023-04-12 23:26:06 +08:00
oobabooga 461ca7faf5
Mention that pull request reviews are welcome 2023-04-11 23:12:48 -03:00
oobabooga 749c08a4ff
Update README.md 2023-04-11 14:42:10 -03:00
IggoOnCode 09d8119e3c
Add CPU LoRA training (#938)
(It's very slow)
2023-04-10 17:29:00 -03:00
oobabooga f035b01823
Update README.md 2023-04-10 16:20:23 -03:00
Jeff Lefebvre b7ca89ba3f
Mention that build-essential is required (#1013) 2023-04-10 16:19:10 -03:00
MarkovInequality 992663fa20
Added xformers support to Llama (#950) 2023-04-09 23:08:40 -03:00
oobabooga bce1b7fbb2
Update README.md 2023-04-09 02:19:40 -03:00
oobabooga f7860ce192
Update README.md 2023-04-09 02:19:17 -03:00
oobabooga ece8ed2c84
Update README.md 2023-04-09 02:18:42 -03:00
MarlinMr ec979cd9c4
Use updated docker compose (#877) 2023-04-07 10:48:47 -03:00
MarlinMr 2c0018d946
Cosmetic change of README.md (#878) 2023-04-07 10:47:10 -03:00
oobabooga 848c4edfd5
Update README.md 2023-04-06 22:52:35 -03:00
oobabooga e047cd1def Update README 2023-04-06 22:50:58 -03:00
loeken 08b9d1b23a
creating a layer with Docker/docker-compose (#633) 2023-04-06 22:46:04 -03:00
oobabooga d9e7aba714
Update README.md 2023-04-06 13:42:24 -03:00
oobabooga eec3665845
Add instructions for updating requirements 2023-04-06 13:24:01 -03:00
oobabooga 4a28f39823
Update README.md 2023-04-06 02:47:27 -03:00
eiery 19b516b11b
fix link to streaming api example (#803) 2023-04-05 14:50:23 -03:00
oobabooga 7617ed5bfd
Add AMD instructions 2023-04-05 14:42:58 -03:00
oobabooga 770ef5744f Update README 2023-04-05 14:38:11 -03:00
oobabooga 65d8a24a6d Show profile pictures in the Character tab 2023-04-04 22:28:49 -03:00
oobabooga b24147c7ca Document --pre_layer 2023-04-03 17:34:25 -03:00
oobabooga 525f729b8e
Update README.md 2023-04-02 21:12:41 -03:00
oobabooga 53084241b4
Update README.md 2023-04-02 20:50:06 -03:00
oobabooga b6f817be45
Update README.md 2023-04-01 14:54:10 -03:00
oobabooga 88fa38ac01
Update README.md 2023-04-01 14:49:03 -03:00
oobabooga 4b57bd0d99
Update README.md 2023-04-01 14:38:04 -03:00
oobabooga b53bec5a1f
Update README.md 2023-04-01 14:37:35 -03:00
oobabooga 9160586c04
Update README.md 2023-04-01 14:31:10 -03:00
oobabooga 7ec11ae000
Update README.md 2023-04-01 14:15:19 -03:00
oobabooga 012f4f83b8
Update README.md 2023-04-01 13:55:15 -03:00
oobabooga 2c52310642 Add --threads flag for llama.cpp 2023-03-31 21:18:05 -03:00
oobabooga cbfe0b944a
Update README.md 2023-03-31 17:49:11 -03:00
oobabooga 5c4e44b452
llama.cpp documentation 2023-03-31 15:20:39 -03:00
oobabooga d4a9b5ea97 Remove redundant preset (see the plot in #587) 2023-03-30 17:34:44 -03:00
oobabooga 41b58bc47e
Update README.md 2023-03-29 11:02:29 -03:00
oobabooga 3b4447a4fe
Update README.md 2023-03-29 02:24:11 -03:00
oobabooga 5d0b83c341
Update README.md 2023-03-29 02:22:19 -03:00
oobabooga c2a863f87d
Mention the updated one-click installer 2023-03-29 02:11:51 -03:00
oobabooga 010b259dde Update documentation 2023-03-28 17:46:00 -03:00
oobabooga 036163a751 Change description 2023-03-27 23:39:26 -03:00
oobabooga 30585b3e71 Update README 2023-03-27 23:35:01 -03:00
oobabooga 49c10c5570
Add support for the latest GPTQ models with group-size (#530)
**Warning: old 4-bit weights will not work anymore!**

See here how to get up to date weights: https://github.com/oobabooga/text-generation-webui/wiki/LLaMA-model#step-2-get-the-pre-converted-weights
2023-03-26 00:11:33 -03:00
oobabooga 70f9565f37
Update README.md 2023-03-25 02:35:30 -03:00
oobabooga 04417b658b
Update README.md 2023-03-24 01:40:43 -03:00
oobabooga 143b5b5edf
Mention one-click-bandaid in the README 2023-03-23 23:28:50 -03:00
oobabooga 6872ffd976
Update README.md 2023-03-20 16:53:14 -03:00
oobabooga dd4374edde Update README 2023-03-19 20:15:15 -03:00
oobabooga 9378754cc7 Update README 2023-03-19 20:14:50 -03:00
oobabooga 7ddf6147ac
Update README.md 2023-03-19 19:25:52 -03:00
oobabooga ddb62470e9 --no-cache and --gpu-memory in MiB for fine VRAM control 2023-03-19 19:21:41 -03:00
oobabooga 0cbe2dd7e9
Update README.md 2023-03-18 12:24:54 -03:00
oobabooga d2a7fac8ea
Use pip instead of conda for pytorch 2023-03-18 11:56:04 -03:00
oobabooga a0b1a30fd5
Specify torchvision/torchaudio versions 2023-03-18 11:23:56 -03:00
oobabooga a163807f86
Update README.md 2023-03-18 03:07:27 -03:00
oobabooga a7acfa4893
Update README.md 2023-03-17 22:57:46 -03:00
oobabooga dc35861184
Update README.md 2023-03-17 21:05:17 -03:00
oobabooga f2a5ca7d49
Update README.md 2023-03-17 20:50:27 -03:00
oobabooga 8c8286b0e6
Update README.md 2023-03-17 20:49:40 -03:00
oobabooga 0c05e65e5c
Update README.md 2023-03-17 20:25:42 -03:00
oobabooga 66e8d12354
Update README.md 2023-03-17 19:59:37 -03:00
oobabooga 9a871117d7
Update README.md 2023-03-17 19:52:22 -03:00
oobabooga d4f38b6a1f
Update README.md 2023-03-17 18:57:48 -03:00
oobabooga ad7c829953
Update README.md 2023-03-17 18:55:01 -03:00
oobabooga 4426f941e0
Update the installation instructions. Tldr use WSL 2023-03-17 18:51:07 -03:00
oobabooga ebef4a510b Update README 2023-03-17 11:58:45 -03:00
oobabooga cdfa787bcb Update README 2023-03-17 11:53:28 -03:00
oobabooga dd1c5963da Update README 2023-03-16 12:45:27 -03:00
oobabooga 445ebf0ba8
Update README.md 2023-03-15 20:06:46 -03:00
oobabooga 09045e4bdb
Add WSL guide 2023-03-15 19:42:06 -03:00
oobabooga 128d18e298
Update README.md 2023-03-14 17:57:25 -03:00
oobabooga 1236c7f971
Update README.md 2023-03-14 17:56:15 -03:00
oobabooga b419dffba3
Update README.md 2023-03-14 17:55:35 -03:00
oobabooga 87192e2813 Update README 2023-03-14 08:02:21 -03:00
oobabooga 3da73e409f Merge branch 'main' into Zerogoki00-opt4-bit 2023-03-14 07:50:36 -03:00
Ayanami Rei b746250b2f Update README 2023-03-13 20:20:45 +03:00
oobabooga 66b6971b61 Update README 2023-03-13 12:44:18 -03:00
oobabooga ddea518e0f Document --auto-launch 2023-03-13 12:43:33 -03:00
oobabooga d97bfb8713
Update README.md 2023-03-13 12:39:33 -03:00
oobabooga bdff37f0bb
Update README.md 2023-03-13 11:05:51 -03:00
oobabooga d168b6e1f7
Update README.md 2023-03-12 17:54:07 -03:00
oobabooga 54e8f0c31f
Update README.md 2023-03-12 16:58:00 -03:00
oobabooga 3375eaece0 Update README 2023-03-12 15:01:32 -03:00
oobabooga 17210ff88f
Update README.md 2023-03-12 14:31:24 -03:00
oobabooga 4dc1d8c091
Update README.md 2023-03-12 12:46:53 -03:00
oobabooga 89e9493509 Update README 2023-03-12 11:23:20 -03:00
draff 28fd4fc970 Change wording to be consistent with other args 2023-03-10 23:34:13 +00:00
draff 804486214b Re-implement --load-in-4bit and update --llama-bits arg description 2023-03-10 23:21:01 +00:00
draff e6c631aea4 Replace --load-in-4bit with --llama-bits
Replaces --load-in-4bit with a more flexible --llama-bits arg to allow for 2 and 3 bit models as well. This commit also fixes a loading issue with .pt files which are not in the root of the models folder
2023-03-10 21:36:45 +00:00
oobabooga 7c3d1b43c1
Merge pull request #204 from MichealC0/patch-1
Update README.md
2023-03-09 23:04:09 -03:00
oobabooga 1a3d25f75d
Merge pull request #206 from oobabooga/llama-4bit
Add LLaMA 4-bit support
2023-03-09 21:07:32 -03:00
oobabooga eb0cb9b6df Update README 2023-03-09 20:53:52 -03:00
oobabooga d41e3c233b
Update README.md 2023-03-09 18:02:44 -03:00
oobabooga 33414478bf Update README 2023-03-09 11:13:03 -03:00
oobabooga e7adf5fe4e Add Contrastive Search preset #197 2023-03-09 10:27:11 -03:00
Chimdumebi Nebolisa 4dd14dcab4
Update README.md 2023-03-09 10:22:09 +01:00
oobabooga b4bfd87319
Update README.md 2023-03-06 20:55:01 -03:00
oobabooga d0e8780555
Update README.md 2023-03-06 20:17:59 -03:00
oobabooga 18ccfcd7fe
Update README.md 2023-03-06 20:15:55 -03:00
oobabooga 91823e1ed1
Update README.md 2023-03-06 16:48:31 -03:00
oobabooga aa7ce0665e Merge branch 'main' of github.com:oobabooga/text-generation-webui 2023-03-06 10:58:41 -03:00
oobabooga bf56b6c1fb Load settings.json without the need for --settings settings.json
This is for setting UI defaults
2023-03-06 10:57:45 -03:00
oobabooga 2de9f122cd
Update README.md 2023-03-06 09:34:49 -03:00
oobabooga 736f61610b Update README 2023-03-04 01:33:52 -03:00
oobabooga 76378c6cc2 Update README 2023-03-02 11:27:15 -03:00
oobabooga f4b130e2bd
Update README.md 2023-02-27 15:15:45 -03:00