Commit graph

461 commits

Author SHA1 Message Date
Tom Jobbins 876fbb97c0
Allow downloading model from HF branch via UI (#1662)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-05-05 13:59:01 -03:00
oobabooga 95d04d6a8d Better warning messages 2023-05-03 21:43:17 -03:00
Tom Jobbins 3c67fc0362
Allow groupsize 1024, needed for larger models eg 30B to lower VRAM usage (#1660) 2023-05-02 00:46:26 -03:00
oobabooga a777c058af
Precise prompts for instruct mode 2023-04-26 03:21:53 -03:00
oobabooga f39c99fa14 Load more than one LoRA with --lora, fix a bug 2023-04-25 22:58:48 -03:00
oobabooga b6af2e56a2 Add --character flag, add character to settings.json 2023-04-24 13:19:42 -03:00
oobabooga caaa556159 Move extensions block definition to the bottom 2023-04-24 03:30:35 -03:00
oobabooga b1ee674d75 Make interface state (mostly) persistent on page reload 2023-04-24 03:05:47 -03:00
oobabooga 47809e28aa Minor changes 2023-04-24 01:04:48 -03:00
Andy Salerno 654933c634
New universal API with streaming/blocking endpoints (#990)
Previous title: Add api_streaming extension and update api-example-stream to use it

* Merge with latest main

* Add parameter capturing encoder_repetition_penalty

* Change some defaults, minor fixes

* Add --api, --public-api flags

* remove unneeded/broken comment from blocking API startup. The comment is already correctly emitted in try_start_cloudflared by calling the lambda we pass in.

* Update on_start message for blocking_api, it should say 'non-streaming' and not 'streaming'

* Update the API examples

* Change a comment

* Update README

* Remove the gradio API

* Remove unused import

* Minor change

* Remove unused import

---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-23 15:52:43 -03:00
oobabooga 2dca8bb25e Sort imports 2023-04-21 17:20:59 -03:00
oobabooga c238ba9532 Add a 'Count tokens' button 2023-04-21 17:18:34 -03:00
oobabooga 2d766d2e19 Improve notebook mode button sizes 2023-04-21 02:37:58 -03:00
oobabooga b4af319fa2 Add a workaround for GALACTICA on some systems 2023-04-19 01:43:10 -03:00
oobabooga 61126f4674 Change the button styles 2023-04-19 00:56:24 -03:00
oobabooga 649e4017a5 Style improvements 2023-04-19 00:36:28 -03:00
oobabooga c58c1d89bd
Clean method to prevent gradio from phoning home 2023-04-18 03:56:20 -03:00
oobabooga e1b80e6fe6
Comment the gradio patch 2023-04-18 01:57:59 -03:00
oobabooga 36f7c022f2
Rename a file 2023-04-18 01:38:33 -03:00
oobabooga 00186f76f4
Monkey patch gradio to prevent it from calling home 2023-04-18 01:13:16 -03:00
oobabooga c3dc348d1c Don't show 'None' in the LoRA list 2023-04-17 13:52:23 -03:00
oobabooga 209fcd21d5 Reorganize Parameters tab 2023-04-17 00:33:22 -03:00
oobabooga b937c9d8c2
Add skip_special_tokens checkbox for Dolly model (#1218) 2023-04-16 14:24:49 -03:00
oobabooga a9c7ef4159 Exclude yaml files from model list 2023-04-16 12:47:30 -03:00
Mikel Bober-Irizar 16a3a5b039
Merge pull request from GHSA-hv5m-3rp9-xcpf
* Remove eval of API input

* Remove unnecessary eval/exec for security

* Use ast.literal_eval

* Use ast.literal_eval

---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-16 01:36:50 -03:00
oobabooga ac189011cb Add "Save current settings for this model" button 2023-04-15 12:54:02 -03:00
oobabooga b9dcba7762 Don't overwrite --gpu_memory on boot (#1237/#1235) 2023-04-15 11:59:31 -03:00
oobabooga 628f8e6168 Reorganize chat buttons 2023-04-14 23:17:15 -03:00
oobabooga c4aa1a42b1 Fix chat history downloading 2023-04-14 19:38:30 -03:00
oobabooga 3a337cfded Use argparse defaults 2023-04-14 15:35:06 -03:00
oobabooga 43e01282b3 Don't override user initial wbits/groupsize 2023-04-14 15:24:03 -03:00
Alex "mcmonkey" Goodwin 64e3b44e0f
initial multi-lora support (#1103)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-14 14:52:06 -03:00
oobabooga ebb81eb176 Add Undo/Regenerate buttons to notebook mode 2023-04-14 14:34:56 -03:00
oobabooga 581f739b2f Reorganize 2023-04-14 11:38:26 -03:00
oobabooga 8e31f2bad4
Automatically set wbits/groupsize/instruct based on model name (#1167) 2023-04-14 11:07:28 -03:00
v0xie 9d66957207
Add --listen-host launch option (#1122) 2023-04-13 21:35:08 -03:00
oobabooga c13e8651ad Suppress "TypedStorage is deprecated" warnings 2023-04-13 12:09:42 -03:00
oobabooga 17ce7c8671 Suppress annoying bitsandbytes welcome message 2023-04-13 12:04:39 -03:00
oobabooga fbb448ce4f If only 1 model is available, load that model 2023-04-13 11:44:10 -03:00
oobabooga 5744b31593 Reorganize some buttons 2023-04-13 11:05:47 -03:00
Xan 6e19ae4b2f
Fix gpt-j model type in UI (#1129) 2023-04-13 10:17:20 -03:00
oobabooga ddbd237ec9 Better way to sort the models/loras 2023-04-12 22:56:32 -03:00
oobabooga 7dfbe54f42 Add --model-menu option 2023-04-12 21:24:26 -03:00
oobabooga 86c10c6f0c Add some labels 2023-04-12 18:39:21 -03:00
oobabooga 0baa50bcc4 Update a comment 2023-04-12 18:26:15 -03:00
oobabooga 5d1d0bd11f Add the GPU index to the label 2023-04-12 18:24:19 -03:00
oobabooga 13789fd200 Handle the no-GPU / multi-GPU cases 2023-04-12 18:21:14 -03:00
oobabooga 1566d8e344 Add model settings to the Models tab 2023-04-12 17:20:18 -03:00
oobabooga 80f4eabb2a Fix send_pictures extension 2023-04-12 10:27:06 -03:00
oobabooga 2289d3686f Update API example 2023-04-11 22:43:43 -03:00