Commit graph

68 commits

Author SHA1 Message Date
oobabooga 1b69694fe9 Add types to the encode/decode/token-count endpoints 2023-11-07 19:32:14 -08:00
oobabooga f6ca9cfcdc Add /v1/internal/model-info endpoint 2023-11-07 18:59:02 -08:00
oobabooga 6e2e0317af
Separate context and system message in instruction formats (#4499) 2023-11-07 20:02:58 -03:00
oobabooga 3fc505dc0f Document unused parameters 2023-11-07 08:56:09 -08:00
oobabooga 3d59346871 Implement echo/suffix parameters 2023-11-07 08:43:45 -08:00
oobabooga 48c9c31440 Document the "preset" option in the API 2023-11-07 08:23:17 -08:00
oobabooga 79b3f5a546
Add /v1/internal/stop-generation to OpenAI API (#4498) 2023-11-07 00:10:42 -03:00
oobabooga 97c21e5667 Don't strip leading spaces in OpenAI API 2023-11-06 19:09:41 -08:00
oobabooga 4a45dc4041 Reorder the parameters in the FastAPI documentation 2023-11-06 09:55:36 -08:00
oobabooga 0db81355bc Reorder a parameter 2023-11-06 07:11:49 -08:00
oobabooga b87c6213ae Remove obsolete endpoint 2023-11-06 05:45:45 -08:00
oobabooga 28fd535f9c Make chat API more robust 2023-11-06 05:22:01 -08:00
oobabooga ec17a5d2b7
Make OpenAI API the default API (#4430) 2023-11-06 02:38:29 -03:00
俞航 84d957ba62
[Fix] fix openai embedding_model loading as str (#4147) 2023-11-05 20:42:45 -03:00
kabachuha e18a0460d4
fix openai extension not working because of absent new defaults (#4477) 2023-11-04 16:12:51 -03:00
tdrussell 72f6fc6923
Rename additive_repetition_penalty to presence_penalty, add frequency_penalty (#4376) 2023-10-25 12:10:28 -03:00
tdrussell 4440f87722
Add additive_repetition_penalty sampler setting. (#3627) 2023-10-23 02:28:07 -03:00
hronoas db7ecdd274
openai: fix empty models list on query present in url (#4139) 2023-10-16 17:02:47 -03:00
Jesus Alvarez ed66ca3cdf
Add HTTPS support to APIs (openai and default) (#4270)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-10-13 01:31:13 -03:00
oobabooga 08cf150c0c
Add a grammar editor to the UI (#4061) 2023-09-24 18:05:24 -03:00
oobabooga b227e65d86 Add grammar to llama.cpp loader (closes #4019) 2023-09-24 07:10:45 -07:00
Chenxiao Wang 9de2dfa887
extensions/openai: Fix error when preparing cache for embedding models (#3995) 2023-09-24 00:58:28 -03:00
Chenxiao Wang 347aed4254
extensions/openai: load extension settings via settings.yaml (#3953) 2023-09-17 22:39:29 -03:00
Shulzhenko Anatolii 0668f4e67f
Add speechrecognition dependency for OpenAI extension (#3959) 2023-09-16 13:49:48 -03:00
wizd cc7f345c29
add whisper api to openai plugin (#3958) 2023-09-16 12:04:04 -03:00
oobabooga f5fb1ee666 Change a comment 2023-09-15 20:16:30 -07:00
oobabooga 2c1b548cea Minor fix 2023-09-15 20:14:32 -07:00
oobabooga 8f97e87cac Lint the openai extension 2023-09-15 20:11:16 -07:00
saltacc f01b9aa71f
Add customizable ban tokens (#3899) 2023-09-15 18:27:27 -03:00
Chang Chi, Meng b61d9aef19
openai API: add support for chunked transfer encoding in POST requests (#3870) 2023-09-12 15:54:42 -03:00
oobabooga df123a20fc Prevent extra keys from being saved to settings.yaml 2023-09-11 20:13:10 -07:00
oobabooga 9331ab4798
Read GGUF metadata (#3873) 2023-09-11 18:49:30 -03:00
oobabooga cec8db52e5
Add max_tokens_second param (#3533) 2023-08-29 17:44:31 -03:00
oobabooga c237ce607e Move characters/instruction-following to instruction-templates 2023-08-06 17:50:32 -07:00
oobabooga 0af10ab49b
Add Classifier Free Guidance (CFG) for Transformers/ExLlama (#3325) 2023-08-06 17:22:48 -03:00
matatonic 8f98268252
extensions/openai: include content-length for json replies (#3416) 2023-08-03 16:10:49 -03:00
oobabooga e931844fe2
Add auto_max_new_tokens parameter (#3419) 2023-08-02 14:52:20 -03:00
matatonic 9ae0eab989
extensions/openai: +Array input (batched) , +Fixes (#3309) 2023-08-01 22:26:00 -03:00
matatonic 90a4ab631c
extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 11:28:12 -03:00
oobabooga e202190c4f lint 2023-07-12 11:33:25 -07:00
matatonic 3e7feb699c
extensions/openai: Major openai extension updates & fixes (#3049)
* many openai updates

* total reorg & cleanup.

* fixups

* missing import os for images

* +moderations, custom_stopping_strings, more fixes

* fix bugs in completion streaming

* moderation fix (flagged)

* updated moderation categories

---------

Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org>
2023-07-11 18:50:08 -03:00
oobabooga 3443219cbc
Add repetition penalty range parameter to transformers (#2916) 2023-06-29 13:40:13 -03:00
oobabooga c52290de50
ExLlama with long context (#2875) 2023-06-25 22:49:26 -03:00
matatonic b45baeea41
extensions/openai: Major docs update, fix #2852 (critical bug), minor improvements (#2849) 2023-06-24 22:50:04 -03:00
matatonic 1e97aaac95
extensions/openai: docs update, model loader, minor fixes (#2557) 2023-06-17 19:15:24 -03:00
matatonic 4a17a5db67
[extensions/openai] various fixes (#2533) 2023-06-06 01:43:04 -03:00
matatonic a6d3f010a5
extensions/openai: include all available models in Model.list (#2368)
Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org>
2023-05-30 22:13:37 -03:00
matatonic e5b756ecfe
Fixes #2331, IndexError: string index out of range (#2383) 2023-05-30 22:07:40 -03:00
matatonic ebcadc0042
extensions/openai: cross_origin + chunked_response (updated fix) (#2423) 2023-05-30 21:54:24 -03:00
matatonic df50f077db
fixup missing tfs top_a params, defaults reorg (#2443) 2023-05-30 21:52:33 -03:00