Commit graph

175 commits

Author SHA1 Message Date
kalomaze b6077b02e4
Quadratic sampling (#5403)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2024-02-04 00:20:02 -03:00
Forkoz 528318b700
API: Remove tiktoken from logit bias (#5391) 2024-01-28 21:42:03 -03:00
oobabooga aa575119e6 API: minor fix 2024-01-22 04:38:43 -08:00
oobabooga 821dd65fb3 API: add a comment 2024-01-22 04:15:51 -08:00
oobabooga 6247eafcc5 API: better handle temperature = 0 2024-01-22 04:12:23 -08:00
oobabooga 817866c9cf Lint 2024-01-22 04:07:25 -08:00
oobabooga aad73667af Lint 2024-01-22 03:25:55 -08:00
Cohee fbf8ae39f8
API: Allow content arrays for multimodal OpenAI requests (#5277) 2024-01-22 08:10:26 -03:00
Ercan 166fdf09f3
API: Properly handle Images with RGBA color format (#5332) 2024-01-22 08:08:51 -03:00
lmg-anon db1da9f98d
Fix logprobs tokens in OpenAI API (#5339) 2024-01-22 08:07:42 -03:00
Stefan Daniel Schwarz 232c07bf1f
API: set do_sample=false when temperature=0 (#5275) 2024-01-17 23:58:11 -03:00
oobabooga e055967974
Add prompt_lookup_num_tokens parameter (#5296) 2024-01-17 17:09:36 -03:00
Samuel Weinhardt 952a05a7c8
Correct field alias types for OpenAI extension (#5257) 2024-01-14 13:30:36 -03:00
oobabooga bb2c4707c4 API: fix bug after previous commit 2024-01-09 19:08:02 -08:00
oobabooga 4332e24740 API: Make user_name/bot_name the official and name1/name2 the alias 2024-01-09 19:06:11 -08:00
oobabooga a4c51b5a05 API: add "user_name" and "bot_name" aliases for name1 and name2 2024-01-09 19:02:45 -08:00
oobabooga 29c2693ea0
dynatemp_low, dynatemp_high, dynatemp_exponent parameters (#5209) 2024-01-08 23:28:35 -03:00
oobabooga 0d07b3a6a1
Add dynamic_temperature_low parameter (#5198) 2024-01-07 17:03:47 -03:00
kalomaze 48327cc5c4
Dynamic Temperature HF loader support (#5174)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2024-01-07 10:36:26 -03:00
Philipp Claßen 3eca20c015
Typo fixed in variable names (#5184) 2024-01-06 03:05:03 -03:00
kabachuha dbe438564e
Support for sending images into OpenAI chat API (#4827) 2023-12-22 22:45:53 -03:00
oobabooga 23818dc098 Better logger
Credits: vladmandic/automatic
2023-12-19 20:38:33 -08:00
Felipe Ferreira 11f082e417
[OpenAI Extension] Add more types to Embeddings Endpoint (#4895) 2023-12-15 00:26:16 -03:00
Kim Jaewon e53f99faa0
[OpenAI Extension] Add 'max_logits' parameter in logits endpoint (#4916) 2023-12-15 00:22:43 -03:00
oobabooga 39d2fe1ed9
Jinja templates for Instruct and Chat (#4874) 2023-12-12 17:23:14 -03:00
oobabooga 2a335b8aa7 Cleanup: set shared.model_name only once 2023-12-08 06:35:23 -08:00
oobabooga 2c5a1e67f9
Parameters: change max_new_tokens & repetition_penalty_range defaults (#4842) 2023-12-07 20:04:52 -03:00
oobabooga b6d16a35b1 Minor API fix 2023-11-21 17:56:28 -08:00
oobabooga f0d66cf817 Add missing file 2023-11-19 10:12:13 -08:00
Jordan Tucker cb836dd49c
fix: use shared chat-instruct_command with api (#4653) 2023-11-19 01:19:10 -03:00
oobabooga 771e62e476
Add /v1/internal/lora endpoints (#4652) 2023-11-19 00:35:22 -03:00
oobabooga ef6feedeb2
Add --nowebui flag for pure API mode (#4651) 2023-11-18 23:38:39 -03:00
oobabooga 0fa1af296c
Add /v1/internal/logits endpoint (#4650) 2023-11-18 23:19:31 -03:00
oobabooga 8f4f4daf8b
Add --admin-key flag for API (#4649) 2023-11-18 22:33:27 -03:00
wizd af76fbedb8
Openai embedding fix to support jina-embeddings-v2 (#4642) 2023-11-18 20:24:29 -03:00
oobabooga e0a7cc5e0f Simplify CORS code 2023-11-16 20:11:55 -08:00
oobabooga c0233bb9d3 Minor message change 2023-11-16 18:36:57 -08:00
oobabooga 510a01ef46 Lint 2023-11-16 18:03:06 -08:00
oobabooga a475aa7816 Improve API documentation 2023-11-15 18:39:08 -08:00
oobabooga a85ce5f055 Add more info messages for truncation / instruction template 2023-11-15 16:20:31 -08:00
oobabooga e6f44d6d19 Print context length / instruction template to terminal when loading models 2023-11-15 16:00:51 -08:00
oobabooga be125e2708 Add /v1/internal/model/unload endpoint 2023-11-15 15:48:33 -08:00
oobabooga 52758f15da Remove sentence-transformers requirement (for #1575) 2023-11-10 07:35:29 -08:00
oobabooga c5be3f7acb Make /v1/embeddings functional, add request/response types 2023-11-10 07:34:27 -08:00
oobabooga 0777b0d3c7 Add system_message parameter, document model (unused) parameter 2023-11-10 06:47:18 -08:00
oobabooga 4aabff3728 Remove old API, launch OpenAI API with --api 2023-11-10 06:39:08 -08:00
GuizzyQC 6a7cd01ebf
Fix bug with /internal/model/load (#4549)
Update shared.model_name after loading model through API call
2023-11-10 00:16:38 -03:00
oobabooga d86f1fd2c3 OpenAI API: stop streaming on client disconnect (closes #4521) 2023-11-09 06:37:32 -08:00
oobabooga effb3aef42 Prevent deadlocks in OpenAI API with simultaneous requests 2023-11-08 20:55:39 -08:00
oobabooga 678fd73aef Document /v1/internal/model/load and fix a bug 2023-11-08 17:41:12 -08:00
MrMojoR 1754a3761b
Include trust remote code usage in openai api's embedder (#4513) 2023-11-08 11:25:43 -03:00
hronoas 6c7aad11f3
openai extension: wrong frequency_penalty type (#4512) 2023-11-08 11:23:51 -03:00
oobabooga 881e8a6e70
Small bug fix in /v1/internal/model/load 2023-11-08 02:34:13 -03:00
oobabooga 050ff36bd6 Revert "Add a comment to /v1/models"
This reverts commit 38b07493a0.
2023-11-07 21:09:47 -08:00
oobabooga 38b07493a0 Add a comment to /v1/models 2023-11-07 21:07:12 -08:00
oobabooga 2358706453 Add /v1/internal/model/load endpoint (tentative) 2023-11-07 20:58:06 -08:00
oobabooga 43c53a7820 Refactor the /v1/models endpoint 2023-11-07 19:59:27 -08:00
oobabooga 1b69694fe9 Add types to the encode/decode/token-count endpoints 2023-11-07 19:32:14 -08:00
oobabooga f6ca9cfcdc Add /v1/internal/model-info endpoint 2023-11-07 18:59:02 -08:00
oobabooga 6e2e0317af
Separate context and system message in instruction formats (#4499) 2023-11-07 20:02:58 -03:00
oobabooga 3fc505dc0f Document unused parameters 2023-11-07 08:56:09 -08:00
oobabooga 3d59346871 Implement echo/suffix parameters 2023-11-07 08:43:45 -08:00
oobabooga 48c9c31440 Document the "preset" option in the API 2023-11-07 08:23:17 -08:00
oobabooga 79b3f5a546
Add /v1/internal/stop-generation to OpenAI API (#4498) 2023-11-07 00:10:42 -03:00
oobabooga 97c21e5667 Don't strip leading spaces in OpenAI API 2023-11-06 19:09:41 -08:00
oobabooga 4a45dc4041 Reorder the parameters in the FastAPI documentation 2023-11-06 09:55:36 -08:00
oobabooga 0db81355bc Reorder a parameter 2023-11-06 07:11:49 -08:00
oobabooga b87c6213ae Remove obsolete endpoint 2023-11-06 05:45:45 -08:00
oobabooga 28fd535f9c Make chat API more robust 2023-11-06 05:22:01 -08:00
oobabooga ec17a5d2b7
Make OpenAI API the default API (#4430) 2023-11-06 02:38:29 -03:00
俞航 84d957ba62
[Fix] fix openai embedding_model loading as str (#4147) 2023-11-05 20:42:45 -03:00
kabachuha e18a0460d4
fix openai extension not working because of absent new defaults (#4477) 2023-11-04 16:12:51 -03:00
tdrussell 72f6fc6923
Rename additive_repetition_penalty to presence_penalty, add frequency_penalty (#4376) 2023-10-25 12:10:28 -03:00
tdrussell 4440f87722
Add additive_repetition_penalty sampler setting. (#3627) 2023-10-23 02:28:07 -03:00
hronoas db7ecdd274
openai: fix empty models list on query present in url (#4139) 2023-10-16 17:02:47 -03:00
Jesus Alvarez ed66ca3cdf
Add HTTPS support to APIs (openai and default) (#4270)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-10-13 01:31:13 -03:00
oobabooga 08cf150c0c
Add a grammar editor to the UI (#4061) 2023-09-24 18:05:24 -03:00
oobabooga b227e65d86 Add grammar to llama.cpp loader (closes #4019) 2023-09-24 07:10:45 -07:00
Chenxiao Wang 9de2dfa887
extensions/openai: Fix error when preparing cache for embedding models (#3995) 2023-09-24 00:58:28 -03:00
Chenxiao Wang 347aed4254
extensions/openai: load extension settings via settings.yaml (#3953) 2023-09-17 22:39:29 -03:00
Shulzhenko Anatolii 0668f4e67f
Add speechrecognition dependency for OpenAI extension (#3959) 2023-09-16 13:49:48 -03:00
wizd cc7f345c29
add whisper api to openai plugin (#3958) 2023-09-16 12:04:04 -03:00
oobabooga f5fb1ee666 Change a comment 2023-09-15 20:16:30 -07:00
oobabooga 2c1b548cea Minor fix 2023-09-15 20:14:32 -07:00
oobabooga 8f97e87cac Lint the openai extension 2023-09-15 20:11:16 -07:00
saltacc f01b9aa71f
Add customizable ban tokens (#3899) 2023-09-15 18:27:27 -03:00
Chang Chi, Meng b61d9aef19
openai API: add support for chunked transfer encoding in POST requests (#3870) 2023-09-12 15:54:42 -03:00
oobabooga df123a20fc Prevent extra keys from being saved to settings.yaml 2023-09-11 20:13:10 -07:00
oobabooga 9331ab4798
Read GGUF metadata (#3873) 2023-09-11 18:49:30 -03:00
oobabooga cec8db52e5
Add max_tokens_second param (#3533) 2023-08-29 17:44:31 -03:00
oobabooga c237ce607e Move characters/instruction-following to instruction-templates 2023-08-06 17:50:32 -07:00
oobabooga 0af10ab49b
Add Classifier Free Guidance (CFG) for Transformers/ExLlama (#3325) 2023-08-06 17:22:48 -03:00
matatonic 8f98268252
extensions/openai: include content-length for json replies (#3416) 2023-08-03 16:10:49 -03:00
oobabooga e931844fe2
Add auto_max_new_tokens parameter (#3419) 2023-08-02 14:52:20 -03:00
matatonic 9ae0eab989
extensions/openai: +Array input (batched) , +Fixes (#3309) 2023-08-01 22:26:00 -03:00
matatonic 90a4ab631c
extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 11:28:12 -03:00
oobabooga e202190c4f lint 2023-07-12 11:33:25 -07:00
matatonic 3e7feb699c
extensions/openai: Major openai extension updates & fixes (#3049)
* many openai updates

* total reorg & cleanup.

* fixups

* missing import os for images

* +moderations, custom_stopping_strings, more fixes

* fix bugs in completion streaming

* moderation fix (flagged)

* updated moderation categories

---------

Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org>
2023-07-11 18:50:08 -03:00
oobabooga 3443219cbc
Add repetition penalty range parameter to transformers (#2916) 2023-06-29 13:40:13 -03:00
oobabooga c52290de50
ExLlama with long context (#2875) 2023-06-25 22:49:26 -03:00