Commit graph

70 commits

Author SHA1 Message Date
oobabooga 9849aac0f1 Don't show .pt models in the list 2023-03-09 21:54:50 -03:00
oobabooga 74102d5ee4 Insert to the path instead of appending 2023-03-09 20:51:22 -03:00
oobabooga 2965aa1625 Check if the .pt file exists 2023-03-09 20:48:51 -03:00
oobabooga 828a524f9a Add LLaMA 4-bit support 2023-03-09 15:50:26 -03:00
oobabooga e91f4bc25a Add RWKV tokenizer 2023-03-06 08:45:49 -03:00
oobabooga c33715ad5b Move towards HF LLaMA implementation 2023-03-05 01:20:31 -03:00
oobabooga bd8aac8fa4 Add LLaMA 8-bit support 2023-03-04 13:28:42 -03:00
oobabooga ed8b35efd2 Add --pin-weight parameter for FlexGen 2023-03-04 01:04:02 -03:00
oobabooga ea5c5eb3da Add LLaMA support 2023-03-03 14:39:14 -03:00
oobabooga 659bb76722 Add RWKVModel class 2023-03-01 12:08:55 -03:00
oobabooga 6837d4d72a Load the model by name 2023-02-28 02:52:29 -03:00
oobabooga 70e522732c Move RWKV loader into a separate file 2023-02-27 23:50:16 -03:00
oobabooga ebc64a408c RWKV support prototype 2023-02-27 23:03:35 -03:00
oobabooga 8e3e8a070f Make FlexGen work with the newest API 2023-02-26 16:53:41 -03:00
oobabooga 65326b545a Move all gradio elements to shared (so that extensions can use them) 2023-02-24 16:46:50 -03:00
oobabooga f6f792363b Separate command-line params by spaces instead of commas 2023-02-24 08:55:09 -03:00
luis 5abdc99a7c gpu-memory arg change 2023-02-23 18:43:55 -05:00
oobabooga 7224343a70 Improve the imports 2023-02-23 14:41:42 -03:00
oobabooga e46c43afa6 Move some stuff from server.py to modules 2023-02-23 13:42:23 -03:00
oobabooga 1dacd34165 Further refactor 2023-02-23 13:28:30 -03:00