tortoise-tts

mirror of https://github.com/neonbjb/tortoise-tts.git synced 2026-02-17 13:14:32 +01:00

Author	SHA1	Message	Date
William Gaylord	e8c53c6a87	Add CPU only support to hifigan_decoder.py Allow higigan_decoder to run when not using a GPU.	2023-10-18 12:35:44 -05:00
manmay-nakhashi	ab270c7b31	add fast api for tortoise	2023-10-18 18:52:39 +05:30
NourEldin Osama	4d4a423971	Update autoregressive.py Fix AttributeError: module 'torch.backends.cuda' has no attribute 'is_available'	2023-08-11 20:08:19 +03:00
manmay nakhashi	4ad73a143d	check cuda in deepspeed init	2023-08-11 21:02:08 +05:30
Jerry-Master	8d67995ba7	Addes MPS support	2023-08-06 19:01:10 +02:00
Jerry-Master	b4988c24b3	Added MPS support for do_tts	2023-08-06 17:41:30 +02:00
manmay-nakhashi	180d65d0fb	add parallelize back as it was already there	2023-07-16 15:42:53 +05:30
manmay-nakhashi	45462c6cf1	pass half from TexttoSpeech args	2023-07-16 14:25:42 +05:30
manmay-nakhashi	19f5250454	add half because kv_cache increases memory footprint	2023-07-16 00:49:17 +05:30
manmay-nakhashi	a88534adb2	added kv_cache	2023-07-15 23:00:19 +05:30
manmay-nakhashi	82724cca54	import deepspeed only if use_deepspeed is True	2023-07-10 07:47:46 +05:30
manmay-nakhashi	5a9707d93c	added deepspeed inference	2023-07-09 18:40:10 +05:30
James Betker	25dd04650c	Revert autocast "fix".	2023-04-02 04:04:18 -07:00
Sergey	4e990b4327	Update diffusion_decoder.py Chanhged import autocast from torch	2023-03-29 13:20:18 +03:00
James Betker	5dc3e269b3	Merge pull request #233 from kianmeng/fix-typos Fix typos	2023-01-17 18:24:24 -07:00
원빈 정	b3d67dcc6b	Add reference of univnet implementation	2023-01-06 15:57:02 +09:00
Kian-Meng Ang	551fe655ff	Fix typos Found via `codespell -S *.json -L splitted,nd,ser,broadcat`	2023-01-06 11:04:36 +08:00
James Betker	958c6d2f73	Get rid of checkpointing It isn't needed in inference.	2022-06-15 22:09:15 -06:00
Johan Nordberg	d8f98c07b4	Remove some assumptions about working directory This allows cli tool to run when not standing in repository dir	2022-05-29 01:10:19 +00:00
James Betker	f56f3d5468	Fix import issue for CVVP	2022-05-26 08:44:20 -06:00
Johan Nordberg	a52e3026ba	Revive CVVP model	2022-05-25 10:22:50 +00:00
James Betker	8139afd0e5	Remove CVVP After training a similar model for a different purpose, I realized that this model is faulty: the contrastive loss it uses only pays attention to high-frequency details which do not contribute meaningfully to output quality. I validated this by comparing a no-CVVP output with a baseline using tts-scores and found no differences.	2022-05-17 12:21:25 -06:00
James Betker	ffd0238a16	v2.2	2022-05-06 00:11:10 -06:00
James Betker	29b2f36f55	Remove entmax dep	2022-05-02 21:43:14 -06:00
James Betker	8c7f709c12	k I think this works..	2022-05-02 21:31:31 -06:00
James Betker	9acce239d3	fix paths	2022-05-02 20:56:28 -06:00
James Betker	cdf44d7506	more fixes	2022-05-02 16:44:47 -06:00
James Betker	39ec1b0db5	Support totally random voices (and make fixes to previous changes)	2022-05-02 15:40:03 -06:00
James Betker	0ffc191408	Add support for extracting and feeding conditioning latents directly into the model - Adds a new script and API endpoints for doing this - Reworks autoregressive and diffusion models so that the conditioning is computed separately (which will actually provide a mild performance boost) - Updates README This is untested. Need to do the following manual tests (and someday write unit tests for this behemoth before it becomes a problem..) 1) Does get_conditioning_latents.py work? 2) Can I feed those latents back into the model by creating a new voice? 3) Can I still mix and match voices (both with conditioning latents and normal voices) with read.py?	2022-05-01 17:25:18 -06:00
James Betker	f7c8decfdb	Move everything into the tortoise/ subdirectory For eventual packaging.	2022-05-01 16:24:24 -06:00

30 commits