Commit graph

213 commits

Author SHA1 Message Date
Josh Ziegler 5b0e50eaa6
avoid mutable default in aligner 2022-05-26 16:20:09 -04:00
James Betker f56f3d5468 Fix import issue for CVVP 2022-05-26 08:44:20 -06:00
James Betker 3acca1445a
Merge pull request #64 from jnordberg/revive-cvvp
Revive CVVP model
2022-05-25 15:59:09 -06:00
Johan Nordberg b681fa9d11 Skip CLVP if cvvp_amount is 1
Also fixes formatting bug in log message
2022-05-25 11:12:53 +00:00
Johan Nordberg a52e3026ba Revive CVVP model 2022-05-25 10:22:50 +00:00
James Betker 7f9f1dbfc3 Fix bug 2022-05-22 05:50:26 -06:00
James Betker e118785aaf Support combining voices in do_tts 2022-05-22 05:28:15 -06:00
James Betker e882484c4a Update read.py to support multiple candidates 2022-05-22 05:26:01 -06:00
James Betker 8feb18b03f Merge remote-tracking branch 'origin/main' 2022-05-22 05:13:50 -06:00
James Betker 12a767c7f5 Commit comparisons with naturalspeech
This is the first TTS engine I've seen come along that has comparable performance
to Tortoise, though what has been released is pretty sparse on actual results. Still,
it's an interesting comparison.
2022-05-22 05:13:08 -06:00
James Betker eae0414f94
Merge pull request #58 from kwibjo/main
Update README.md
2022-05-21 10:41:55 -06:00
Jai Mu 5bff5dd819
Update README.md
Useless update but it was bothering me.
2022-05-22 00:56:06 +09:30
James Betker b98860552a
Merge pull request #57 from wavymulder/main
Updated train_lescault voices
2022-05-19 15:56:47 -06:00
Tristan Drake cdc138a3df Updated lescault voices 2022-05-19 17:39:25 -04:00
James Betker f4bd9c4dd0 Fix faulty merge 2022-05-19 10:37:57 -06:00
James Betker 1a8c9f741a Merge remote-tracking branch 'origin/main'
# Conflicts:
#	tortoise/read.py
2022-05-19 10:34:54 -06:00
James Betker 6d3157ebff Remove faulty 3rd example for train_mouse 2022-05-19 10:30:02 -06:00
James Betker 550874cbec Update broken train_empire voice 2022-05-19 10:26:46 -06:00
James Betker 2ba8d5bf97 Update requirements to specify version of transformers 2022-05-19 10:22:04 -06:00
James Betker 4641933d74
Merge pull request #55 from jnordberg/models-dir
Make models dir configurable
2022-05-19 09:51:21 -06:00
Johan Nordberg e34ffca8fb Allow passing additional voice directories when loading voices 2022-05-19 21:02:11 +09:00
Johan Nordberg 20220893af Allow setting models path from environment variable 2022-05-19 21:02:09 +09:00
James Betker 8139afd0e5 Remove CVVP
After training a similar model for a different purpose, I realized that
this model is faulty: the contrastive loss it uses only pays attention
to high-frequency details which do not contribute meaningfully to
output quality. I validated this by comparing a no-CVVP output with
a baseline using tts-scores and found no differences.
2022-05-17 12:21:25 -06:00
James Betker 5d5aacc38c v2.4 2022-05-17 12:15:13 -06:00
James Betker aef86d21bf Add a way to get deterministic behavior from tortoise and add debug states for reporting 2022-05-17 12:11:18 -06:00
James Betker 9eac62598a Merge remote-tracking branch 'origin/main' 2022-05-17 11:22:40 -06:00
James Betker 24612f81c2 Add chapter 1 of GoT for read.py demos 2022-05-17 11:21:57 -06:00
James Betker 160963b105 Add conditioning latent example 2022-05-17 11:21:37 -06:00
James Betker b5fc8f198b
Merge pull request #49 from faad3/main
Fix bug in load_voices in audio.py
2022-05-17 11:20:44 -06:00
Danila Berezin dc3d7b1667
Fix bug in load_voices in audio.py
The read.py script did not work with pth latents, so I fix bug in audio.py. It seems that in the elif statement, instead of voice, voices should be clip, clips. And torch stack doesn't work with tuples, so I had to split this operation.
2022-05-17 18:34:54 +03:00
James Betker 11e80b0dae
Merge pull request #42 from jnordberg/main
Improve sentence splitting
2022-05-14 08:52:46 -06:00
James Betker 50690e4465 Automatically pick batch size based on available GPU memory 2022-05-13 10:30:02 -06:00
James Betker cb7adf16af Remove samples_generator
Was useful but the page is more detailed now.
2022-05-13 10:28:16 -06:00
Johan Nordberg 5197904660 Improve sentence splitting 2022-05-13 11:02:17 +00:00
James Betker 8c0b3855bf Release notes for 2.3 2022-05-12 20:26:24 -06:00
James Betker 1a4f0fa350 update model paths (including clvp2!) 2022-05-12 20:18:11 -06:00
James Betker 75b0e03ab3 Add error message 2022-05-12 20:15:40 -06:00
James Betker ec16c0208c add eval script for testing 2022-05-12 20:15:22 -06:00
James Betker 7d5e7dbba8 CLVP2! 2022-05-12 13:23:03 -06:00
James Betker fda5130819 Add support for multiple output candidates in do_tts. 2022-05-12 11:25:35 -06:00
James Betker 6ed77b0ea4 update examples 2022-05-12 11:25:03 -06:00
James Betker 0005d02940 read.py: allow user-specified splits 2022-05-12 11:24:55 -06:00
James Betker 945bd88f21
Merge pull request #36 from e0xextazy/main
Optimizing graphics card memory
2022-05-11 21:46:16 -06:00
Mark Baushenko cbccc5e953
Optimizing graphics card memory
During inference it does not store gradients, which take up most of the video memory
2022-05-11 16:35:11 +03:00
James Betker cfd436be00 update requirements to (hopefully) provide better compatibility
man python deps are garbage.....
2022-05-10 23:42:37 -06:00
James Betker 609e3a26c6 whoops 2022-05-10 18:25:14 -06:00
James Betker eae2b92385 Upload finetuned ljspeech examples 2022-05-10 18:23:24 -06:00
James Betker 099bf8363c Update README with suggestions for windows installation 2022-05-08 20:44:44 -06:00
James Betker 317d55c252 re-enable redaction 2022-05-06 09:36:42 -06:00
James Betker 8672075914 temporarily disable redaction 2022-05-06 09:06:20 -06:00