Samples
Performance comparison on zero-shot TTS
Samples
Models | Reference speech | Small | Medium | Large | Proposed(dense) | Proposed(sparse) | Original |
---|---|---|---|---|---|---|---|
Female(nonprofessional) | |||||||
Male(nonprofessional) | |||||||
Female(professional) | |||||||
Male(professional) |
Lightweight Zero-shot Text-to-Speech with Mixture of Adapters
Kenichi Fujita, Hiroshi Sato, Takanori Ashihara, Hiroki Kanagawa
Marc Delcroix, Takafumi Moriya, and Yusuke Ijima
Models | Reference speech | Small | Medium | Large | Proposed(dense) | Proposed(sparse) | Original |
---|---|---|---|---|---|---|---|
Female(nonprofessional) | |||||||
Male(nonprofessional) | |||||||
Female(professional) | |||||||
Male(professional) |