Samples
Performance comparison on zero-shot TTS
Samples
| Models | Reference speech | Small | Medium | Large | Proposed(dense) | Proposed(sparse) | Original |
|---|---|---|---|---|---|---|---|
| Female(nonprofessional) | |||||||
| Male(nonprofessional) | |||||||
| Female(professional) | |||||||
| Male(professional) |
Lightweight Zero-shot Text-to-Speech with Mixture of Adapters
Kenichi Fujita, Hiroshi Sato, Takanori Ashihara, Hiroki Kanagawa
Marc Delcroix, Takafumi Moriya, and Yusuke Ijima
| Models | Reference speech | Small | Medium | Large | Proposed(dense) | Proposed(sparse) | Original |
|---|---|---|---|---|---|---|---|
| Female(nonprofessional) | |||||||
| Male(nonprofessional) | |||||||
| Female(professional) | |||||||
| Male(professional) |