Text-to-Speech
Text-to-speech 관련 논문
2024
- VALL-E: Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
- Voicebox
- MQTTS: A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech
- CrossSpeech
- CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis
- GenerSpeech
- VARIANCEFLOW: HIGH-QUALITY AND CONTROLLABLE TEXT-TO-SPEECH USING VARIANCE INFORMATION VIA NORMALIZING FLOW
- DSE-TTS
- UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
- CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training
- P-Flow
- Matcha-TTS
- M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis