Jump to: Navigation

Text-to-Speech

Text-to-speech 관련 논문

2024

VALL-E: Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers 22 Feb
Voicebox 21 Feb
MQTTS: A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech 15 Feb
CrossSpeech 07 Feb
CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis 01 Feb
GenerSpeech 31 Jan
VARIANCEFLOW: HIGH-QUALITY AND CONTROLLABLE TEXT-TO-SPEECH USING VARIANCE INFORMATION VIA NORMALIZING FLOW 25 Jan
DSE-TTS 25 Jan
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding 18 Jan
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training 10 Jan
P-Flow 10 Jan
Matcha-TTS 04 Jan
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis 03 Jan

2023

Mega-TTS 28 Dec
Encoding Speaker-specific Latent Speech Feature for Speech Synthesis 21 Dec
Diff-TTS 20 Dec
VITS [ICML 2021] 07 Dec
CyFi-TTS [ICASSP 2023] 07 Dec
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search 23 Nov
AdaSpeech 23 Nov
Meta-StyleSpeech 16 Nov
FastSpeech 2 09 Nov
[Author], [Paper title], [Journal/Conference], [year] 26 Oct

Powered by PRML Lab. Speech team

PRML Lab. Speech Team

PRML Lab. in Korea University (Director: Prof. Seong-Whan Lee)

Navigation:

Publications
About
Seminar
Post
Coding

Social:

GitHub

Templates (for web app):