Jump to: Navigation

Seminar

논문 읽고 정리해서 공유하기

2024

VALL-E: Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers 22 Feb
Voicebox 21 Feb
MQTTS: A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech 15 Feb
CrossSpeech 07 Feb
CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis 01 Feb
GenerSpeech 31 Jan
VARIANCEFLOW: HIGH-QUALITY AND CONTROLLABLE TEXT-TO-SPEECH USING VARIANCE INFORMATION VIA NORMALIZING FLOW 25 Jan
DSE-TTS 25 Jan
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding 18 Jan
Flow matching 수식 정리 11 Jan
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training 10 Jan
P-Flow 10 Jan
Matcha-TTS 04 Jan
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis 03 Jan

2023

Mega-TTS 28 Dec
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling 28 Dec
Encoding Speaker-specific Latent Speech Feature for Speech Synthesis 21 Dec
Fine-Grained Emotional Control of Text-to-Speech: Learning to Rank Inter-And Intra-Class Emotion Intensities 21 Dec
Diff-TTS 20 Dec
VITS [ICML 2021] 07 Dec
Denoising Diffusion Probabilistic Models 07 Dec
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search 23 Nov
AdaSpeech 23 Nov
Meta-StyleSpeech 16 Nov
Variational Inference with Normalizing Flows 16 Nov
Glow: Generative Flow with Invertible 1×1 Convolutions 16 Nov
Emo-Q 14 Nov
FastSpeech 2 09 Nov
FluentSpeech 03 Nov
[Author], [Paper title], [Journal/Conference], [year] 26 Oct

Powered by PRML Lab. Speech team

PRML Lab. Speech Team

PRML Lab. in Korea University (Director: Prof. Seong-Whan Lee)

Navigation:

Publications
About
Seminar
Post
Coding

Social:

GitHub

Templates (for web app):