Publications

Recent Publications (Refer to this Google Scholar Page for the Full List of Publications)

Mels-TTS

Enhancing Document-Level Machine Translation via filtered synthetic corpora and two-stage LLM adaptation

Ireh Kim, Tesia Sker, and Chanwoo Kim

Proc. ICASSP, 2026.

Mels-TTS

Controllable Singing Voice Synthesis using Phoneme-Level Energy Sequence

Yerin Ryu, Inseop Shin, and Chanwoo Kim

Proc. ASRU, 2025.

Mels-TTS

A Novel Chain-of-Thought Reasoning Approach for Alzheimer’s Disease Detection Using Large Language and Vision-Language Models

Chanwoo Park and Chanwoo Kim

IEEE Trans. Neural Systems and Rehabilitation Engineering Nov. 2025 (TNSRE) (Top 2% in the JCR category of "Rehabilitation")

Mels-TTS

Reasoning-Based Approach with Chain-of-Thought for Alzheimer’s Detection Using Speech and Large Language Models

Chanwoo Park, Anna Seo Gyoung Choi, Sunghye Cho and Chanwoo Kim

Proc. Interspeech, 2025.

Mels-TTS

Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution

Yongjoon Lee, and Chanwoo Kim

Proc. ICASSP, 2025.

Mels-TTS

Mels-Tts: Multi-Emotion Multi-Lingual Multi-Speaker Text-To-Speech System Via Disentangled Style Tokens

Heejin Choi, Jae-Sung Bae, Joun Yeop Lee, Seongkyu Mun, Jihwan Lee, Hoon-Young Cho, Chanwoo Kim

Proc. ICASSP, 2024.

Latent Filling

Latent Filling: Latent Space Data Augmentation for Zero-Shot Speech Synthesis

Jae-Sung Bae, Joun Yeop Lee, Ji-Hyun Lee, Seongkyu Mun, Taehwa Kang, Hoon-Young Cho, Chanwoo Kim

Proc. ICASSP, 2024.

Hierarchical Timbre-Cadence

Hierarchical Timbre-Cadence Speaker Encoder for Zero-shot Speech Synthesis

Joun Yeop Lee, Jae-Sung Bae, Seongkyu Mun, Jihwan Lee, Ji-Hyun Lee, Hoon-Young Cho, Chanwoo Kim

Proc. INTERSPEECH, 2023.

Self-Supervised Accent Learning

Self-Supervised Accent Learning for Under-Resourced Accents Using Native Language Data

Mehul Kumar, Jiyeon Kim, Dhananjaya Gowda, Abhinav Garg, Chanwoo Kim

Proc. ICASSP, 2023.

Conformer On-Device

Conformer-Based on-Device Streaming Speech Recognition with KD Compression and Two-Pass Architecture

Jinhwan Park, Sichen Jin, Junmo Park, Sungsoo Kim, Dhairya Sandhyana, Changheon Lee, Myoungji Han, Jungin Lee, Seokyeong Jung, Changwoo Han, Chanwoo Kim

Proc. SLT, 2022.

Macro-Block Dropout

Macro-Block Dropout for Improved Regularization in Training End-to-End Speech Recognition Models

Chanwoo Kim, Sathish Indurti, Jinhwan Park, Wonyong Sung

Proc. SLT, 2022.