😃 About me

Hello! I am Dongheon Lee, a research scientist intern at Meta Reality Labs Meta and also a postdoctoral researcher at KAIST kaist. My research mainly focuses on multichannel sound source separation (speech enhancement, separation, and audio source separation), including real-time, on-device models and array-agnostic processing. I am interested in AR/VR, and speech recognition. I obtained Ph.D. (2025) and B.S. (2020) at KAIST, advised by Professor Jung-Woo Choi.

Alert! I am looking for work starting in June 2026. If you would like to work with me, please feel free to contact me!

📌 Research Interests

  • Audio Signal Processing
  • Speech Signal Processing
  • Microphone Array Processing
  • Generative AI (Speech/Audio)

🔥 News

  • 2025.07:  🎉 1st rank in DCASE 2025 task 4: Spatial Semantic Segmentation of Sound Scenes
  • 2025.06:  🎉 Join in Meta Reality Labs as research scientist intern
  • 2025.03:  🎉 One paper accepted to Forum Acusticum Euronoise, 2025
  • 2025.02:  🎉 End journey for Ph.D., School of Electrical Engineering, KAIST

🏫 Education

  • Mar.2020 - Feb.2025: Ph.D. Electrical Engineering (KAIST)

   Dissertation: “Unified Auditory Scene Analysis and Separation using Dense Frequency-Time Attentive Network”

   Advisor: Prof. Jung-Woo Choi

  • Mar.2016 - Feb.2020: B.S. Electrical Engineering (KAIST)

  • Mar.2014 - Feb.2016: Changwon Science Highschool

   Early Graduation

📎 Selected Publications

[6] Dongheon Lee, Jung-Woo Choi, “DeFT-Mamba: Multichannel universal sound separation and polyphonic audio classification,” International Conference on Audio, Speech, Signal Processing (ICASSP) 2025 (oral)

[5] Dongheon Lee, Jung-Woo Choi, “DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing,” IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP) 2024

[4] Dongheon Lee, Jung-Woo Choi, “DeFTAN-AA: Array geometry-agnostic multichannel speech enhancement,” International Speech Communication Association (Interspeech) 2024 (oral)

[3] Dongheon Lee, Dayun Choi, Jung-Woo Choi, “DeFT-AN RT: Real-time multichannel speech enhancement using Dense Frequency Time Attentive Network and non-overlapping synthesis window,,” International Speech Communication Association (Interspeech) 2023

[2] Dongheon Lee, Jung-Woo Choi, “DeFT-AN: Dense Frequency-Time Attentive Network for multichannel speech enhancement,” IEEE Signal Processing Letters (IEEE SPL) 2023

[1] Dongheon Lee, Byeongho Jo, Jung-Woo Choi, “Direction-of-arrival estimation with blind surface impedance compensation for spherical microphone array”, Journal of Acoustical Society of America Express Letters (JASA EL) 2021

📝 Publications (14 publications, 13 first-author)

2025

DCASE 2025
sym

[C09] Self-guided target sound extraction and classification through universal sound separation model and multiple clues

Younghoo Kwon*, Dongheon Lee*, Dohwan Kim, and Jung-Woo Choi (*: Equal Contribution)

DCASE Technical Report 1st rank, Winner (DCASE) 2025

EuroNoise 2025
sym

[C07] Universal auditory scene analysis model for source separation, event localization, and detection

Dongheon Lee, Jung-Woo Choi

Forum Acusticum Euronoise (EuroNoise) 2025

ICASSP 2025
sym

[C06] DeFT-Mamba: Multichannel universal sound separation and polyphonic audio classification

Dongheon Lee, Jung-Woo Choi

International Conference on Audio, Speech, Signal Processing (ICASSP) 2025 (oral)

Demo Page

2024

IEEE/ACM TASLP 2024
sym

[J05, C05] DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing

Dongheon Lee, Jung-Woo Choi

IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP) 2024

GitHub Repo Demo Page

Interspeech 2024
sym

[C04] DeFTAN-AA: Array geometry-agnostic multichannel speech enhancement

Dongheon Lee, Jung-Woo Choi

International Speech Communication Association (Interspeech) 2024 (oral)

Demo Page

2023

Interspeech 2023
sym

[C03] DeFT-AN RT: Real-time multichannel speech enhancement using Dense Frequency Time Attentive Network and non-overlapping synthesis window

Dongheon Lee, Dayun Choi, and Jung-Woo Choi

International Speech Communication Association (Interspeech) 2023

GitHub Repo

IEEE SPL 2023
sym

[J04, C02] DeFT-AN: Dense Frequency-Time Attentive Network for multichannel speech enhancement

Dongheon Lee, and Jung-Woo Choi

IEEE Signal Processing Letters (IEEE SPL) 2023

GitHub Repo

2022

Inter-Noise 2022
sym

[C01] Inter-channel Conv-TasNet for source-agnostic multichannel audio enhancement

Dongheon Lee, Jung-Woo Choi

51st International Congress Exposition on Noise Controal Engineering (Inter-Noise), 2022. (oral)

2021

Nanoscale 2021
sym
ArXiv 2021
sym

[J02] Inter-channel Conv-TasNet for multichannel speech enhancement

Dongheon Lee, Seongrae Kim, and Jung-Woo Choi

ArXiv, 2021

GitHub Repo

JASA EL 2021
sym

[J01] Direction-of-arrival estimation with blind surface impedance compensation for spherical microphone array

Dongheon Lee, Byeongho Jo, Jung-Woo Choi

Journal of Acoustical Society of America Express Letters (JASA EL) 2021

Academic Activities

💬 Invited Talks

  • 2024 Machine Learning and Big Data, Next-Generation ICT Research Center
  • 2023 On-device multichannel speech enhancement system, AICube
  • 2023 AI Specialized Training Program, KAIST-Hwaseong Hub

🏆 Awards

  • 1st Rank (Winner) of DCASE 2025 Task 4: Spatial Semantic Segmentation of Sound Scenes (2025)

  • Outstanding Teaching Assistant Award, EE488B: Audio Signal Processing (2024)

  • Excellence Paper Award, Acoustical Society of Korea (2021)

  • 1st Prize in Internship Project, SK Hynix (2019)

📑 Reviewer

Confernce

  • IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA): 2025 ~

  • International Conference on Audio, Speech, Signal Processing (ICASSP): 2025 ~

  • Proceedings of International Speech Communication Association (Interspeech): 2025 ~

Journal

  • IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP)

  • IEEE Signal Processing Letters (IEEE SPL)

📏 Teaching Assistant

  • Signals and Systems, Mar. 2021 – Feb. 2024

  • Audio Signal Processing, Feb. 2024 – Jun. 2024

  • Individual Research, Mar. 2021 – Aug. 2024

🎏 Patent

  • Method and device of suppressing outdoor noise by using a microphone array, KR 10-2022-0131773

  • Method and apparatus for array geometry agnostic denoising and dereverberation based on deep learning, KR 10-2024-0063256

  • Voice enhancement method and system in eXtended Reality space for multi-party voice conversation, KR 10-2023-0150037