您的瀏覽器不支援JavaScript語法,網站的部份功能在JavaScript沒有啟用的狀態下無法正常使用。

中央研究院 資訊科學研究所

活動訊息

友善列印

列印可使用瀏覽器提供的(Ctrl+P)功能

學術演講

:::

TIGP (SNHCC) -- Deep Learning-based Speech Assessment Metrics and its Applications

  • 講者李安德 博士 (中央研究院資訊科技創新研究中心)
    邀請人:TIGP (SNHCC)
  • 時間2023-12-04 (Mon.) 14:00 ~ 16:00
  • 地點資訊所新館106演講廳
摘要
Most conventional speech assessment metrics require a golden clean reference to calculate the evaluation score. Such a scenario has limited applicability in real-world scenarios since clean reference is not always accessible. To address this limitation, non-intrusive speech assessment metrics have caught great attention in recent years. Recently, with the emergence of the deep learning model and the availability of training data, many studies have involved the deep learning model to deploy a non-intrusive speech assessment model. However, despite the good performance achieved by the deep learning-based speech assessment model, the generalization of the model remains a challenge. In this talk, we would like to introduce several approaches to improve the generalization of the deep learning-based speech assessment model. Additionally, we aim to introduce the direct integration between deep learning-based speech assessment models and speech enhancement systems.
BIO
Dr. Ryandhimas E. Zezario received a Ph.D. degree in Computer Science and Information Engineering from National Taiwan University in 2023. He is currently a Postdoctoral Researcher at the Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan. He was awarded the Gold Prize for the best non-intrusive system and 1st place for the Hearing Industry Research Consortium student prizes at the Clarity Prediction Challenge 2022. His research interests include speech enhancement, non-intrusive quality assessment, speech processing, speech/speaker recognition.