Waveform-domain Speech Enhancement Using Spectrogram Encoding for Robust Speech Recognition Published in IEEE/ACM Trans. on ASLP, 2024 Hao Shi, Masato Mimura, and Tatsuya Kawahara Download Paper | Download BibTeX
Adapting Pretrained Speech Recognition Models for Code-Switching through Encoding Refining and Language-Aware Attention-based Decoding Published in IEEE-ICASSP, 2025 Jiahui Zhao, Hao Shi, Tianrui Wang, Hexin Liu, Zhaoheng Ni, Lingxuan Ye, and Longbiao Wang Download Paper | Download BibTeX
Reducing the Gap between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module Published in IEEE-ICASSP, 2025 Zhongjian Cui, Chenrui Cui, Tianrui Wang, Mengnan He, Hao Shi, Meng Ge, Caixia Gong, Longbiao Wang, and Jianwu Dang Download Paper | Download BibTeX
Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition Published in IEEE-SLT, 2024 Hao Shi, Yuan Gao, Zhaoheng Ni, and Tatusya Kawahara Download Paper | Download BibTeX
Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition Published in Interspeech, 2024 Yuchun Shu, Bo Hu, Yifeng He, Hao Shi, Longbiao Wang, and Jianwu Dang Download Paper | Download BibTeX
Dual-path Adaptation of Pretrained Feature Extraction Module for Robust Automatic Speech Recognition Published in Interspeech, 2024 Hao Shi, and Tatusya Kawahara Download Paper | Download BibTeX
Speech Emotion Recognition with Multi-level Acoustic and Semantic Information Extraction and Interaction Published in Interspeech, 2024 Yuan Gao, Hao Shi, Chenhui Chu, and Tatsuya Kawahara Download Paper | Download BibTeX
Ensemble Inference for Diffusion Model-based Speech Enhancement Published in IEEE-ICASSPW, 2024 Hao Shi, Naoyuki Kamo, Marc Delcroix, Tomohiro Nakatani, and Shoko Araki Download Paper | Download BibTeX
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders Published in IEEE-ICASSP, 2024 Hao Shi, Kazuki Shimada, Masato Hirano, Takashi Shibuya, Yuichiro Koyama, Zhi Zhong, Shusuke Takahashi, Tatsuya Kawahara, and Yuki Mitsufuji Download Paper | Download BibTeX
Enhancing Two-stage Finetuning for Speech Emotion Recognition Using Adapters Published in IEEE-ICASSP, 2024 Yuan Gao, Hao Shi, Chenhui Chu, Tatsuya Kawahara Download Paper | Download BibTeX
Extending Audio Masked Autoencoders Toward Audio Restoration Published in WASPAA, 2023 Zhi Zhong, Hao Shi, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji Download Paper | Download BibTeX
Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder Published in IEEE-ICASSP, 2023 Hao Shi, Masato Mimura, Longbiao Wang, Jianwu Dang, and Tatsuya Kawahara Download Paper | Download BibTeX
Adaptive Attention Network with Domain Adversarial Training for Multi-Accent Speech Recognition Published in ISCSLP, 2022 Yanbing Yang, Hao Shi, Yuqin Lin, Meng Ge, Longbiao Wang, Qingzhi Hou and Jianwu Dang Download Paper | Download BibTeX
Subband-Based Spectrogram Fusion for Speech Enhancement by Combining Mapping and Masking Approaches Published in APSIPA ASC, 2022 Hao Shi, Longbiao Wang, Sheng Li, Jianwu Dang, and Tatsuya Kawahara Download Paper | Download BibTeX
Fusing Multiple Bandwidth Spectrograms for Improving Speech Enhancement Published in APSIPA ASC, 2022 Hao Shi, Yuchun Shu, Longbiao Wang, Jianwu Dang, and Tatsuya Kawahara Download Paper | Download BibTeX
Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model Published in Interspeech, 2022 Qiang Xu, Tongtong Song, Longbiao Wang, Hao Shi, Yuqin Lin, Yongjie Lv, Meng Ge, Qiang Yu, and Jianwu Dang Download Paper | Download BibTeX
Language-specific Characteristic Assistance for Code-switching Speech Recognition Published in Interspeech, 2022 Tongtong Song, Qiang Xu, Meng Ge, Longbiao Wang, Hao Shi, Yongjie Lv, Yuqin Lin, and Jianwu Dang Download Paper | Download BibTeX
Monaural speech enhancement based on spectrogram decomposition for convolutional neural network-sensitive feature extraction Published in Interspeech, 2022 Hao Shi, Longbiao Wang, Sheng Li, Jianwu Dang, and Tatsuya Kawahara Download Paper | Download BibTeX
Spectrograms Fusion-based End-to-end Robust Automatic Speech Recognition Published in APSIPA ASC, 2021 Hao Shi, Longbiao Wang, Sheng Li, Cunhang Fan, Jianwu Dang, and Tatsuya Kawahara Download Paper | Download BibTeX
Simultaneous Progressive Filtering-based Monaural Speech Enhancement Published in ICONIP, 2021 Haoran Yin, Hao Shi, Longbiao Wang, Luya Qiang, Sheng Li, Meng Ge, Gaoyan Zhang, and Jianwu Dang Download Paper | Download BibTeX
Speech Dereverberation Based on Scale-aware Mean Square Error Loss Published in ICONIP, 2021 Luya Qiang, Hao Shi, Meng Ge, Haoran Yin, Nan Li, Longbiao Wang, Sheng Li, and Jianwu Dang Download Paper | Download BibTeX
Singing Voice Extraction with Attention-Based Spectrograms Fusion Published in Interspeech, 2020 Hao Shi, Longbiao Wang, Sheng Li, Chenchen Ding, Meng Ge, Nan Li, Jianwu Dang, and Hiroshi Seki Download Paper | Download BibTeX
Spectrograms Fusion with Minimum Difference Masks Estimation for Monaural Speech Dereverberation Published in IEEE-ICASSP, 2020 Hao Shi, Longbiao Wang, Meng Ge, Sheng Li, and Jianwu Dang Download Paper | Download BibTeX
Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement Published in Interspeech, 2019 Meng Ge, Longbiao Wang, Nan Li, Hao Shi, Jianwu Dang, and Xiangang Li Download Paper | Download BibTeX
Investigation of Adapter for Automatic Speech Recognition in Noisy Environment Published in SIG Technical Reports, 2023 Hao Shi, and Tatsuya Kawahara Download Paper | Download BibTeX