Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
Future Blog Post
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml
and set future: false
.
Blog Post number 4
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 3
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 2
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 1
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
portfolio
Portfolio item number 1
Short description of portfolio item number 1
Portfolio item number 2
Short description of portfolio item number 2
publications
Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement
Published in Interspeech, 2019
Meng Ge, Longbiao Wang, Nan Li, Hao Shi, Jianwu Dang, and Xiangang Li
Spectrograms Fusion with Minimum Difference Masks Estimation for Monaural Speech Dereverberation
Published in IEEE-ICASSP, 2020
Hao Shi, Longbiao Wang, Meng Ge, Sheng Li, and Jianwu Dang
Singing Voice Extraction with Attention-Based Spectrograms Fusion
Published in Interspeech, 2020
Hao Shi, Longbiao Wang, Sheng Li, Chenchen Ding, Meng Ge, Nan Li, Jianwu Dang, and Hiroshi Seki
Speech Dereverberation Based on Scale-aware Mean Square Error Loss
Published in ICONIP, 2021
Luya Qiang, Hao Shi, Meng Ge, Haoran Yin, Nan Li, Longbiao Wang, Sheng Li, and Jianwu Dang
Simultaneous Progressive Filtering-based Monaural Speech Enhancement
Published in ICONIP, 2021
Haoran Yin, Hao Shi, Longbiao Wang, Luya Qiang, Sheng Li, Meng Ge, Gaoyan Zhang, and Jianwu Dang
Spectrograms Fusion-based End-to-end Robust Automatic Speech Recognition
Published in APSIPA ASC, 2021
Hao Shi, Longbiao Wang, Sheng Li, Cunhang Fan, Jianwu Dang, and Tatsuya Kawahara
Monaural speech enhancement based on spectrogram decomposition for convolutional neural network-sensitive feature extraction
Published in Interspeech, 2022
Hao Shi, Longbiao Wang, Sheng Li, Jianwu Dang, and Tatsuya Kawahara
Language-specific Characteristic Assistance for Code-switching Speech Recognition
Published in Interspeech, 2022
Tongtong Song, Qiang Xu, Meng Ge, Longbiao Wang, Hao Shi, Yongjie Lv, Yuqin Lin, and Jianwu Dang
Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model
Published in Interspeech, 2022
Qiang Xu, Tongtong Song, Longbiao Wang, Hao Shi, Yuqin Lin, Yongjie Lv, Meng Ge, Qiang Yu, and Jianwu Dang
Fusing Multiple Bandwidth Spectrograms for Improving Speech Enhancement
Published in APSIPA ASC, 2022
Hao Shi, Yuchun Shu, Longbiao Wang, Jianwu Dang, and Tatsuya Kawahara
Subband-Based Spectrogram Fusion for Speech Enhancement by Combining Mapping and Masking Approaches
Published in APSIPA ASC, 2022
Hao Shi, Longbiao Wang, Sheng Li, Jianwu Dang, and Tatsuya Kawahara
Adaptive Attention Network with Domain Adversarial Training for Multi-Accent Speech Recognition
Published in ISCSLP, 2022
Yanbing Yang, Hao Shi, Yuqin Lin, Meng Ge, Longbiao Wang, Qingzhi Hou and Jianwu Dang
Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Published in IEEE-ICASSP, 2023
Hao Shi, Masato Mimura, Longbiao Wang, Jianwu Dang, and Tatsuya Kawahara
Extending Audio Masked Autoencoders Toward Audio Restoration
Published in WASPAA, 2023
Zhi Zhong, Hao Shi, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji
Investigation of Adapter for Automatic Speech Recognition in Noisy Environment
Published in SIG Technical Reports, 2023
Hao Shi, and Tatsuya Kawahara
Enhancing Two-stage Finetuning for Speech Emotion Recognition Using Adapters
Published in IEEE-ICASSP, 2024
Yuan Gao, Hao Shi, Chenhui Chu, Tatsuya Kawahara
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Published in IEEE-ICASSP, 2024
Hao Shi, Kazuki Shimada, Masato Hirano, Takashi Shibuya, Yuichiro Koyama, Zhi Zhong, Shusuke Takahashi, Tatsuya Kawahara, and Yuki Mitsufuji
Ensemble Inference for Diffusion Model-based Speech Enhancement
Published in IEEE-ICASSPW, 2024
Hao Shi, Naoyuki Kamo, Marc Delcroix, Tomohiro Nakatani, and Shoko Araki
Waveform-domain Speech Enhancement Using Spectrogram Encoding for Robust Speech Recognition
Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
Hao Shi, Masato Mimura, and Tatsuya Kawahara
Speech Emotion Recognition with Multi-level Acoustic and Semantic Information Extraction and Interaction
Published in Interspeech, 2024
Yuan Gao, Hao Shi, Chenhui Chu, and Tatsuya Kawahara
Dual-path Adaptation of Pretrained Feature Extraction Module for Robust Automatic Speech Recognition
Published in Interspeech, 2024
Hao Shi, and Tatusya Kawahara
Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition
Published in Interspeech, 2024
Yuchun Shu, Bo Hu, Yifeng He, Hao Shi, Longbiao Wang, and Jianwu Dang
Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Published in IEEE-SLT, 2024
Hao Shi, Yuan Gao, Zhaoheng Ni, and Tatusya Kawahara
Reducing the Gap between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module
Published in IEEE-ICASSP, 2025
Zhongjian Cui, Chenrui Cui, Tianrui Wang, Mengnan He, Hao Shi, Meng Ge, Caixia Gong, Longbiao Wang, and Jianwu Dang
Adapting Pretrained Speech Recognition Models for Code-Switching through Encoding Refining and Language-Aware Attention-based Decoding
Published in IEEE-ICASSP, 2025
Jiahui Zhao, Hao Shi, Tianrui Wang, Hexin Liu, Zhaoheng Ni, Lingxuan Ye, and Longbiao Wang
talks
Talk 1 on Relevant Topic in Your Field
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Conference Proceeding talk 3 on Relevant Topic in Your Field
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
teaching
Teaching experience 1
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Teaching experience 2
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.