Hi, I am a PhD student at the School of Data Science, the Chinese University of Hong Kong (Shenzhen), supervised by Prof. Haizhou Li. Prior to that, I received my Bachelor Degree from Southern University of Science and Technology, supervised by Prof. Tom Ko. My research interests include automatic speech recognition, speech translation and speech pre-training. I have published several papers at the top international AI conferences such as ICASSP, INTERSPEECH, ACL and EMNLP.
📖 Educations
- 2022.09 - now, Ph.D., the Chinese University of Hong Kong (Shenzhen).
- 2016.09 - 2020.06, B.Eng, Southern University of Science and Technology.
- 2018.09 - 2019.05, Visiting Student, the University of Edinburgh.
💻 Internships
- 2021.06 - 2022.04, MSRA NLC group, Beijing, mentored by Dr. Long Zhou and Dr. Shujie Liu.
- 2019.06 - 2019.08, Tencent, Shenzhen.
📝 Publications
-
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning, Chutong Meng, Junyi Ao, Tom Ko, Mingxuan Wang, Haizhou Li, INTERSPEECH 2023 |
-
Self-Supervised Acoustic Word Embedding Learning via Correspondence Transformer Encoder, Jingru Lin, Xianghu Yue, Junyi Ao, Haizhou Li, INTERSPEECH 2023
-
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text, Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li, ICASSP 2023
-
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data, Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei, INTERSPEECH 2022 |
-
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing, Junyi Ao, Rui Wang, Long Zhou, Chengyi Wang, Shuo Ren, Yu Wu, Shujie Liu, Tom Ko, Qing Li, Yu Zhang, Zhihua Wei, Yao Qian, Jinyu Li, Furu Wei, ACL 2022 |
-
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training, Ziqiang Zhang, Long Zhou, Junyi Ao, Shujie Liu, Lirong Dai, Jinyu Li, Furu Wei, EMNLP 2022 |
-
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT, Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li, INTERSPEECH 2022 |
-
The YiTrans Speech Translation System for IWSLT 2022 Offline Shared Task, Ziqiang Zhang, Junyi Ao, Long Zhou, Shujie Liu, Furu Wei, Jinyu Li, ACL@IWSLT 2022 |
-
Multi-View Self-Attention Based Transformer for Speaker Recognition, Rui Wang, Junyi Ao, Long Zhou, Shujie Liu, Zhihua Wei, Tom Ko, Qing Li, Yu Zhang, ICASSP 2022
-
Improving Attention-based End-to-end ASR by Incorporating an N-gram Neural Network, Junyi Ao, Tom Ko, ISCSLP 2021
📜 Preprints
- USED: Universal Speaker Extraction and Diarization, Junyi Ao, Mehmet Sinan Yıldırım, Ruijie Tao, Meng Ge, Shuai Wang, Yanmin Qian, Haizhou Li, arXiv preprint arXiv:2309.10674
📚 Teaching
- DDA3020 Machine Learning, Spring 2023
- CSC3100 Data Structures, Fall 2022