Publication Sonification Use Cases in Highly Automated Vehicles: Designing and Evaluating Use Cases in Level 4 Automation Chihab Nadri, Sangjin Ko, Colin Diggs, R. Michael Winters, Sreehari Vattakkand, Myounghoon Jeon International Journal of Human-Computer Interaction | February 2023, pp. 1-11
Publication i-Code: An Integrative and Composable Multimodal Learning Framework Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang AAAI 2023 | February 2023
Publication Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei January 2023 Project
Microsoft Research Blog Microsoft Soundscape – New Horizons with a Community-Driven Approach December 12, 2022 Editor’s note, June 19, 2023 – The date existing installations of the Soundscape iOS app can continue to be used has been extended to Aug. 30, 2023. The article has been updated to reflect that…
Publication SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu The 37th AAAI Conference on Artificial Intelligence | December 2022
Publication Acoustics as a guide to welding quality Dimitra Emmanouilidou The Journal of the Acoustical Society of America | December 2022, Vol 152(4): pp. A142
Publication TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method Zeqian Ju, Peiling Lu, Xu Tan, Rui Wang, Chen Zhang, Songruoyao Wu, Kejun Zhang, Xiangyang Li, Tao Qin, Tie-Yan Liu EMNLP 2022 | November 2022
Publication Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition Zili Huang, Zhuo Chen, Naoyuki Kanda, Jian Wu, Yiming Wang, Jinyu Li, Takuya Yoshioka, Xiaofei Wang, Peidong Wang arXiv:2211.05564 | November 2022
Publication Speech separation with large-scale self-supervised learning Zhuo Chen, Naoyuki Kanda, Jian Wu, Yu Wu, Xiaofei Wang, Takuya Yoshioka, Jinyu Li, Sunit Sivasankaran, Sefik Emre Eskimez arXiv:2211.05172 | November 2022
Publication Real-Time Target Sound Extraction Bandhav Veluri, Justin Chan, Malek Itani, Tuochao Chen, Takuya Yoshioka, Shyamnath Gollakota arXiv:2211.02250 | November 2022