Publication Multi-View Learning for Speech Emotion Recognition Daniel Tompkins, Dimitra Emmanouilidou, Soham Deshmukh, Benjamin Elizalde International Conference on Acoustics, Speech and Signal Processing | June 2023 Project
Publication Pengi: An Audio Language Model for Audio Tasks Soham Deshmukh, Benjamin Elizalde, Rita Singh, Huaming Wang NeurIPS 2023 | May 2023
Publication Any-to-Any Generation via Composable Diffusion Zineng Tang, Ziyi Yang, Chenguang Zhu, Michael Zeng, Mohit Bansal NeurIPS 2023 | May 2023
Publication WINC: A Wireless IoT Network for Multi-Noise Source Cancellation Ishani Janveja, Jiaming Wang, Junfeng Guan, Suraj Jog, Haitham Hassanieh IPSN’23 | May 2023
Publication Investigations in Audio Captioning: Addressing Vocabulary Imbalance and Evaluating Suitability of Language-Centric Performance Metrics Sandeep Kothinti, Dimitra Emmanouilidou European Signal Processing Conference (EUSIPCO) | May 2023 Project
Publication i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang North American Chapter of the Association for Computational Linguistics (NAACL) 2024 | May 2023
Publication GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework Ang Lv, Xu Tan, Peiling Lu, Wei Ye, Shikun Zhang, Jiang Bian, Rui Yan May 2023
Publication Gaze & Tongue: A Subtle, Hands-Free Interaction for Head-Worn Devices Tan Gemicioglu, R. Michael Winters, Yu-Te Wang, Thom Gable, Ann Paradiso, Ivan Tashev CHI EA ’23: Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems | April 2023 Finalist – Best Interactivity Demo Project
Publication Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling Ziqiang Zhang, Long Zhou, Chengyi Wang, Sanyuan Chen, Yu Wu, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei March 2023 Project
Video Neural Interfaces – Towards a new generation of human-computer interface February 10, 2023 | Yacine Achiakh, Alain Sirois Talk by Yacine Achiakh, co-founder & CEO of Wisear, and Alain Sirois, co-founder & CTO of Wisear. 01:27:52