News & features
Research Focus: Week of August 26, 2024
Learn what’s next for AI at Research Forum on Sept. 3; WizardArena simulates human-annotated chatbot games; MInference speeds pre-filling for long-context LLMs via dynamic sparse attention; Reef: Fast succinct non-interactive zero-knowledge regex proofs.
Research Focus: Week of April 15, 2024
In this issue: New research on appropriate reliance on generative AI; Power management opportunities for LLMs in the cloud; LLMLingua-2 improves task-agnostic prompt compression; Enhancing COMET to embrace under-resourced African languages:
LLMLingua: Innovating LLM efficiency with prompt compression
| Huiqiang Jiang, Qianhui Wu, Chin-Yew Lin, Yuqing Yang, and Lili Qiu
Advanced prompting technologies for LLMs can lead to excessively long prompts, causing issues. Learn how LLMLingua compresses prompts up to 20x, maintaining quality, reducing latency, and supporting improved UX.
Efficient and hardware-friendly neural architecture search with SpaceEvo
| Li Lyna Zhang, Jiahang Xu, Quanlu Zhang, Yuqing Yang, Ting Cao, and Mao Yang
A persistent challenge in deep learning is optimizing neural network models for diverse hardware configurations, balancing performance and low latency. Learn how SpaceEvo automates hardware-aware neural architecture search to fine-tune DNN models for swift execution on diverse devices.
ICLR 2022 highlights from Microsoft Research Asia: Expanding the horizon of machine learning techniques and applications
| Shun Zheng, Jiang Bian, Tie-Yan Liu, Li Zhao, Tao Qin, Yue Wang, Dongsheng Li, Yuqing Yang, and Xufang Luo
ICLR (International Conference on Learning Representations) (opens in new tab) is recognized as one of the top conferences in the field of deep learning. Many influential papers on artificial intelligence, statistics, and data science—as well as important application fields such…
Awards | The 19th ACM International Conference on Mobile Systems, Applications, and Services (MobiSys 2021) | June 2021
Li Lyna Zhang, Ting Cao, and Yuqing Yang Mobisys 2021 Best Paper Award
Li Lyna Zhang, Ting Cao, and Yuqing Yang Mobisys 2021 Best Paper Award nn-Meter: Towards Accurate Latency Prediction of Deep-Learning Model Inference on Diverse Edge Devices.