新闻与深度文章
| Arindam Mitra, Ahmed Awadallah, 和 Yash Lara
Orca-AgentInstruct, from Microsoft Research, can generate diverse, high-quality synthetic data at scale to post-train and fine-tune base LLMs for expanded capabilities, continual learning, and increased performance.
New Research | FLASH: Workflow automation agent for diagnosing recurring incidents; METAREFLECTION: Learning instructions for language agents using past reflections; Boosting LLM training efficiency through faster communication between GPUs; and more.
新闻报道 | Eye on AI (podcast)
Why AI Agents Are the Next Big Thing in Tech
In this episode of the Eye on AI podcast, we dive deep into the world of AI agents with Ece Kamar, VP of Research and Managing Director of AI Frontiers Lab at Microsoft. Ece shares her unique insights on the…
Explore multimodal & small language models, plus advanced benchmarks for AI evaluation. Microsoft researchers are working on breakthroughs in weather prediction, materials design, even a new kind of computer for AI inference and hard optimization problems.
| Vidhisha Balachandran, Jingya Chen, Neel Joshi, Besmira Nushi, Hamid Palangi, Eduardo Salinas, Vibhav Vineet, James Woffinden-Luey, 和 Safoora Yousefi
How can we rigorously evaluate and understand state-of-the-art progress in AI? Eureka is an open-source framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings. Learn more about the extended findings.
In this episode, learn about the latest multimodal AI models, advanced benchmarks for AI evaluation and model self-improvement, and an entirely new kind of computer for AI inference and hard optimization. Discover how these research breakthroughs and more can help…
| Ching-An Cheng, Adith Swaminathan, 和 Allen Nie
Introducing Trace, Microsoft and Stanford University’s novel AI optimization framework, now available as a Python library. Trace adapts dynamically and optimizes a wide range of applications from language models to robot control.
Abstracts: July 18, 2024
| Gretchen Huizinga 和 Arindam Mitra
Senior Researcher Arindam Mitra introduces AgentInstruct. Using raw data sources, the automated multi-agent framework can create diverse, high-quality synthetic data at scale for the post-training of small and large language models.
新闻报道 | Microsoft News Center
Why AI sometimes gets it wrong — and big strides to address it
Around the time GPT-4 was making headlines for acing standardized tests, Microsoft researchers and collaborators were putting other AI models through a different type of test — one designed to make the models fabricate information.