Return to Microsoft Research Lab – New York City

Reinforcement Learning

新闻与深度文章

photo of Ida Momennejad for the AI Frontiers Microsoft Research Podcast series

微软研究院播客

AI Frontiers: Rethinking intelligence with Ashley Llorens and Ida Momennejad

2024年3月28日 | Ashley Llorens 和 Ida Momennejad

Principal Researcher Ida Momennejad brings her expertise in cognitive neuroscience and computer science to this in-depth conversation about general intelligence and what the evolution of the brain across species can teach us about building AI.

Microsoft Research Podcast - Abstracts hero with a microphone icon

微软研究院播客

Abstracts: January 25, 2024

2024年1月25日 | Gretchen Huizinga, Jordan Ash, 和 Dipendra Misra

On “Abstracts,” Jordan Ash & Dipendra Misra discuss the parameter reduction method LASER. Tune in to learn how selective removal of stored data alone can boost LLM performance, then sign up for Microsoft Research Forum for more on LASER &…

Research Focus Edition 30 December 6, 2023

微软研究院博客

Research Focus: Week of December 4, 2023

2023年12月6日

Research Focus: Using LLMs in a Rust-based formal verification framework; Rethinking network measurements with user feedback; 3D telemedicine using HoloportationTM communication technology could enhance overseas surgical visits.

奖项 | International World Wide Web Conference

John Langford, Rob Schapire and co-authors receive the 2023 Seoul Test of Time Award

2023年5月2日

The International World Wide Web Conference Committee (IW3C2) announced today that the 2023 Seoul Test of Time Award will be presented to the authors of the paper “A Contextual-Bandit Approach to Personalized News Article Recommendation;” Wei Chu, (Ant Group), Lihong…

SelfTune interaction with Client (Developer Machine) into Data Store (Azure ML Workspace)

微软研究院博客

Automatic post-deployment management of cloud applications

2023年4月18日 | Nagarajan Natarajan, Lei Zhao, Rodrigo Fonseca, Tatiana Racheva, 和 Yogesh Bansal

In the first two blog posts in this series, we presented our vision for Cloud Intelligence/AIOps (AIOps) research, and scenarios where innovations in AI technologies can help build and operate complex cloud platforms and services effectively and efficiently at scale.…

微软研究院博客

Research Collection – Shall we play a game?

2021年2月5日

From a research point of view, games offer an amazing environment in which to develop new machine learning algorithms and techniques. And we hope, in due course, that those new algorithms will feed back not just into gaming, but into…

新闻报道 | Machine Learning (Theory)

HOMER: Provable Exploration in Reinforcement Learning

2020年7月21日

Last week at ICML 2020, Mikael Henaff, Akshay Krishnamurthy, John Langford and Dipendra Misra had a paper on a new reinforcement learning (RL) algorithm that solves three key problems in RL: (i) global exploration, (ii) decoding latent dynamics, and (iii) optimizing a given…

新闻报道 | Medium | Machine Learning

HOMER: Provable Exploration in Reinforcement Learning

2020年7月14日

At ICML 2020, Mikael Henaff, Akshay Krishnamurthy, John Langford and Dipendra Misra published a paper presenting a new reinforcement learning (RL) algorithm called HOMER that addresses three main problems in real-world RL problem: (i) exploration, (ii) decoding latent dynamics, and (iii) optimizing…

微软研究院播客

Provably efficient reinforcement learning with Dr. Akshay Krishnamurthy

2020年6月3日

MSR’s New York City lab is home to some of the best reinforcement learning research on the planet but if you ask any of the researchers, they’ll tell you they’re very interested in getting it out of the lab and…