新闻与深度文章
| Ashley Llorens 和 Ida Momennejad
Principal Researcher Ida Momennejad brings her expertise in cognitive neuroscience and computer science to this in-depth conversation about general intelligence and what the evolution of the brain across species can teach us about building AI.
Abstracts: January 25, 2024
| Gretchen Huizinga, Jordan Ash, 和 Dipendra Misra
On “Abstracts,” Jordan Ash & Dipendra Misra discuss the parameter reduction method LASER. Tune in to learn how selective removal of stored data alone can boost LLM performance, then sign up for Microsoft Research Forum for more on LASER &…
We’re proud to have 100+ accepted papers At NeurIPS 2023, plus 18 workshops. Several submissions were chosen as oral presentations and spotlight posters, reflecting groundbreaking concepts, methods, or applications. Here’s an overview of those submissions.
| Jessica Maghakian, Akanksha Saran, Cheng Tan, 和 Paul Mineiro
In reinforcement learning, handcrafting reward functions is difficult and can yield algorithms that don’t generalize well. IGL-P, an interaction-grounded learning strategy, learns personalized rewards for different people in recommender system scenarios.
奖项 | International World Wide Web Conference
John Langford, Rob Schapire and co-authors receive the 2023 Seoul Test of Time Award
The International World Wide Web Conference Committee (IW3C2) announced today that the 2023 Seoul Test of Time Award will be presented to the authors of the paper “A Contextual-Bandit Approach to Personalized News Article Recommendation;” Wei Chu, (Ant Group), Lihong…
新闻报道 | Machine Learning (Theory)
HOMER: Provable Exploration in Reinforcement Learning
Last week at ICML 2020, Mikael Henaff, Akshay Krishnamurthy, John Langford and Dipendra Misra had a paper on a new reinforcement learning (RL) algorithm that solves three key problems in RL: (i) global exploration, (ii) decoding latent dynamics, and (iii) optimizing a given…
新闻报道 | Medium | Machine Learning
HOMER: Provable Exploration in Reinforcement Learning
At ICML 2020, Mikael Henaff, Akshay Krishnamurthy, John Langford and Dipendra Misra published a paper presenting a new reinforcement learning (RL) algorithm called HOMER that addresses three main problems in real-world RL problem: (i) exploration, (ii) decoding latent dynamics, and (iii) optimizing…
MSR’s New York City lab is home to some of the best reinforcement learning research on the planet but if you ask any of the researchers, they’ll tell you they’re very interested in getting it out of the lab and…
新闻报道 | Microsoft Research Webinar Series
Machine Learning and Fairness Webinar
In this webinar led by Microsoft researchers Jenn Wortman Vaughan and Hanna Wallach, 15-year veterans of the machine learning field, you’ll learn how to make detecting and mitigating biases a first-order priority in your development and deployment of ML systems.