Microsoft Research Blog
See what we mean – Visually grounded natural language navigation is going places
How do humans communicate efficiently? The common belief is that the words humans use to communicate – such as dog, for instance – invoke similar understanding of the physical concepts. Indeed, there exists a common conception about the physical appearance…
A picture from a dozen words – A drawing bot for realizing everyday scenes—and even stories
If you were asked to draw a picture of several people in ski gear, standing in the snow, chances are you’d start with an outline of three or four people reasonably positioned in the center of the canvas, then sketch…
Envisioning privacy preserving image-based localization for augmented reality
New camera localization technology for sensitive environments can keep images and map data confidential Advances in augmented reality (AR) and mobile robotics promise to revolutionize how we see and interact with our physical world in the future. Today, AR and…
HELP! Training assistive indoor agents to ask for assistance via imitation learning
| Debadeepta Dey, Khanh Nguyen, Chris Brockett, et Bill Dolan
Today people use personal digital assistants for help with scheduling, playing music, turning on or adjusting other devices, and answering basic questions such as “What time’s the game on?” or “Where’s the nearest hardware store?” But what if these assistants…