À propos
Kevin Lin is a Senior Researcher at Microsoft Azure AI, working on computer vision and vision-language multimodal intelligence.
He received his Ph.D. degree in electrical engineering from the University of Washington, in 2020, and his M.S. degree in computer science from National Taiwan University, in 2014.
More information can be found in his personal homepage (opens in new tab).
Featured content
End-to-End Human Pose and Mesh Reconstruction with Transformers
We present a new method, called MEsh TRansfOrmer (METRO), to reconstruct 3D human pose and mesh vertices from a single image. Code and model can be found at https://github.com/microsoft/MeshTransformer
Novel object captioning surpasses human performance on benchmarks
Consider for a moment what it takes to visually identify and describe something to another person. Now imagine that the other person can’t see the object or image, so every detail matters. How do you decide what information is important…
Skeletal Tracking on Azure Kinect
Microsoft has released a new RGB-D sensor called Azure Kinect. In this project, we develop the skeletal tracking SDK for Azure Kinect. The product is called Azure Kinect Body Tracking SDK. It consists of 2D pose estimation and 3D model fitting. The 2D pose estimation is a neural network based solution and its input is the IR image of the depth sensor.