Projects
Making Azure’s big bet possible Recent innovations in generative large language models (LLMs) have made their applications and use-cases ubiquitous. This has led to large-scale deployments of these models, using complex, expensive, and power-hungry AI accelerators, most commonly GPUs. These…
The Zissou project is exploring immersion cooling in large-scale cloud platforms. Our main motivation is that chip power has been steadily increasing since the end of Dennard scaling.
Established:
Function as a Service (FaaS) is a software paradigm that is becoming increasingly popular. Multiple cloud providers offer FaaS as the interface to usage-driven, stateless (serverless) backend services. FaaS offers an intuitive, event-based interface for developing cloud-based applications. In contrast…
Power Capping and Oversubscription is a collaboration between MSR, Azure Compute, CO+I, and AHSI to harvest stranded datacenter resources via smart performance-aware power capping and oversubscription.
Established:
No public cloud can host large latency-sensitive services, such as search engines, in a way that is economic for those services today! Project LEAP (short for Lean, Efficient, And Predictable) addresses the research challenges in enabling cloud platforms to host…
Established:
Resource Central is a general ML and prediction-serving system deployed in Azure Compute. It trains ML models offline and uses them to produce predictions online.