Project Turing header: electric pulse on black background

AI at Scale

Models, infrastructure and hardware for next-generation AI applications

Nouvelles et reportages

Microsoft Research Focus 03: Week of November 7th, 2022

Blog de recherche Microsoft

Research Focus: Week of November 7, 2022

novembre 8, 2022

Welcome to Research Focus, a new series of blog posts that highlights notable publications, events, code/datasets, new hires and other milestones from across the research community at Microsoft. Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei,…

Blog de recherche Microsoft

DeepSpeed Compression: A composable library for extreme compression and zero-cost quantization

juillet 20, 2022 | DeepSpeed Team et Andrey Proskurin

Large-scale models are revolutionizing deep learning and AI research, driving major improvements in language understanding, generating creative texts, multi-lingual translation and many more. But despite their remarkable capabilities, the models’ large size creates latency and cost constraints that hinder the…

Dans l’actualité | ZDNet

Microsoft improves Translator and Azure AI services with new AI ‘Z-code’ models

March 22, 2022

Microsoft is updating its Translator and other Azure AI services with a set of AI models called Z-code, officials announced on March 22. These updates will improve the quality of machine translations, as well as help these services support more…

Blog de recherche Microsoft

DeepSpeed: Advancing MoE inference and training to power next-generation AI scale

janvier 19, 2022 | DeepSpeed Team et Andrey Proskurin

In the last three years, the largest trained dense models have increased in size by over 1,000 times, from a few hundred million parameters to over 500 billion parameters in Megatron-Turing NLG 530B (MT-NLG). Improvements in model quality with size…

SuperGLUE leaderboards showing T-NLRv5 at the top

Blog de recherche Microsoft

Efficiently and effectively scaling up language model pretraining for best language representation model on GLUE and SuperGLUE

décembre 2, 2021 | Jianfeng Gao et Saurabh Tiwary

As part of Microsoft AI at Scale (opens in new tab), the Turing family of NLP models are being used at scale across Microsoft to enable the next generation of AI experiences. Today, we are happy to announce that the…

Dans l’actualité | Microsoft Translator Blog

Multilingual translation at scale: 10000 language pairs and beyond

November 22, 2021

Microsoft is on a quest for AI at Scale with high ambition to enable the next generation of AI experiences. The Microsoft Translator ZCode team is working together with Microsoft Project Turing and Microsoft Research Asia to advance language and…

Blog de recherche Microsoft

Turing Bletchley: A Universal Image Language Representation model by Microsoft

novembre 1, 2021 | Saurabh Tiwary

Today, the Microsoft Turing team (opens in new tab) is thrilled to introduce Turing Bletchley, a 2.5-billion parameter Universal Image Language Representation model (T-UILR) that can perform image-language tasks in 94 languages. T-Bletchley has an image encoder and a universal language encoder that vectorize…

Figure 1. Trend of sizes of state-of-the-art NLP models over time

Blog de recherche Microsoft

Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World’s Largest and Most Powerful Generative Language Model

octobre 11, 2021 | Ali Alvi et Paresh Kharya

We are excited to introduce the DeepSpeed- and Megatron-powered Megatron-Turing Natural Language Generation model (MT-NLG), the largest and the most powerful monolithic transformer language model trained to date, with 530 billion parameters. It is the result of a research collaboration…

XTREME leaderboard showing T-ULRv5 at the top.

Blog de recherche Microsoft

Microsoft Turing Universal Language Representation model, T-ULRv5, tops XTREME leaderboard and trains 100x faster

septembre 28, 2021 | Saurabh Tiwary et Lidong Zhou

Today, we are excited to announce that with our latest Turing universal language representation model (T-ULRv5), a Microsoft-created model is once again the state of the art and at the top of the Google XTREME public leaderboard. Resulting from a…