Research talk: An intelligent data-driven paradigm towards cloud reliability
Cloud systems are perhaps the most complicated computing systems developed so far, yet our daily lives depend heavily on their continuous and reliable operations. To achieve systematic cloud reliability, we propose an intelligent data-driven paradigm based on AIOps (artificial intelligence for IT operations). We collect heterogeneous data such as traces, logs, key performance indicators (KPIs), topologies, and incidences from multiple sources in cloud systems, and perform various data-driven operations in the paradigm, including anomaly detection, failure diagnosis, and fault localization, for cloud resilience. We conduct experimentations on industrial settings, demonstrating applicability of the proposed paradigm, as well as its effectiveness towards achieving reliable AIOps tasks in cloud computing environments.
Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit (opens in new tab)
- Track:
- Cloud Intelligence/AIOps
- Date:
- Speakers:
- Michael Lyu
- Affiliation:
- Chinese University of Hong Kong
-
-
Michael Lyu
Professor
Chinese University of Hong Kong
-
-
Cloud Intelligence/AIOps
-
Opening remarks: Cloud Intelligence/AIOps
Speakers:- Marcus Fontoura
-
-
-
Research talk: An intelligent data-driven paradigm towards cloud reliability
Speakers:- Michael Lyu
-
Research talk: Automating and Optimizing IT Operations Management with AI
Speakers:- Rama Akkiraju
-
-
-