“If life is a marathon, then health is the key to its duration.” Health is not only the foundation of happiness and societal progress but also a pivotal aspect of the intelligent era. AI’s integration into healthcare represents a transformative tool for maintaining and enhancing human well-being. From aiding early disease detection and progression prediction to personalizing precision medicine and accelerating medical research and drug development, AI’s unique value and potential are increasingly evident.
Over recent years, Microsoft Research Asia has deepened its collaboration with medical institutions and academic experts, attracting professionals in healthcare to foster AI’s profound application in medical health, thereby contributing to a healthier global community.
Early detection, early treatment: AI in disease detection and rehabilitation
The early diagnosis of diseases is vital for enhancing treatment outcomes and patient quality of life. Rehabilitation training, a critical component of many treatment regimens, plays a significant role in restoring patient functions. Traditional methods face limitations due to resource distribution, geographic constraints, and a shortage of medical professionals, affecting the accessibility and efficiency of healthcare services. AI can support medical staff by providing automated, intelligent early disease detection, enabling timely intervention and treatment.
AI-powered voice recognition for speech rehabilitation in cleft palate patients
Cleft palate and cleft lip, prevalent congenital deformities in the oral and maxillofacial region, often result in hyper nasal speech due to velopharyngeal insufficiency. Microsoft Research Asia, in collaboration with medical institutions, recognizes hypernasality detection as crucial for treating cleft palate patients.
Traditionally, speech-language pathologists assess hypernasality, but their limited availability and concentration in certain hospitals necessitate extensive, costly cross-regional patient travel. An automated hypernasality assessment method would not only aid pathologists in making accurate evaluations but also facilitate remote patient diagnosis and treatment, significantly reducing costs.
Leveraging transfer learning technology, Microsoft Research Asia has developed an innovative approach using an automatic speech recognition (ASR) model to enhance hypernasality assessment. This innovative model excels in extracting acoustic features and demonstrates robust generalization capabilities. Comparative studies on two cleft palate datasets reveal that it surpasses existing methods, significantly enhancing the precision of pathologists’ diagnostic processes.
Following hypernasality evaluations, physicians devise tailored speech therapy regimens for patients. Microsoft Research Asia has advanced this process by developing the Masked Pre-training Pronunciation Assessment (MPA) model. This model supports end-to-end training and adapts to both unsupervised and supervised learning environment, enabling user-friendly remote deployment. Utilizing reference texts and integrating masking with prediction tactics, the MPA model adeptly circumvents issues of misalignment or misrecognition in pronunciation assessments, offering more precise speech correction support for individuals with cleft palate.
Microsoft Research Asia is actively collaborating with healthcare providers to assess the feasibility of deploying this cutting-edge speech assessment technology. The goal is to enhance the efficiency of medical diagnoses and treatments, lower the financial burden on patients, and extend the benefits of this technology to numerous cleft palate sufferers in isolated regions.
Related papers:
- Paper on hypernasality detection: Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech (opens in new tab). INTERSPEECH 2022.
- Paper on speech assessment: End-to-End Word-Level Pronunciation Assessment with MASK Pre-training (opens in new tab). INTERSPEECH 2023.
Voice analysis model enhances Alzheimer’s disease screening
Alzheimer’s disease, a prevalent neurodegenerative condition primarily affecting the elderly, leads to progressive cognitive decline, including memory loss, language difficulties, and impaired reasoning. While there’s no cure for Alzheimer’s, early detection and intervention are key to decelerating its progression.
Traditional diagnostic methods, such as brain scans, blood tests, and cognitive assessments, are extensive and expensive. However, research indicates that early Alzheimer’s can be detected through speech analysis, identifying symptoms like fluent aphasia and word retrieval challenges.
Capitalizing on this insight, Microsoft Research Asia has pioneered speech and language analysis technologies to detect Alzheimer’s indicators from sophisticated acoustic and linguistic data. A novel task-oriented model has also been introduced, correlating language descriptions with cognitive tasks.
In the ADReSS dataset’s subtask involving “Cookie Theft” picture descriptions (Figure 1), these methods attained a 91.4% accuracy rate. This innovative approach, merging speech and semantic analysis, significantly increases disease detection accuracy. The model’s high efficiency and performance on new test sets offer promising prospects for Alzheimer’s screening at scale.
Related paper:
- Leveraging Pretrained Representations with Task-related Keywords for Alzheimer’s Disease Detection (opens in new tab). ICASSP 2023.
Advancing autism diagnosis: Unsupervised detection of stereotypical behaviors
Autism Spectrum Disorder (ASD) typically manifests in early childhood, presenting challenges in social interaction and communication, accompanied by repetitive behaviors. These behaviors, which may include actions like persistent hand-flapping or head-banging, serve as vital indicators for ASD diagnosis. Early detection and intervention are crucial for improving outcomes, yet traditional methods relying on prolonged observation by specialists are not always efficient. Hence, the development of a swift, automated detection system is invaluable.
Traditional approaches have utilized computer vision and supervised learning to analyze video data of individuals with ASD. However, these methods face limitations due to the diverse range of stereotypical behaviors and privacy concerns associated with video data collection.
Addressing these challenges, Microsoft Research Asia, in collaboration with medical institutions, has innovated an unsupervised approach using video anomaly recognition. The new Dual-Stream Stereotypical Behavior Detector (DS-SBD) model leverages the temporal dynamics of human posture and repetitive motion patterns. Remarkably, DS-SBD requires only non-anomalous behavior for training and can identify previously unseen stereotypical behaviors during inference, such as identifying circling behaviors in the training data.
Extensive studies validate that DS-SBD’s unsupervised technique has increased the micro-average AUROC from 60.43% to 71.04% and the macro-average AUROC from 56.45% to 73.39%, signifying a substantial improvement in both accuracy and the scope of detectable behaviors. This breakthrough outperforms current state-of-the-art methods and is poised to set a new standard in the field. While DS-SBD marks a significant advancement in recognizing stereotypical behaviors, it represents only one facet of the broader ASD diagnostic process. Comprehensive early diagnosis and intervention strategies will benefit from continued interdisciplinary collaboration and societal engagement.
Related paper:
- Unsupervised Video Anomaly Detection for Stereotypical Behaviors in Autism (opens in new tab). ICASSP 2023.
Advancing neonatal seizure detection through brainwave analysis
Epilepsy in children is a multifaceted, often recurring neurological disorder that predominantly occurs in the formative years (0-18 years). The prompt identification of epilepsy in newborns is vital to safeguard their developmental trajectory.
The genesis of epileptic seizures lies in the abnormal discharges of neurons within the brain, rendering brainwave analysis a pivotal tool for epilepsy diagnosis. Nonetheless, the nascent state of neonatal brain development, coupled with the pronounced noise in brainwave data and the marked variability among infants, renders the detection of neonatal epilepsy a formidable medical challenge.
Microsoft Research Asia and its collaborators have unveiled a deep learning paradigm, harnessing artificial intelligence and electroencephalogram (EEG) signals – dubbed the Spatial-Temporal EEG Network (STATENet). This model adeptly processes neural signals, nimbly adjusts to neonatal EEG channel variations, and addresses the challenges outlined above. Additionally, the team has introduced a model-level integration technique that synergistically combines outcomes from various spatial-temporal deep models, thereby bolstering the STATENet model’s generalization ability across diverse neonatal subjects.
Extensive studies utilizing a comprehensive dataset of real-world neonatal EEG data have demonstrated the STATENet model’s substantial enhancement in detection precision. The model’s area under the precision-recall curve (AUPRC) witnessed an improvement exceeding 30% relative to prevailing top-tier methods, equipping physicians with a novel diagnostic instrument for pediatric epilepsy.
Moreover, Microsoft Research Asia has pioneered the inaugural cross-dataset EEG model capable of deciphering any EEG data, thus achieving a ‘one-to-many’ brainwave comprehension. This breakthrough underpins the AI Neurologist system, designed to augment brainwave signal analysis in both clinical and research settings, elevating diagnostic accuracy from 75% to 90% in a case study. The associated models are now open source on GitHub, inviting global research participation to extend this technology’s impact across the medical spectrum and catalyze new diagnostic and therapeutic innovations.
Related papers:
- Paper on neonatal seizure detection: Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling (opens in new tab). IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2023.
- Paper on brainwave pre-training model: Learning Topology-Agnostic EEG Representations with Geometry-Aware Modeling. (opens in new tab)
Enhancing disease progression prediction and personalized care: The role of AI in precision medicine
Precision medicine represents a transformative approach to healthcare, tailoring treatments to individual patient profiles. Despite the promise, the complexity and unique nature of diseases present significant hurdles. AI emerges as a powerful ally, leveraging data analysis, pattern detection, and predictive modeling to forecast disease progression and risks. This capability is especially crucial in chronic disease management, aiding clinicians and patients in mitigating illness severity and preventing complications.
Graph neural networks: A novel approach to Parkinson’s disease progression
Parkinson’s disease, a prevalent neurodegenerative condition in seniors, progresses gradually. Patient conditions may remain stable or even improve over time with proper medication and therapy, maintaining optimal physical functions. Yet, Parkinson’s presents a spectrum of symptoms, from sleep disturbances to motor challenges, making disease progression prediction complex.
Researchers at Microsoft Research Asia advocate for the analysis of multimodal data to extract similar symptoms, thus enhancing prediction accuracy. Graph neural networks (GNNs) excel in mapping patient interconnections, forming networks where nodes represent patients linked by shared attributes. Selecting these attributes, however, largely demands expert knowledge and experience.
To overcome this, Microsoft Research Asia collaborated closely with medical institutions. Based on recommendations from professional medical personnel, a new algorithm called AdaMedGraph was proposed. AdaMedGraph autonomously identifies key features to construct patient similarity graphs, harmonizing with existing knowledge and integrating expert-designed graphs into a comprehensive model. Unifying individual and collective data, this innovation simplifies the graph construction process.
Evaluated on two public datasets, the Parkinson’s Progression Markers Initiative (PPMI) and the Parkinson’s Disease Biomarkers Program (PDBP), AdaMedGraph outperformed benchmarks in predicting Parkinson’s progression over 24 months, setting the stage for personalized treatment strategies.
Moreover, AdaMedGraph’s robust generalization ability shines in metabolic syndrome prediction, achieving an AUROC of 0.675 on test datasets. This underscores the model’s efficacy in integrating intra- and inter-patient data for individual disease progression forecasting, inspiring new avenues in medical research.
Related paper:
Enhancing interdisciplinary collaboration to maximize AI’s potential
Microsoft Research Asia endeavors extend beyond mere disease detection and progression prediction. In collaboration with the medical sector, Microsoft Research Asia is probing the vast capabilities of AI in advancing drug development and medical research. This includes leveraging state-of-the-art technology in constructing artificial retinas, analyzing drug dependency, advancing cancer therapies, and exploring human metabolism, among other areas.
As AI technology matures and progresses, its practical application potential becomes increasingly evident. Yet, unlocking AI’s full value across diverse sectors necessitates essential interdisciplinary and cross-domain collaboration. “The synergistic collaboration with medical professionals from healthcare and research institutions have enabled Microsoft Research Asia to conduct extensive research projects within the medical and health domain. Our continuous exploration into AI’s application in critical healthcare aspects—ranging from disease detection to rehabilitation and disease progression forecasting—is a testament to our collective dedication. We invite more exceptional individuals passionate about interdisciplinary research to join us in our quest to safeguard human health and foster medical advancements,” expressed Lili Qiu, assistant managing director of Microsoft Research Asia.
Note: The medical health research conducted by Microsoft Research Asia, as discussed in this article, is purely exploratory and guided by professional medical entities and research institutions. Our aim is to further scientific advancement and offer theoretical and technical support for the future medical applications benefiting humanity. All research is in strict adherence to Microsoft’s responsible AI principles, upholding fairness, inclusiveness, reliability and safety, transparency, privacy and security, and accountability. The technologies and methodologies referenced herein are in the R&D phase and are not yet commercialized products or services, nor do they represent medical advice or treatment plans. For health-related concerns, we advise consulting with certified medical practitioners.