项目
-
- Language Understanding: Don’t just recognize the words a user spoke, but understand what they mean.
- Noise Robustness: How do we make the system work when background noise is present?
- Voice search: Users can search for information such as a business from your phone.
- Automatic Grammar Induction: How do create grammars to ease the development of spoken language systems?
- (MiPad) Multimodal Interactive Pad: Our first multimodal prototype.
- SALT (Speech Enabled Language Tags): A markup language for the multimodal web
- From Captions to Visual Concepts and Back: Image captioning and understanding
- Intent Understanding: Not recognize the words the user says, but understand what they mean.
- Multimodal Conversational User Interface
- Personalized Language Model for improved accuracy
- Recurrent Neural Networks for Language Processing
- Speech Technology for Computational Phonetics and Reading Assessment
- (Whisper) Speech Recognition: Our previous dictation-oriented speech recognition project is a state-of-the-art general-purpose speech recognizer.
- (WhisperID) Speaker Identification: Who is doing the talking?
- Speech Application Programming Interface (SAPI) Development Toolkit: The Whisper speech recognizer can be used by developers to produce applications using speech recognition
Current Projects
加载中…
成立:
In most organizations, staff spend many hours in meetings. This project addresses all levels of analysis and understanding, from speaker tracking and robust speech transcription to meaning extraction and summarization, with the goal of increasing productivity both during the meeting…