LLaVA: Large Language and Vision Assistant: Personne

Building Next-Gen Multimodal Foundation Models for General-Purpose Assistants

LLaVA is an open-source project, collaborating with research community to advance the state-of-the-art in AI. LLaVA represents the first end-to-end trained large multimodal model (LMM) that achieves impressive chat capabilities mimicking spirits of the multimodal GPT-4. The LLaVA family continues growing to support more modalities, capabilities, applications and beyond.

Personne

Open research collaboration across universities in the research community and multiple Microsoft team, pushing the SoTA in new capabilities scale and applications etc.

Hao Cheng

Principal Researcher

Building Next-Gen Multimodal Foundation Models for General-Purpose Assistants

Personne

Hao Cheng

Michel Galley

Jianfeng Gao

Yong Jae Lee

Lars Liden

Haotian Liu

Xiaodong Liu

Yadong Lu

Matt Mazzola

Tristan Naumann

Hoifung Poon

Yelong Shen

Swadheen Shukla

Irina Spiridonova

Andrea Tupini

Naoto Usuyama

Cliff Wong

Jianwei Yang

Sheng Zhang