Huiqiang Jiang

Research SDE 2

关于

Huiqiang is a Research SDE in MSRA Shanghai Lab.

Huiqiang’s research primarily concentrates on efficient methods to accelerate inference or training, including dynamic sparse attention (MInference, RetrievalAttention), prompt compression (LLMLingua), KV-cache compression, speculative decoding, model compression, sparse inference (PIT), neural architecture search (NAS), and efficient tuning, with a particular emphasis on LLMs. Additionally, he is interested in addressing typical challenges in natural language processing, such as information extraction.

He’s looking for one research intern in efficient methods. Please get in touch with him (hjiang[aT]microsoft[DoT.]com) if you are interested in the research topics.