Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao
ICLR 2024 (Oral) | May 2024
Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao
ICLR 2024 (Oral) | May 2024
Liyuan Liu, Chengyu Dong, Xiaodong Liu, Bin Yu, Jianfeng Gao
NeurIPS 2023 (Oral) | December 2023
Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao
ICLR 2024 | November 2023
Chengyu Dong, Liyuan Liu, Jingbo Shang
NeurIPS 2022 (Oral) | November 2022
Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao
ICLR 2024 (Oral) | May 2024
Liyuan Liu, Chengyu Dong, Xiaodong Liu, Bin Yu, Jianfeng Gao
NeurIPS 2023 (Oral) | December 2023
Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao
ICLR 2024 | November 2023
Chengyu Dong, Liyuan Liu, Jingbo Shang
NeurIPS 2022 (Oral) | November 2022
Liyuan Liu, Chengyu Dong, Xiaodong Liu, Bin Yu, Jianfeng Gao
NeurIPS 2023 (Oral) | December 2023
Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao
ICLR 2024 (Oral) | May 2024
Liyuan Liu, Chengyu Dong, Xiaodong Liu, Bin Yu, Jianfeng Gao
NeurIPS 2023 (Oral) | December 2023
Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao
ICLR 2024 | November 2023
Chengyu Dong, Liyuan Liu, Jingbo Shang
NeurIPS 2022 (Oral) | November 2022