DéjàVu: KV-cache Streaming for Fast, Fault-tolerant Generative LLM Serving
Foteini Strati, Sara Mcallister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic
ICML 2024 | July 2024
Foteini Strati, Sara Mcallister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic
ICML 2024 | July 2024
Irene Wang, Jakub Tarnawski, Amar Phanishayee, Divya Mahajan
2024 International Conference on Machine Learning | July 2024
Wei Hao, Daniel Mendoza, Rafael da Silva, Deepak Narayanan, Amar Phanishayee, Asaf Cidon, Junfeng Yang
2024 International Conference on Machine Learning | July 2024
Muhammad Adnan, Amar Phanishayee,, Janardhan (Jana) Kulkarni, Prashant J. Nair, Divya Mahajan
arXiv:2404.14632 | April 2024
Publié par Microsoft
Saurabh Agarwal, Amar Phanishayee, Shivaram Venkataraman
European Conference on Computer Systems (ACM EuroSys 2024 - spring accept). | December 2023
arXiv:2312.12621 | December 2023
Ankit Bhardwaj, Amar Phanishayee, Deepak Narayanan, Mihail Tarta, Ryan Stutsman
arXiv:2311.18174 | November 2023
Publié par arXiv | Nov 2023
Wei Hao, Daniel Mendoza, Rafael da Silva, Deepak Narayanan, Amar Phanishayee
arXiv:2307.07507 | July 2023
Publié par arXiv
Jack Kosaian, Amar Phanishayee
arXiv:2212.07936 | December 2022
Publié par arXiv
Youjie Li, Amar Phanishayee, Derek Murray, Jakub Tarnawski, Nam Sung Kim
VLDB 2022 | September 2022
Jayashree Mohan, Amar Phanishayee, Janardhan (Jana) Kulkarni, Vijay Chidambaram
USENIX Symposium on Operating Systems Design and Implementation (OSDI 2022) | July 2022
Jakub Tarnawski, Deepak Narayanan, Amar Phanishayee
NeurIPS 2021 | December 2021
Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Anand Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia
International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2021) | November 2021
Best Student Paper
Jayashree Mohan, Amar Phanishayee, Janardhan (Jana) Kulkarni, Vijay Chidambaram
October 2021
Jack Kosaian, Amar Phanishayee, Matthai Philipose, Debadeepta Dey, Rashmi Vinayak
2021 International Conference on Machine Learning (ICML 2021) | July 2021
Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia
International Conference on Machine Learning (ICML 2021) | July 2021
Youjie Li, Amar Phanishayee, Derek Murray, Nam Sung Kim
HotOS Workshop | May 2021
Jayashree Mohan, Amar Phanishayee, Vijay Chidambaram
USENIX FAST 2021 | February 2021
Jayashree Mohan, Amar Phanishayee, Ashish Raniwala, Vijay Chidambaram
VLDB 2021 | January 2021
Jakub Tarnawski, Amar Phanishayee, Nikhil R. Devanur, Divya Mahajan, Fanny Nina Paravecino
NeurIPS 2020 | December 2020
Deepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, Matei Zaharia
USENIX Symposium on Operating Systems Design and Implementation (OSDI 2020) | November 2020
Hongyu Zhu, Amar Phanishayee, Gennady Pekhimenko
USENIX ATC 2020 | July 2020
Kevin Hsieh, Amar Phanishayee, Onur Mutlu, Phillip Gibbons
International Conference on Machine Learning (ICML 2020) | July 2020
Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia
arXiv:2006.09503 | June 2020
Publié par arXiv
Guanhua Wang, Shivaram Venkataraman, Amar Phanishayee, Jorgen Thelin, Nikhil Devanur, Ion Stoica
Conference on Machine Learning and Systems (MLSys 2020) | March 2020
Kshiteej Mahajan, Arjun Balasubramanian, Arjun Singhvi, Shivaram Venkataraman, Aditya Akella, Amar Phanishayee, Shuchi Chawla
USENIX NSDI 2020 | February 2020
Deepak Narayanan, Aaron Harlap, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Granger, Phil Gibbons, Matei Zaharia
ACM Symposium on Operating Systems Principles (SOSP 2019) | October 2019
Aarati Kakaraparthy, Abhay Venkatesh, Amar Phanishayee, Shivaram Venkataraman
USENIX HotCloud | July 2019
Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, Fan Yang
2019 USENIX Annual Technical Conference | May 2019
Deepak Narayanan, Keshav Santhanam, Amar Phanishayee, Matei Zaharia
NeurIPS Workshop on Systems for Machine Learning | December 2018
Liang Luo, Jacob Nelson, Luis Ceze, Amar Phanishayee, Arvind Krishnamurthy
SOCC 2018 | October 2018
Hongyu Zhu, Mohamed Akrout, Bojian Zheng, Andrew Pelegris, Amar Phanishayee, Bianca Schroeder, Gennady Pekhimenko
International Symposium on Workload Characterization (IISWC 2018) | August 2018
Animesh Jain, Amar Phanishayee, Jason Mars, Lingjia Tang, Gennady Pekhimenko
International Symposium on Computer Architecture (ISCA 2018) | June 2018
Aaron Harlap, Deepak Narayanan, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Ganger, Phil Gibbons
June 2018
arXiv preprint
Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, Fan Yang
MSR-TR-2018-13 | May 2018
Publié par Microsoft
Irene Wang, Jakub Tarnawski, Amar Phanishayee, Divya Mahajan
2024 International Conference on Machine Learning | July 2024
Foteini Strati, Sara Mcallister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic
ICML 2024 | July 2024
Wei Hao, Daniel Mendoza, Rafael da Silva, Deepak Narayanan, Amar Phanishayee, Asaf Cidon, Junfeng Yang
2024 International Conference on Machine Learning | July 2024
Muhammad Adnan, Amar Phanishayee,, Janardhan (Jana) Kulkarni, Prashant J. Nair, Divya Mahajan
arXiv:2404.14632 | April 2024
Publié par Microsoft
Saurabh Agarwal, Amar Phanishayee, Shivaram Venkataraman
European Conference on Computer Systems (ACM EuroSys 2024 - spring accept). | December 2023
arXiv:2312.12621 | December 2023
Ankit Bhardwaj, Amar Phanishayee, Deepak Narayanan, Mihail Tarta, Ryan Stutsman
arXiv:2311.18174 | November 2023
Publié par arXiv | Nov 2023
Wei Hao, Daniel Mendoza, Rafael da Silva, Deepak Narayanan, Amar Phanishayee
arXiv:2307.07507 | July 2023
Publié par arXiv
Jack Kosaian, Amar Phanishayee
arXiv:2212.07936 | December 2022
Publié par arXiv
Youjie Li, Amar Phanishayee, Derek Murray, Jakub Tarnawski, Nam Sung Kim
VLDB 2022 | September 2022
Jayashree Mohan, Amar Phanishayee, Janardhan (Jana) Kulkarni, Vijay Chidambaram
USENIX Symposium on Operating Systems Design and Implementation (OSDI 2022) | July 2022
Jakub Tarnawski, Deepak Narayanan, Amar Phanishayee
NeurIPS 2021 | December 2021
Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Anand Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia
International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2021) | November 2021
Best Student Paper
Jayashree Mohan, Amar Phanishayee, Janardhan (Jana) Kulkarni, Vijay Chidambaram
October 2021
Jack Kosaian, Amar Phanishayee, Matthai Philipose, Debadeepta Dey, Rashmi Vinayak
2021 International Conference on Machine Learning (ICML 2021) | July 2021
Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia
International Conference on Machine Learning (ICML 2021) | July 2021
Youjie Li, Amar Phanishayee, Derek Murray, Nam Sung Kim
HotOS Workshop | May 2021
Jayashree Mohan, Amar Phanishayee, Vijay Chidambaram
USENIX FAST 2021 | February 2021
Jayashree Mohan, Amar Phanishayee, Ashish Raniwala, Vijay Chidambaram
VLDB 2021 | January 2021
Jakub Tarnawski, Amar Phanishayee, Nikhil R. Devanur, Divya Mahajan, Fanny Nina Paravecino
NeurIPS 2020 | December 2020
Deepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, Matei Zaharia
USENIX Symposium on Operating Systems Design and Implementation (OSDI 2020) | November 2020
Hongyu Zhu, Amar Phanishayee, Gennady Pekhimenko
USENIX ATC 2020 | July 2020
Kevin Hsieh, Amar Phanishayee, Onur Mutlu, Phillip Gibbons
International Conference on Machine Learning (ICML 2020) | July 2020
Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia
arXiv:2006.09503 | June 2020
Publié par arXiv
Guanhua Wang, Shivaram Venkataraman, Amar Phanishayee, Jorgen Thelin, Nikhil Devanur, Ion Stoica
Conference on Machine Learning and Systems (MLSys 2020) | March 2020
Kshiteej Mahajan, Arjun Balasubramanian, Arjun Singhvi, Shivaram Venkataraman, Aditya Akella, Amar Phanishayee, Shuchi Chawla
USENIX NSDI 2020 | February 2020
Deepak Narayanan, Aaron Harlap, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Granger, Phil Gibbons, Matei Zaharia
ACM Symposium on Operating Systems Principles (SOSP 2019) | October 2019
Aarati Kakaraparthy, Abhay Venkatesh, Amar Phanishayee, Shivaram Venkataraman
USENIX HotCloud | July 2019
Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, Fan Yang
2019 USENIX Annual Technical Conference | May 2019
Deepak Narayanan, Keshav Santhanam, Amar Phanishayee, Matei Zaharia
NeurIPS Workshop on Systems for Machine Learning | December 2018
Liang Luo, Jacob Nelson, Luis Ceze, Amar Phanishayee, Arvind Krishnamurthy
SOCC 2018 | October 2018
Hongyu Zhu, Mohamed Akrout, Bojian Zheng, Andrew Pelegris, Amar Phanishayee, Bianca Schroeder, Gennady Pekhimenko
International Symposium on Workload Characterization (IISWC 2018) | August 2018
Animesh Jain, Amar Phanishayee, Jason Mars, Lingjia Tang, Gennady Pekhimenko
International Symposium on Computer Architecture (ISCA 2018) | June 2018
Aaron Harlap, Deepak Narayanan, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Ganger, Phil Gibbons
June 2018
arXiv preprint
Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, Fan Yang
MSR-TR-2018-13 | May 2018
Publié par Microsoft
Irene Wang, Jakub Tarnawski, Amar Phanishayee, Divya Mahajan
2024 International Conference on Machine Learning | July 2024
Foteini Strati, Sara Mcallister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic
ICML 2024 | July 2024
Wei Hao, Daniel Mendoza, Rafael da Silva, Deepak Narayanan, Amar Phanishayee, Asaf Cidon, Junfeng Yang
2024 International Conference on Machine Learning | July 2024
Saurabh Agarwal, Amar Phanishayee, Shivaram Venkataraman
European Conference on Computer Systems (ACM EuroSys 2024 - spring accept). | December 2023
arXiv:2312.12621 | December 2023
Ankit Bhardwaj, Amar Phanishayee, Deepak Narayanan, Mihail Tarta, Ryan Stutsman
arXiv:2311.18174 | November 2023
Publié par arXiv | Nov 2023
Wei Hao, Daniel Mendoza, Rafael da Silva, Deepak Narayanan, Amar Phanishayee
arXiv:2307.07507 | July 2023
Publié par arXiv
Jack Kosaian, Amar Phanishayee
arXiv:2212.07936 | December 2022
Publié par arXiv
Youjie Li, Amar Phanishayee, Derek Murray, Jakub Tarnawski, Nam Sung Kim
VLDB 2022 | September 2022
Jayashree Mohan, Amar Phanishayee, Janardhan (Jana) Kulkarni, Vijay Chidambaram
USENIX Symposium on Operating Systems Design and Implementation (OSDI 2022) | July 2022
Jakub Tarnawski, Deepak Narayanan, Amar Phanishayee
NeurIPS 2021 | December 2021
Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Anand Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia
International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2021) | November 2021
Best Student Paper
Jayashree Mohan, Amar Phanishayee, Janardhan (Jana) Kulkarni, Vijay Chidambaram
October 2021
Jack Kosaian, Amar Phanishayee, Matthai Philipose, Debadeepta Dey, Rashmi Vinayak
2021 International Conference on Machine Learning (ICML 2021) | July 2021
Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia
International Conference on Machine Learning (ICML 2021) | July 2021
Youjie Li, Amar Phanishayee, Derek Murray, Nam Sung Kim
HotOS Workshop | May 2021
Jayashree Mohan, Amar Phanishayee, Vijay Chidambaram
USENIX FAST 2021 | February 2021
Jayashree Mohan, Amar Phanishayee, Ashish Raniwala, Vijay Chidambaram
VLDB 2021 | January 2021
Jakub Tarnawski, Amar Phanishayee, Nikhil R. Devanur, Divya Mahajan, Fanny Nina Paravecino
NeurIPS 2020 | December 2020
Deepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, Matei Zaharia
USENIX Symposium on Operating Systems Design and Implementation (OSDI 2020) | November 2020
Hongyu Zhu, Amar Phanishayee, Gennady Pekhimenko
USENIX ATC 2020 | July 2020
Kevin Hsieh, Amar Phanishayee, Onur Mutlu, Phillip Gibbons
International Conference on Machine Learning (ICML 2020) | July 2020
Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia
arXiv:2006.09503 | June 2020
Publié par arXiv
Guanhua Wang, Shivaram Venkataraman, Amar Phanishayee, Jorgen Thelin, Nikhil Devanur, Ion Stoica
Conference on Machine Learning and Systems (MLSys 2020) | March 2020
Kshiteej Mahajan, Arjun Balasubramanian, Arjun Singhvi, Shivaram Venkataraman, Aditya Akella, Amar Phanishayee, Shuchi Chawla
USENIX NSDI 2020 | February 2020
Deepak Narayanan, Aaron Harlap, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Granger, Phil Gibbons, Matei Zaharia
ACM Symposium on Operating Systems Principles (SOSP 2019) | October 2019
Aarati Kakaraparthy, Abhay Venkatesh, Amar Phanishayee, Shivaram Venkataraman
USENIX HotCloud | July 2019
Deepak Narayanan, Keshav Santhanam, Amar Phanishayee, Matei Zaharia
NeurIPS Workshop on Systems for Machine Learning | December 2018
Liang Luo, Jacob Nelson, Luis Ceze, Amar Phanishayee, Arvind Krishnamurthy
SOCC 2018 | October 2018
Hongyu Zhu, Mohamed Akrout, Bojian Zheng, Andrew Pelegris, Amar Phanishayee, Bianca Schroeder, Gennady Pekhimenko
International Symposium on Workload Characterization (IISWC 2018) | August 2018
Animesh Jain, Amar Phanishayee, Jason Mars, Lingjia Tang, Gennady Pekhimenko
International Symposium on Computer Architecture (ISCA 2018) | June 2018
Aaron Harlap, Deepak Narayanan, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Ganger, Phil Gibbons
June 2018
arXiv preprint
Jakub Tarnawski, Amar Phanishayee, Nikhil R. Devanur, Divya Mahajan, Fanny Nina Paravecino
NeurIPS 2020 | December 2020
Muhammad Adnan, Amar Phanishayee,, Janardhan (Jana) Kulkarni, Prashant J. Nair, Divya Mahajan
arXiv:2404.14632 | April 2024
Publié par Microsoft
Ankit Bhardwaj, Amar Phanishayee, Deepak Narayanan, Mihail Tarta, Ryan Stutsman
arXiv:2311.18174 | November 2023
Publié par arXiv | Nov 2023
Wei Hao, Daniel Mendoza, Rafael da Silva, Deepak Narayanan, Amar Phanishayee
arXiv:2307.07507 | July 2023
Publié par arXiv
Jack Kosaian, Amar Phanishayee
arXiv:2212.07936 | December 2022
Publié par arXiv
Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia
arXiv:2006.09503 | June 2020
Publié par arXiv
Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, Fan Yang
MSR-TR-2018-13 | May 2018
Publié par Microsoft
Irene Wang, Jakub Tarnawski, Amar Phanishayee, Divya Mahajan
2024 International Conference on Machine Learning | July 2024
Foteini Strati, Sara Mcallister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic
ICML 2024 | July 2024
Wei Hao, Daniel Mendoza, Rafael da Silva, Deepak Narayanan, Amar Phanishayee, Asaf Cidon, Junfeng Yang
2024 International Conference on Machine Learning | July 2024
Saurabh Agarwal, Amar Phanishayee, Shivaram Venkataraman
European Conference on Computer Systems (ACM EuroSys 2024 - spring accept). | December 2023
arXiv:2312.12621 | December 2023
Youjie Li, Amar Phanishayee, Derek Murray, Jakub Tarnawski, Nam Sung Kim
VLDB 2022 | September 2022
Jayashree Mohan, Amar Phanishayee, Janardhan (Jana) Kulkarni, Vijay Chidambaram
USENIX Symposium on Operating Systems Design and Implementation (OSDI 2022) | July 2022
Jakub Tarnawski, Deepak Narayanan, Amar Phanishayee
NeurIPS 2021 | December 2021
Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Anand Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia
International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2021) | November 2021
Best Student Paper
Jack Kosaian, Amar Phanishayee, Matthai Philipose, Debadeepta Dey, Rashmi Vinayak
2021 International Conference on Machine Learning (ICML 2021) | July 2021
Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia
International Conference on Machine Learning (ICML 2021) | July 2021
Youjie Li, Amar Phanishayee, Derek Murray, Nam Sung Kim
HotOS Workshop | May 2021
Jayashree Mohan, Amar Phanishayee, Vijay Chidambaram
USENIX FAST 2021 | February 2021
Jayashree Mohan, Amar Phanishayee, Ashish Raniwala, Vijay Chidambaram
VLDB 2021 | January 2021
Jakub Tarnawski, Amar Phanishayee, Nikhil R. Devanur, Divya Mahajan, Fanny Nina Paravecino
NeurIPS 2020 | December 2020
Deepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, Matei Zaharia
USENIX Symposium on Operating Systems Design and Implementation (OSDI 2020) | November 2020
Hongyu Zhu, Amar Phanishayee, Gennady Pekhimenko
USENIX ATC 2020 | July 2020
Kevin Hsieh, Amar Phanishayee, Onur Mutlu, Phillip Gibbons
International Conference on Machine Learning (ICML 2020) | July 2020
Guanhua Wang, Shivaram Venkataraman, Amar Phanishayee, Jorgen Thelin, Nikhil Devanur, Ion Stoica
Conference on Machine Learning and Systems (MLSys 2020) | March 2020
Kshiteej Mahajan, Arjun Balasubramanian, Arjun Singhvi, Shivaram Venkataraman, Aditya Akella, Amar Phanishayee, Shuchi Chawla
USENIX NSDI 2020 | February 2020
Deepak Narayanan, Aaron Harlap, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Granger, Phil Gibbons, Matei Zaharia
ACM Symposium on Operating Systems Principles (SOSP 2019) | October 2019
Aarati Kakaraparthy, Abhay Venkatesh, Amar Phanishayee, Shivaram Venkataraman
USENIX HotCloud | July 2019
Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, Fan Yang
2019 USENIX Annual Technical Conference | May 2019
Deepak Narayanan, Keshav Santhanam, Amar Phanishayee, Matei Zaharia
NeurIPS Workshop on Systems for Machine Learning | December 2018
Liang Luo, Jacob Nelson, Luis Ceze, Amar Phanishayee, Arvind Krishnamurthy
SOCC 2018 | October 2018
Hongyu Zhu, Mohamed Akrout, Bojian Zheng, Andrew Pelegris, Amar Phanishayee, Bianca Schroeder, Gennady Pekhimenko
International Symposium on Workload Characterization (IISWC 2018) | August 2018
Animesh Jain, Amar Phanishayee, Jason Mars, Lingjia Tang, Gennady Pekhimenko
International Symposium on Computer Architecture (ISCA 2018) | June 2018
Jayashree Mohan, Amar Phanishayee, Janardhan (Jana) Kulkarni, Vijay Chidambaram
October 2021
Aaron Harlap, Deepak Narayanan, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Ganger, Phil Gibbons
June 2018
arXiv preprint