DDS: DPU-optimized Disaggregated Storage [Extended Report]
- Qizhen Zhang ,
- Phil Bernstein ,
- Badrish Chandramouli ,
- Jason Hu ,
- Yiming Zheng
VLDB 2024 |
This extended report presents DDS, a novel disaggregated storage architecture enabled by emerging networking hardware, namely DPUs (Data Processing Units). DPUs can optimize the latency and CPU consumption of disaggregated storage servers. However, utilizing DPUs for DBMSs requires careful design of the network and storage paths and the interface exposed to the DBMS. To fully benefit from DPUs, DDS heavily uses DMA, zero-copy, and userspace I/O to minimize overhead when improving throughput. It also introduces an offload engine that eliminates host CPUs by executing client requests directly on the DPU. Adopting DDS’ API requires minimal DBMS modification. Our experimental study and production system integration show promising results — DDS achieves higher disaggregated storage throughput with an order of magnitude lower latency, and saves up to tens of CPU cores per storage server.