[reading review] Exploiting Cloud Object Storage for High-Performance Analytics

In this paper, the authors explore the characteristics of cloud storage, and developed AnyBlob which using io_uring to improve throughput and saturate network bandwidth. They used Umbra with different configuration to do the experiments on TPC-H benchmark.

  • Strengths: achieving high bandwidth data processing; using io_uring to saturate network bandwidth

  • Future works: can explore more network technologies




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • [reading review] An Empirical Evaluation of Columnar Storage Formats
  • [reading review] Lakehouse: A New Generation of Open Platforms that unify Data Warehouse and Advanced Analytics
  • [reading review] Velox: Meta's Unified Execution Engine
  • [reading review] OceanBase: A 707 Million tpmC Distributed Relational Database System
  • [reading review] MonetDB/X100: Hyper-Pipelining Query Execution