For many latency-sensitive SQL workloads, Presto is often bound by retrieving distant data. In this talk, Rohit Jain from Facebook and Bin Fan from Alluxio will introduce their teams’ collaboration on adding a local on-SSD Alluxio cache inside Presto workers to improve unsatisfied Presto latency.


This talk will focus on:

  • Insights of the Presto workloads at Facebook w.r.t. cache effectiveness
  • API and internals of the Alluxio local cache, from design trade-offs (e.g. caching granularity, concurrency level and etc) to performance optimizations.
  • Initial performance analysis and timeline to deliver this feature for general Presto users.
  • Discussion on our future work to optimize cache performance with deeper integration with Presto



Interested in learning more? 


Save your spot

Online Meetup | Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration Between Presto & Alluxio

_______________

Thursday, May 7

10AM PST

Rohit Jain is a software engineer at Facebook. He is currently developing solutions to help low latency queries in Presto at Facebook.

Software Engineer, Facebook

Speaker: Rohit Jain

Speaker: Yutian "James" Sun

Software Engineer, Facebook

Yutian "James" Sun is a Software Engineer at Facebook working on large-scale distributed database systems. Major interests are query optimization, data federation, and low-latency query execution. James received his Ph.D in Computer Science from University of California, Santa Barbara focusing on data integration and data-centric processes.

Bin Fan is the founding engineer and VP of Open Source at Alluxio, Inc. Prior to Alluxio, he worked for Google to build the next-generation storage infrastructure. Bin received his Ph.D. in Computer Science from Carnegie Mellon University on the design and implementation of distributed systems.

Speaker: Bin Fan

Founding Engineer & VP of OS, Alluxio

...a data orchestration layer for compute in any cloud. It unifies data silos on-premise and across any cloud to give you data locality, accessibility, and elasticity.


Whether it’s accelerating big data frameworks on the public cloud, running big data workloads in hybrid cloud environments, or enabling big data on object stores or multiple clouds, Alluxio reduces the complexities associated with orchestrating data for today’s big data and AI/ML workloads.

Alluxio is...