Over the last few years, organizations have worked towards the separation of storage and compute for a number of benefits in the areas of cost, data duplication and data latency. Cloud resolves most of these issues but comes to the expense of needing a way to query data on remote storages. Alluxio and Presto are a powerful combination to address the compute problem, which is part of the strategy used by Simbiose Ventures to create a product called StorageQuery - A platform to query files in cloud storages with SQL.
This talk will focus on:
- How Alluxio fits StorageQuery's tech stack;
- Advantages of using Alluxio as a cache layer and its unified filesystem;
- Development of new under file system for Backblaze B2 and fine-grained code documentation;
- ShannonDB remote storage mode.
Interested in learning more?
Save your spot
Online Meetup | StorageQuery: federated querying on object stores, powered by Alluxio and Presto
Tuesday, August 25
Abner Ferreira is a backend developer at Simbiose Ventures. He is currently working on implementing fine-grained logs and code-level documentation for Alluxio.
Developer, Simbiose Ventures
Speaker: Abner Ferreira
Caio Pavanelli is a team lead backend developer at Simbiose Ventures. He is currently focused on customizing PrestoSQL and Alluxio for building StorageQuery's platform. He has a M.Sc. in Electrical Engineering from Centro Universitário da FEI on cognitive robotics.
Speaker: Caio Pavanelli
Team Lead, Simbiose Ventures
Bin Fan is the founding engineer and VP of Open Source at Alluxio, Inc. Prior to Alluxio, he worked for Google to build the next-generation storage infrastructure. Bin received his Ph.D. in Computer Science from Carnegie Mellon University on the design and implementation of distributed systems.
Speaker: Bin Fan
Founding Engineer & VP of OS, Alluxio
...a data orchestration layer for compute in any cloud. It unifies data silos on-premise and across any cloud to give you data locality, accessibility, and elasticity.
Whether it’s accelerating big data frameworks on the public cloud, running big data workloads in hybrid cloud environments, or enabling big data on object stores or multiple clouds, Alluxio reduces the complexities associated with orchestrating data for today’s big data and AI/ML workloads.