Community Office Hours
Alluxio can help data scientists and data engineers interact with different storage systems in a hybrid cloud environment. Using Alluxio as a data access layer for Big Data and Machine Learning applications, data processing pipelines can improve efficiency without explicit data ETL steps and the resulting data duplication across storage systems.
In this Office Hour you'll learn:
- How to set up Alluxio so that applications can seamlessly read from and write to different storage systems (including cloud storage like AWS S3, Azure Blob Store and on-prem storage like HDFS)
- How to analyze data access pattern and also manage data lifecycle in Alluxio using Alluxio web UI and shell commands
- Open Session for discussion on any topics such as solving the separation of compute and storage problem, and more
Interested in learning more?
Alluxio for Hybrid Cloud | HDFS and AWS S3 demo
Bin Fan is the founding engineer of Alluxio, Inc. and the PMC member of Alluxio open source project. Prior to Alluxio, he worked for Google where he won the Technical Infrastructure Award. Bin received his Ph.D. in Computer Science from Carnegie Mellon University working on distributed systems.
Evangelist and Founding Member at Alluxio
Speaker: Bin Fan
...a data orchestration layer for compute in any cloud. It unifies data silos on-premise and across any cloud to give you data locality, accessibility, and elasticity.
Whether it’s accelerating big data frameworks on the public cloud, running big data workloads in hybrid cloud environments, or enabling big data on object stores or multiple clouds, Alluxio reduces the complexities associated with orchestrating data for today’s big data and AI/ML workloads.