Skip to content

Latest commit

 

History

History
30 lines (21 loc) · 1.88 KB

readme.md

File metadata and controls

30 lines (21 loc) · 1.88 KB

Alluxio

Apache Alluxio is an open-sourced virtual distributed file system.
Highly recommend start with Alluxio's Lab here.

General

  • Alluxio target for the architecture of separating storage and calculation engine, those two components could scale separately. Calculation engine could access different data source from Amazon S3, HDFS.
  • Alluxio could manage multiple file system and using one name space to access them. Application decoupled with data access from different data source.
  • Alluxio provide file interface/structured data format, it manages metadata for related files(how many blocks, certain block on which server)

Architecture

alluxio_architecture_1.png

Best scenario

  • Alluxio acted as cache layer for Hotspot.
  • Alluxio hide data access between different data access, such as from cloud provider(S3, Azure file storage) or HDFS
  • Alluxio not suggest for live data processing

More info