Apache Hadoop Ozone

Ozone is a scalable, distributed object store for Hadoop. Applications like Apache Spark, Hive and YARN, can run against Ozone without any modifications. Ozone comes with a Java client library and a command line interface which makes it easy to use Ozone. This client library supports both RPC and REST protocols.

Ozone consists of volumes, buckets, and Keys.

  • Volumes are similar to user accounts. Only administrators can create or delete volumes.
  • Buckets are similar to directories. A bucket can contain any number of keys, but buckets cannot contain other buckets.
  • Keys are similar to files. A bucket can contain any number of keys.