Apache Ozone

Ozone is a scalable, redundant, and distributed object store for Hadoop.

Apart from scaling to billions of objects of varying sizes, Ozone can function effectively in containerized environments like Kubernetes.

Applications like Apache Spark, Hive and YARN, work without any modifications when using Ozone. Ozone comes with a Java client library, S3 protocol support, and a command line interface which makes it easy to use Ozone.

Ozone consists of volumes, buckets, and keys:

  • Volumes are similar to user accounts. Only administrators can create or delete volumes.
  • Buckets are similar to directories. A bucket can contain any number of keys, but buckets cannot contain other buckets.
  • Keys are similar to files.