Apache Ozone

Ozone is a scalable, redundant, and distributed object store for Big data workloads.

Apart from scaling to billions of objects of varying sizes, Ozone can function effectively in containerized environments like Kubernetes.

Applications like Apache Spark, Hive and YARN, work without any modifications when using Ozone. Ozone comes with a Java client library, S3 protocol support, and a command line interface which makes it easy to use Ozone.

Ozone consists of volumes, buckets, and keys:

  • Volumes are similar to user accounts. Only administrators can create or delete volumes.
  • Buckets are similar to directories. A bucket can contain any number of keys, but buckets cannot contain other buckets.
  • Keys are similar to files.

Check out the Getting Started guide to dive right in and learn how to run Ozone on your machine or in the cloud.