Write Pipelines
Write pipelines are a fundamental component of Apache Ozone's storage architecture, enabling reliable data storage across distributed nodes. This document provides a comprehensive overview of write pipelines, covering both replication and erasure coding approaches, their architecture, implementation details, and usage patterns.
What are Write Pipelines?
Write pipelines are groups of Datanodes that work together as a unit to store and replicate data in Ozone. They serve as the foundation for Ozone's data redundancy strategy, providing:
- A coordinated path for write operations across multiple nodes
- Consistency guarantees for data replication
- Efficient management of data distribution and storage
The Storage Container Manager (SCM) is responsible for creating and managing write pipelines, selecting appropriate Datanodes based on factors like availability, capacity, and network topology.
Pipeline Types
Ozone supports different types of write pipelines to accommodate various durability and storage efficiency requirements: