Placement Group States
When checking a cluster’s status (e.g., running
ceph -w or
Ceph will report on the status of the placement groups. A placement group has
one or more states. The optimum state for placement groups in the placement group
active + clean.
Ceph is still creating the placement group.
The placement group is peered but not yet active.
Ceph will process requests to the placement group.
Ceph replicated all objects in the placement group the correct number of times.
A replica with necessary data is down, so the placement group is offline.
A replica is not acknowledging new leases from the primary in a timely fashion; IO is temporarily paused.
The set of OSDs for this PG has just changed and IO is temporarily paused until the previous interval’s leases expire.
Ceph is checking the placement group metadata for inconsistencies.
Ceph is checking the placement group data against stored checksums.
Ceph has not replicated some objects in the placement group the correct number of times yet.
Ceph detects inconsistencies in the one or more replicas of an object in the placement group (e.g. objects are the wrong size, objects are missing from one replica after recovery finished, etc.).
The placement group is undergoing the peering process
Ceph is checking the placement group and repairing any inconsistencies it finds (if possible).
Ceph is migrating/synchronizing objects and their replicas.
High recovery priority of that PG is enforced by user.
The placement group is waiting in line to start recover.
A recovery operation is waiting because the destination OSD is over its full ratio.
Recovery stopped due to unfound objects.
Ceph is scanning and synchronizing the entire contents of a placement group instead of inferring what contents need to be synchronized from the logs of recent operations. Backfill is a special case of recovery.
High backfill priority of that PG is enforced by user.
The placement group is waiting in line to start backfill.
A backfill operation is waiting because the destination OSD is over the backfillfull ratio.
Backfill stopped due to unfound objects.
Ceph detects that a placement group is missing information about writes that may have occurred, or does not have any healthy copies. If you see this state, try to start any failed OSDs that may contain the needed information. In the case of an erasure coded pool temporarily reducing min_size may allow recovery.
The placement group is in an unknown state - the monitors have not received an update for it since the placement group mapping changed.
The placement group is temporarily mapped to a different set of OSDs from what CRUSH specified.
The placement group has fewer copies than the configured pool replication level.
The placement group has peered, but cannot serve client IO due to not having enough copies to reach the pool’s configured min_size parameter. Recovery may occur in this state, so the pg may heal up to min_size eventually.
Queued to trim snaps.
Error stopped trimming snaps.
The ceph-mgr hasn’t yet received any information about the PG’s state from an OSD since mgr started up.