Notice
This document is for a development version of Ceph.
Cloud Restore
The Cloud Transition feature makes it possible to transition objects to a remote
cloud service. The cloud-restore
feature described below enables restoration
of those transitioned objects from the remote S3 endpoints into the local
RGW deployment.
This feature currently enables the restoration of objects transitioned to
S3-compatible cloud services. In order to faciliate this,
the retain_head_object
option should be set to true
in the tier-config
when configuring the storage class.
Objects can be restored using the S3 RestoreObject
API. The restored copies will be retained within RGW only for the number
of days
specified. However if days
is not provided, the restored copies
are considered permanent and will be treated as regular objects.
In addition, by enabling the allow_read_through
option,
the S3 GetObject
API can be used to restore the object temporarily.
Cloud Storage Class Tier Configuration
The tier configuration of the cloud storage class configured for data transition is used to restore objects as well:
{
"access_key": <access>,
"secret": <secret>,`
"endpoint": <endpoint>,
"region": <region>,
"host_style": <path | virtual>,
"acls": [ { "type": <id | email | uri>,
"source_id": <source_id>,
"dest_id": <dest_id> } ... ],
"target_path": <target_path>,
"target_storage_class": <target-storage-class>,
"multipart_sync_threshold": {object_size},
"multipart_min_part_size": {part_size},
"retain_head_object": <true | false>
}
The below options have been added to the tier configuration to facilitate object restoration.
restore_storage_class
(string)
The storage class to which object data is to be restored. Default value is STANDARD
.
Read-through Specific Configurables
allow_read_through
(true
|false
)
If true, enables read-through
. Objects can then be restored using the S3 GetObject
API.
read_through_restore_days
(integer)
The duration for which objects restored via read-through
are retained.
Default value is 1 day.
For example:
radosgw-admin zonegroup placement modify --rgw-zonegroup default \
--placement-id default-placement \
--storage-class CLOUDTIER \
--tier-config=endpoint=http://XX.XX.XX.XX:YY,\
access_key=<access_key>,secret=<secret>, \
retain_head_object=true, \
restore_storage_class=COLDTIER, \
allow_read_through=true, \
read_through_restore_days=10
S3 Glacier Specific Configurables
To restore objects archived in an S3 Glacier or Tape cloud storage class, the
data must first be restored to the cloud service before being read and
downloaded into RGW. To enable this process, ensure the storage class
is configured with --tier-type=cloud-s3-glacier
. Additionally,
the following configurables should be set accordingly:
glacier_restore_days
(integer)
The duration for which the objects are to be restored on the remote cloud service.
glacier_restore_tier_type
(Standard
|Expedited
)
The type of retrieval within the cloud service, which may represent different
pricing. Supported options are Standard
and Expedited
.
For example:
radosgw-admin zonegroup placement add --rgw-zonegroup=default \
--placement-id=default-placement \
--storage-class=CLOUDTIER-GLACIER --tier-type=cloud-s3-glacier
radosgw-admin zonegroup placement modify --rgw-zonegroup default \
--placement-id default-placement \
--storage-class CLOUDTIER \
--tier-config=endpoint=http://XX.XX.XX.XX:YY,\
access_key=XXXXX,secret=YYYYY, \
retain_head_object=true, \
target_storage_class=Glacier, \
............ \
............ \
restore_storage_class=COLDTIER, \
glacier_restore_days=2, \
glacier_restore_tier_type=Expedited
[
{
"key": "default-placement",
"val": {
"name": "default-placement",
"tags": [],
"storage_classes": [
"CLOUDTIER-GLACIER",
"STANDARD"
],
"tier_targets": [
{
"key": "CLOUDTIER-GLACIER",
"val": {
"tier_type": "cloud-s3-glacier",
"storage_class": "CLOUDTIER-GLACIER",
"retain_head_object": "true",
"s3": {
"endpoint": http://XX.XX.XX.XX:YY,
"access_key": "XXXXX",
"secret": "YYYYY",
"host_style": "path",
"target_storage_class": "Glacier",
.......
.......
}
"allow_read_through": true,
"read_through_restore_days": 10,
"restore_storage_class": "COLDTIER",
"s3-glacier": {
"glacier_restore_days": 2
"glacier_restore_tier_type": "Expedited"
}
}
}
]
}
}
]
Examples of Restore Objects
Using the S3 RestoreObject CLI
Th S3 restore-object CLI supports these options:
aws s3api restore-object --bucket <value> \
--key <value> \
[--version-id <value>] \
--restore-request (structure) { \
Days=<integer> \
}
Note: Days
is optional and if not provided, the object is restored permanently.
Example 1:
aws s3api restore-object --bucket bucket1 --key doc1.rtf \
[--version-id 3sL4kqtJlcpXroDTDmJ+rmSpXd3dIbrHY+MTRCxf3vjVBH40Nr8X8gdRQBpUMLUo] \
--restore-request Days=10 \
....
This will restore the object doc1.rtf
at an optional version,
for the duration of 10 days.
Example 2:
aws s3api restore-object --bucket bucket1 --key doc2.rtf --restore-request {} ....
This will restore the object doc2.rtf
permanently and it will be treated as regular object.
Using the S3 GetObject CLI
Ensure that the allow_read_through
tier-config option is enabled.
Example 3:
aws s3api get-object --bucket bucket1 --key doc3.rtf ....
This will restore the object doc3.rtf
for read_through_restore_days
days.
Note: The above CLI command may time out if object restoration takes too long. You can verify the restore status before reissuing the command.
Verifying the Restoration Status
Verify the status of the restoration by issuing
an S3 HeadObject
request. The response includes the x-amz-restore
header if object restoration
is in progress or a copy of it is already restored.
Example:
aws s3api head-object --key doc1.rtf --bucket bucket1 ....
The radosgw-admin
CLI can be used to check restoration status and other
details.
Example:
radosgw-admin object stat --bucket bucket1 --object doc1.rtf
Restored Object Properties
Storage
Objects are restored to the storage class configured via restore_storage_class
in the tier-config. However, as
per https://docs.aws.amazon.com/cli/latest/reference/s3api/restore-object.html
the storage class of restored objects should remain unchanged. Therefore, for
temporary copies, the x-amz-storage-class
will continue to reflect the
original cloud-tier storage class.
mtime
The mtime
of the transitioned and restored objects should remain unchanged.
Lifecycle
Temporary
copies are not subject to transition to the cloud. However, as is the
case with cloud-transitioned objects, they can be deleted via regular LC (Life Cycle)
expiration rules or an external S3 delete
request.
Permanent
copies are treated as regular objects and are subject to applicable LC
policies.
Replication
Temporary
copies are not replicated and will be retained only by the zone
on which the restore request is initiated.
Permanent
copies are replicated like other regular objects.
Versioned Objects
For versioned objects, if an object has been cloud-transitioned, it is in a
non-current state. After a restore, the same non-current object will be
updated with the downloaded data, and its HEAD
object will be modified accordingly.
Future Work
Admin Ops
Notifications
Brought to you by the Ceph Foundation
The Ceph Documentation is a community resource funded and hosted by the non-profit Ceph Foundation. If you would like to support this and our other efforts, please consider joining now.