S3 and Glacier

S3 Standard-IA and S3 One Zone-IA storage classes are designed for long-lived and infrequently accessed data that still requires millisecond access.

The S3 Glacier and S3 Glacier Deep Archive storage classes are designed for low-cost data archiving. The S3 Glacier and S3 Glacier Deep Archive objects are not available for real-time access. You must first restore the S3 Glacier and S3 Glacier Deep Archive objects before you can access them.

  • Glacier archives are encrypted by default, whereas encryption on S3 is an option you need to select.
  • Unlike S3’s “human-readable” key names, Glacier archives are given machine-generated IDs.

The biggest difference is the time it takes to retrieve your data. Retrieving the objects from an existing Glacier archive can take a number of hours. You can specify one of the following when initiating a job to retrieve an archive based on your access time and cost requirements:

  • Expedited - data accessed using Expedited retrievals are typically made available within 1–5 minutes
  • Standard - Standard retrievals typically complete within 3–5 hours.
  • Bulk - Bulk retrievals typically complete within 5–12 hours.

Read More

S3 Select & Glacier Select

Amazon S3 Select works on objects stored in CSV, JSON, or Apache Parquet format. It also works with objects that are compressed with GZIP or BZIP2 (for CSV and JSON objects only), and server-side encrypted objects.

Amazon Athena and Amazon Redshift Spectrum also runs SQL queries directly against data at rest in Amazon S3.

S3 Lifycycle

S3 Lifecyle rules consist of 2 types of actions:

  • Transition actions - Define when objects transion to another S3 storage class.
  • Expiration actions - Define when objects expires. Amazon deletes expired objects on your behalf.

In addition, Amazon S3 supports a bucket lifecycle rule AbortIncompleteMultipartUpload that you can use to direct Amazon S3 to abort incomplete multipart uploads.

Read More

Restore archieved objects.

Archived objects are not accessible in real time. You must first initiate a restore request and then wait until a temporary copy of the object is available for the duration that you specify in the request.

After you receive a temporary copy of the restored object, the object's storage class remains S3 Glacier or S3 Glacier Deep Archive.

You can then create a copy of the restored objects and store in other storage class such as S3 standard storage.

S3 Object Lock

Even with bucket versioning enabled, you are still be able to remove a object version by DELETE object <version>.

With S3 Object Lock, it helps prevent objects from being deleted or overwritten for a fixed amount of time or indefinitely

Read More

S3 Event

Amazon S3 supports the following destinations where it can publish events:

  • Amazon SNS
  • Amazon SQS
  • AWS Lambda

S3 Storage Class

  • Minimum 30-Day Storage Charge for S3-Intelligent-Tiering, S3 Standard-IA and S3 One Zone-IA.
  • Minimum 90-Day Storage Charge for Glacier
  • Minimum 180-Day Storage Charge for Glacier Deep Archieve
  • Storage Gateway connects to S3, S3 Glacier and Glacier Deep Archive.

Data Transer cost

  • Data transferred out to an Amazon EC2 instance, when the instance is in the same AWS Region as the S3 bucket (including to a different account in the same AWS region).
  • Data transferred out to Amazon CloudFront.

S3 Range GET?