Best Practices Design Patterns: Optimizing Amazon S3 .

3y ago

56 Views

2 Downloads

523.33 KB

12 Pages

Last View : 10d ago

Last Download : 3m ago

Upload by : Camryn Boren

Report this link

Download PDF

Transcription

Best Practices Design Patterns:Optimizing Amazon S3PerformancePublished June 2019Updated March 10, 2021

NoticesCustomers are responsible for making their own independent assessment of theinformation in this document. This document: (a) is for informational purposes only, (b)represents current AWS product offerings and practices, which are subject to changewithout notice, and (c) does not create any commitments or assurances from AWS andits affiliates, suppliers or licensors. AWS products or services are provided “as is”without warranties, representations, or conditions of any kind, whether express orimplied. The responsibilities and liabilities of AWS to its customers are controlled byAWS agreements, and this document is not part of, nor does it modify, any agreementbetween AWS and its customers. 2019 Amazon Web Services, Inc. or its affiliates. All rights reserved.

ContentsIntroduction .1Performance Guidelines for Amazon S3 .2Measure Performance.2Scale Storage Connections Horizontally .2Use Byte-Range Fetches .2Retry Requests for Latency-Sensitive Applications .3Combine Amazon S3 (Storage) and Amazon EC2 (Compute) in the Same AWSRegion .3Use Amazon S3 Transfer Acceleration to Minimize Latency Caused by Distance.3Use the Latest Version of the AWS SDKs.4Performance Design Patterns for Amazon S3.4Using Caching for Frequently Accessed Content .4Timeouts and Retries for Latency-Sensitive Applications .5Horizontal Scaling and Request Parallelization for High Throughput .6Using Amazon S3 Transfer Acceleration to Accelerate Geographically Disparate DataTransfers .7Contributors .8Document Revisions.8

AbstractWhen building applications that upload and retrieve storage from Amazon S3, follow ourbest practices guidelines to optimize performance. We also offer moredetailed Performance Design Patterns.

Amazon Web ServicesBest Practices Design Patterns: Optimizing Amazon S3 PerformanceIntroductionYour applications can easily achieve thousands of transactions per second in requestperformance when uploading and retrieving storage from Amazon S3. Amazon S3automatically scales to high request rates. For example, your application can achieve atleast 3,500 PUT/COPY/POST/DELETE and 5,500 GET/HEAD requests per second perprefix in a bucket. There are no limits to the number of prefixes in a bucket. You canincrease your read or write performance by parallelizing reads. For example, if youcreate 10 prefixes in an Amazon S3 bucket to parallelize reads, you could scale yourread performance to 55,000 read requests per second.Some data lake applications on Amazon S3 scan many millions or billions of objects forqueries that run over petabytes of data. These data lake applications achieve singleinstance transfer rates that maximize the network interface use for their AmazonEC2 instance, which can be up to 100 Gb/s on a single instance. These applicationsthen aggregate throughput across multiple instances to get multiple terabits per second.Other applications are sensitive to latency, such as social media messagingapplications. These applications can achieve consistent small object latencies (and firstbyte-out latencies for larger objects) of roughly 100–200 milliseconds.Other AWS services can also help accelerate performance for different applicationarchitectures. For example, if you want higher transfer rates over a single HTTPconnection or single-digit millisecond latencies, use Amazon CloudFront or AmazonElastiCache for caching with Amazon S3.Additionally, if you want fast data transport over long distances between a client and anS3 bucket, use Amazon S3 Transfer Acceleration. Transfer Acceleration uses theglobally distributed edge locations in CloudFront to accelerate data transport overgeographical distances.If your Amazon S3 workload uses server-side encryption with AWS Key ManagementService (SSE-KMS), see AWS KMS Limits in the AWS Key Management ServiceDeveloper Guide for information about the request rates supported for your use case.The following topics describe best practice guidelines and design patterns for optimizingperformance for applications that use Amazon S3.This guidance supersedes any previous guidance on optimizing performance forAmazon S3. For example, previously Amazon S3 performance guidelinesrecommended randomizing prefix naming with hashed characters to optimizePage 1

Amazon Web ServicesBest Practices Design Patterns: Optimizing Amazon S3 Performanceperformance for frequent data retrievals. You no longer have to randomize prefixnaming for performance, and can use sequential date-based naming for your prefixes.Refer to Performance Guidelines and Performance Design Patterns for the most currentinformation about performance optimization for Amazon S3.Performance Guidelines for Amazon S3To obtain the best performance for your application on Amazon S3, we recommend thefollowing guidelines.Measure PerformanceWhen optimizing performance, look at network throughput, CPU, and DRAMrequirements. Depending on the mix of demands for these different resources, it mightbe worth evaluating different Amazon EC2 instance types. For more information aboutinstance types, see Instance Types in the Amazon EC2 User Guide for Linux Instances.It’s also helpful to look at DNS lookup time, latency, and data transfer speed usingHTTP analysis tools when measuring performance.Scale Storage Connections HorizontallySpreading requests across many connections is a common design pattern tohorizontally scale performance. When you build high performance applications, think ofAmazon S3 as a very large distributed system, not as a single network endpoint like atraditional storage server. You can achieve the best performance by issuing multipleconcurrent requests to Amazon S3. Spread these requests over separate connectionsto maximize the accessible bandwidth from Amazon S3. Amazon S3 doesn't have anylimits for the number of connections made to your bucket.Use Byte-Range FetchesUsing the Range HTTP header in a GET Object request, you can fetch a byte-rangefrom an object, transferring only the specified portion. You can use concurrentconnections to Amazon S3 to fetch different byte ranges from within the same object.This helps you achieve higher aggregate throughput versus a single whole-objectrequest. Fetching smaller ranges of a large object also allows your application toimprove retry times when requests are interrupted. For more information, see GettingObjects.Page 2

Amazon Web ServicesBest Practices Design Patterns: Optimizing Amazon S3 PerformanceTypical sizes for byte-range requests are 8 MB or 16 MB. If objects are PUT using amultipart upload, it’s a good practice to GET them in the same part sizes (or at leastaligned to part boundaries) for best performance. GET requests can directly addressindividual parts; for example, GET ?partNumber N.Retry Requests for Latency-Sensitive ApplicationsAggressive timeouts and retries help drive consistent latency. Given the large scale ofAmazon S3, if the first request is slow, a retried request is likely to take a different pathand quickly succeed. The AWS SDKs have configurable timeout and retry values thatyou can tune to the tolerances of your specific application.Combine Amazon S3 (Storage) and Amazon EC2(Compute) in the Same AWS RegionAlthough S3 bucket names are globally unique, each bucket is stored in a Region thatyou select when you create the bucket. To optimize performance, we recommend thatyou access the bucket from Amazon EC2 instances in the same AWS Region whenpossible. This helps reduce network latency and data transfer costs.For more information about data transfer costs, see Amazon S3 Pricing.Use Amazon S3 Transfer Acceleration to MinimizeLatency Caused by DistanceAmazon S3 Transfer Acceleration manages fast, easy, and secure transfers of files overlong geographic distances between the client and an S3 bucket. Transfer Accelerationtakes advantage of the globally distributed edge locations in Amazon CloudFront. Asthe data arrives at an edge location, it is routed to Amazon S3 over an optimizednetwork path. Transfer Acceleration is ideal for transferring gigabytes to terabytes ofdata regularly across continents. It's also useful for clients that upload to a centralizedbucket from all over the world.You can use the Amazon S3 Transfer Acceleration Speed Comparison tool to compareaccelerated and non-accelerated upload speeds across Amazon S3 Regions. TheSpeed Comparison tool uses multipart uploads to transfer a file from your browser tovarious Amazon S3 Regions with and without using Amazon S3 Transfer Acceleration.Page 3

Amazon Web ServicesBest Practices Design Patterns: Optimizing Amazon S3 PerformanceUse the Latest Version of the AWS SDKsThe AWS SDKs provide built-in support for many of the recommended guidelines foroptimizing Amazon S3 performance. The SDKs provide a simpler API for takingadvantage of Amazon S3 from within an application and are regularly updated to followthe latest best practices. For example, the SDKs include logic to automatically retryrequests on HTTP 503 errors and are investing in code to respond and adapt to slowconnections.The SDKs also provide the Transfer Manager, which automates horizontally scalingconnections to achieve thousands of requests per second, using byte-range requestswhere appropriate. It’s important to use the latest version of the AWS SDKs to obtainthe latest performance optimization features.You can also optimize performance when you are using HTTP REST API requests.When using the REST API, you should follow the same best practices that are part ofthe SDKs. Allow for timeouts and retries on slow requests, and multiple connections toallow fetching of object data in parallel. For information about using the REST API, seethe Amazon Simple Storage Service API Reference.Performance Design Patterns for Amazon S3When designing applications to upload and retrieve storage from Amazon S3, use ourbest practices design patterns for achieving the best performance for your application.We also offer Performance Guidelines for you to consider when planning yourapplication architecture.To optimize performance, you can use the following design patterns.Using Caching for Frequently Accessed ContentMany applications that store data in Amazon S3 serve a “working set” of data that isrepeatedly requested by users. If a workload is sending repeated GET requests for acommon set of objects, you can use a cache such as Amazon CloudFront, AmazonElastiCache, or AWS Elemental MediaStore to optimize performance. Successful cacheadoption can result in low latency and high data transfer rates. Applications that usecaching also send fewer direct requests to Amazon S3, which can help reduce requestcosts.Page 4

Amazon Web ServicesBest Practices Design Patterns: Optimizing Amazon S3 PerformanceAmazon CloudFront is a fast content delivery network (CDN) that transparently cachesdata from Amazon S3 in a large set of geographically distributed points of presence(PoPs). When objects might be accessed from multiple Regions, or over the internet,CloudFront allows data to be cached close to the users that are accessing the objects.This can result in high performance delivery of popular Amazon S3 content. Forinformation about CloudFront, see the Amazon CloudFront Developer Guide.Amazon ElastiCache is a managed, in-memory cache. With ElastiCache, you canprovision Amazon EC2 instances that cache objects in memory. This caching results inorders of magnitude reduction in GET latency and substantial increases in downloadthroughput. To use ElastiCache, you modify application logic to both populate the cachewith hot objects and check the cache for hot objects before requesting them fromAmazon S3. For examples of using ElastiCache to improve Amazon S3 GETperformance, see the blog post Turbocharge Amazon S3 with Amazon ElastiCache forRedis.AWS Elemental MediaStore is a caching and content distribution system specificallybuilt for video workflows and media delivery from Amazon S3. MediaStore providesend-to-end storage APIs specifically for video, and is recommended for performancesensitive video workloads. For information about MediaStore, see the AWS ElementalMediaStore User Guide.Timeouts and Retries for Latency-SensitiveApplicationsThere are certain situations where an application receives a response from Amazon S3indicating that a retry is necessary. Amazon S3 maps bucket and object names to theobject data associated with them. If an application generates high request rates(typically sustained rates of over 5,000 requests per second to a small number ofobjects), it might receive HTTP 503 slowdown responses. If these errors occur, eachAWS SDK implements automatic retry logic using exponential backoff. If you are notusing an AWS SDK, you should implement retry logic when receiving the HTTP 503error. For information about back-off techniques, see Error Retries and ExponentialBackoff in AWS in the Amazon Web Services General Reference.Amazon S3 automatically scales in response to sustained new request rates,dynamically optimizing performance. While Amazon S3 is internally optimizing for a newrequest rate, you will receive HTTP 503 request responses temporarily until theoptimization completes. After Amazon S3 internally optimizes performance for the newrequest rate, all requests are generally served without retries.Page 5

Amazon Web ServicesBest Practices Design Patterns: Optimizing Amazon S3 PerformanceFor latency-sensitive applications, Amazon S3 advises tracking and aggressivelyretrying slower operations. When you retry a request, we recommend using a newconnection to Amazon S3 and performing a fresh DNS lookup.When you make large variably sized requests (for example, more than 128 MB), weadvise tracking the throughput being achieved and retrying the slowest 5 percent of therequests. When you make smaller requests (for example, less than 512 KB), wheremedian latencies are often in the tens of milliseconds range, a good guideline is to retrya GET or PUT operation after 2 seconds. If additional retries are needed, the bestpractice is to back off. For example, we recommend issuing one retry after 2 secondsand a second retry after an additional 4 seconds.If your application makes fixed-size requests to Amazon S3, you should expect moreconsistent response times for each of these requests. In this case, a simple strategy isto identify the slowest 1 percent of requests and to retry them. Even a single retry isfrequently effective at reducing latency.If you are using AWS Key Management Service (AWS KMS) for server-side encryption,see Limits in the AWS Key Management Service Developer Guide for information aboutthe request rates that are supported for your use case.Horizontal Scaling and Request Parallelization forHigh ThroughputAmazon S3 is a very large distributed system. To help you take advantage of its scale,we encourage you to horizontally scale parallel requests to the Amazon S3 serviceendpoints. In addition to distributing the requests within Amazon S3, this type of scalingapproach helps distribute the load over multiple paths through the network.For high-throughput transfers, Amazon S3 advises using applications that use multipleconnections to GET or PUT data in parallel. For example, this is supported by AmazonS3 Transfer Manager in the AWS Java SDK, and most of the other AWS SDKs providesimilar constructs. For some applications, you can achieve parallel connections bylaunching multiple requests concurrently in different application threads, or in differentapplication instances. The best approach to take depends on your application and thestructure of the objects that you are accessing.You can use the AWS SDKs to issue GET and PUT requests directly rather thanemploying the management of transfers in the AWS SDK. This approach lets you tuneyour workload more directly, while still benefiting from the SDK’s support for retries andPage 6

Amazon Web ServicesBest Practices Design Patterns: Optimizing Amazon S3 Performanceits handling of any HTTP 503 responses that might occur. As a general rule, when youdownload large objects within a Region from Amazon S3 to Amazon EC2, we suggestmaking concurrent requests for byte ranges of an object at the granularity of 8–16 MB.Make one concurrent request for each 85–90 MB/s of desired network throughput. Tosaturate a 10 Gb/s network interface card (NIC), you might use about 15 concurrentrequests over separate connections. You can scale up the concurrent requests overmore connections to saturate faster NICs, such as 25 Gb/s or 100 Gb/s NICs.Measuring performance is important when you tune the number of requests to issueconcurrently. We recommend starting with a single request at a time. Measure thenetwork bandwidth being achieved and the use of other resources that your applicationuses in processing the data. You can then identify the bottleneck resource (that is, theresource with the highest usage), and hence the number of requests that are likely to beuseful. For example, if processing one request at a time leads to a CPU usage of 25percent, it suggests that up to four concurrent requests can be accommodated.Measurement is essential, and it is worth confirming resource use as the request rate isincreased.If your application issues requests directly to Amazon S3 using the REST API, werecommend using a pool of HTTP connections and re-using each connection for aseries of requests. Avoiding per-request connection setup removes the need to performTCP slow-start and Secure Sockets Layer (SSL) handshakes on each request. Forinformation about using the REST API, see the Amazon Simple Storage Service APIReference.Finally, it’s worth paying attention to DNS and double-checking that requests are beingspread over a wide pool of Amazon S3 IP addresses. DNS queries for Amazon S3 cyclethrough a large list of IP endpoints. But caching resolvers or application code thatreuses a single IP address do not benefit from address diversity and the load balancingthat follows from it. Network utility tools such as the netstat command line tool canshow the IP addresses being used for communication with Amazon S3, and we provideguidelines for DNS configurations to use. For more information about these guidelines,see DNS Considerations.Using Amazon S3 Transfer Acceleration to AccelerateGeographically Disparate Data TransfersAmazon S3 Transfer Acceleration is effective at minimizing or eliminating the latencycaused by geographic distance between globally dispersed clients and a regionalapplication using Amazon S3. Transfer Acceleration uses the globally distributed edgePage 7

Amazon Web ServicesBest Practi

Amazon Web Services Best Practices Design Patterns: Optimizing Amazon S3 Performance Page 1 Introduction Your applications can easily achieve thousands of transactions per second in request performance when uploading and retrieving storage from Amazon S3. Amazon S3 automatically scales to high request rates.

Related Documents:

Best Practices for designing, deploying, and administering ...

Switch and Zoning Best Practices 28-30 2. IP SAN Best Practices 30-32 3. RAID Group Best Practices 32-34 4. HBA Tuning 34-38 5. Hot Sparing Best Practices 38-39 6. Optimizing Cache 39 7. Vault Drive Best Practices 40 8. Virtual Provisioning Best Practices 40-43 9. Drive

95 Views

2y ago

LEESSONSSON 1313 Linear Patterns

LLinear Patterns: Representing Linear Functionsinear Patterns: Representing Linear Functions 1. What patterns do you see in this train? Describe as What patterns do you see in this train? Describe as mmany patterns as you can find.any patterns as you can find. 1. Use these patterns to create the next two figures in Use these patterns to .

56 Views

2y ago

Enterprise Integration Patterns:Enterprise Integration Patterns: Past ...

1. Transport messages Channel Patterns 3. Route the message to Routing Patterns 2. Design messages Message Patterns the proper destination 4. Transform the message Transformation Patterns to the required format 5. Produce and consume Endpoint Patterns Application messages 6. Manage and Test the St Management Patterns System

19 Views

1y ago

NetBackup 7.6 Best Practices: Optimizing Performance - 1725 - Veritas

SYMANTEC VISION 2014 Agenda NBU 7.6 Best Practices : Optimizing Performance 2 1 NetBackup Architecture & Scalability 2 NetBackup 5230 Appliance Scalability 3 NetBackup Performance Tuning Best Practices 4 NetBackup Performance Testing Best Practices 5 NetBackup 7.6 MSDP Performance Enhancements 6 NetBackup 7.6 Appliance Use Cases

21 Views

1y ago

Praise for Design Patterns

Creational patterns This design patterns is all about class instantiation. This pattern can be further divided into class-creation patterns and object-creational patterns. While class-creation patterns use inheritance effectively in the instantiation process, object-creation patterns

44 Views

2y ago

Best Practices for Optimizing Your Dbt and Snowflake Deployment

dbt and Snowflake best practices that JetBlue and thousands of other clients have implemented to optimize performance. OPTIMIZING SNOWFLAKE Your business logic is defined in dbt, but dbt ultimately pushes down all processing to Snowflake. For that reason, optimizing the Snowflake side of your deployment is critical to maximizing your query

14 Views

1y ago

Design Patterns - Iowa State University

141 Design Patterns Are Not About Design Design patterns are not about designs such as linked lists and hash tables that can be encoded in classes and reused as is. Design patterns are not complex, domain-speciﬁc designs for an entire application or subsystem. Design patterns are descriptions of communicating objects and classes that are customized to solve a general design

80 Views

3y ago

RT3170 Issue 9 June 2013 - PTS Course

others are just rough paths. Details are given in a document called the Hazard Directory. 1.3 Signals Most running lines have signals to control the trains. Generally, signals are operated from a signal box and have an identifying number displayed on them. Signals are usually attached to posts alongside the track but can also be found on overhead gantries or on the ground. Modern signals tend .

60 Views

3y ago

Recent Views

Forex Trading - iniForex

Forex System, 10 Minute Forex Wealth Builder, and Forex Hidden Systems. If you prefer to get a software you can look at . Supra Forex, Forex Multiplier, Turbo Forex Trader or Forex Killer. If you prefer to use an automatic trading system, you can start with . Fap Turbo, Forex Autopilot or Forex Auto Run.

3y ago

2.2K Views

Forex for Beginners: How to Make Money in Forex Trading .

6. The Basic Forex Trading Strategy 7. Forex Trading Risk Management . 8. What You Need to Succeed in Forex 9. Technical Analysis As a Tool for Forex Trading Success . 10. Developing a Forex Strategy and Entry and Exit Signals 11. A Few Trading Tips for Dessert . 1. Making Money in Forex Trading . The Forex market has a daily volume of over 4 .

3y ago

3.4K Views

The Easiest Way to Make Money in Forex

1. Making Money in Forex Trading 2. What is Forex Trading Table of Contents 3. How to Control Losses with "Stop Loss" 4. How to Use Forex for Hedging 5. Advantages of Forex Over Other Investment Assets 6. The Basic Forex Trading Strategy 7. Forex Trading Risk Management 8. What You Need to Succeed in Forex 9.

3y ago

1.5K Views

Forex One Minute Strategy. - avfxtradinghub

forex. There are lots of other factors which will decide the rate of forex. 2. Forex brokers. Second major part of the structure of the forex market is the forex brokers. They are commission agents; they help to bring buyers of forex near to the sellers. Like other industry brokers, they sell or buy the forex on behalf of their customers. They .

1y ago

486 Views

Forex Trading 101 - 'Beginners Forex Trading Introduction Course'

Professional Price Action Forex Trading Strategies Other Tutorials & Guides: How To Correctly Set Up Meta Trader Forex Charting Platform. Part 1: What Is Forex Trading ? - A Definition & Introduction . An Introduction to Forex Trading: Hey traders, This free Forex mini-course is designed to teach you the .

1y ago

868 Views

Presents Trade Forex Responsibly - Forex Crunch

And perhaps it is time to consider another forex system. Forex systems don't work all the time anyway. Trade With a Registered Broker There are a lot of forex brokers out there. The forex industry is quite spread out: there are many players in different countries. Competition is great and some small forex brokers compete with the big boys is .

10m ago

105 Views

The Forex quick guide

The Forex quick guide for beginners and private traders This guide was created by Easy-Forex Trading Platform, and is offered FREE to all Forex traders. Make your Forex learning much more efficient: Register now at Easy-Forex and get FREE 1-on-1 LIVE training, in your language!

3y ago

270 Views

28 Forex Patterns - Asia Forex Mentor

Dec 28, 2020 · Forex patterns cheat sheet 23. Forex candlestick patterns 24. Limitations: 25. Conclusion: Page 3 The 28 Forex Patterns Complete Guide Asia Forex Mentor Chart patterns Chart patterns are formations visually identifiable by the careful study of charts. Completing chart p

2y ago

441 Views

FOREX TRADING (Dasar-Dasar) - Gain Scope

Trading Forex atau Valas adalah BUKAN Judi, karena perdagangan Forex dapat dianalisa secara NYATA, disamping itu Forex juga sama dengan perdagangan pada umumnya dan hanya berbeda di obyeknya saja (di Forex obyeknya adalah mata uang, sedangkan di perdagangan umum obyeknya adalah barang atau jasa). Forex Trading dapat berarti ibarat anda .

1y ago

1.1K Views

Simple-N-Easy Forex - Money Making Forex Tools

Simple-N-Easy Forex 7 Great Simple-N-Easy ways to GROW & SAFEGUARD YOUR money in the Forex market Page 6 Trading records can be based on Demo trading or live trading. So pl ease treat your trading record like gold and with respect. It is your Forex trading mirror which tells you how you are doing. Forex trading is a never ending process of .

1y ago

812 Views

Forex Systems - مرجع آموزش بازار بورس و فارکس

4. The Day Trade Forex System 10 5."Micro Trading" the 1 Minute Chart System 12 6.Tom Demark FX System 13 7.The Forex News Trading System 14 8.The CI System 25 9.Forex Intraday Pivots Trading System 31 Helpful Information for all Forex Trading Systems Building blocks that I believe to be foundations to the Forex Profit System.

1y ago

1.2K Views

Forex 101 L4 - FXN Trading

Forex 101 Lesson 4. How to choose a Forex Broker Forex Broker is the intermediary that facilitates your trading. Although traders prefer to remove the middle-man, a broker forms an important part of trading. In this article we will help you choose forex broker. While most traders tend to take the idea of choosing a forex

10m ago

352 Views

FOREX TRADING FOR BEGINNERS - comparic

Forex trading for beginners – tutorial by Comparic.com 3 This is a forex trading guide for beginners. I try to answer all questions about Forex trading. If you are new to trading or you traded stocks and want to learn more about Forex trading, then this guide is for you.

3y ago

8.7K Views

Forex Trading: The Basics Explained in Simple Terms (Bonus .

explain Forex in a plain and simple manner and give you enough information to get started sooner rather than later, in the exciting world of Forex Trading. What is Forex? Forex is the common term used to describe Foreign Exchange. It is also called currency trading, or just FX trading, and every now and then you may see it referred to as Spot FX.

3y ago

1.1K Views

FOREX TRADING - c.mql5

night. Automated software in the form of a Forex robot can even make this physically possible. However, a cautious trader will choose his times and will not be active during all of the Forex market hours. Forex Margin Trading: Make More Money With Less Forex margin trading is a way of applying leverage to increase the purchasing power of your .

3y ago

370 Views

Best Practices Design Patterns: Optimizing Amazon S3 .

It looks like you're using an ad-blocker