Chapter 2 Memory Hierarchy Design - Cse.msu.edu

3y ago

24 Views

4 Downloads

1.34 MB

40 Pages

Last View : 2m ago

Last Download : 3m ago

Upload by : Lucca Devoe

Report this link

Download PDF

Transcription

n n n Programmers want unlimited amounts of memory withlow latencyFast memory technology is more expensive per bit thanslower memorySolution: organize memory system into a hierarchyn n n IntroductionIntroductionEntire addressable memory space available in largest, slowestmemoryIncrementally smaller and faster memories, each containing asubset of the memory below it, proceed in steps up toward theprocessorTemporal and spatial locality insures that nearly allreferences can be found in smaller memoriesn Gives the allusion of a large, fast memory being presented to theprocessorCopyright 2012, Elsevier Inc. All rights reserved.2

Memory hierarchy design becomes more crucialwith recent multi-core processors:n IntroductionMemory Hierarchy DesignAggregate peak bandwidth grows with # cores:n n n n Intel Core i7 can generate two references per core per clockFour cores and 3.2 GHz clock25.6 billion 64-bit data references/second 12.8 billion 128-bit instruction references 409.6 GB/s!DRAM bandwidth is only 6% of this (25 GB/s)Requires:n n n Multi-port, pipelined cachesTwo levels of cache per coreShared third-level cache on chipCopyright 2012, Elsevier Inc. All rights reserved.5

When a word is not found in the cache,a miss occurs:n n n Fetch word from lower level in hierarchy,requiring a higher latency referenceLower level may be another cacheor the main memoryAlso fetch the other words contained within the blockn n IntroductionMemory Hierarchy BasicsTakes advantage of spatial localityPlace block into cache in any location within its set,determined by addressn block address MOD number of setsCopyright 2012, Elsevier Inc. All rights reserved.7

n n sets n-way set associativen n n IntroductionMemory Hierarchy BasicsDirect-mapped cache one block per setFully associative one setWriting to cache: two strategiesn Write-throughn n Write-backn n Immediately update lower levels of hierarchyOnly update lower levels of hierarchywhen an updated block is replacedBoth strategies use write bufferto make writes asynchronousCopyright 2012, Elsevier Inc. All rights reserved.8

n Miss raten n IntroductionMemory Hierarchy BasicsFraction of cache access that result in a missCauses of missesn Compulsoryn n Capacityn n First reference to a blockBlocks discarded and later retrievedConflictn Program makes repeated references to multiple addressesfrom different blocks that map to the same location in thecacheCopyright 2012, Elsevier Inc. All rights reserved.9

IntroductionMemory Hierarchy BasicsNote that speculative and multithreadedprocessors may execute other instructionsduring a missn Reduces performance impact of missesCopyright 2012, Elsevier Inc. All rights reserved.10

Six basic cache optimizations:n Larger block sizen n n n Reduces overall memory access timeGiving priority to read misses over writesn n Reduces conflict missesIncreases hit time, increases power consumptionHigher number of cache levelsn n Increases hit time, increases power consumptionHigher associativityn n Reduces compulsory missesIncreases capacity and conflict misses, increases miss penaltyLarger total cache capacity to reduce miss raten n IntroductionMemory Hierarchy BasicsReduces miss penaltyAvoiding address translation in cache indexingn Reduces hit timeCopyright 2012, Elsevier Inc. All rights reserved.11

Small and simple first level cachesn Critical timing path:n n n n n addressing tag memory, thencomparing tags, thenselecting correct setAdvanced OptimizationsTen Advanced OptimizationsDirect-mapped caches can overlap tag compare andtransmission of dataLower associativity reduces power because fewercache lines are accessedCopyright 2012, Elsevier Inc. All rights reserved.12

Addressing CacheChapter 5 — Large and Fast: Exploiting Memory Hierarchy — 1313

Example: Intrinsity FastMATHChapter 5 — Large and Fast: Exploiting Memory Hierarchy — 1414

Set Associative Cache OrganizationChapter 5 — Large and Fast: Exploiting Memory Hierarchy — 1515

n To improve hit time,predict the way to pre-set muxn n Mis-prediction gives longer hit timePrediction accuracyn n n n n n Advanced OptimizationsWay Prediction 90% for two-way 80% for four-wayI-cache has better accuracy than D-cacheFirst used on MIPS R10000 in mid-90sUsed on ARM Cortex-A8Extend to predict block as welln n “Way selection”Increases mis-prediction penaltyCopyright 2012, Elsevier Inc. All rights reserved.18

n Pipeline cache access to improve bandwidthn Examples:n n n n n Pentium: 1 cyclePentium Pro – Pentium III: 2 cyclesPentium 4 – Core i7: 4 cyclesAdvanced OptimizationsPipelining CacheIncreases branch mis-prediction penaltyMakes it easier to increase associativityCopyright 2012, Elsevier Inc. All rights reserved.19

n Allow hits beforeprevious missescompleten n n n “Hit under miss”“Hit under multiplemiss”Advanced OptimizationsNonblocking CachesL2 must support thisIn general,processors can hideL1 miss penalty butnot L2 miss penaltyCopyright 2012, Elsevier Inc. All rights reserved.20

n Organize cache as independent banks tosupport simultaneous accessn n n ARM Cortex-A8 supports 1-4 banks for L2Intel i7 supports 4 banks for L1 and 8 banks for L2Advanced OptimizationsMultibanked CachesInterleave banks according to block addressCopyright 2012, Elsevier Inc. All rights reserved.21

n Critical word firstn n n Early restartn n n Request missed word from memory firstSend it to the processor as soon as it arrivesAdvanced OptimizationsCritical Word First, Early RestartRequest words in normal orderSend missed work to the processor as soon as itarrivesEffectiveness of these strategies depends onblock size and likelihood of another access tothe portion of the block that has not yet beenfetchedCopyright 2012, Elsevier Inc. All rights reserved.22

n n n When storing to a block that is already pending in thewrite buffer, update write bufferReduces stalls due to full write bufferDo not apply to I/O addressesAdvanced OptimizationsMerging Write BufferNo writebufferingWrite bufferingCopyright 2012, Elsevier Inc. All rights reserved.23

n Loop Interchangen n Swap nested loops to access memory insequential orderAdvanced OptimizationsCompiler OptimizationsBlockingn n Instead of accessing entire rows or columns,subdivide matrices into blocksRequires more memory accesses but improveslocality of accessesCopyright 2012, Elsevier Inc. All rights reserved.24

n n n Insert prefetch instructions before data isneededNon-faulting: prefetch doesn’t causeexceptionsRegister prefetchn n Loads data into registerCache prefetchn n Advanced OptimizationsCompiler PrefetchingLoads data into cacheCombine with loop unrolling and softwarepipeliningCopyright 2012, Elsevier Inc. All rights reserved.26

n Performance metricsn n n Latency is concern of cacheBandwidth is concern of multiprocessors and I/OAccess timen n Time between read request and when desired wordarrivesCycle timen n Memory TechnologyMemory TechnologyMinimum time between unrelated requests to memoryDRAM used for main memory,SRAM used for cacheCopyright 2012, Elsevier Inc. All rights reserved.28

n SRAMn n n Requires low power to retain bitRequires 6 transistors/bitMemory TechnologyMemory TechnologyDRAMn n Must be re-written after being readMust also be periodically refeshedn n n n Every 8 msEach row can be refreshed simultaneouslyOne transistor/bitAddress lines are multiplexed:n n Upper half of address: row access strobe (RAS)Lower half of address: column access strobe (CAS)Copyright 2012, Elsevier Inc. All rights reserved.29

n Amdahl:n n n Memory capacity should grow linearly with processor speedUnfortunately, memory capacity and speed has not keptpace with processorsMemory TechnologyMemory TechnologySome optimizations:n n Multiple accesses to same rowSynchronous DRAMn n n n n Added clock to DRAM interfaceBurst mode with critical word firstWider interfacesDouble data rate (DDR)Multiple banks on each DRAM deviceCopyright 2012, Elsevier Inc. All rights reserved.30

n DDR:n DDR2n n n DDR3n n n 1.5 V800 MHzDDR4n n n Lower power (2.5 V - 1.8 V)Higher clock rates (266 MHz, 333 MHz, 400 MHz)Memory TechnologyMemory Optimizations1-1.2 V1600 MHzGDDR5 is graphics memory based on DDR3Copyright 2012, Elsevier Inc. All rights reserved.33

n Graphics memory:n Achieve 2-5 X bandwidth per DRAM vs. DDR3n n Wider interfaces (32 vs. 16 bit)Higher clock raten n Memory TechnologyMemory OptimizationsPossible because they are attached via soldering instead ofsocketted DIMM modulesReducing power in SDRAMs:n n Lower voltageLow power mode (ignores clock, continues torefresh)Copyright 2012, Elsevier Inc. All rights reserved.34

n n n n n n Type of EEPROMMust be erased (in blocks) before beingoverwrittenNon volatileLimited number of write cyclesCheaper than SDRAM, more expensive thandiskSlower than SRAM, faster than diskCopyright 2012, Elsevier Inc. All rights reserved.Memory TechnologyFlash Memory36

n n Memory is susceptible to cosmic raysSoft errors: dynamic errorsn n Hard errors: permanent errorsn n Detected and fixed by error correcting codes(ECC)Memory TechnologyMemory DependabilityUse sparse rows to replace defective rowsChipkill: a RAID-like error recovery techniqueCopyright 2012, Elsevier Inc. All rights reserved.37

n Protection via virtual memoryn n Keeps processes in their own memory spaceRole of architecture:n n n n n Provide user mode and supervisor modeProtect certain aspects of CPU stateProvide mechanisms for switching between usermode and supervisor modeProvide mechanisms to limit memory accessesProvide TLB to translate addressesCopyright 2012, Elsevier Inc. All rights reserved.Virtual Memory and Virtual MachinesVirtual Memory38

n n n n Supports isolation and securitySharing a computer among many unrelated usersEnabled by raw speed of processors, making theoverhead more acceptableAllows different ISAs and operating systems to bepresented to user programsn n n Virtual Memory and Virtual MachinesVirtual Machines“System Virtual Machines”SVM software is called “virtual machine monitor” or“hypervisor”Individual virtual machines run under the monitor are called“guest VMs”Copyright 2012, Elsevier Inc. All rights reserved.39

n Each guest OS maintains its own set of pagetablesn n VMM adds a level of memory between physicaland virtual memory called “real memory”VMM maintains shadow page table that mapsguest virtual addresses to physical addressesn n Requires VMM to detect guest’s changes to its own pagetableOccurs naturally if accessing the page table pointer is aprivileged operationCopyright 2012, Elsevier Inc. All rights reserved.Virtual Memory and Virtual MachinesImpact of VMs on Virtual Memory40

Memory Hierarchy Design Memory hierarchy design becomes more crucial with recent multi-core processors: ! Aggregate peak bandwidth grows with # cores: ! Intel Core i7 can generate two references per core per clock ! Four cores and 3.2 GHz clock 25.6 billion 64-bit data references/second 12.8 billion 128-bit instruction references

Related Documents:

Memory Hierarchy Design - ICL UTK

Chapter 2 Memory Hierarchy Design 2 Introduction Goal: unlimited amount of memory with low latency Fast memory technology is more expensive per bit than slower memory –Use principle of locality (spatial and temporal) Solution: organize memory system into a hierarchy –Entire addressable memory space available in largest, slowest memory –Incrementally smaller and faster memories, each .

49 Views

3y ago

Heir of Fire (Throne of Glass Book 3) - WordPress.com

Part One: Heir of Ash Chapter 1 Chapter 2 Chapter 3 Chapter 4 Chapter 5 Chapter 6 Chapter 7 Chapter 8 Chapter 9 Chapter 10 Chapter 11 Chapter 12 Chapter 13 Chapter 14 Chapter 15 Chapter 16 Chapter 17 Chapter 18 Chapter 19 Chapter 20 Chapter 21 Chapter 22 Chapter 23 Chapter 24 Chapter 25 Chapter 26 Chapter 27 Chapter 28 Chapter 29 Chapter 30 .

357 Views

3y ago

2. Memory Hierarchy – What Is It

1. Chapter 2/Appendix B: Memory Hierarchy General Principles of Memory Hierarchies Understanding Caches and their Design Main Memory Organization Virtual Memory 2. Memory Hierarchy – What Is It Key idea: Use layers of increasingly large, cheap and slow storage: – Try to keep as much access as possible in small, fast levels

33 Views

3y ago

MOCKINGBIRD harper lee - ICRRD

TO KILL A MOCKINGBIRD. Contents Dedication Epigraph Part One Chapter 1 Chapter 2 Chapter 3 Chapter 4 Chapter 5 Chapter 6 Chapter 7 Chapter 8 Chapter 9 Chapter 10 Chapter 11 Part Two Chapter 12 Chapter 13 Chapter 14 Chapter 15 Chapter 16 Chapter 17 Chapter 18. Chapter 19 Chapter 20 Chapter 21 Chapter 22 Chapter 23 Chapter 24 Chapter 25 Chapter 26

108 Views

1y ago

Computer Organization (Autonomous)

21-07-2017 2 Chap. 12 Memory Organization Memory Organization 12-5 12-1 Memory Hierarchy Memory hierarchy in a computer system Main Memory: memory unit that communicates directly with the CPU (RAM) Auxiliary Memory: device that provide backup storage (Disk Drives) Cache Memory: special very-high-

36 Views

2y ago

Chapter 6 Memory - University of Houston–Downtown

Memory -- Chapter 6 2 virtual memory, memory segmentation, paging and address translation. Introduction Memory lies at the heart of the stored-program computer (Von Neumann model) . In previous chapters, we studied the ways in which memory is accessed by various ISAs. In this chapter, we focus on memory organization or memory hierarchy systems.

46 Views

3y ago

Exam-2 Scope 1. Memory Hierarchy Design (Cache, Virtual ...

Exam-2 Scope 1. Memory Hierarchy Design (Cache, Virtual memory) Chapter-2 slides memory-basics.ppt Optimizations of Cache Performance Memory technology and optimizations Virtual memory 2. SIMD, MIMD, Vector, Multimedia extended ISA, GPU, loop level parallelism, Chapter4 slides you may also refer to chapter3-ilp.ppt starting with slide #114 3.

68 Views

3y ago

CSE 3320 Operating Systems Memory Management

Memory Management Ideally programmers want memory that is o large o fast o non volatile o and cheap Memory hierarchy o small amount of fast, expensive memory -cache o some medium-speed, medium price main memory o gigabytes of slow, cheap disk storage Memory management tasks o Allocate and de-allocate memory for processes o Keep track of used memory and by whom

33 Views

1y ago

Recent Views

Legal Proceedings and Legal Privilege Exemptions: Myth-busting - ICO

If asking for legal advice, say so, and start new email chain If giving legal advice, say so Involve lawyers (before litigation contemplated) Maintain confidentiality of legal advice documents Limit dissemination of legal advice (need to know; original only) Make internal communications re legal advice factual

1y ago

240 Views

Smart People Ask for (My) Advice: Seeking Advice Boosts .

advice strategically is likely to be a different experi-ence for the advice seeker than seeking advice with the intention of using it, from the advisor’s perspec-tive, strategic advice seeking may elicit the same per-ceptual effects as authentic advice seeking because the advice seeker’s intentions (and her reliance on advice)

3y ago

177 Views

Legal Action Group The Role of Advice Services in Health Outcomes

The Role of Advice Services in Health Outcomes Evidence Review and Mapping Study June 2015 The Role of Advice Services in Health Outcomes . tor.!Our! r,!

1y ago

170 Views

Legal Information vs Legal Advice Guidelines - TMCEC

giving legal advice. Legal advice is a written or oral statement that: o Interprets some aspect of the law, court rules, or court procedures; o Recommends a specific course of conduct a person should take in an actual or potential legal proceeding; or o Applies the law to the individual person's specific factual circumstances. What is Legal .

1y ago

225 Views

ProQual L2 Certificate Supporting Access to Legal Advice

R/502/7657 Communicating with legal advice clients 2 3 D/503/0822 Supporting clients to make use of the legal advice service 2 3 R/502/7660 Enabling legal advice clients to access signposting and referral opportunities 2 3 Optional Units - a minimum of 6 credits Unit Reference Number Unit Title Unit Level Credit Value

1y ago

173 Views

Guidance for opponents in civil legal aid cases - Scottish Legal Aid Board

injury case - may apply for civil legal aid (since this leaﬂet deals only with civil legal aid, where we refer to "legal aid" we mean "civil legal aid"). Legal aid is ﬁnancial help from public funds. It helps people who qualify to get legal advice and the help of a solicitor to put their case in court.

4m ago

110 Views

Priority Banking Tariff - Standard Chartered

Foreign exchange rate Free Free Free Free Free Free Free Free Free Free Free Free Free Free Free SMS Banking Daily Weekly Monthly. in USD or in other foreign currencies in VND . IDD rates min. VND 85,000 Annual Rental Fee12 Locker size Small Locker size Medium Locker size Large Rental Deposit12,13 Lock replacement

2y ago

206 Views

legal and ethical dimensions of practice - Dovetail

Material in this Guide should never be taken as providing you or any other person with legal advice. Legal advice regarding the application of the law to a particular circumstance or situation can only come from a legal practitioner. A range of sources for legal advice can be found in the Guide.

1y ago

167 Views

How Social Welfare Legal Advice and Social Prescribing can work .

The position of social welfare legal advice and its role in London's recovery The Mayor of London and partners should position social welfare legal advice as a core pillar of Londons recovery from the OVID-19 pandemic, with a core focus on ensuring adequate funding and practical support for advice agencies to ensure ongoing viability.

1y ago

172 Views

WHAT TO DO IF YOU ARE SEXUALLY HARASSED

There are many legal clinics or legal information centres you can contact to obtain legal information, educational resources or legal referrals. Alberta Central Alberta Community Legal Clinic (Red Deer) Centre for Public Legal Education Alberta Pro Bono Law Alberta Women's Centre Legal Advice Clinic (Calgary)

3y ago

245 Views

Legal Advocacy Essentials

Legal Advocacy Essentials: a core training for legal advocates Presented by the Washington State Coalition Against Domestic Violence, 2008. This information is not intended as a substitute for legal advice. 1 Legal Advocacy Essentials . A core training for legal advocates . Table of Contents . What is a legal advocate?

1y ago

249 Views

Legal & Corporate Services: Strategic Plan - CP6

the provision of legal advice, managing legal risk and managing the legal supply chain. By doing this well, the team will move towards its vision. Legal Services is made up of 4 teams, each serving different customers with a dedicated legal resource. This is summarised in the figure right. Although Legal Services has customerdistinct, -focussed .

1y ago

171 Views

Regulatory Guide RG 90 Example Statement of Advice: Scaled advice for a .

representatives and advisers who give personal advice to retail clients. It explains how and why we have developed an example Statement of Advice (SOA) for scaled advice (i.e. personal advice that is limited in scope) on personal insurance for a new retail client. The example SOA was developed in consultation with stakeholders, and we

1y ago

186 Views

Removal of licence disqualification - Legal Aid WA

agencies, permission must first be obtained from Legal Aid Western Australia. This Kit provides information about the law only and does not constitute legal advice. You should seek legal advice if you have a specific legal problem. Every effort is made to ensure that the information contai

2y ago

253 Views

Legal Information vs - txcourts.gov

giving legal advice. Legal advice is a written or oral statement that: Inter p rets some as ect of th elaw, courtles, or du s; Recomme nd s a pecific c ourse of ndu ters h ld k ein an actual or ntial legal proceeding; or 'sApplies th elaw to individu alperso n seci fic actu circums a . What is Legal Information?

1y ago

174 Views

Chapter 2 Memory Hierarchy Design - Cse.msu.edu

It looks like you're using an ad-blocker