Piranha - Barroso

2y ago

12 Views

2 Downloads

1.62 MB

31 Pages

Last View : 2m ago

Last Download : 3m ago

Upload by : Callan Shouse

Report this link

Download PDF

Transcription

Piranha:Designing a Scalable CMP-based System forCommercial WorkloadsLuiz André BarrosoWestern Research LaboratoryApril 27, 2001Asilomar Microcomputer Workshop

What is Piranha?lAscalable shared memory architecture based on chipmultiprocessing (CMP) and targeted at commercialworkloadslAresearch prototype under development by CompaqResearch and Compaq NonStop Hardware DevelopmentGrouplAdeparture from ever increasing processor complexityand system design/verification cycles

Importance of Commercial ApplicationsWorldwide Server Customer Spending (IDC 1999)Scientific & Otherengineering opment14%Decisionsupport14%l TotalBusinessprocessing22%server market size in 1999: 55-60B– technical applications: less than 6B– commercial applications: 40B

Price Structure of Serversl IBMNormalized breakdown of HW costeServer 680(220KtpmC; 43/tpmC)§ 24 CPUs§ 96GB DRAM, 18 TB Disk§ 9M price tagl CompaqProLiant ML370(32KtpmC; 12/tpmC)§ 4 CPUs§ 8GB DRAM, 2TB Disk§ 240K price IBM eServer 680SystemCompaq ProLiant ML570Price per component /CPU /MB DRAM /GB DiskIBM eServer 680 65,417Compaq ProLiant ML570 6,048 9 4 359 64- Storage prices dominate (50%-70% in customer installations)- Software maintenance/management costs even higher (up to 100M)- Price of expensive CPUs/memory system amortized

Outlinel Importanceof Commercial Workloadsl Commerciall TrendsWorkload Requirementsin Processor Designl Piranhal DesignMethodologyl Summary

Studies of Commercial WorkloadslCollaboration with Kourosh Gharachorloo (Compaq WRL)– ISCA’98: Memory System Characterization of Commercial Workloads(with E. Bugnion)– ISCA’98: An Analysis of Database Workload Performance onSimultaneous Multithreaded Processors(with J. Lo, S. Eggers, H. Levy, and S. Parekh)– ASPLOS’98: Performance of Database Workloads on Shared-MemorySystems with Out-of-Order Processors(with P. Ranganathan and S. Adve)– HPCA’00: Impact of Chip-Level Integration on Performance of OLTPWorkloads(with A. Nowatzyk and B. Verghese)– ISCA’01: Code Layout Optimizations for Transaction ProcessingWorkloads(with A. Ramirez, R. Cohn, J. Larriba-Pey, G. Lowney, and M. Valero)

Studies of Commercial Workloads: summaryl Memory––––astronomically high CPIdominated by memory stall timesinstruction stalls as important as data stallsfast/large L2 caches are criticall Very––––system is the main bottleneckpoor Instruction Level Parallelism (ILP)frequent hard-to-predict brancheslarge L1 miss ratiosLd-Ld dependenciesdisappointing gains from wide-issue out-of-order techniques!

Outlinel Importanceof Commercial Workloadsl Commerciall TrendsWorkload Requirementsin Processor Designl Piranhal DesignMethodologyl Summary

Increasing Complexity of Processor Designsl Pushinglimits of instruction-level parallelism– multiple instruction issue– speculative out-of-order (OOO) executionl Drivenby applications such as SPECl Increasing design time and team sizeProcessor(SGI sto rCount(millions)0.101.406.80DesignTeamSize2055 100DesignTi m e(months)152436VerificationTeam S ize(% of total)15%20% 35%courtesy: John Hennessy, IEEE Computer, 32(8)l Yieldingdiminishing returns in performance

Exploiting Higher Levels of Integrationll1.5MBL2 Network Interface1GHz21264 CPU64KB 64KBI D MSinglechipCoherence EngineMEM-CTLMEM-CTL310310Alpha 21364364364IOIOMM364364IOIOMM364lower latency, higher bandwidthreuse of existing CPU coreaddresses complexity issues364IOI/OlMIOincrementally scalableglueless multiprocessing

Exploiting Parallelism in Commercial AppsChip Multiprocessing (CMP)MEM-CTLthread 1thread 2thread 3thread 4I MEM-CTLtimeCPUExample: Alpha 21464D L2 CPUI D CoherenceNetworkSimultaneous Multithreading (SMT)Example: IBM Power4lSMT superior in single-thread performancelCMP addresses complexity by using simpler coresI/O

Outlinel Importanceof Commercial Workloadsl Commerciall TrendsWorkload Requirementsin Processor Designl Piranha– Architecture– Performancel DesignMethodologyl Summary

Piranha Projectl Explorechip multiprocessing for scalable serversl Focus on parallel commercial workloadsl Small team, modest investment, short design timel Address complexity by using:– simple processor cores– standard ASIC methodologyGive up on ILP, embrace TLP

Piranha Team MembersResearch–––––––Luiz André Barroso (WRL)Kourosh Gharachorloo (WRL)David Lowell (WRL)Joel McCormack (WRL)Mosur Ravishankar (WRL)Rob Stets (WRL)Yuan Yu (SRC)NonStop Hardware DevelopmentASIC Design Center––––––––Tom HeynemannDan JoyceHarland MaxwellHarold MillerSanjay SinghScott SmithJeff Sprouse several contractorsFormer ContributorsRobert McNamaraBasem NayfehAndreas NowatzykJoan PendletonShaz QadeerBrian RobinsonBarton SanoDaniel ScalesBen Verghese

Piranha Processing NodeAlpha core:MEM-CTL MEM-CTL MEM-CTL MEM-CTLCPUCPUCPUCPUHEL2 I D L2 I D L2 I D L2 I D 1-issue, in-order,500MHzL1 caches:I&D, 64KB, 2-wayIntra-chip switch (ICS)32GB/sec, 1-cycle delayRouterL2 cache:shared, 1MB, 8-wayICSI D RE L2 I D L2 CPUMemory Controller (MC)I D L2 CPUI D L2 CPURDRAM, 12.8GB/secProtocol Engines (HE & RE):µprog., 1K µinstr.,even/odd interleavingSystem Interconnect:CPUMEM-CTL MEM-CTL MEM-CTL MEM-CTL4-port Xbar routertopology independent32GB/sec total bandwidthSingle Chip

2 Links @8GB/sRouterPiranha I/O NodeCPUHEI D D PCI-XFBICSFBRE L2 MEM-CTLlI/O node is a full-fledged member of system interconnect– CPU indistinguishable from Processing Node CPUs– participates in global coherence protocol

Example ConfigurationPPPP- I/OP- I/OPPPlArbitrary topologieslMatch ratio of Processing to I/O nodes to application requirements

L2 Cache and Intra-Node Coherencel Noinclusion between L1s and L2 cache– total L1 capacity equals L2 capacity– L2 misses go directly to L1– L2 filled by L1 replacementsl L2keeps track of all lines in the chip– sends Invalidates, Forwards– orchestrates L1-to-L2 write-backs to maximizechip-memory utilization– cooperates with Protocol Engines to enforcesystem-wide coherence

Inter-Node Coherence Protocoll ‘Stealing’ECC bits for memory directory8x(64 8)4X(128 9 7)2X(256 10 22) 1X(512 11 53)Data-bitsECCDirectory-bits028l Directory4453(2b state 40b sharing info)state2binfo on sharers20bstate2binfo on sharers20bl Dualrepresentation: limited pointer coarse vectorl “Cruise Missile” Invalidations (CMI)CMI– limit fan-out/fan-in serialization with CVl Severalnew protocol optimizations010000001000

Simulated Architectures

Single-Chip Piranha Performance350Normalized Execution 10044340P1INOOOOP8500 MHz 1GHz1GHz 500MHz1-issue 1-issue 4-issue 1-issueOLTPP1INOOOOP8500 MHz 1GHz1GHz 500MHz1-issue 1-issue 4-issue 1-issueDSSlPiranha’s performance margin 3x for OLTP and 2.2x for DSSlPiranha has more outstanding misses è better utilizes memory system

Single-Chip Performance (Cont.)(Cont.)8Normalized Breakdown of L1Misses (%)1007Speedup654321090807060L2 MissL2 FwdL2 Hit50403020100012345678Number of Coresl Near-linearP1P2P4P8500 MHz, 1-issuescalability– low memory latencies– effectiveness of highly associative L2 and non-inclusive caching

Normalized Execution TimePotential of a Full-Custom Piranha120100100100L2 MissL2 ssueDSSP8F1.25GHz1-issue5x margin over OOO for OLTP and DSSFull-custom design benefits substantially from boost in core speed

Outlinel Importanceof Commercial Workloadsl Commerciall TrendsWorkload Requirementsin Processor Designl Piranhal DesignMethodologyl Summary

Managing Complexity in the Architecturel Use–––––of many simpler logic modulesshorter designeasier verificationonly short wires*faster synthesissimpler chip-level layoutl Simplifyintra-chip communication– all traffic goes through ICS (no backdoors)l Useof microprogrammed protocol enginesl Adoption of large VM pagesl Implement sub-set of Alpha ISA– no VAX floating point, no multimedia instructions, etc.

Methodology Challengesl Isolatedsub-module testing– need to create robust bus functional models (BFM)– sub-modules’ behavior highly inter-dependent– not feasible with a small teaml System-level––––(integrated) testingmuch easier to create testsonly one BFM at the processor interfacesimpler to assert correct operationVerilog simulation is too slow for comprehensive testing

Our Approach:l Designin stylized C (synthesizable RTL level)– use mostly system-level, semi-random testing– simulations in C (faster & cheaper than Verilog)§ simulation speed 1000 clocks/second– employ directed tests to fill test coverage gapsl Automatic––––C to Verilog translationsingle design databasereduce translation errorsfaster turnaround of design changesrisk: untested methodologyl Usingl IBMindustry-standard synthesis toolsASIC process (Cu11)

Piranha Methodology: OverviewC RTLModelsC RTL Models: Cycleaccurate and “synthesizeable”CLevelcxxcxxVerilogModelsPS1: Fast (C ) LogicSimulatorVerilog Models: Machinetranslated from C modelsPhysical Design: leveragesindustry standard Verilog-basedtoolsPS1PS1VPhysicalDesigncxx: C compilerCLevel: C -to-Verilog TranslatorPS1V: Can “co-simulate” C and Verilog module versionsand check correspondence

Summaryl CMParchitectures are inevitable in the near futurel Piranhainvestigates an extreme point in CMP design– many simple coresl Piranhahas a large architectural advantage over complexsingle-core designs ( 3x) for database applicationsl Piranhal Keymethodology enables faster design turnaroundto Piranha is application focus:– One-size-fits-all solutions may soon be infeasible

Referencel Paperson commercial workload performance & Piranharesearch.compaq.com/wrl/projects/Database

lCompaq ProLiant ML370 (32KtpmC; 12/tpmC) §4 CPUs §8GB DRAM, 2TB Disk . IBM eServer 680 Compaq ProLiant ML570 I/O DRAM CPU Base /CPU /MB DRAM /GB Disk IBM eServer 680 65,417 9 359 Compaq ProLiant ML570 6,048 4 64 Price per component System. OutlineOutline l

Related Documents:

PIRANHA SOLUTION SAFETY GUIDELINES

a) Storage and Waste Handling Do not store Piranha solution. Mix fresh solution for each use. The primary hazard from storage of Piranha etch waste is the potential for gas generation and over pressurization of the container when the solution is still hot. If you store a hot solution in an air tight container, it will explode! Prior to

89 Views

3y ago

“Use your private parts as piranha bait”

“Use your private parts as piranha bait” Character jumps into water, black censor square covering genital area. (It’s important to see his whole body first, so you realise he’s been eaten later.) Starts gyrating in a hu

13 Views

2y ago

Piranha Marine Wastewater Systems

PIRANHA MARINE WASTEWATER SYSTEMS MEPC 227(64) Compliant . Disaster Relief Water Purification Units RV Zero Dump (discharge) systems . Portable Water Purification Units Portable Sewage Treatment Plants (10mᶟ/2,640gallons/day) Closed Loop Water Reclamation Systems

17 Views

1y ago

A A C A A E I T H F D L C O N - DreamWorks Animation

HACKER THIEF HEIST CON CAPER MUSCLE PIRANHA'S WORD SEARCH HELP PIRANHA FIND ALL 15 HIDDEN WORDS OR PHRASES BELOW by . Can you find your way through the spider web maze without crossing any breaks in the web? SCAN ME FOR MORE FUN STUFF! MR. snake Expert safe cracker, and Mr. Wolf's cynical best friend.

9 Views

1y ago

A a C a A E I T H F D L C O N

23 Views

1y ago

T2583-3 135-10 PB Gen II w Garfil-FluidTech Manual

The Piranha press brake is a heavy duty, high performance hydraulic powered machine that provides several important advantages surpassing other press brakes in today's market. The Piranha's single hydraulic cylinder mechanical linkage system provides full tonnage at any point across the bed.

6 Views

1y ago

T-1- 350-14 PB Gen II w Garfil-FluidTech Manual

The area around the Piranha 350 Press Brake should be well lighted, dry, and free of obstacles. The Piranha 350 Press Brake is designed for single person operation only. Always insure that all tooling is properly secured in position before starting any operation. When servicing the machine always practice standard lockout/tag-out procedures to .

6 Views

7m ago

T2584-4 175-12 PB Gen II - Garfil - Fluidtech Manual

The area around the Piranha 175 Press Brake should be well lighted, dry, and free of obstacles. The Piranha 175 Press Brake is designed for single person operation only. Always insure that all tooling is properly secured in position before starting any operation. When servicing the machine always practice standard lockout/tag-out procedures to .

8 Views

7m ago

Recent Views

Yahoo: Failures - Harvard University

Stock closes at an all time low 8.11 Yahoo invested 1Bn in Alibaba Yahoo co-founder & CEO Jerry Yang steps down after 18 months Microsoft and Yahoo agree to search partnership 2008 Yahoo tries to buy Google for 3Bn. Google denied the offer 2009 Yahoo acquires many media companies Microsoft tries to buy Yahoo for 44.6Bn Yahoo denied offer .

1y ago

200 Views

Reviewers Guide – AT&T Yahoo! Go Mobile

Reviewers Guide – AT&T Yahoo! Go Mobile AT&T Yahoo! Go Mobile gives you access to a wide range of the Yahoo! services you . select download then select attachments to view and download the attachment. 4 . emoticons, audibles, voice IMs and attach photos to IM conversations. To use Yahoo! Messenger, click on Messenger in the Yahoo! Go .

2y ago

369 Views

MANAGERIAL FINANCE - GBV

of Managerial Finance page 2 Introduction to Managerial Finance 1 Starbucks—A Taste for Growth page 3 1.1 Finance and Business What Is Finance? 4 Major Areas and Opportunities in Finance 4 Legal Forms of Business Organization 5 Why Study Managerial Finance? Review Questions 9 1.2 The Managerial Finance Function 9 Organization of the Finance

3y ago

6.8K Views

Chapter 1 The roles of finance function in organisations

The roles of the finance function in organisations 4. The role of ethics in the role of the finance function Ethics is the system of moral principles that examines the concept of right and wrong. Ethics underpins an organisation’s sustained value creation. The roles that the finance function performs should be carried out in an .File Size: 888KBPage Count: 10Explore furtherRole of the Finance Function in the Financial Management .www.managementstudyguide.c Roles and Responsibilities of a Finance Department in a .www.pharmapproach.comRoles and Responsibilities of a Finance Department .www.smythecpa.comTop 10 – Functions of Business Finance in an om23 Functions and Duties of Accounting and Finance nded to you b

2y ago

335 Views

Yahoo Microsoft: A Horizontal Romance, or a Broken

News, Finance, Sports and Rivals Entertainment -Yahoo! Music, Movies, TV, Games, Video and omg! Life Style - Yahoo! Autos, Real Estate, Food, Tech, Kids, Health o Connected Life - Co-branded broadband, Yahoo! Moblie Digital Home, Desktop

1y ago

127 Views

2017-2018 GRANDE ÉCOLE MSc in MANAGEMENT

Descriptif des cours Course Outlines 10 Catalogue des cours/ Course Catalog 2017-2018 FIN: Finance/Finance A : Actuariat/Actuarial, Insurance E : Finance d’entreprise/Corporate Finance The course liste tables and the course outlines G : Finance générale/General Finance M : Finance de marché/Market Finance S : Synthèse/Synthesis IDS: Systèmes d’Information, Sciences de la Décision et .

3y ago

312 Views

Behavioral Finance and Wealth L Management

Introduction to Behavioral Finance CHAPTER1 What Is Behavioral Finance? Behavioral Finance: The Big Picture Standard Finance versus Behavioral Finance The Role of Behavioral Finance with Private Clients How Practical Application of Behavioral Finance Can Create a Successful Advisory Rel

2y ago

377 Views

Catalogue des Cours Course Catalog - ESSEC Business School

10 Catalogue des cours/Course Catalog 2021-2022 FIN: Finance/Finance E : Finance d'entreprise/Corporate Finance G : Finance générale/General Finance M : Finance de marché/Market Finance S : Synthèse/Synthesis IDS: Systèmes d'Information, Sciences de la Décision et Statistiques/ Information Systems, Decision Sciences and Statistics

1y ago

222 Views

kama sastry 2004@yahoo.co.uk in.groups.yahoo .

kama_sastry_2004@yahoo.co.uk up/hot-indi

2y ago

477 Views

IX. “Can You Buy Me Now?”: The Erratic Closing of the .

2016-2017 Developments in Banking law 547 by both parties, Verizon was supposed to purchase Yahoo’s shares for 4,825,800,000.965 Excluded from the transaction were Yahoo’s holdings in Yahoo Japan and Alibaba.966 The sale will end Yahoo’s twenty

2y ago

358 Views

Implementasi Rest Web Service Pada Aplikasi Pengolah Pesan Yahoo . - Core

REST Web Service: Gambar 3. Desain Sistem REST Web Service 3. HASIL DAN PEMBAHASAN 3.1 Gambaran Umum Aplikasi Pada Penelitian ini akan menghasilkan sebuah aplikasi pengolah pesan Yahoo Messenger dan Aplikasi REST Web Service. Aplikasi pengolah pesan Yahoo Messenger berfungsi untuk mengirim dan menerima pesan Yahoo Messenger.

1y ago

165 Views

SINGAPORE - Kelly Services

FINANCE Chief Financial Officer Degree/Master 15 20,000 25,000 Finance Assistant Diploma 1-3 2,800 3,400 Finance Controller Degree 10-15 10,000 18,000 Finance Director Degree 15 15,000 20,000 Finance Executive/ Senior Finance Executive Degree 2-5 3,000 6,000 Finance Manager/ Assistan

2y ago

527 Views

Ministries of Finance and Nationally Determined Contributions

Rodrigo Rojo, IDB Sr. Consultant and advisor to Ministry of Finance of Chile. Colombia German Romero Otalora and Laura Marcela Ruiz Daza — Office of the Vice-Minister — Ministry of Finance. Ireland Paul Ryan — International Finance Division — Ministry of Finance Sean Judge — Department of Finance — Ministry of Finance

1y ago

232 Views

Trade Finance & Supply Chain Finance Awards 2022

In February 2022, Global Finance will publish its annual selections for the World's Best Trade Finance and Supply Chain Finance Providers. Global Finance will name the best trade finance providers in more than 100 countries and territories, eight global regions and

1y ago

215 Views

Vol. 36 No. 7 - tall

Finance Officer Barry Umbs xxtallbarry@aol.com Secretary Mary Kershner tllskr@yahoo.com Editor Megan Lukans pdxmegan@yahoo.com Miss TI Coordinator Erica Hand QueenErica2015@gmail.com Alt. Exec Officer Patty Huggett pjh2637@yahoo.com Treasurer Bob Huggett Sactallbob@gmail.com

1y ago

106 Views

Piranha - Barroso

It looks like you're using an ad-blocker