Collective Framework And Performance Optimization To Open .

2y ago

25 Views

2 Downloads

2.32 MB

24 Pages

Last View : 16d ago

Last Download : 3m ago

Upload by : Grady Mosby

Report this link

Download PDF

Transcription

Collective Framework and PerformanceOptimization to Open MPI for Cray XT 5platformsCray Users Group 20111Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Collectives are Critical for HPC ApplicationPerformance A large percentage of application execution time is spent inthe global synchronization operations (collectives) Moving towards exascale systems (million processorcores), the time spent in collectives only increases Performance and scalability of HPC applications requiresefficient and scalable collective operations2Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Weakness in current Open MPI implementationOpen MPI lacks support for Customized collective implementation for arbitrarycommunication hierarchies Concurrent progress of collectives on differentcommunication hierarchies Nonblocking collectives Taking advantage of capabilities of recent networkinterfaces (example offload capabilities) Efficient point-to-point message protocol for Cray XTplatforms3Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Cheetah : A Framework for ScalableHierarchical CollectivesGoals of the framework Provide building blocks for implementing collectives forarbitrary communication hierarchy Support collectives tailored to the communicationhierarchy Support both blocking and nonblocking collectivesefficiently Enable building collectives customized for the hardwarearchitecture4Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Cheetah Framework : Design principles Collective operation is split into collective primitives overdifferent communication hierarchies Collective primitives over the different hierarchies areallowed to progress concurrently Decouple the topology of a collective operation from theimplementation, enabling the reusability of primitives Design decisions are driven by nonblocking collectivedesign, blocking collectives are a special case ofnonblocking ones Use Open MPI component architecture5Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Cheetah is Implemented as a Part of Open MUMAPTPCOLLIBOFFLOADBASEMUMACheetah ComponentsOpen MPI Components6Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Cheetah Components and its Functions Base Collectives (BCOL) – Implements basic collectiveprimitives Subgrouping (SBGP) – Provides rules for grouping theprocesses Multilevel (ML) – Coordinates collective primitiveexecution, manages data and control buffers, and mapsMPI semantics to BCOL primitives Schedule – Defines the collective primitives that are partof collective operation Progress Engine – Responsible for starting, progressingand completing the collective primitives7Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

BCOL Component – Base collective primitives Provides collective primitives that are optimized for certaincommunication hierarchies– BASESMUMA: Shared memory– P2P: SeaStar 2 , Ethernet, InfiniBand– IBNET: ConnectX-2 A collective operation is implemented as a combination ofthese primitives– Example, n level Barrier can be a combination of Fanin ( firstn-1 levels), Barrier (nth level) and Fanout ( first n-1 levels)8Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

SBGP Component – Group the Processes Basedon the Communication HierarchyP2P SubgroupUMA SubroupUMA Group LeaderSocket SubroupsSocket Group LeaderCPU SocketAllocated CoreUnallocated CoreNode 19Managed by UT-Battellefor the Department of EnergyNode 2Graham OpenMPI SC08

Open MPI portals BTL optimizationSender MPI ProcessReceiver MPI ProcessMPI MessageOpen MPIMessagePortals MessageAckXPortal acknowledgment is not required for Cray XT 5 platforms asthey use Basic End to End Protocol (BEER) for message transfer10 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Experimental Setup Hardware :Jaguar– 18,688 Compute Nodes– 2.6 GHz AMD Opteron (Istanbul)– SeaStar 2 Routers connected in a 3D torus topology Benchmarks :– Point-to-Point : OSU Latency and Bandwidth– Collectives : Broadcast in a tight loop Barrier in a tight loop11 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

1 Byte Open MPI P2P Latency is15% better than Cray MPIOMPI vs CRAY portals latency110OMPI with portals optimizationOMPI without portals optimizationCray MPI10090Latency (Usec)807060504030201001101001000Message size (bytes)12 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08100001000001e 06

Open MPI and Cray MPI bandwidth saturateat 2 Gbp/sOMPI vs CRAY portals bandwidth2500OMPI with portals optimizationOMPI without portals optimizationCray MPIBandwidth (Mb/s)2000150010005000110100100010000Message size (bytes)13 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC081000001e 061e 07

Hierarchical Collective Algorithms14 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Flat Barrier AlgorithmHost 1Host 21234Step 11234Inter HostCommunicationStep 2115 Managed by UT-Battellefor the Department of Energy23Graham OpenMPI SC084

Hierarchical Barrier AlgorithmHost 1Host 21234Step 11234Inter HostCommunicationStep 21234Step 3116 Managed by UT-Battellefor the Department of Energy23Graham OpenMPI SC084

Cheetah’s Barrier Collective Outperforms theCray MPI Barrier by 10%140CheetahCray MPI120Latency (microsec.)10080604020002000400060008000MPI Processes17 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08100001200014000

Data Flow in a Hierarchical Broadcast AlgorithmSNODE 1SNODE 2Source of the Broadcast18 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Hierarchical Broadcast Algorithms Knownroot Hierarchical Broadcast– the suboperations are ordered based on the source of data– the suboperations are concurrently started after theexecution of suboperation with the source of broadcast– uses k-nomial tree for data distribution N-ary Hierarchical Broadcast– same as Knownroot algorithm but uses N-ary tree for datadistribution Sequential Hierarchical Broadcast– the suboperations are ordered sequentially– there is no concurrent execution19 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Cheetah’s Broadcast Collective Outperforms theCray MPI Broadcast by 10% (8 Byte)908070Latency (microsec.)6050403020Cray MPICheetah three level known k-nomialCheetah three level known n-aryCheetah three level sequential bcast100050001000015000MPI Processes20 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC082000025000

Cheetah’s Broadcast Collective Outperforms theCray MPI Broadcast by 92% (4 KB)200Latency (microsec.)15010050Cray MPICheetah three-level known k-nomialCheetah three-level known NB n-aryCheetah three-level known NB k-nomialCheetah sequential bcast0021 Managed by UT-Battellefor the Department of Energy100002000030000MPI ProcessesGraham OpenMPI SC084000050000

Cheetah’s Broadcast Collective Outperforms theCray MPI Broadcast by 9% (4 MB)550005000045000Latency (Usec)40000350003000025000Cray MPICheetah three level known k-nomialCheetah three level known n-aryCheetah three level sequential bcast20000150000500010000MPI Processes22 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08150002000025000

Summary Cheetah’s Broadcast is 92% better than the Cray MPI’sBroadcast Cheetah’s Barrier outperforms Cray MPI’s Barrier by 10% Open MPI point-to-point message latency is 15% betterthan the Cray MPI (1 byte message) The key to the performance and scalability of thecollective operations––––Concurrent execution of sub-operationsScalable resource usage techniquesAsynchronous semantics and progressCustomized collective primitives for each of communicationhierarchy23 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

Acknowledgements US Department of Energy ASCR FASTOSprogram National Center For Computational Sciences,ORNL24 Managed by UT-Battellefor the Department of EnergyGraham OpenMPI SC08

10 Managed by UT-Battelle for the Department of Energy Graham_OpenMPI_SC08 Open MPI portals BTL optimization Open MPI Message Portals Message MPI Message Ack Sender MPI Process Receiver MPI Process X Portal acknowledgment is not required for Cray XT 5 platforms as they use Basic End to End Protocol (BEER) for message transfer

Related Documents:

Derivative-free optimization methods

Since the eld { also referred to as black-box optimization, gradient-free optimization, optimization without derivatives, simulation-based optimization and zeroth-order optimization { is now far too expansive for a single survey, we focus on methods for local optimization of continuous-valued, single-objective problems.

32 Views

1y ago

Collective Bargaining Simulation - Western University

determine the terms and conditions of the Collective Agreement. This Collective Agreement then sets the rules for the workplace until the next bargaining round. The best way to learn about collective bargaining is to do it! We will be using a collective bargaining simulation designed by Dr. Kelly Williams-Whitt of the University of Lethbridge, AB.

13 Views

1y ago

I Fleet Assignment Using Collective Intelligence - NASA

fleet constraints. Finally, the total fleet size F is enforced using: slk - F 5 0 k This results in a total of 184 constraints. Collective Intelligence and Product Distribution Theory Collective Intelligence (COIN) is a framework for design- ing a collective, defined as a group of agents with a specified world utility or system-level objective.

8 Views

1y ago

Hierarchical topology and shape optimization of crash ...

An approach for the combined topology, shape and sizing optimization of profile cross-sections is the method of Graph and Heuristic Based Topology Optimization (GHT) [4], which separates the optimization problem into an outer optimization loop for the topology modification and an inner optimization loo

67 Views

2y ago

Topology Optimization of Front Leaf Spring Mounting Bracket

Structure topology optimization design is a complex multi-standard, multi-disciplinary optimization theory, which can be divided into three category Sizing optimization, Shape optimization and material selection, Topology optimization according to the structura

72 Views

2y ago

Review of Data-Driven Robust Optimization - IEOM

2. Robust Optimization Robust optimization is one of the optimization methods used to deal with uncertainty. When the parameter is only known to have a certain interval with a certain level of confidence and the value covers a certain range of variations, then the robust optimization approach can be used. The purpose of robust optimization is .

23 Views

1y ago

Topology Optimization Design of Automotive Engine Bracket

2. Topology Optimization Method Based on Variable Density 2.1. Basic Theory There are three kinds of structure optimization, they are: size optimization, shape optimization and topology op-timization. Three optimization methods correspond to the three stages of the product design process, namely the

12 Views

1y ago

AP Calculus Syllabus Mrs. Latta - bhamcityschools.org

alculus In Motion “Related Rates” * Related Rates MORE” 4.7 Applied Optimization Pg. 262-269 #2-8E, 12, 19 WS –Optimization(LL) NC #45(SM) MMM 19 Optimization MMM 20 Economic Optimization Problems WS – Optimization(KM) Calculus In Motion “Optimization-Applications” TEST: CH

38 Views

2y ago

Recent Views

Legal Proceedings and Legal Privilege Exemptions: Myth-busting - ICO

If asking for legal advice, say so, and start new email chain If giving legal advice, say so Involve lawyers (before litigation contemplated) Maintain confidentiality of legal advice documents Limit dissemination of legal advice (need to know; original only) Make internal communications re legal advice factual

1y ago

240 Views

Smart People Ask for (My) Advice: Seeking Advice Boosts .

advice strategically is likely to be a different experi-ence for the advice seeker than seeking advice with the intention of using it, from the advisor’s perspec-tive, strategic advice seeking may elicit the same per-ceptual effects as authentic advice seeking because the advice seeker’s intentions (and her reliance on advice)

3y ago

177 Views

Legal Action Group The Role of Advice Services in Health Outcomes

The Role of Advice Services in Health Outcomes Evidence Review and Mapping Study June 2015 The Role of Advice Services in Health Outcomes . tor.!Our! r,!

1y ago

170 Views

Legal Information vs Legal Advice Guidelines - TMCEC

giving legal advice. Legal advice is a written or oral statement that: o Interprets some aspect of the law, court rules, or court procedures; o Recommends a specific course of conduct a person should take in an actual or potential legal proceeding; or o Applies the law to the individual person's specific factual circumstances. What is Legal .

1y ago

225 Views

ProQual L2 Certificate Supporting Access to Legal Advice

R/502/7657 Communicating with legal advice clients 2 3 D/503/0822 Supporting clients to make use of the legal advice service 2 3 R/502/7660 Enabling legal advice clients to access signposting and referral opportunities 2 3 Optional Units - a minimum of 6 credits Unit Reference Number Unit Title Unit Level Credit Value

1y ago

173 Views

Guidance for opponents in civil legal aid cases - Scottish Legal Aid Board

injury case - may apply for civil legal aid (since this leaﬂet deals only with civil legal aid, where we refer to "legal aid" we mean "civil legal aid"). Legal aid is ﬁnancial help from public funds. It helps people who qualify to get legal advice and the help of a solicitor to put their case in court.

4m ago

110 Views

Priority Banking Tariff - Standard Chartered

Foreign exchange rate Free Free Free Free Free Free Free Free Free Free Free Free Free Free Free SMS Banking Daily Weekly Monthly. in USD or in other foreign currencies in VND . IDD rates min. VND 85,000 Annual Rental Fee12 Locker size Small Locker size Medium Locker size Large Rental Deposit12,13 Lock replacement

2y ago

206 Views

legal and ethical dimensions of practice - Dovetail

Material in this Guide should never be taken as providing you or any other person with legal advice. Legal advice regarding the application of the law to a particular circumstance or situation can only come from a legal practitioner. A range of sources for legal advice can be found in the Guide.

1y ago

167 Views

How Social Welfare Legal Advice and Social Prescribing can work .

The position of social welfare legal advice and its role in London's recovery The Mayor of London and partners should position social welfare legal advice as a core pillar of Londons recovery from the OVID-19 pandemic, with a core focus on ensuring adequate funding and practical support for advice agencies to ensure ongoing viability.

1y ago

172 Views

WHAT TO DO IF YOU ARE SEXUALLY HARASSED

There are many legal clinics or legal information centres you can contact to obtain legal information, educational resources or legal referrals. Alberta Central Alberta Community Legal Clinic (Red Deer) Centre for Public Legal Education Alberta Pro Bono Law Alberta Women's Centre Legal Advice Clinic (Calgary)

3y ago

245 Views

Legal Advocacy Essentials

Legal Advocacy Essentials: a core training for legal advocates Presented by the Washington State Coalition Against Domestic Violence, 2008. This information is not intended as a substitute for legal advice. 1 Legal Advocacy Essentials . A core training for legal advocates . Table of Contents . What is a legal advocate?

1y ago

249 Views

Legal & Corporate Services: Strategic Plan - CP6

the provision of legal advice, managing legal risk and managing the legal supply chain. By doing this well, the team will move towards its vision. Legal Services is made up of 4 teams, each serving different customers with a dedicated legal resource. This is summarised in the figure right. Although Legal Services has customerdistinct, -focussed .

1y ago

171 Views

Regulatory Guide RG 90 Example Statement of Advice: Scaled advice for a .

representatives and advisers who give personal advice to retail clients. It explains how and why we have developed an example Statement of Advice (SOA) for scaled advice (i.e. personal advice that is limited in scope) on personal insurance for a new retail client. The example SOA was developed in consultation with stakeholders, and we

1y ago

186 Views

Removal of licence disqualification - Legal Aid WA

agencies, permission must first be obtained from Legal Aid Western Australia. This Kit provides information about the law only and does not constitute legal advice. You should seek legal advice if you have a specific legal problem. Every effort is made to ensure that the information contai

2y ago

253 Views

Legal Information vs - txcourts.gov

giving legal advice. Legal advice is a written or oral statement that: Inter p rets some as ect of th elaw, courtles, or du s; Recomme nd s a pecific c ourse of ndu ters h ld k ein an actual or ntial legal proceeding; or 'sApplies th elaw to individu alperso n seci fic actu circums a . What is Legal Information?

1y ago

174 Views

Collective Framework And Performance Optimization To Open .

It looks like you're using an ad-blocker