A Cuda Implementation Of The Hpcg Benchmark - Nvidia

1y ago

13 Views

2 Downloads

1.44 MB

24 Pages

Last View : 3d ago

Last Download : 3m ago

Upload by : Aiyana Dorn

Report this link

Download PDF

Transcription

A CUDA IMPLEMENTATION OF THEHPCG BENCHMARKEverett PhillipsMassimiliano Fatica

OUTLINEHigh Performance Conjugate Gradient BenchmarkMotivationOverviewOptimizationPerformance ResultsSingle GPUGPU SupercomputersConclusion

WHY HPCG ?HPL (Linpack) Top500 benchmarkSupercomputer Ranking / EvaluationDense Linear Algebra (Ax b)Compute intensiveDGEMM (Matrix-Matrix Multiply)O(N3)FLOPS / O(N2) Data10-100 Flop/ByteWorkload does not correlate with many modern applications

WHY HPCG?New Benchmark toSupplement HPLCommon ComputationPatterns not addressed byHPLNumerical Solution ofPDEsMemory IntensiveNetwork

HPCG BENCHMARKPreconditioned Conjugate Gradient AlgorithmSparse Linear Algebra (Ax b), Iterative solverBandwidth Intensive: 1/6 Flop/ByteSimple Problem (sparsity pattern of Matrix A)Simplifies matrix generation/solution validationRegular 3D grid, 27-point stencilNx x Ny x Nz local domain / Px x Py x Pz ProcessorsCommunications: boundary global reduction

HPCG ALGORITHMMulti-Grid PreconditionerSymmetric-Gauss-Seidel Smoother (SYMGS)Sparse Matrix Vector Multiply (SPMV)Dot Product – MPI Allreduce()

HPCG BENCHMARKProblem Setup – initialize data structuresOptimization (required to expose parallelism in SYMGS smoother)Matrix analysis / reordering / data layoutTime counted against final performance resultReference Run – 50 iterations with reference code – Record ResidualOptimized Run – converge to Reference ResidualMatrix re-ordering slows convergence (55-60 iterations)Additional iterations counted against final performance resultRepeat to fill target execution time (few minutes typical, 1 hour for official run )

HPCGSPMV (y Ax)Exchange Halo(x) //neighbor communicationsfor row 0 to nrowssum 0for j 0 to nonzeros in row[ row ]col A col[ j ]val A val[ j ]sum sum val * x[ col ]y[ row ] sumNo dependencies between rows, safe to process rows in parallel

HPCGSYMGS (Ax y, smooth x)Exchange Halo(x) //neighbor communicationsfor row 0 to nrows (Fwd Sweep, then Backward Sweep for row nrows to 0)sum b[ row ]for j 0 to nonzeros in row[ row ]col A col[ j ]val A val[ j ]if( col ! row )sum sum – val * x[ col ]x[ row ] sum / A diag[ row ]if col row, must wait for x[col] to be updated

MATRIX REORDERING (COLORING)SYMGS - order requirementPrevious rows must have new valuereorder by color (independent rows)2D example: 5-point stencil - red-black3D 27-point stencil 8 colors

MATRIX REORDERING (COLORING)Coloring to extract parallelismAssignment of “color” (integer) to vertices (rows), with no twoadjacent vertices the same color“Efficient Graph Matching and Coloring on the GPU” – (Jon Cohen)Luby / Jones-Plassman based algorithmCompare hash of row index with neighborsAssign color if local extremaOptional: recolor to reduce # of colors

MORE OPTIMIZATIONSOverlap Computation with neighbor communicationOverlap 1/3 MPI Allreduce with ComputationLDG loads for irregular access patterns (SPMV SYMGS)

OPTIMIZATIONSSPMV Overlap Computation with communicationsGather to GPU send bufferCopy send buffer to CPUMPI send / MPI recvCopy recv buffer to GPULaunch SPMV KernelGPUCPUTime

OPTIMIZATIONSSPMV Overlap Computation with communicationsGather to GPU send bufferCopy send buffer to CPULaunch SPMV interior KernelMPI send / MPI recvCopy recv buffer to GPULaunch SPMV boundary KernelGPU Stream AGPU Stream BCPUTime

RESULTS – SINGLE GPU

RESULTS – GPU SUPERCOMPUTERSTitan @ ORNLCray XK7, 18688 Nodes16-core AMD Interlagos K20XGemini Network - 3D Torus TopologyPiz Daint @ CSCSCray XC30, 5272 Nodes8-core Xeon E5 K20XAries Network – Dragonfly Topology

RESULTS – GPU SUPERCOMPUTERS1 GPU 20.8 GFLOPS (ECC ON) 7% iteration overhead at scaleTitan @ ORNL322 TFLOPS (18648 K20X)89% efficiency (17.3 GF per GPU)Piz Daint @ CSCS97 TFLOPS (5265 K20X)97% efficiency (19.0 GF per GPU)

RESULTS – GPU SUPERCOMPUTERSDDOT (-10%)MPI Allreduce()Scales as Log(#nodes)MG (-2%)Exchange Halo (neighbor)SPMV (-0%)Overlapped w/Compute

SUPERCOMPUTER COMPARISON

CONCLUSIONSGPUs proven effective for HPL, especially for power efficiencyHigh flop rateGPUs also very effective for HPCGHigh memory bandwidthStacked memory will give a huge boostFuture work will add CPU GPU

ACKNOWLEDGMENTSOak Ridge Leadership Computing Facility (ORNL)Buddy Bland, Jack Wells and Don MaxwellSwiss National Supercomputing Center (CSCS)Gilles Fourestey and Thomas SchulthessNVIDIALung Scheng Chien and Jonathan Cohen

High Performance Conjugate Gradient Benchmark . . 10-100 Flop/Byte Workload does not correlate with many modern applications HPL (Linpack) Top500 benchmark . WHY HPCG? New Benchmark to Supplement HPL Common Computation Patterns not addressed by . GPUs proven effective for HPL, especially for power efficiency High flop rate GPUs also very .

Related Documents:

Nonprofit Self-Assessment Checklist

May 02, 2018 · D. Program Evaluation ͟The organization has provided a description of the framework for how each program will be evaluated. The framework should include all the elements below: ͟The evaluation methods are cost-effective for the organization ͟Quantitative and qualitative data is being collected (at Basics tier, data collection must have begun)

1.4K Views

2y ago

Name of thé élément in thé language and script of thé ... - UNESCO

Silat is a combative art of self-defense and survival rooted from Matay archipelago. It was traced at thé early of Langkasuka Kingdom (2nd century CE) till thé reign of Melaka (Malaysia) Sultanate era (13th century). Silat has now evolved to become part of social culture and tradition with thé appearance of a fine physical and spiritual .

117 Views

9m ago

[Kl - Mauritius

On an exceptional basis, Member States may request UNESCO to provide thé candidates with access to thé platform so they can complète thé form by themselves. Thèse requests must be addressed to esd rize unesco. or by 15 A ril 2021 UNESCO will provide thé nomineewith accessto thé platform via their émail address.

470 Views

1y ago

Employee Benefits Event - Schneider Downs Tax Services

̶The leading indicator of employee engagement is based on the quality of the relationship between employee and supervisor Empower your managers! ̶Help them understand the impact on the organization ̶Share important changes, plan options, tasks, and deadlines ̶Provide key messages and talking points ̶Prepare them to answer employee questions

329 Views

1y ago

Study Investigating thè Effect of E- Service Quality on Customer's ...

Dr. Sunita Bharatwal** Dr. Pawan Garga*** Abstract Customer satisfaction is derived from thè functionalities and values, a product or Service can provide. The current study aims to segregate thè dimensions of ordine Service quality and gather insights on its impact on web shopping. The trends of purchases have

127 Views

9m ago

DU-05227-042 v6.5 | August 2014 CUDA-GDB CUDA DEBUGGER

CUDA-GDB runs on Linux and Mac OS X, 32-bit and 64-bit. CUDA-GDB is based on GDB 7.6 on both Linux and Mac OS X. 1.2. Supported Features CUDA-GDB is designed to present the user with a seamless debugging environment that allows simultaneous debugging of both GPU and CPU code within the same application.

31 Views

3y ago

DU-05227-042 v5.5 | July 2013 CUDA-GDB CUDA DEBUGGER

www.nvidia.com CUDA Debugger DU-05227-042 _v5.5 3 Chapter 2. RELEASE NOTES 5.5 Release Kernel Launch Stack Two new commands, info cuda launch stack and info cuda launch children, are introduced to display the kernel launch stack and the children k

29 Views

2y ago

NVIDIA CUDA Toolkit 10.0

CUDA Toolkit Major Components www.nvidia.com NVIDIA CUDA Toolkit 10.0.153 RN-06722-001 _v10.0 2 ‣ cudadevrt (CUDA Device Runtime) ‣ cudart (CUDA Runtime) ‣ cufft (Fast Fourier Transform [FFT]) ‣ cupti (Profiling Tools Interface) ‣ curand (Random Number Generation) ‣ cusolver (Dense and Sparse Direct Linear Solvers and Eigen Solvers) ‣ cusparse (Sparse Matrix)

14 Views

3m ago

Recent Views

MERRILL ALABAMA CAPITOL SECRETARY OF STATE

Aug 24, 2018 · State House 38 Brian McGee state House 40 Pamela Jean Howard State House 41 Emily Anne Marcum State House 43 Carin Mayo State House 45 Jenn Gray state House 46 Felicia Stewart State House 4 7 1Jim Toomey State House 48 IAlli Summerford State House 51 Veronica R. Johnson State House 52 John W. Rogers, Jr. State House 53 Anthony Daniels

2y ago

375 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

Consumer Guide to Auto Insurance - csimt.gov

consumer guide to auto insurance contents introduction to auto insurance 1 understanding your auto insurance policy 2 required auto insurance 3 optional types of auto insurance 4-5 getting the right coverage 6 accidents and violations 7 how to shop for auto insurance 8 shopping tips 9 frequently asked questions 10-11 insurance complaints/when you have a problem 12

2y ago

805 Views

Industry Observations Insurance Industry

Jun 30, 2019 · 6/17/2019 Commercial Insurance Branch of Extraco Banks, N.A. Higginbotham Insurance Group, Inc. Insurance Brokers NA 6/13/2019 Links Insurance Services, LLC World Insurance Associates LLC Property and Casualty Insurance NA 6/13/2019 Abram Interstate Insurance Services, Inc. Risk Placement Services,

2y ago

619 Views

Life Insurance Buyer's Guide Life Insurance - National Association of .

Life Insurance uers uide Naional ssociaion of Insurance Commissioners Compare the Different Types of Insurance Policies There are many types of life insurance pol-icies. You should choose a policy with fea-tures that fit your individual needs. Some things to consider are: Term Insurance vs. Cash Value In-surance. Term insurance is intended to

1y ago

520 Views

your guide to understanding auto ins in nh - New Hampshire

Hampshire Insurance Department does not mandate or set Auto Insurance Rates. Auto Insurance Rates will vary by insurance company. This guide is intended to give New Hampshire consumers basic information on auto insurance. It suggests ways to: Lower the cost of your auto insurance, shop for Auto insurance and, file an auto insurance claim.

1y ago

449 Views

18.01.41 - REPLACEMENT OF LIFE INSURANCE AND ANNUITIES - Idaho

Department of Insurance Replacement of Life Insurance and Annuities. Page 3. 04. Existing Life Insurance or Annuity. "Existing Life Insurance or Annuity" means any life insurance or annuity in force, including life insurance under a binding or conditional receipt or a lif e insurance policy or annuity that is within an unconditional refund period.

1y ago

407 Views

EXAMINATION REPORT OF THE ADMIRAL INSURANCE COMPANY AS OF . - Delaware

Berkley Regional Specialty Insurance Comp 31295 DE Carolina Casualty Insurance Company 10510 IA Clermont Insurance Company 33480 IA Continental Western Insurance Company 10804 IA Firemen's Insurance Com pany of Wash, D.C. 21784 DE Gemini Insurance Company 10833 DE Great Divide Insurance Company 25224 ND

1y ago

258 Views

American International Group, Inc. - Federal Reserve

American General Life Insurance Company AGL U.S. Life Insurance Company AGC Life Insurance Company AGC Life U.S. Life Insurance Company The United States Life Insurance Company in the City of New York U.S. Life U.S. Life Insurance Company The Variable Annuity Life Insurance Company VALIC U.S. Life Insurance Company

1y ago

269 Views

Japan's Insurance Market - Toa Re

with 61.6% of net premiums written, of which automobile insurance totaled 48.8% and compulsory automobile liability insurance totaled 12.8%. Fire insurance accounted for 13.7%, miscellaneous casualty insurance including liability insurance accounted for 11.6%, accident insurance accounted for 9.8%, and marine insurance accounted for 3.2%.

1y ago

179 Views

List of Insurance Companies by Insurance Manager - Cayman Islands dollar

2447 Batan Insurance Company SPC, Ltd. 29-Sep-03 1307714 BBG Insurance Services, Ltd. 09-Aug-16 1254 BCHS Insurance, Ltd. 07-Oct-98 1168 Bearacuda Re 01-Aug-97 2639 Bedrock Insurance Limited 24-Nov-05 2150 Bom Ambiente Insurance Company 14-Jun-00 2565 Boundless Insurance Company, Ltd. 01-Dec-04 769 Bucap Limited 03-Mar-89

1y ago

293 Views

Insurance Certificate 713705-3 and Assistance Program

Name of insurance product: Purchase Protection and Travel Insurance for National Bank of Canada Mastercard credit cards, group insurance policy no. 713705 (Schedule A Certificate number 3)/713705-3 Type of insurance product: Purchase insurance and extended warranty and travel insurance (group insurance) Assistance provider contact information

4m ago

54 Views

Policy - Kiwibank

House Insurance is provided by The Hollard Insurance Company Pty Ltd. The Hollard Insurance Company Pty Ltd is the only organisation responsible for claims under this cover. Administration of House Insurance and claims handling services are managed by Ando Insurance Group Limited on behalf of The Hollard Insurance Company Pty Ltd.

1y ago

133 Views

House insurance - Tower

insurance in New Zealand. We've included limits and exclusions to your house cover throughout this policy wording and on your certificate of insurance. What your house policy does and does not cover What we cover We cover your house, meaning the domestic buildings you own at the situation shown on your certificate of insurance including its: 1.

1y ago

145 Views

A Cuda Implementation Of The Hpcg Benchmark - Nvidia

It looks like you're using an ad-blocker