Introduction To GPU Computing - Boston University

2y ago

43 Views

3 Downloads

5.14 MB

21 Pages

Last View : 9d ago

Last Download : 3m ago

Upload by : Joanna Keil

Report this link

Download PDF

Transcription

NagasakiUniversityIntroduction to GPUcomputingFelipe A. CruzNagasaki Advanced Computing CenterNagasaki University, Japan

NagasakiUniversityThe GPU evolution The Graphic Processing Unit (GPU) is a processor that was specialized forprocessing graphics. The GPU has recently evolved towards a more flexible architecture. Opportunity: We can implement *any algorithm*, not only graphics. Challenge: obtain efficiency and high performance.Tesla cardFelipe A. CruzTesla S1070: 4 cards

NagasakiUniversityContext:performance!Felipe A. CruzFelipe A. Cruz

NagasakiUniversityPaths to performanceFelipe A. Cruz

NagasakiUniversityPaths to performanceFasterprocessorsImportant method for improving performance.Processors speed improved from 1 MHz (1980s) to over 4GHz (2000s)Over 1,000 fold faster clock rates in 30 years!Felipe A. Cruz

NagasakiUniversityPaths to performanceFasterprocessorsMemory wall: large gap between memory and processor speed.Power wall: higher clock speeds require exponential power increase.Felipe A. Cruz

NagasakiUniversityPaths to s with many components that may execute in parallel.Benefit from aggregated performance.Felipe A. Cruz

NagasakiUniversityPaths to performance There are two ways to obtain a fastcomputing system: Fast processor (CPU) Use concurrency (GPU) For both, CPU and GPU, serialperformance is not increasing. Multicore and GPU performance isbased on concurrency. GPU many times *faster* than CPU!Felipe A. Cruz

NagasakiUniversityAfter thebuzz.Felipe A. CruzFelipe A. Cruz

NagasakiUniversityWhy accelerators? Accelerators have been around since the 70s! but they were very expensive. GPUs are commodity hardware, powerful, and cheap. New GPU generation are released every two years. Uni-processor speed is not doubling every two years anymore! GPUs are competitive alternatives in the search for performance.Felipe A. Cruz

NagasakiUniversityCan *I* get 100x? *Some* algorithms can get such speedups. It mostly depends on the non-parallel part (the one that does not accelerate witha GPU!). Complex applications make use of many algorithms! Redesign your application to be parallel-friendly is most important! The future is parallel. Let’s get ready!Felipe A. Cruz

NagasakiUniversityGPGPU general purpose? Technically: yes, you can program anything! Practically: we want performance. GPUs are specialized hardware: not flexible (as a CPU) CPU GPU combination of flexible and performance.Felipe A. Cruz

NagasakiUniversityUnderstandingthe GPUFelipe A. CruzFelipe A. Cruz

NagasakiUniversityWhat is a GPU? Graphics Processing Unit (GPU) Most computers have one Simple and regular design! Billions of transistors Performance: Felipe A. Cruz 1 Teraflop (Single precision) 500 Gflops (Double precision)Also: A heater for winter!

NagasakiUniversityHow it works:Many scootersSport carFelipe A. Cruz

NagasakiUniversityHow it works:Many scootersSport carDeliver many packageswithin a reasonable timescaleFelipe A. CruzDeliver a package assoon as possible

NagasakiUniversityHow it works:High throughputandreasonable latencyLow latencyandreasonable throughputCompute many jobswithin a reasonable timescaleCompute a job asfast as possibleFelipe A. Cruz

NagasakiUniversityWhat are “GPU cores”? Not the same than a CPU core! Basic component on a GPU is a ‘stream processor’, or SIMD processor(Single Instruction Multiple Data)SIMD: Same instruction for all cores! but each can operate over different data!Felipe A. Cruz

NagasakiUniversityGPU architectureSketch of the GPU architecture:The GPU spec: 1 Teraflop single precision .5 Teraflop double precision Shared memory: 16 KB to 48 KB L1 cache: 16 KB to 48 KB *1 Teraflop 10 12 floating point operationsObtain performance by concurrency!MIMD: Independent stream-processors.Felipe A. Cruz

NagasakiUniversityGPU issues Massively parallel algorithm needed! Balance computation and data movement! high concurrency and simple data patternCore is much faster than data-pathKeep the GPU busy!Latency of data movement! Felipe A. CruzMulti-threading

NagasakiUniversityObtaining performance Design algorithms for parallelism: fine grained and coarse grained!Map algorithm to the architecture: Felipe A. CruzSIMD at stream processor.MIMD at GPU chip.Consider memory hierarchy.Others: heterogeneous, distributed systems.

Introduction to GPU computing Felipe A. Cruz Nagasaki Advanced Computing Center Nagasaki University, Japan. Felipe A. Cruz Nagasaki University The GPU evolution The Graphic Processing Unit (GPU) is a processor that was specialized for processing graphics. The GPU has recently evolved towards a more ﬂexible architecture.

Related Documents:

THE GPU COMPUTING ERA - University of Wisconsin-Madison

the gpu computing era gpu computing is at a tipping point, becoming more widely used in demanding consumer applications and high-performance computing.this article describes the rapid evolution of gpu architectures—from graphics processors to massively parallel many-core multiprocessors, recent developments in gpu computing architectures, and how the enthusiastic

12 Views

1y ago

GPU Tutorial 1: Introduction to GPU Computing

GPU Tutorial 1: Introduction to GPU Computing Summary This tutorial introduces the concept of GPU computation. CUDA is employed as a framework for this, but the principles map to any vendor’s hardware. We provide an overview of GPU computation, its origins and development, before presenting both the CUDA hardware and software APIs. New Concepts

43 Views

2y ago

OpenCV on a GPU

OpenCV GPU header file Upload image from CPU to GPU memory Allocate a temp output image on the GPU Process images on the GPU Process images on the GPU Download image from GPU to CPU mem OpenCV CUDA example #include opencv2/opencv.hpp #include <

154 Views

2y ago

An Introduction to GPU Computing[2] (Read-Only)

GPU Computing in Matlab u Included in the Parallel Computing Toolbox. u Extremely easy to use. To create a variable that can be processed using the GPU, use the gpuArray function. u This function transfers the storage location of the argument to the GPU. Any functions which use this argument will then be computed by the GPU.

39 Views

2y ago

Take GPU processing power beyond graphics with GPU ...

limitation, GPU implementers made the pixel processor in the GPU programmable (via small programs called shaders). Over time, to handle increasing shader complexity, the GPU processing elements were redesigned to support more generalized mathematical, logic and flow control operations. Enabling GPU Computing: Introduction to OpenCL

65 Views

2y ago

Introduction to GPU computing for statisticicans

Will Landau (Iowa State University) Introduction to GPU computing for statisticicans September 16, 2013 20 / 32. Introduction to GPU computing for statisticicans Will Landau GPUs, parallelism, and why we care CUDA and our CUDA systems GPU computing with R CUDA and our CUDA systems Logging in

34 Views

2y ago

GPU Computing Advances in 3D Electromagnetic Simulation

Latest developments in GPU acceleration for 3D Full Wave Electromagnetic simulation. Current and future GPU developments at CST; detailed simulation results. Keywords: gpu acceleration; 3d full wave electromagnetic simulation, cst studio suite, mpi-gpu, gpu technology confere

32 Views

2y ago

GCSE English Language Paper 1 Revision

be looking at him through this square, lighted window of glazed paper. As if to protect himself from her. As if to protect her. In his outstretched, protecting hand there’s the stub end of a cigarette. She retrieves the brown envelope when she’s alone, and slides the photo out from among the newspaper clippings. She lies it flat on the table and stares down into it, as if she’s peering .

52 Views

3y ago

Recent Views

TENTH EDITION self-therapy for the stutterer

Stuttering Foundation of America self-therapy for the stutterer TENTH EDITION THE STUTTERING FOUNDATION PUBLICATION NO. 0012 self-therapy for the stutterer Publication No. 0012 First Edition—1978 Tenth Edition—2002 Revised Tenth Edition—2007 Published by Stuttering Foundation of America 3100 Walnut Grove Road, Suite 603 P.O. Box 11749 Memphis, Tennessee 38111-0749 Library of Congress .

3y ago

40 Views

Supply Chain Management: An International Journal

The organization is a partner of the Committee on Publication Ethics (COPE) and also works with Portico and the LOCKSS initiative for digital archive preservation. *Related content and download information correct at time of download. Downloaded by University of Nottingham At 06:12 31 October 2018 (PT) Modern slavery challenges to supply chain management Stefan Gold International Centre for .

3y ago

29 Views

Operation London Bridge - Fremington Parish Council

OPERATION LONDON BRIDGE . 1 CONTENTS Page 2 – 1. Introduction Page 3 – 2. Protocol Page 3 – 2.1 Implementation of Protocol Page 3 – 3. Flag Flying Page 3 – 4. Proclamation Day Schedule Page 4 – 4.1 Proclamation Day Page 4 – 4.2 Proclamation Day Protocol Page 5 – 5. Books of Condolence Page 6 – 5.1 Online Book of Condolence Page 6 – 6. Events During the Period of Mourning .

3y ago

62 Views

A CONTINUUM OF QUALITY: ON FIRE

ASTM D 5132 BSS 7230 MODEL 701-S MODEL 701-S-X (export) MODEL VC-1 MODEL VC-1-X (export) MODEL VC-2 MODEL VC-2-X (export) MODEL HC-1 MODEL HC-1-X (export) MODEL HC-2 MODEL HC-2-X (export) FAA Listed TM. FAA MULTI-PURPOSE SMALL SCALE FLAMMABILITY TESTER SPECIFICATIONS: FAR Part 25 Appendix F Part I (Vertical, Horizontal, 45 and 60 ) DRAPERY FLAMMABILITY The most widely cited .

3y ago

80 Views

Combustion Analysis of Nanoenergetic Materials

Osci 1 05 10 15 P a [MPa] Acc Osci. NEEM MURI Temperature Measurements for understanding Gas Generation Previous work: gas fraction at equilibrium Drawbacks: No intermediate gases (not present at equilibrium) nAl/MoO 3 30 Many of the equilibrium gases will not be realized until very high temperatures (ex. Cu: BP of 2835K) nAl/CuO in burn tube at 10 20 e ssure [MPa] 1atm in air nAl/MoO .

3y ago

37 Views

Wiring and testing electrical equipment and circuits

circuits to occur, strain on terminations, insufficient slack cable at terminations, continuity and polarity checks, insulation checks) K21 the care, handling and application of electrical test and measuring instruments (such as multimeter, insulation resistance tester, loop impedance test instruments) K22 applying approved test procedures; the safe working practices and procedures required .

3y ago

46 Views

GRID DIP METER DESIGN - makearadio

circuits). 2. Rough frequency and harmonic measurements 3. AM signal monitor receiver. 4. Simple RF signal generator including AM modulation if required. 5. Crystal Testing. 6. Use as a BFO for SSB and CW reception 7. Measurement of unknown capacitors and inductors I decided to include some extra features above the normal in functionality RF output from the oscillator enabling use of an .

3y ago

208 Views

OPHTHALMOLOGY GOALS AND OBJECTIVES

The objectives of Ophthalmology Residency Program are to: 1. Provide residents with a strong scientific understanding of the fundamentals of ophthalmology through a combination of mentoring and didactic education. 2. Provide residents with clinical skills in all subspecialties of ophthalmology. 3.

3y ago

60 Views

History of Computers

An analog computer does not store information digitally Values are stored as voltage levels Analog computers are particularly useful solving nonlinear simultaneous differential equations An electric circuit can be defined by an equation. An analog computer is programmed by creating a circuit that follows a desired equation.

3y ago

37 Views

Risk Management and Corporate Governance - OECD

Corporate Governance Risk Management and Corporate Governance Volume 2011/Number of issue,Year of edition Author (affiliation or title), Editor Tagline Groupe de travail/Programme (ligne avec top à 220 mm)

3y ago

66 Views

RF Design and Test Using MATLAB and NI Tools

RF Design and Test Using MATLAB and NI Tools . Antenna array, RF, and digital signal processing cannot be designed separately! – Large communication bandwidth digital signal processing is challenging – High-throughput DSP linearity requirements imposed over large bandwidth

3y ago

87 Views

Digital Signal Processing - Webspaces - Accueil

J.-P. Delmas et al. / Digital Signal Processing 95 (2019) 102579. lower far-ﬁeld DOA CRB. Furthermore, thanks to the decoupling be-tween the DOA and range parameters to the second-order w.r.t. the inverse of the range in the Fisher information matrix, the deriva-tion of closed-form approximate expressions of the CRB is greatly simpliﬁed.

3y ago

23 Views

History of U.S. Children’s Policy, 1900-Present

Social dislocations of the late 19th century, sparked by rapid industrialization, population growth, urbanization, and immigration, together with the economic crises of the late 1870s and 1890s, led to social reform movements in the 1890s and during the Progressive Era at the beginning of the 20th century. With respect to children, many reformers

3y ago

53 Views

EDUKASYONG PANGKATAWAN 5 Lesson Exemplars Karapatang Ari .

nakasaad sa ilalim ng makabagong kurikulum, ang K to 12 Currriculum. Layunin nito na mabigyan ng sapat na kaalaman at pagpapahalaga sa mga gawaing may kinalaman sa pagpapaunlad ng pangangatawan. Sa paghahanda ng mga aralin na nakapaloob sa exemplar na ito, isinasaalang-alang ang mga sumusunod na pangunahing kaisipan:

3y ago

99 Views

ELECTRICAL ENGINEERING GRADUATE

Electrical Engineering, or is not equivalent to the BSEE degree offered by Cal State LA, we may require you to complete certain prerequisite courses before being admitted to our program. These will normally be 300level courses, though the list mig0- ht contain a number of 2 or 400000-0-

3y ago

30 Views

Introduction To GPU Computing - Boston University

It looks like you're using an ad-blocker