PYTHON FOR HPC: BEST PRACTICES

3y ago

99 Views

13 Downloads

2.09 MB

26 Pages

Last View : 21d ago

Last Download : 3m ago

Upload by : Laura Ramon

Report this link

Download PDF

Transcription

PYTHON FOR HPC:BEST PRACTICESdrhgfdjhngngfmhgmghmghjmghfmfWILLIAM SCULLINAssistant Computational ScientistLeadership Computing FacilityArgonne National LaboratoryMay 4th, 2017ALCF Computational Performance Workshop

“PEOPLE ARE DOING HIGH PERFORMANCECOMPUTING WITH PYTHON. HOW DO WE STOPTHEM?”- SENIOR PERFORMANCE ENGINEER

WHY THIS TALK?§ Python is popular§ It’s becoming the de facto languagefor data science§ It’s behind a large number ofscientific workflows§ It’s not uncommon for prototypingor even implementing productionsoftware§ We tend to make a lot of mistages

WHY PYTHON?§ If you like a programming paradigm, it’s supported§ Most functions map to what you know already§ Easy to combine with other languages§ Easy to keep code readable and maintainable§ Lets you do just about anything without changing languages§ The price is right - no license management§ Code portability§ Fully Open Source§ Very low learning curve§ Commercial support options are available§ Comes with a highly enthusiastic and helpful community

WHY NOT PYTHON?§ Performance is often a secondary concern for developers and distributions– Most developers aren’t in HPC environments– Most developers aren’t in science environments– Many tools were designed to work best in generic environments§ Language maintainers favor consistency over compatibility– Backwards compatibility is seldom guaranteed§ Low learning curve– It’s easy to develop a code base that works, but won’t scale

PYTHON 2 OR 3?Python was originally developed as a system scripting language for the Amoeba distributed operating systemand has been developing ever since, with many backwards-incompatible changes made in the name of progresswithout too much delay on adoption. However, the changes from Python 2 to Python 3 were sufficiently radicalthat adoption has been slow going. That said:§§§§Python 3 is the future – and the future is hereAll major libraries now work under Python 3.5 Almost all popular tools work with Python 3.5 Python 3’s loader and more of the interpreter’s internals are written in Python§ This makes loading more I/O intensive which presents challenges for scaling§ It also makes it easier to write alternative interpreters that can be faster than CPython6

WHERE DO WE WANT TO SPEND OUR TIME?Share of execution time7

HOW DOES CPYTHON WORK?

THREADS AND PYTHON: A WORD ON THE GILTo keep memory coherent, Python only allows a single thread to run in the interpreter's memory space atonce. This is enforced by the Global Interpreter Lock, or GIL.The GIL isn’t all bad. It:§ Is mostly sidestepped for I/O (files and sockets)§ Makes writing modules in C much easier§ Makes maintaining the interpreter much easier§ Makes for any easy topic of conversation§ Encourages the development of other paradigms for parallelism§ Is almost entirely irrelevant in the HPC space as it neither impacts MPI or threading within compiledmodulesFor the gory details, see David Beazley's talk on theGIL: https://www.youtube.com/watch?v fwzPF2JLoeU

NUMPY AND SCIPYNumPy should almost always be your first stop for performanceimprovement. It provides:§§§§§N-dimensional homogeneous arrays (ndarray)Universal functions (ufunc)built-in linear algebra, FFT, PRNGsTools for integrating with C/C /FortranHeavy lifting done by optimized C/Fortran libraries such as Intel’s MKLor IBM’s ESSLSciPy extends NumPy with common scientific computing tools§ optimization§ additional linear algebra§ integration§ interpolation§ FFT§ signal and image processing§ ODE solversProblems arise when NumPy isn’t well built

NUMPY AND SCIPYOptimized and built with MKL via SpackInstalled via pipThe test on a KNL system: import timeit sum([timeit.timeit('import numpy as np; )') for i in 3655s

A WORD FROM OUR SPONSORS: CANNED PYTHONAt this point in history, there are few reasons for the average user to manually cobble together a Python stackfor themselves on an x86 64 system. All options are relatively equivalent with unique advantages anddisadvantages to weigh.We will be making two options available on Theta:§ The Intel Python distribution§ Optimized builds of Python built with LLNL/Spack via modulesYou may also wish to consider a commercial distribution:§ Continuum Analytics Anaconda§ Enthought CanopyBoth Intel Python and Continuum Analytics Anaconda build on the Conda package and environmentmanager. Enthought Canopy relies on virtualenv for environment management.Think of Conda as being like rpm or deb packages – easy to install binary packages, though managingdependencies becomes potentially problematic.Think of LLNL/Spack virtualenv as being like BSD or MacPorts – highly customizable, highly transparent, butpotentially a lot of time spent compiling.

WHY MPI?o It is (still) the HPC paradigm for inter-process communicationsSupported by every HPC center and vendor on the planetAPIs are stable, standardized, and portable across platforms and languagesWe’ll still be using it in 10 years o It makes full use of HPC interconnects and hardware Abstracts aspects of the network that may be very system specific Dask, Spark, Hadoop, and Protocol Buffers use sockets or files! Vendors generally optimize MPI for their hardware and softwareo Well-supported tools for development – even for Python Debuggers now handle mixed language applications Profilers are treating Python as a first-class citizen Many parallel solver packages have well-developed Python interfaceso Folks have been writing Python MPI bindings since at least 1996 David Beazley may have started this Other contenders: Pypar (Ole Nielsen), pyMPI (Patrick Miller, et al), Pydusa ( Timothy H. Kaiser), and Boost MPI Python (Andreas Klöckner and Doug Gregor)The community has mostly settled on mpi4py by Lisandro Dalcin15

A BOTTLENECK AT THE START: LOADING PYTHONWhen working in diskless environments or from shared file systems, keeptrack of how much time is spent in startup and module file loading. Parallelfile systems are generally optimized for large, sequential reads and writes.NFS generally serializes metadata transactions. This load time can havesubstantial impact on total runtimes.

MPI4PYPythonic wrapping of the system’s native MPIprovides almost all MPI-1,2 and common MPI-3 featuresvery well maintaineddistributed with major Python distributionsportable and scalable§ requires only: NumPy, Cython, and an MPI§ used to run a python application on 786,432 cores§ capabilities only limited by the system MPI§ http://mpi4py.readthedocs.io/en/stable/§§§§§

HOW MPI4PY WORKS.mpi4py jobs are launched like other MPI binaries:§ mpiexec –np {RANKS} python {PATH TO SCRIPT}§ an independent Python interpreter launches per rank§ no automatic shared memory, files, or state§ crashing an interpreter does crash the MPI program§ it is possible to embed an interpreter in a C/C program and launch aninterpreter that way§ if you crash or have trouble with simple codes§ CPython is a C binary and mpi4py is a binding§ you will likely get core files and mangled stack traces§ use ld or otool to check which MPI mpi4py is linked against§ ensure Python, mpi4py, and your code are available on all nodes andlibraries and paths are correct§ try running with a single rank§ rebuild with debugging symbols§

MPI4PY STARTUP AND SHUTDOWNImporting and MPI initialization§ importing mpi4py allows you to set runtime configuration options (e.g. automaticinitialization, thread level) via mpi4py.rc()§ by default importing the MPI submodule calls MPI Init()§ calling Init()or Init thread()more than once violates the MPI standard§ This will lead to a Python exception or an abort in C/C § use Is initialized() to test for initialization§MPI Finalize() will automatically run at interpreter exit§ there is generally no need to ever call Finalize()§ use Is finalized() to test for finalization if uncertain§ calling Finalize() more than once exits the interpreter with an error and may crashC/C /Fortran modules§

MPI4PY AND PROGRAM STRUCTUREAny code, even if after MPI.Init(), unless reserved to a given rankwill run on all ranks:from mpi4py import MPIcomm MPI.COMM WORLDrank comm.Get rank()mpisize comm.Get size()if rank%2 0:print(“Hello from an even rank: %d” %(rank))comm.Barrier()print(“Goodbye from rank %d” %(rank))

MPI4PY AND DATATYPESPython objects, unless they conform to a C data type, are pickled§ pickling and unpickling have significant compute overhead§ overhead impacts both senders and receivers§ pickling may also increase the memory size of an object§ use the lowercase methods, eg: recv(),send()§ Picklable Python objects include:§None, True, and False§ integers, long integers, floating point numbers, complex numbers§ normal and Unicode strings§ tuples, lists, sets, and dictionaries containing only picklable objects§ functions defined at the top level of a module§ built-in functions and classes defined at the top level of a module§ instances of such classes whose dict () or the result ofcalling getstate () is picklable§

MPI4PY AND DATATYPESBuffers, MPI datatypes, and NumPy objects aren’t pickled§ transmitted near the speed of C/C § NumPy datatypes are autoconverted to MPI datatypes§ buffers may need to be described as a 2/3-list/tuple§[data, MPI.DOUBLE] for a single double§ [data,count,MPI.INT] for an array of integers§ custom MPI datatypes are still possible§ use the capitalized methods, eg: Recv(), Send()§ When in doubt, ask if what is being processed can be represented as memorybuffer or only as PyObject§

MPI4PY: COLLECTIVES AND OPERATIONSCollectives operating on Python objects are naiveFor the most part collective reduction operations on Pythonobjects are serial§ Casing convention applies to methods:§ lowercased methods will work for general Python objects(albeit slowly)§ uppercase methods will work for NumPy/MPI data types atnear C speed§§

MPI4PY: PARALLEL I/OAll 30-something MPI-2 methods are supportedconventional Python I/O is not MPI safe!§ safe to read files, though there might be locking issues§ write a separate file per rank if you must use Python I/O§ h5py 2.2.0 and later support parallel I/O§ hdf5 must be built with parallel support§ make sure your hdf5 matches your MPI§ h5pcc must be present§ check things with: h5pcc -showconfig§ hdf5 and h5py from Anaconda are serial!§ anything which modifies the structure or metadata of a file must be donecollectively§ Generally as simple as:§§f h5py.File('parallel test.hdf5', 'w',driver 'mpio', comm MPI.COMM WORLD)

ENUMERATED ADMONISHMENTSBenchmark as you developProfileAsk if you can do an operation with NumPy or SciPyNever mix forking and threading – ie: Python multiprocessingCheck the build configurations of your important Python modulesBeware of thread affinity:aprun -n -N . –e KMP AFFINITY none -d . -j .7. Watch your data types8. Avoid Python threading9. Watch startup times carefully10. Google – someone else has likely already implemented the solutionyou seek11. Python distutils is always the wrong answer1.2.3.4.5.6.

ScriptCPythonPypySerial / 1Rank8 RanksSerial / 1Rank8 Ranks3.6770741.0657560.3136900.127450builtins pyobj mpi pi 4.0160201.0920050.3046630.110477numba mpi pi0.4163540.424889n/an/anumpy mpi /a0.344480n/abuiltins mpi pi

§ Python 3 is the future –and the future is here § All major libraries now work under Python 3.5 § Almost all popular tools work with Python 3.5 § Python 3’s loader and more of the interpreter’s internals are written in Python § This makes loading more I/O intensive which presents challenges for scaling

Related Documents:

Big Data Welcome - PSC

XSEDE HPC Monthly Workshop Schedule January 21 HPC Monthly Workshop: OpenMP February 19-20 HPC Monthly Workshop: Big Data March 3 HPC Monthly Workshop: OpenACC April 7-8 HPC Monthly Workshop: Big Data May 5-6 HPC Monthly Workshop: MPI June 2-5 Summer Boot Camp August 4-5 HPC Monthly Workshop: Big Data September 1-2 HPC Monthly Workshop: MPI October 6-7 HPC Monthly Workshop: Big Data

44 Views

1y ago

Python Programming for the Absolute Beginner Second Edition

Python Programming for the Absolute Beginner Second Edition. CONTENTS CHAPTER 1 GETTING STARTED: THE GAME OVER PROGRAM 1 Examining the Game Over Program 2 Introducing Python 3 Python Is Easy to Use 3 Python Is Powerful 3 Python Is Object Oriented 4 Python Is a "Glue" Language 4 Python Runs Everywhere 4 Python Has a Strong Community 4 Python Is Free and Open Source 5 Setting Up Python on .

669 Views

3y ago

Learning Python - سیّد صالح اعتمادی

Python 2 versus Python 3 - the great debate Installing Python Setting up the Python interpreter About virtualenv Your first virtual environment Your friend, the console How you can run a Python program Running Python scripts Running the Python interactive shell Running Python as a service Running Python as a GUI application How is Python code .

56 Views

1y ago

The Quick Python Book, Second Edition - Shen@uiuc

Python is readable 5 Python is complete—"batteries included" 6 Python is cross-platform 6 Python is free 6 1.3 What Python doesn't do as well 7 Python is not the fastest language 7 Python doesn't have the most libraries 8 Python doesn't check variable types at compile time 8 1.4 Why learn Python 3? 8 1.5 Summary 9

63 Views

1y ago

Bruksanvisning för bilstereo Bruksanvisning for bilstereo ... - Jula

Bruksanvisning för bilstereo . Bruksanvisning for bilstereo . Instrukcja obsługi samochodowego odtwarzacza stereo . Operating Instructions for Car Stereo . 610-104 . SV . Bruksanvisning i original

374 Views

1y ago

Python (2nd Edition): Learn Python in One Day and Learn It Well. Python ...

site "Python 2.x is legacy, Python 3.x is the present and future of the language". In addition, "Python 3 eliminates many quirks that can unnecessarily trip up beginning programmers". However, note that Python 2 is currently still rather widely used. Python 2 and 3 are about 90% similar. Hence if you learn Python 3, you will likely

33 Views

1y ago

Python (Quick Review)

There are currently two versions of Python in use; Python 2 and Python 3. Python 3 is not backward compatible with Python 2. A lot of the imported modules were only available in Python 2 for quite some time, leading to a slow adoption of Python 3. However, this not really an issue anymore. Support for Python 2 will end in 2020.

11 Views

3m ago

The Queen s Awards for Enterprise 2020 Press Book

The Queen’s Awards for Enterprise are the most prestigious awards for UK business, designed to recognise and encour-age outstanding achievements in the fields of Innovation, International Trade, Sustainable Development and Promoting Opportunity (through social mobility). The Queen’s Awards scheme was instituted by Royal Warrant in 1965 and the first Awards were made in 1966. This year, 220 .

44 Views

3y ago

Recent Views

Rocket Lawyer Legal Benefits Plan Summary - avinc

3) You can also hire a lawyer for ongoing representation. Rocket Lawyer On Call attorneys generally offer Rocket Lawyer members 40% off their normal rate. Alternatively, you can speak to your Rocket Lawyer customer representative or call us at (877) 881-0947. We'll contact you within one business day and connect you to a local lawyer.

1y ago

135 Views

Rocket Lawyer Legal Benefits Plan Summary

1y ago

155 Views

Gorilla exceptions and the ethically apathetic corporate lawyer

Here, Dare sees the lawyer as the instrument of the institution of law. Postema disagrees: 'The lawyer must recognise that the institution acts only through the voluntary activities of the lawyer and client. The lawyer is not the instrument of the institution, rather the institution is the instrument of the client and the client engages the .

1y ago

117 Views

ALABAMA Alabama Legal Help Alabama Lawyer Referral Service How Does the .

The Lawyer Referral Service does not have free attorneys. How Does the Lawyer Referral Service Work? When you call the Lawyer Referral Service's toll free number (1-800-392-5660) you will be asked to briefly state your problem. All information will be held in the strictest confidence. After listening to your problem, the Lawyer Referral .

4m ago

61 Views

TEXAS DISCIPLINARY RULES OF PROFESSIONAL CONDUCT VII .

the lawyer’s letterhead, business cards, office sign, fee contracts, and with the lawyer’s signature on pleadings and other legal documents. (f) A lawyer shall not use a firm name, letterhead, or other professional designation that violates Rule 7.02(a). Comment 1. A lawyer or law firm may not practice law using a name that is misleading as .File Size: 137KB

2y ago

147 Views

RISK MANAGEMENT FOR LEGAL SUPPORT STAFF

A lawyer’s letterhead or a business card may include the name of a non-lawyer assistant if the assistant’s ca-pacity is clearly indicated and the document is otherwise neither false nor misleading. 10. A lawyer may use a non-lawyer, non-employee f

2y ago

181 Views

Hiring and Working with an Attorney

questions and how well the lawyer listens to you. You also want to pay attention to how easy it is to understand the lawyer's explanation of your legal problem, and how you feel about the lawyer's abilities. During the interview, you may ask questions about the lawyer's background, qualifi-cations and experience, such as:

1y ago

139 Views

A Guide to Setting Up and Using Your Lawyer Trust Account

The ethical obligations for those who set up lawyer trust accounts are rooted in the principle that a lawyer who holds funds of a client or third person in trust, even for a . those assets from the lawyer's personal and business assets. Oregon Rules of Professional Conduct (ORPC) 1.15-1 and ORPC 1.15-2 set forth the ethical duties and

1y ago

118 Views

Ethics and the Virtual Practice of Law

An office or a law firm may be "virtual," but the lawyer is in a specific location. Must the lawyer be licensed to practice in the jurisdiction where the lawyer is physically located? Must the lawyer be licensed in the jurisdiction where the client and the client's matter is located? 8. E THICS & V IRTUAL P RACTICE Relevant Ethics Rules .

1y ago

116 Views

Enron and the Corporate Lawyer: A Primer on Legal and Ethical Issues

144 The Business Lawyer; Vol. 58, November 2002 responsibilities when the lawyer learns, or has reason to know, that officers or other agents of the lawyer's corporate client are engaged in conduct that violates the law or their fiduciary duty to the corporation and is likely to result in harm

1y ago

118 Views

LegalZoom Inc v. Rocket Lawyer Incorporated Doc. 11

LegalZoom.com Inc v. Rocket Lawyer Incorporated Doc. 11 Dockets.Justia.com. EXHIBIT 1 EXHIBIT 1. EXHIBIT 1 -21-EXHIBIT 2 EXHIBIT 2. EXHIBIT 2 -22-EXHIBIT 3 EXHIBIT 3. rocketlaywer incorporate-Google Search https://www.google.coml . Free Legal Documents & Legal Forms I Find a Lawyer I Rocket Lawyer

1y ago

109 Views

LegalZoom Inc v. Rocket Lawyer Incorporated Doc. 17

LegalZoom.com Inc v. Rocket Lawyer Incorporated Doc. 17 Dockets.Justia.com. EXHIBIT 1 EXHIBIT 1. EXHIBIT 1 -23-EXHIBIT 2 EXHIBIT 2. EXHIBIT 2 -24-EXHIBIT 3 EXHIBIT 3. rocketlaywer incorporate-Google Search https://www.google.coml . Free Legal Documents & Legal Forms I Find a Lawyer I Rocket Lawyer

1y ago

107 Views

iLJ -- 2017) -, Issued by ACPE, CAA, & UPL June 21, 2017 ADVISORY .

enforcing a legal document (called "document defense"). Users also receive a "free" 3Ominute consultation with a lawyer, and can use the "ask a lawyer" section of its website for legal advice. Participating lawyers do not pay Rocket Lawyer but agree to offer a discounted fee for additional services; Rocket Lawyer retains the monthly subscription fees. The Committees find that the .

1y ago

102 Views

ETHICAL ISSUES IN CLASS ACTIONS . - Parker Mills LLP

DAVID B. PARKER is a trial lawyer and founder of Parker Mills LLP in Los Angeles. His practice is focused on commercial, professional liability and insurance litigation. Often described as a "lawyer’s lawyer," his practice extends to counseling and litigation in legal ethics and disputes between and among lawyers.

3y ago

184 Views

10TH ANNUAL PERFORMANCE REPORT OF THE NATIONAL PRO BONO .

Target. 10TH annual PERFORMANCE REPORT oF The National Pro Bono Aspirational Target PERFORMANCE OF TARGET SIGNATORIES AT LEAST 35 HOURS OF “PRO BONO LEGA L SERVICE S” PER LAWYER PER YEAR 48.6% 35.7 pro bono hours per lawyer across Target Signatories3. Down from 36.0 pro bono hours per lawyer in FY2016 of Signatories met or exceeded the .

3y ago

159 Views

PYTHON FOR HPC: BEST PRACTICES

It looks like you're using an ad-blocker