OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS

3y ago

44 Views

2 Downloads

3.15 MB

87 Pages

Last View : 7d ago

Last Download : 3m ago

Upload by : Randy Pettway

Report this link

Download PDF

Transcription

OPTIMIZING BIG DATA ANALYTICSON HETEROGENEOUS PROCESSORSMAYANK DAGA, MAURICIO BRETERNITZ, JUNLI GUAMD RESEARCH

HETEROGENEOUS PROCESSORS - EVERYWHERESMARTPHONES TO SUPER-COMPUTERSSuper computerDense ServerTabletPhoneWorkstationNotebookFrom Phil Rogers APU13 Keynote2 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

35 YEARS OF MICROPROCESSOR TREND DATAHomogeneous processors3 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

IMPORTANT SHIFTS2004 - 20052007 - 20082010 - 2011GPUCPUThe Era of Heterogeneous Computing Is Here !4 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

CENTRAL PROCESSING UNIT (CPU) Few BIG cores Ideal for scalar & control-intensive parts of application5 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

GRAPHICS PROCESSING UNIT (GPU) Lots of small cores Resides over the PCIe bus Ideal for data-parallel parts of the application6 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

ACCELERATED PROCESSING UNIT (APU)Data-parallelGPUDualx86 CPUModuleSerial A heterogeneous platform with CPU GPU on the same silicon die Ideal for both serial and data-parallel parts of application TDP of less than 100 Watts7 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

HOW IS AN APU DIFFERENTSys. Mem300-500 GB/sCPUPCIe16 GB/sGPU Manage data-movement across PCIe Use local scratchpad memory (cache)LDSSys. Mem No data-movement overhead Programming is similar to discrete GPUbut simplerAccelerated Processing Unit (APU)8 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

MODERN WORKLOADS ARE HETEROGENEOUSSCALAR CONTENT WITH A GROWING MIX OF PARALLEL CONTENT Video is expected to represent two thirds of mobile data traffic by 2017‒ Video processing is inherently parallel and can be accelerated Big data growing exponentially with exabytes of data crawled monthly‒ Map reduce is a heterogeneous workload Rapid growth of Sensor Networks‒ Drives exponential increase in data Internet of Things (IoT) results in explosion of data sources‒ Another exponential growth in data at local and cloud level9 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

HETEROGENEOUS SYSTEM ARCHITECTURE (HSA)CPU1CPUCPU 2NCU1CU2CU3 CUM-1CUMUnified Coherent MemoryAll processors use same memory addressesPower efficientFull access to virtual and physical memoryEasy to program10 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

LARGE-SCALE DATA ANALYTICS & HETEROGENEOUSCOMPUTE (HC)Large amounts of data stored in cloudHC makes the cloud energy-efficient47% GPUNeeds to be analyzed quicklyGPU provides the computationalhorsepowerDifferent types of structuredand unstructured dataA heterogeneous platform for theheterogeneous data11 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

ProgrammingHeterogeneousProcessors

PROGRAMMING LANGUAGES PROLIFERATING ON APUOpenCL AppOpenMPAppC RuntimesVariousRuntimesHSAILHSAHelperLibrariesHSA CoreRuntimeHSAFinalizerHSA KernelDriver13 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

OpenCLPerformanceC AMPProductivity14 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

WHAT IS OPENCLOpenArchitecture OPENCL ComputingLanguage– Platform Model‒ Open standard managed by the Khronos Group Platform Agnostic --- CPUs, GPUs, FPGAs, DSPsHostCompute DeviceProcessingElementCompute Unit15 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

OPENCL: EXECUTION MODEL Host Program‒ Executes on the host (usually a CPU)‒ Sends commands to the compute devices using a queue Kernel‒ Basic unit of executable code which runs on compute devices‒ A grid of parallel threads execute the kernel on the compute deviceGroup of threads executingon the same GPU coreAMDIndividual threads16 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

CODE EXAMPLEfor loop {// do work;}1. OpenCL Initialization2. Allocate memory3. Data copy GPU4. Launch GPU Kernel5. Data copy HostHost-side codekernel {// do work;}Device-side code17 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

GETTING STARTED RESOURCES (OPENCL) AMD APP Programming SDK‒ e/amd-accelerated-parallelprocessing-app-sdk/ AMD APP Programming Guide‒ dia/2013/12/AMD OpenCL Programming User Guide2.pdf‒ dia/2013/12/AMD OpenCL Programming Optimization Guide2.pdf Works for both Windows and Linux18 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

C AMP C , not C Mainstream: programmed by millions Minimal: just one language extension Portable: mix and match hardware from any vendor General and Future Proof: designed to cover full-range of heterogeneity19 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

CODE EXAMPLEfor loop {parallel for each(num threads,[ ] (t idx) {// do work;// do work;}} ); Combination of library and extensions to C standard Single-source Substantially boosts programmer productivity-No asynchronous data-transfers20 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

GETTING STARTED RESOURCES (C AMP) Compiler and runtime‒ ng/wiki/Home‒ Microsoft Visual Studio Programming Guide‒ aspx Works for both Windows and Linux21 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

PerformancePERFORMANCE2xApp1App2App3OpenCLApp4Geo MeanC AMPDaga et al. IISWC 201522 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

o MeanC AMPDaga et al. IISWC 201523 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

SUPPORT FOR POPULAR LIBRARIES Computer Vision‒ OpenCV Data Science‒ SciPy‒ NumPy Image Processing‒ ImageMagick Parallel Standard Template Library‒ Bolt Linear Algebra Library‒ AMD clBLAS24 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

OUR THREE-PRONGED APPROACH TO DATA ANALYTICS Enhancing Programming Model‒ HadoopCL and Apache Spark: flexibility, reliability, and programmability of Hadoopaccelerated by OpenCL Enhancing Data Operations‒ Deep Neural Networks: achieved 2x energy efficiency on the APU than discrete GPUs‒ Breadth-first Search: fastest single-GPU Graph500 implementation (June 2014)‒ SpMV: state-of-the-art CSR-based SpMV (13x faster than prior CSR-SpMV and 2x fasterthan other storage formats Enhancing Data Organization‒ In-Memory B Trees: efficient memory reorganization to achieve 3x speedup on theAPU over a multicore implementation25 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

NESTED PROCESSING OFMACHINE LEARNINGMAP-REDUCE, BIG DATAON APUSMAURICIO BRETERNITZ, PH.D.AMD RESEARCHTECHNOLOGY & ENGINEERING GROUPAMDthanks: Max Grossman, Vivek Sarkar

MapReduce in 30 seconds: Estimating PI1- pick random points in unit rectangle2- count fraction inside circleArea: π/ 4Map-Reduce:Map: random point inside? Issue k 1, v 1 else k 0,v 1Reduce: count 0 keys and count 1 keysProgrammer: writes { map, reduce } methods, system doesrest27 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

HADOOP Open source implementation of MapReduceprogramming modelbarrierV1MapMapMapV2V1V2ReduceV3ReduceV3 Runs on distributed network in several Java VMs Distributed file system, reliability guarantees,speculative execution, Java programming language andlibraries, implicit parallelism28 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

TARGET ARCHITECTURE Target: CLUSTER of APUs Two-Level Parallelism:‒ Across nodes in cluster‒ Within Node (APU)‒ Multicore (CPU)‒ Data parallel(GPU)29 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

NESTED PROCESSING APPROACHPARTITIONWORKLOCALLYCOMBINERESULTSnest: CPU GPU30 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

WHY APUS – MAP REDUCE-REDUCESPLIT: enough work for one GPUNO MEMORY LIMITCPU GPU executionNetworkbarrierV1MapMapMapV2V1 V2ReduceV1 V2ReduceV3ReduceV3ReduceV1 V2ReduceV1 apV2V1 V2ReduceV1 V2ReduceV3ReduceV3ReduceV3CPU GPU execution on each nodeFinal reduction in clusterAggregate node’s results31 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

HADOOPCLImplemented with AMD’s APARAPI: Java methods - GPUCollaboration with Rice University:HadoopCLMapReduce on Distributed Heterogeneous Platforms ThroughSeamless Integration of Hadoop and OpenCLMax Grossman1, Mauricio Breternitz2, Vivek Sarkar11RiceUniversity, 2AMD Research2013 International Workshop on High Performance Data Intensive Computing.May 2013.M. Grossman, M. Breternitz, V. Sarkar. “HadoopCL2: Motivating the Design of a Distributed,Heterogeneous Programming System With Machine-Learning Applications.”IEEE Transactions on Parallel and Distributed Systems, 201432 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

RUNNINGHADOOPCLclass PiMapper extendsDoubleDoubleBoolIntHadoopCLMapper {public void map(double x,double y) {if(x * x y * y 0.25) {write(false, 1);} else {write(true, 1);}}} HadoopCL supports‒Java syntax & MapReduceabstractions‒Dynamic memory allocation‒A variety of data types(primitives, sparse vectors,tuples, etc.) HadoopCL does not support‒Arbitrary inputs, outputs‒Object referencesjob.waitForCompletion(true); javac.class hadoop jar Pi.jar input output33 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

HADOOPCL CLUSTER ARCHITECTURE hadoop jar Pi.jarinput outputHadoop DataNodeTaskNameNode JobTrackerDataNodeDataNodeMap orReduceHadoopCL ChildTaskTrackerHadoopCL MLDevice SchedulerHadoopCL ChildHadoopCL ChildHadoopCL ChildAPU: CPU, GPU share work34 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

HADOOPCL NODE ARCHITECTUREHadoopCL ChildBuffer RunnerTaskMap orReduceLaunchOpenCLDeviceInputBufferRetry KernelStoreOutputExecutorOutputBufferStoreHDFSEach Child JVM enclosesa data-driven pipeline ofInputAggregatorcommunication andcomputation tasksInputBufferManagerOutputBufferManagerKernel Executor handles:Auto-generation and optimization ofOpenCL kernels from JVM bytecodeTransfer of inputs, outputs to deviceAsynch launch of OpenCL kernels35 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

HCL2 EXECUTION FLOWcompileJVMruntime36 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

EVALUATION Mahout Clustering‒ Mahout provides Hadoop MapReduceimplementations of a variety of ML algorithms‒ KMeans iteratively searches for K clusters Evaluated on 1 NameNode and 3 DataNodes in an AMD APU cluster Dataset built from the ASF e-mail archives‒ 1.4GB‒ 1 iteration of searching for 64 clusters‒ Recordedoverall execution time,time spent on compute,time spent on I/O in each mapper and reducer37 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

MAHOUT EXAMPLES KMEANS‒ Finds clusters FUZZY KMEANS‒ Probabilistic “soft clusters” PAIRWISE SIMILARITY‒ Recommender pipeline NAÏVE BAYES Probabilistic classifier DIRICHLET‒ Finds document topics, cluster via probability distribution over ‘topics’38 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

EVALUATIONSpeedup on AMD A10-7300 95w APU for 5 MahoutBenchmarksSpeedup over Mahout-CPU252015Speedup over Mahout-CPU1050KmeansFuzzyCPU boundDirichletPairwiseBayesI/O bound39 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

wisebayes10.5Number of Nodes001234540 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

FUTURE/ONGOING WORK Evaluation on more Mahout applications, more data sets, more platforms‒ Xiangyu Li, Prof David Kaeli /Northeastern University :Mahout Recommenders Evaluate potential power savings In-depth analysis of effectiveness of machine learning on performance Target HSA instead of OpenCL, via Sumatra/APARAPI Various performance improvements41 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

APACHE SPARK Fast, MapReduce-like engine‒ In-memory storage abstraction for iterative/interactive queries‒ General execution graphs‒ Up to 100x faster than Hadoop MR Compatible with Hadoop’s storage APIs‒ Can access HDFS, HBase, S3, SequenceFiles, etc. Great example of ML/Systems/DB collaboration http://spark.apache.org42 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

SWATval rdd CLWrapper.cl(sc.objectFile(inputPath))val nextRdd rdd.map(.).43 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

SWATSWAT RDDAPARAPI-SWAT (Code Generation)SWAT SerializationJVMNative/OpenCLSWAT-OpenCL BridgeMemory Allocation And CachingDevice ManagementGPU 0GPU 1GPU 244 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

SWATBig wins over HadoopCL: Built on Spark. And Scala.No HadoopCL-specific data structures required for representing complex data types User-defined classes w/ restrictions, MLlib DenseVector, Mllib SparseVector, ScalaTuple2 for (key, value) pairs, PrimitivesSimpler semantics for some Spark parallel operations e.g. Spark map() forces one output per input, MapReduce allows arbitrary # ofoutputs (though Spark has flatMap())Better locality, caching of on-device data based on broadcast, RDD IDsSimplified dynamic memory allocator on GPUMore stable, nearing production-ready implementation.Max Grossman / Rice University45 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

SWATCurrent benchmarks: Fuzzy CMeans, KMeans, Neural Net, Pagerank, Connected ComponentsMajor Challenges: Architected as a third-party JAR, internal Spark state is hidden (unlikeHadoopCL)Garbage collection Allocation patterns of a SWAT program are very differentfrom those of an equivalent Spark versionCan we do better auto-scheduling than HadoopCL? Offline? Experiment withother classification algorithms based on IBM work?46 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

CONCLUSION HadoopCL offers the flexibility, reliability, and programmability ofHadoop accelerated by native, heterogeneous OpenCL threads Using HadoopCL is a tradeoff: lose parts of the Java language butgain improved performance Evaluation of KMeans with real-world data sets shows thatHadoopCL is flexible and efficient enough to improve performanceof real-world applicationsThanks: Max Grossman, max.grossman@rice.edu47 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

e/aparapi/HADOOPCL achments/4425835/hpdic.pdf?version 1&modificationDate 1366561784922&api v2HADOOP on star-hadoop-gpuHADOOPCL presentationhttps://www.youtube.com/watch?v KMpjFsOO4nw48 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

DEEP NEURAL NETWORK(DNN)ACCELERATION ON AMDACCELERATORSJUNLI GU, AMD RESEARCH

MACHINE LEARNING BIG DATA INDUSTRY TREND Why machine learning for Big Data?‒ Original human defined algorithms don’t work well for Big Data‒ Competing in machine learning to understand Big Data DNN (deep neural networks) is breaking through & leading direction‒ Large scale of image classification/recognition/search‒ Face recognition, Online recommendation, Ads‒ Documentation retrieval, Optical Character Recognition (OCR) Long list of companies looking for DNN solutionsDNN Big Data is believed to bethe evolutionary trend for apps &HPC systems.50 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

DEEP LEARNING BRINGS CHALLENGES TO SYSTEM DESIGN Typical scale of data set Image search: 1M OCR: 100M Speech: 10B, CTR: 100BBig Data inputDNN modelAnswer Projected data to grow 10X per year DNN model training time Weeks to months on GPU clusters Trained DNNs then deployed on cloudDNN compute & memory intensive, thus clusters System is the final enabler Current platform runs into bottleneck CPU clusters CPU GPU clusters Looking at dGPUs, APUs, FPGAs, ASIC, etc.51 OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS OCTOBER 23, 2015 2015 IEEE BIG DATA SANTA CLARA, CA, USA

WHAT HAVE WE DONEBASED ON TODAY’S INDUSTRY Focused on three major industry DNN algorithms CNN: Convolutional Neural Network (image/video classification)‒ Reference Source: Univ. of Toronto CUDA version (http://github.com/bvlc/caffe)‒ AMD implementation open sourced at https://github.com/amd/OpenCL-caffe Multi-layer Perceptron (Voice Recognition)‒ Source: publications, interaction with industry experts and ISVs‒ AMD implementation in C , OpenCL Auto-encoder L-BFGS training image and document retrieval)‒ Reference Source: Stanford Univ. Matlab code(http://ai.stanford.edu

3 Reduce Reduce V 1 V 2 V 3 barrier Map Map V Map V 1 V 2 V 3 Reduce Reduce V 1 V 2 3 Map Map Reduce Map V 1 V 2 V 3 V 1 V 2 Reduce V 3 V 1 V 2 Reduce 3 Reduce Reduce V 1 V 2 V 3 V 1 V 2 Reduce 3 SPLIT: enough work for one GPU NO MEMORY LIMIT CPU GPU execution CPU GPU execution on each node Final reduction in cluster Aggregate nodes results Network

Related Documents:

Big Data Analytics - Vivomente

tdwi.org 5 Introduction 1 See the TDWI Best Practices Report Next Generation Data Warehouse Platforms (Q4 2009), available on tdwi.org. Introduction to Big Data Analytics Big data analytics is where advanced analytic techniques operate on big data sets. Hence, big data analytics is really about two things—big data and analytics—plus how the two have teamed up to

33 Views

1y ago

Big Data and Business Analytics: Trends, Platforms, Success Factors and ...

big data analytics" To discuss the in-depth analysis of hardware and software platforms for big data analytics The study only focused on the hardware and software platform for big data analytics. The review is centered on the impact of parameters such as scalability, data sizes, resources availability on big data analytics. However, the

50 Views

1y ago

AI & Big Data/Analytics

India has the second largest unmet demand for AI and Big Data/Analytics, driven primarily by large service providers, GCCs and the start-up ecosystem NCR Others Hyderabad Pune Mumbai Bangalore Chennai Top Skills Talent Big Data/ Analytics 5,800 AI 1,200 Top Skills Talent Big Data/ Analytics 19,100 AI 7.400 Top Skills Talent Big Data/ Analytics .

37 Views

2y ago

FUNDAMENTALS OF BIG DATA ANALYTICS - Prasad V. Potluri Siddhartha ...

Q) Define Big Data Analytics. What are the various types of analytics? Big Data Analytics is the process of examining big data to uncover patterns, unearth trends, and find unknown correlations and other useful information to make faster and better decisions. Few Top Analytics tools are: MS Excel, SAS, IBM SPSS Modeler, R analytics,

26 Views

1y ago

Python Tools for Big Data Analytics - IJSR

example, Netflix uses Big Data Analytics to prescribe favourite song/movie based on customer‟s interests, behaviour, day and time analysis. 3. Python For Big Data Analytics 3.1 . Advantages. of . Python for Big Data Analytics Python. is. the most popular language amongst Data Scientists for Data Analytics not only because of its ease in

16 Views

1y ago

Big Data Analytics Turning Big Data Into Big Money

The Rise of Big Data Options 25 Beyond Hadoop 27 With Choice Come Decisions 28 ftoc 23 October 2012; 12:36:54 v. . Gauging Success 35 Chapter 5 Big Data Sources.37 Hunting for Data 38 Setting the Goal 39 Big Data Sources Growing 40 Diving Deeper into Big Data Sources 42 A Wealth of Public Information 43 Getting Started with Big Data .

50 Views

1y ago

Top Big Data Analytics Use Cases - oracle.com

Retail. Big data use cases 4-8. Healthcare . Big data use cases 9-12. Oil and gas. Big data use cases 13-15. Telecommunications . Big data use cases 16-18. Financial services. Big data use cases 19-22. 3 Top Big Data Analytics use cases. Manufacturing Manufacturing. The digital revolution has transformed the manufacturing industry. Manufacturers

26 Views

1y ago

A novel big data analytics framework for smart cities - ترجمه یار

The process of analyzing big data to extract useful information and insights is usually referred to as big data analytics or big data valu e chain [6], which is considered as one of the key enabling technologies of smart cities [7, 8, 9]. However, big data complexities comprise non-trivial challenges for the processes of big data analytics [3].

15 Views

1y ago

Recent Views

Request for Proposals for VILLAGE SOLICITOR

705.11 Village solicitor or city director of law - duties. “The village solicitor or city director of law shall act as the legal advisor to and attorney for the municipal corporation, and for all officers of the municipal corporation in matters relating to their official duties.

2y ago

131 Views

Appellee'S Answering Brief - Dol

APPELLEE'S ANSWERING BRIEF . KATE S. O'SCANNLAIN THOMAS TSO . Solicitor of Labor Counsel for Appellate and Special Litigation . G. WILLIAM SCOTT Associate Solicitor JEFFREY M. HAHN for Plan Benefits Security Senior Trial Attorney . U.S. Department of Labor Office of the Solicitor 200 Constitution Ave. N.W. Room N-4611 Washington, D.C. 20210

1y ago

137 Views

Alb erta Solicitor Gene nd Min of Publi c Sec

Alberta Solicitor General and Ministry of Public Security Alberta Basic Security Training Jan-14 Module One: Introduction to the Security Industry, Page 1 Module One: Introduction to the Security Industry When you decided to enroll in this course, you presumably did so because you have a desire or need to work in the security industry in Alberta.

3y ago

219 Views

REQUEST FOR PROPOSALS FOR PROFESSIONAL SERVICES FOR .

The State Solicitor is appointed by the Attorney General and reports through the Chief Deputy Attorney General. As set forth at 29 Del. C. § 2505 (b), the State Solicitor is responsible for all civil actions and matters wherein the State or its agencies or subdivisions are involved and has such powers as the Attorney General shall designate.

3y ago

193 Views

Interviewing and advising

(d) To have the solicitor’s full attention. (e) To sit in reasonable comfort. (f) No physical barriers between us and our solicitor to impede communication. (g) Not to be kept waiting. The bare minimum, then, would appear to be a comfortable, quiet room where you

3y ago

148 Views

Information for Trainee Solicitors on CV Preparation

Solicitor who trained in large law firm with a wide range of experience in banking and financial services, debt collection, commercial property and company law. EDUCATION April 2011 PPC II Completed - Qualified as a solicitor with Law Society of Ireland 2003-2007 Trinity College Dublin - BCL - 2.1 Honours

3y ago

226 Views

STATE OF TEXAS, OFFICE OF THE SPECIAL MASTER Solicitor .

Jul 05, 2019 · FREDERICK LIU Assistant to the Solicitor General US Department of Justice 950 Pennsylvania Avenue, NW Washington, D.C. 20530-0001 JAMES J. DUBOIS* R. LEE LEININGER THOMAS K. SNODGRASS U.S. Department of Justice Environment & Natural Resources Division 999 18th Street South Terrace – Suite 3

2y ago

156 Views

WILLIAM K W LEUNG & CO

Articleship with : Isadore Goldman, a top City of London firm specializing in insolvency (corporate); recovery; commercial and international litigation. 1986 – 1988 (2) Assistant Solicitor with . Wilkinson & Grist, a reputable local solicitor firm . 1990 (3) Corporate Finance Associate wit

2y ago

185 Views

Newsletter of Tweed Valley Jazz Club

SOUTHERN CROSS CREDIT UNION 2 Commercial Road, Murwillumbah Ph. 6672 2744 TWEED ENDEAVOUR CRUISES River Terrace, Tweed Heads Ph. 0755 368800 RUSSELL J BAXTER SOLICITOR N.S.W. & QLD (Honorary Club Solicitor) 28 Recreation Street, Tweed Heads Ph. 0755 992266 AN

2y ago

309 Views

IN THE HULL CROWN COURT R -v- WILLIAM FLANNIGAN

Adams, who worked for a well-known and reputable firm in Hull known as Cooper Wilkin Chapman. It is clear from the evidence that the recommendation to use this solicitor was just that; a recommendation. There was no obligation to use this solicitor and the purchasers were at liberty to use th

1y ago

140 Views

JAMES S. M. KITCHEN Suite 224 BARRISTER & SOLICITOR Airdrie AB T4B 3C3

BARRISTER & SOLICITOR. 203-304 Main St S Suite 224 . Airdrie AB T4B 3C3 . Phone: 403-667-8575 . Email: james@jsmklaw.ca [2] COVID mRNA vaccines (Pfizer and Moderna). She is also compelled to maintain both the physical and spiritual integrity of her body by asserting her God-given prior right to decline

1y ago

206 Views

Rules of Court - WordPress

and includes a Registrar, Court interpreter, bailiff, clerk, process server or other officer who is attached to a Court; "solicitor" means an advocate and solicitor as defined in section 3 of the Legal Profession Act 1976 [Act 166]; "Registry" means the Registry of the High Court, the Sessions Court or the Magistrates' Court;

1y ago

131 Views

Financial Statement for a variation of an order for a financial remedy .

If there is not enough room on the form for any particular piece of information, you may continue on an . attached sheet of paper. If you are in doubt about how to complete any part of this form you should seek legal advice. This statement is filed by (give name and address of solicitor) Solicitor's fee account no.

10m ago

111 Views

SOLICITOR - Charleston County Bar Association

101 Meeting St., Ste 400, Charleston, SC 29401. Main No: 843 958-1900 Fx:- 843 958-1905 Front Desk Line #2: 843 958-1927 . Direct Access is 843 958-**** Website: www.scsolicitor9.org. E-mail: solicitor@scsolicitor9.org. . Moncks Corner, SC 29461. Berkeley Number. 843 719-4529 . Charleston Number .843 723- 3800 ext.4529 .

7m ago

151 Views

DIGES T - academy.difc.ae

The Qualified Lawyers Transfer Scheme (QLTS) allows qualified lawyers in other jurisdictions to qualify as a solicitor in England and Wales. The English legal profession is relatively open to international lawyers seeking to qualify as a solicitor and it does not impose restrictions to admission on grounds of nationality or residence.

7m ago

74 Views

OPTIMIZING BIG DATA ANALYTICS ON HETEROGENEOUS PROCESSORS

It looks like you're using an ad-blocker