Virtual Machine (VM) For Hadoop Training - Iitr.ac.in

1y ago

3 Views

1 Downloads

526.57 KB

13 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Averie Goad

Report this link

Download PDF

Transcription

2012 coreservlets.com and Dima MayVirtual Machine (VM)For Hadoop TrainingOriginals of slides and source code for examples: http://www.coreservlets.com/hadoop-tutorial/Also see the customized Hadoop training courses (onsite or at public venues) – mlCustomized Java EE Training: http://courses.coreservlets.com/Hadoop, Java, JSF 2, PrimeFaces, Servlets, JSP, Ajax, jQuery, Spring, Hibernate, RESTful Web Services, Android.Developed and taught by well-known author and developer. At public venues or onsite at your location. 2012 coreservlets.com and Dima MayFor live customized Hadoop training (including prepfor the Cloudera certification exam), please emailinfo@coreservlets.comTaught by recognized Hadoop expert who spoke on Hadoopseveral times at JavaOne, and who uses Hadoop daily inreal-world apps. Available at public venues, or customizedversions can be held on-site at your organization. Courses developed and taught by Marty Hall– JSF 2.2, PrimeFaces, servlets/JSP, Ajax, jQuery, Android development, Java 7 or 8 programming, custom mix of topics– Coursesavailable in any stateor country.Maryland/DC http://courses.coreservlets.com/area companies can also choose afternoon/evening courses.CustomizedJavaEE Training: Coursesdevelopedand taught Servlets,by coreservlets.comexperts(editedHibernate,by Marty)RESTful Web Services, Android.Hadoop,Java,JSF 2, PrimeFaces,JSP, Ajax, jQuery,Spring,– Spring, Hibernate/JPA, GWT, Hadoop, HTML5, RESTful Web ServicesDeveloped and taught by well-knownand developer. At publicvenues or onsite at your location.Contactauthorinfo@coreservlets.comfor details

Agenda Overview of Virtual Machine for HadoopTraining Eclipse installation Environment Variables Firefox bookmarks Scripts Developing Exercises Well-Known Issues4Virtual Machine In this class we will be using Virtual Box , adesktop virtualization product, to runUbuntu– https://www.virtualbox.org Ubuntu image is provided with Hadoopproducts pre-installed and configured fordevelopment– Cloudera Distribution for Hadoop (CDH) 4 is used;installed products are: 5Hadoop (HDFS and YARN/MapReduce)HBaseOoziePig & Hive

Installing Virtual Box Download the latest release for yourspecific OS– https://www.virtualbox.org/wiki/Downloads After download is complete, run Virtual Boxinstaller Start Virtual Box and import providedUbuntu image/appliance– File Import Appliance Now that new image is imported, select itand click ‘Start’6VM Resource VM is set up with– 3G of RAM and 2CPUs and 13G of Storage If you can spare more RAM and CPU adjustVM Settings– Virtual Box Manager right click on VM Settings System adjust under Motherboard and Processor tabs7

Logging In Username: hadoop Password: hadoop8Desktop ScreenCommand line terminalEclipse is installed to assist indeveloping Java code and scripts9

Directory LocationsAll the training artifacts; located in the user’s home directoryInstallation directory for Hadoop productsEclipse installationCode, resources and scripts managed via EclipseData for exercisesHadoop is configured to store its data hereJava Development Kit (JDK) installationLogs are configured to be saved in this directoryEclipse Plugin to enable highlighting of Pig ScriptsExecute Java code, MapReduce Jobs and scripts from hereWell known shell scripts10EclipseEclipse workspace will contain threeprojects: Exercises – you will implementhands-on exercises in this project Solutions – the solutions to theexercises can be found here HadoopSamples – code samplesused throughout the slides11

Eclipse ProjectProjects follow maven directorystructure /src/main/java – Java packagesand classes reside here /src/main/resources – non-Javaartifacts /src/main/test/java – Java unittests go hereTo further learn about maven pleasevisit http://maven.apache.org12Environment Variables VM is set up with various environmentvariables to assist you with referencingwell-known directories Environment variables are sourced from– /home/hadoop/Training/scripts/hadoop-env.sh For example:– echo PLAY AREA– yarn jar PLAY AREA/Solutions.jar .13

Environment Variables PLAY AREA /home/hadoop/Training/play area– Run examples, exercises, and solutions from this directory– Jar files are copied here (by maven) TRAINING HOME /home/hadoop/Training– Root directory for all of the artifacts for this class HADOOP LOGS TRAINING HOME/logs– Directory for logs; logs for each product are stored under– ls HADOOP LOGS/ hbase hdfs oozie pig yarn HADOOP CONF DIR HADOOP HOME/conf– Hadoop configuration files are stored here14Environment Variables There is a variable per product referencingit’s home directory– CDH HOME TRAINING HOME/CDH4– HADOOP HOME CDH HOME/hadoop-2.0.0cdh4.0.0– HBASE HOME CDH HOME/hbase-0.92.1-cdh4.0.0– OOZIE HOME CDH HOME/oozie-3.1.3-cdh4.0.0– PIG HOME CDH HOME/pig-0.9.2-cdh4.0.0– HIVE HOME CDH HOME/hive-0.8.1-cdh4.0.015

Firefox BookmarksFolder with bookmarks toJavadocs for each productused in this classFolder with bookmarks todocumentation packagedwith each product used inthis classFolders with bookmarks tomanagement webapplications for eachproduct; of course theHadoop product has to berunning for those links towork16Scripts Scripts to start/stop ALL installed Hadoopproducts––––startCDH.sh - start ALL of the productsstopCDH.sh - stop ALL of the productsThese scripts are located in /Training/scripts/Scripts are on the PATH, you can execute from anywhere startCDH.sh. stopCDH.sh. ps -ef grep java. kill XXXX17Start then stop all of the productsCheck if any processes failed to shutdown, if so kill them by PID

Developing Exercises Proposed steps to develop code for trainingexercises1. Add code, configurations and/or scripts to the Exercisesproject Utilize Eclipse2. Run mvn package Generates JAR file with all of the Java classes andresources For your convenience copies JAR file to a set of wellknown locations Copies scripts to a well-known location3. Execute your code (MapReduce Job, Oozie job or ascript)181: Add Code to the ExercisesProject19Write and edit code

2: Run mvn packageSelect a project then use Eclipse’s pre-configured"mvn package" command; messages on the Consoleview will appear; notice that it copied jar file intoplay area directory; we will be executing majority ofcode in the play area directory203: Execute your code Utilize the jar produced by step #2 Run your code in PLAY AREA directory cd PLAY AREAProduced by previous step Exercises.jarwill reside in PLAY AREA directory yarn jar PLAY AREA/Exercises.jar \mapRed.workflows.CountDistinctTokens \/training/data/hamlet.txt \/training/playArea/firstJobClean up after yourself;Delete output directoryThis is a MapReduce jobimplemented in the Exercisesproject and then package intoa JAR file hdfs dfs -rm -r /training/playArea/firstJob21

Save VM Option Instead of Shutting down OS you can savecurrent OS State– When you load it again the saved state will be restored22Well-Known Issues If you "save the machine state", instead ofrestarting VM, HBase will not properlyreconnect to HDFS– Solution: shutdown all of the Hadoop products priorclosing VM (run stopCDH.sh script) Current VM allocates 3G of RAM; it is reallynot much given all of the Hadoop andMapReduce daemons– Solution: If your machine has more RAM to spare,increase it. When the VM is down go to Settings System Base Memory23

2012 coreservlets.com and Dima MayWrap-UpCustomized Java EE Training: http://courses.coreservlets.com/Hadoop, Java, JSF 2, PrimeFaces, Servlets, JSP, Ajax, jQuery, Spring, Hibernate, RESTful Web Services, Android.Developed and taught by well-known author and developer. At public venues or onsite at your location.Summary We now know more about Ubuntu VMThere are useful environment variablesThere are helpful Firefox bookmarksUse management scripts to start/stopHadoop products Develop exercises utilizing Eclipse andMaven Look out for well-known issues with runningHadoop on top of Virtual Box VM25

2012 coreservlets.com and Dima MayQuestions?More info:http://www.coreservlets.com/hadoop-tutorial/ – Hadoop programming ining.html – Customized Hadoop training courses, at public venues or onsite at your -Materials/java.html – General Java programming l/ – Java 8 sf2/ – JSF 2.2 rimefaces/ – PrimeFaces tutorialhttp://coreservlets.com/ – JSF 2, PrimeFaces, Java 7 or 8, Ajax, jQuery, Hadoop, RESTful Web Services, Android, HTML5, Spring, Hibernate, Servlets, JSP, GWT, and other Java EE trainingCustomized Java EE Training: http://courses.coreservlets.com/Hadoop, Java, JSF 2, PrimeFaces, Servlets, JSP, Ajax, jQuery, Spring, Hibernate, RESTful Web Services, Android.Developed and taught by well-known author and developer. At public venues or onsite at your location.

Related Documents:

hadoop - riptutorial.com

1: hadoop 2 2 Apache Hadoop? 2 Apache Hadoop : 2: 2 2 Examples 3 Linux 3 Hadoop ubuntu 5 Hadoop: 5: 6 SSH: 6 hadoop sudoer: 8 IPv6: 8 Hadoop: 8 Hadoop HDFS 9 2: MapReduce 13 13 13 Examples 13 ( Java Python) 13 3: Hadoop 17 Examples 17 hoods hadoop 17 hadoop fs -mkdir: 17: 17: 17 hadoop fs -put: 17: 17

35 Views

1y ago

Big Data Analytics - learnerspoint.org

The hadoop distributed file system Anatomy of a hadoop cluster Breakthroughs of hadoop Hadoop distributions: Apache hadoop Cloudera hadoop Horton networks hadoop MapR hadoop Hands On: Installation of virtual machine using VMPlayer on host machine. and work with some basics unix commands needs for hadoop.

10 Views

1y ago

Lecture @Dhbw: Data Warehouse Part Vii: Hadoop

2006: Doug Cutting implements Hadoop 0.1. after reading above papers 2008: Yahoo! Uses Hadoop as it solves their search engine scalability issues 2010: Facebook, LinkedIn, eBay use Hadoop 2012: Hadoop 1.0 released 2013: Hadoop 2.2 („aka Hadoop 2.0") released 2017: Hadoop 3.0 released HADOOP TIMELINE Daimler TSS Data Warehouse / DHBW 12

13 Views

1y ago

IN-MEMORY ACCELERATOR FOR HADOOP - GridGain Systems

The In-Memory Accelerator for Hadoop is a first-of-its-kind Hadoop extension that works with your choice of Hadoop distribution, which can be any commercial or open source version of Hadoop available, including Hadoop 1.x and Hadoop 2.x distributions. The In-Memory Accelerator for Hadoop is designed to provide the same performance

13 Views

1y ago

Real Time Micro-Blog Summarization based on Hadoop/HBase

Introduction Apache Hadoop . What is Apache Hadoop? MapReduce is the processing part of Hadoop HDFS is the data part of Hadoop Dept. of Computer Science, Georgia State University 05/03/2013 5 Introduction Apache Hadoop HDFS MapReduce Machine . What is Apache Hadoop? The MapReduce server on a typical machine is called a .

19 Views

1y ago

Bruksanvisning för bilstereo Bruksanvisning for bilstereo ... - Jula

Bruksanvisning för bilstereo . Bruksanvisning for bilstereo . Instrukcja obsługi samochodowego odtwarzacza stereo . Operating Instructions for Car Stereo . 610-104 . SV . Bruksanvisning i original

376 Views

1y ago

hadoop - RIP Tutorial

Configuring SSH: 6 Add hadoop user to sudoer's list: 8 Disabling IPv6: 8 Installing Hadoop: 8 Hadoop overview and HDFS 9 Chapter 2: Debugging Hadoop MR Java code in local eclipse dev environment. 12 Introduction 12 Remarks 12 Examples 12 Steps for configuration 12 Chapter 3: Hadoop commands 14 Syntax 14 Examples 14 Hadoop v1 Commands 14 1 .

42 Views

3y ago

Installing Hadoop 2.7.3 / Yarn, Hive 2.1.0, Scala 2.11.8 ...

-Type "sudo tar -xvzf hadoop-2.7.3.tar.gz" 6. I renamed the download to something easier to type-out later. -Type "sudo mv hadoop-2.7.3 hadoop" 7. Make this hduser an owner of this directory just to be sure. -Type "sudo chown -R hduser:hadoop hadoop" 8. Now that we have hadoop, we have to configure it before it can launch its daemons (i.e .

41 Views

3y ago

Recent Views

IN THIS ISSUE CAR WASH INSIGHT Recent, Notable M&A Transactions .

9/8/2022 Club Car Wash Sites of Tidal Wave Express Car Wash 8 8/29/2022 Take 5 Car Wash Soft Touch Car Wash, Auto Oasis Car Wash, Clearwater Car Wash and Birdie's Car Wash 5 8/25/2022 WhiteWater Express Geaux Clean Car Wash 7 8/19/2022 ModWash Home Team Car Wash 3 8/18/2022 Splash In ECO Car Wash (Wills Group) Blue Hen Car Wash 2

9m ago

100 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

ESSENTIAL PLAN - Discovery

Car insurance only Car and home insurance Car insurance only Car and home insurance 12.5% 25% 5% 10% YOUR FUEL CASH BACK PERCENTAGE GET TO THE HIGHEST CASH BACK PERCENTAGE Add at least R250 000 of home insurance (household contents, buildings or both) Take your car to Tiger Wheel & Tyre and pass the Annual MultiPoint check

1y ago

269 Views

CAR INSURANCE EVERYTHING EXPLAINED - RSA Insurance Group

CAR INSURANCE 93013821.indd 1 15/03/2018 10:46. 2 WELCOME TO µ CAR INSURANCE Thank you for choosing µ to protect you and your car. This booklet is intended to help you check your cover and to reassure you that µ will give you the protection you need for the year ahead. First of all, to help you understand your car insurance policy we want to .

1y ago

274 Views

Describe types and purposes of insurance.

D.O. CAPS Consumer Skills: Insurance—10E 3 Your car - The car you drive can also affect your insurance rates. Insurance companies place certain kinds of cars in special risk categories. You should ask your insurance agent before making a car purchase to make sure you aren't getting a car that will cost you extra for your liability insurance.

1y ago

233 Views

Contours Options Infant Car Seat Adapter Instruction Sheet

your Infant Car Seat, as described in the instruction manual provided by the Infant Car Seat manufacturer. † WHEN USING ONLY ONE INFANT CAR SEAT ADAPTER OR TWO FOR TWINS, THE FOLLOWING INFANT CAR SEATS CAN BE USED: † If your Infant Car Seat is not one of the models listed above, DO NOT use your infant car seat with this car seat adapter.

2y ago

564 Views

Microsoft Advertising Travel Update

last minute cruise deals -58.50% Car Rental Queries WoW Change car rental -43.80% rental cars -46.30% car rentals -40.60% cheap car rentals -48.00% car rentals cheapest rates -52.20% rent a car- 40.30% cheap rental cars -45.60% rental car -41.80% car rental deals -49.30% rental cars lowest price -53.90% Flight Queries WoW Change cheap flights .

1y ago

337 Views

Design and development of lift for an automatic car parking system

1. Stacker type car parking system 2. Puzzle type car parking system 3. Level type car parking system 4. Chess type car parking system 5. Rotary type car parking system 6. Tower type car parking system But lift is used only in tower type car parking system. Objectives:-

6m ago

172 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

Car Insurance This booklet covers:Car Rapid Bonus Business

Car Insurance This booklet covers:Car Rapid Bonus Business RAC Direct Insurance is a trading name of London and Edinburgh Insurance Company Limited. Registered in England No 924430. Registered Office: 8 Surrey Street, Norwich NR1 3NG. Member of the Aviva Group. Authorised and regulated by the Financial Services Authority. RAC052(V27)-1971-06.06 .

1y ago

218 Views

Root Insurance (ROOT) - Citron Research

Root Insurance (ROOT) Leveling the Playing Field of Car Insurance What every trader needs to know about one of the mostheavily shorted stocks in the market Traditional Credit-Based Car Insurance PerpetuatesEconomic and Racial Inequalities as one in three American cannot affordessentials because of car insurance premiums

1y ago

209 Views

NK-ID 0192-8365-3702-0D3E - Car-O-Liner

CAR-O-DATA. 4. The vast majority of vehicles on the road today can be found in Car-O-Liner's database. Your . Car-O-Tronic. is delivered with a 14-day trial . Car-O-Data Vision2. subscription. Car-O-Data. is available with different subscription periods and database. 4. Check all options with our distributors. SOFTWARE PART. NO. Vision2 X1 .

3y ago

321 Views

46686 Vision2 IM EN r0 - Metropolitan Car-o-liner

Car-O-Tronic, Vision2 Software and Car-O-Data. Car-O-Tronic is the measuring hardware, Vision2 Software is the measuring software. Car-O-Data is a database containing Car-O-Liner DataSheets, photo DataSheets and indexes for most vehicles. Car-O-Data is available through an online subscription or a DVD subscription which is updated 4 times a year.

3y ago

295 Views

Colorado Masonic Library & Museum Store

York Rite 15.00 _ CE40 Car Emblem - Order of the Eastern Star Cut-Out Auto Car Emblem-CE40 OES 15.00 _ CE41 Car Emblem - Shriners Cut-Out Auto Car Emblem-CE41 Shrine 15.00 _ CE42 Car Emblem - 33rd Degree Wings Up Cut-Out Auto Car Emblem-CE42 Scottish Rite 15.00 _ CE43 Car Emblem Free & Ac

2y ago

517 Views

Queueing Theory Part 2 - UW Courses Web Server

Queueing Theory-12 Car Wash Example Consider the following 3 car washes Suppose cars arrive according to a Poisson input process and service follows an exponential distribution Fill in the following table What conclusions can you draw from your results? ! µ! L L q W W q P 0 Car Wash A 0.1 car/min 0.5 car/min Car Wash B 0.1 car/min

1y ago

245 Views

Virtual Machine (VM) For Hadoop Training - Iitr.ac.in

It looks like you're using an ad-blocker