Australian Centre For Robotic Vision Deep Learning Niko Suenderhauf For .

1y ago

14 Views

2 Downloads

6.12 MB

143 Pages

Last View : 4d ago

Last Download : 3m ago

Upload by : Jacoby Zeller

Report this link

Download PDF

Transcription

Deep Learning for Robotic Vision An Introduction Niko Suenderhauf Queensland University of Technology Australian Centre for Robotic Vision

What is Deep Learning?

What is Deep Learning? Artificial Intelligence

What is Deep Learning? Artificial Intelligence Intelligence demonstrated by machines. The study of "intelligent agents": any device that perceives its environment and takes actions that maximize its chance of successfully achieving its goals. Machines that mimic "cognitive" functions that humans associate with the human mind, such as "learning" and "problem solving".

What is Deep Learning? Machine learning is the scientific study of algorithms and statistical models that computer systems use to perform a specific task without using explicit instructions, relying on patterns and inference instead. Machine Learning algorithms build a mathematical model based on sample data, known as "training data", in order to make predictions or decisions without being explicitly programmed to perform the task Knowledge Representation Reasoning Machine Learning Artificial Intelligence Logic Search Planning

What is Deep Learning? Knowledge Representation Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Reasoning Deep Learning Machine Learning LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015) Artificial Intelligence Logic Search Planning

What is Robotic Vision?

What is Robotic Vision? Output Images Data Images ? ? Data ? ? Input

What is Robotic Vision? Output Images Images Data Image Processing ? ? ? Input Data

What is Robotic Vision? Output Images Data Images Image Processing ? Data Computer Graphics ? Input

What is Robotic Vision? Output Images Data Images Image Processing Computer Vision Data Computer Graphics ? Input

What is Robotic Vision? Output Images Data Images Image Processing Computer Vision Data Computer Graphics Data Science Input

What is Robotic Vision? Output Images Data Images Image Processing Computer Vision Data Computer Graphics Data Science Input “Computer Vision on a robot?”

What is Robotic Vision? Output Images Data Actions Images Image Processing Computer Vision Robotic Vision Data Computer Graphics Data Science Input

What is Robotic Vision? This is where robotic vision differs from computer vision. For robotic vision, perception is only one part of a more complex, embodied, active, and goal-driven system. Robotic vision therefore has to take into account that its immediate outputs (object detection, segmentation, depth estimates, 3D reconstruction, a description of the scene, and so on), will ultimately result in actions in the real world. In a simplified view, whereas computer vision takes images and translates them into information, robotic vision translates images into actions. The Limits and Potentials of Deep Learning for Robotics. Sünderhauf, Brock, Scheirer, Hadsell, Fox, Leitner, Upcroft, Abbeel, Burgard, Milford, Corke. IJRR 2018.

Supervised (Deep) Learning

Supervised Learning Supervised learning is the machine learning task of learning a function that maps an input to an output based on example input-output pairs. It infers a function from labeled training data consisting of a set of training examples.

Supervised Learning Training examples: (image, label) X { ( , ‘dog’), ( , ‘cat’), ( , ‘car’), } Goal: Learn function f: Image Label f( ) ‘cat’ (if all goes well)

Nearest Neighbor Classifiers

Intuition

Every Image can be rearranged into a vector. Shape: (32,32,3) Shape: (1024,1,3) Shape: (3072,1)

3072-Dimensional Space

Linear Classifiers

Interpret values of y as class-confidences. The bigger y i, the more confident we are that x is of class i.

We are actually projecting from 2D into 3D!

Softmax

Towards a Neural Network

Every Image can be rearranged into a vector. Shape: (32,32,3) Shape: (1024,1,3) Shape: (3072,1)

Airplane Car Bird Cat Deer Dog Frog Horse Ship Truck . Shape: (32,32,3) Shape: (3072,1)

. Shape: (32,32,3) Shape: (3072,1)

Loss Functions (How Good is the Model?)

Loss Function How good or bad are the current parameters?

Loss Function How good or bad are the current parameters? Cross-Entropy Loss (Softmax Classifier) Interpret outputs y as probabilities for each class. (unnormalised log-probabilities) e.g. apply Softmax function to get probabilities score assigned to true class

Loss Function Example 1 True class: “0”

Loss Function Example 2 True class: “1”

Cross Entropy Loss Intuition approximates a max function!

Cross Entropy Loss Intuition approximates a max function! Minimum Loss when: highest score for correct class!

Cross Entropy Loss Intuition Minimum Loss when: highest score for correct class! minimize average loss for all training samples

Training Finding Good Weights (and Biases)

How do we find the best (W,b)? Objective: minimize average loss for all training samples. But how? Some ideas: Random search randomly choose (W,b), and remember the best

How do we find the best (W,b)? Objective: minimize average loss for all training samples. But how? Some ideas: Random search randomly choose (W,b), and remember the best Random local search randomly change (W,b) slowly by adding a small increment, check if that made it better

Gradient Descent learning rate step size derivative of loss with respect to the weights

Gradient Descent learning rate step size derivative of loss with respect to the weights Fortunately, automatic differentiation is part of most DL libraries! Same for various optimization methods!

Training a simple linear classifier

And Now: Actual Neural Networks

Missing Ingredient . Nonlinear activation function

Missing Ingredient (nonlinear) activation function Linear models are often overly simple Enables meaningful “stacking” of layers deep networks .

Missing Ingredient . . . (nonlinear) activation function Linear models are often overly simple Enables meaningful “stacking” of layers deep networks Historically: sigmoid function

Missing Ingredient . . . (nonlinear) activation function Linear models are often overly simple Enables meaningful “stacking” of layers deep networks Historically: sigmoid function Many other choices: tanh(x) Rectified Linear Unit ReLU max(0,x) .

Missing Ingredient (nonlinear) activation function Linear models are often overly simple Enables meaningful “stacking” of layers deep networks Historically: sigmoid function .

Deep Networks . . Shape: (32,32,3) Shape: (3072,1) Airplane Car Bird Cat Deer Dog Frog Horse Ship Truck

Convolutional Networks

-1 -1 -1 -1 1 -1 1 1 1

-1 -1 -1 -1 1 -1 1 1 1 -1 -1 -1 -1 1 -1 1 1 1 Kernel Dot product -1 -1 -1 -1 -1 -1 -1 -1 -1 Image Patch

-1 -1 -1 -1 1 -1 1 1 1

3 channels (RGB) shape (3, 244, 244)

Convolution: Slide filter over all locations, perform dot product. 3 x 11 x 11 filter 1 (scalar) result 3 x 244 x 244 Image

Convolution: Slide filter over all locations, perform dot product. 3 x 11 x 11 filter 3 x 244 x 244 Image

1st Convolutional Layer Alexnet ResNeXt

3 channels (RGB) shape (3, 244, 244) Alexnet Conv1: 64 filters, size (3, 11, 11)

Alexnet Conv1: 64 filters, size (3, 11, 11)

3 channels (RGB) shape (3, 244, 244) Result: (64, 55, 55)

. conv1 (64, 55, 55) . 3 channels (RGB) shape (3, 244, 244)

. conv1 (64, 55, 55) . 3 channels (RGB) shape (3, 244, 244) conv2 (192, 27, 27)

AlexNet

ResNeXt

3 channels (RGB) shape (3, 244, 244)

. . 1000 classes Shape: (9216,1) Shape: (4096,1) Shape: (1000,1)

super high-dimensional very high-dimensional pretty high-dimensional still high-dimensional Nonlinear projections from one space into another. Until classes are linearly separable.

Backpropagation

predictions (1, 10) . (64, 55, 55) . 3 channels (RGB) shape (3, 244, 244) (192, 27, 27)

predictions (1, 10) (64, 55, 55) (192, 27, 27) . . 3 channels (RGB) shape (3, 244, 244) conv1 parameters conv2 parameters fc1 parameters loss

predictions (1, 10) (192, 27, 27) . (64, 55, 55) . 3 channels (RGB) shape (3, 244, 244) fc1 parameters loss

http://cs231n.github.io/optimization-2/

conv1 parameters conv2 parameters fc1 parameters loss

Loss Training Validation Time

Loss Training Validation stop training here overfitting Time

Applications

Image Classification Image ConvNet Representation Linear Classifier Class Labels

Semantic Segmentation Image ConvNet Representation Per-Pixel Class Probabilities

Object Detection Image ConvNet Representation [x,y,width,height] confidence class label

Reinforcement Learning Image ConvNet Representation Distribution over actions

What is your task? Image ConvNet Representation Your Task?

Fine Tuning Image ConvNet Representation Linear Classifier Class Labels

Fine Tuning Image ConvNet Representation Freeze early layer in ConvNet (use as fixed feature extractor). Re-initialise last layer(s) and only train them. Linear Classifier Class Labels

Tips and Tricks http://karpathy.github.io/2019/04/25/recipe/ http://cs231n.github.io/neural-networks-3/

Deep Learning for Robotic Vision An Introduction Niko Suenderhauf Queensland University of Technology Australian Centre for Robotic Vision

Related Documents:

Bruksanvisning för bilstereo Bruksanvisning for bilstereo ... - Jula

Bruksanvisning för bilstereo . Bruksanvisning for bilstereo . Instrukcja obsługi samochodowego odtwarzacza stereo . Operating Instructions for Car Stereo . 610-104 . SV . Bruksanvisning i original

376 Views

1y ago

10 tips och tricks för att lyckas med ert sap-projekt

10 tips och tricks för att lyckas med ert sap-projekt 20 SAPSANYTT 2/2015 De flesta projektledare känner säkert till Cobb’s paradox. Martin Cobb verkade som CIO för sekretariatet för Treasury Board of Canada 1995 då han ställde frågan

737 Views

2y ago

Nordens 25 största medieföretag efter omsättning

service i Norge och Finland drivs inom ramen för ett enskilt företag (NRK. 1 och Yleisradio), fin ns det i Sverige tre: Ett för tv (Sveriges Television , SVT ), ett för radio (Sveriges Radio , SR ) och ett för utbildnings program (Sveriges Utbildningsradio, UR, vilket till följd av sin begränsade storlek inte återfinns bland de 25 största

338 Views

1y ago

SS 02 52 68 Ljudklassning av utrymmen i byggnader - byggtjanst.se

Hotell För hotell anges de tre klasserna A/B, C och D. Det betyder att den "normala" standarden C är acceptabel men att motiven för en högre standard är starka. Ljudklass C motsvarar de tidigare normkraven för hotell, ljudklass A/B motsvarar kraven för moderna hotell med hög standard och ljudklass D kan användas vid

358 Views

1y ago

Apple Developer Program License Agreement (Swedish)

LÄS NOGGRANT FÖLJANDE VILLKOR FÖR APPLE DEVELOPER PROGRAM LICENCE . Apple Developer Program License Agreement Syfte Du vill använda Apple-mjukvara (enligt definitionen nedan) för att utveckla en eller flera Applikationer (enligt definitionen nedan) för Apple-märkta produkter. . Applikationer som utvecklas för iOS-produkter, Apple .

345 Views

1y ago

vision centre manual for pdf1

Layout of the Vision Center Equipment needs for a Vision Center Furniture Drugs and consumables at a Vision Centre Stationery at Vision Centers Personnel at a Vision Center Support from a Secondary Center (Service Center) for a Vision Center Expected workload at a Vision Centre Scheduling of activities at a Vision Center Financial .

92 Views

1y ago

Dynamics of Morphing Robotic Arm With Space Debris Capture

Figure 2. Design of Space craft with robotic arm space in the launching vehicle compared to the traditional rigid, ﬁxed geometry robotic arm. Figure 3. Morphing robotic arm section 3. DYNAMIC MODEL OF ROBOTIC ARM In this section, dynamic model of the morphing arm based on telescopic type morphing beam is derived. The robotic arm is assumed to .

24 Views

1y ago

DIFFERENTIAL EQUATIONS FOR ENGINEERS

Wei-Chau Xie is a Professor in the Department of Civil and Environmental Engineering and the Department of Applied Mathematics at the University of Waterloo. He is the author of Dynamic Stability of Structures and has published numerous journal articles on dynamic stability, structural dynamics and random vibration, nonlinear dynamics and stochastic mechanics, reliability and safety analysis .

328 Views

3y ago

Recent Views

PHONE NO. CONTACT TOPIC/SUBTOPIC ORGANIZATION #A

651-757-2762 Deborah Klooz MPCA Paralegal: 651-757-2631 Jean Coleman MPCA Staff Attorney: 651-757-2791 Adonis Neblett MPCA Staff Attorney: 651-757-2017 Carmen Netten MPCA Staff Attorney: 651-757-2759 David Stellmach MPCA Staff Attorney: 651-757-2247 Joseph Dammel MPCA Staff Attorney: 651-757-2545 Michelle Janson MPCA Staff Attorney: #ATTORNEY .

2y ago

403 Views

Local Prosecutors and The Attorney General

Attorney General of Iowa Other Members iii Honorable Arthur K. Bolton Attorney General of Georgia Honorable Chauncey H. Browning, J 1'. Honorable John C. Danforth Attorney General of Missouri Honorable J olm P. Moore Attorney General of Colorado Attorney General of West Virginia Honorable Larry Derryberry Attorney General of Oklahoma

1y ago

178 Views

30th Annual Anti-Fraud Conference Tentative Schedule

Apr 30, 2019 · Jill Nerone, Supervising Deputy District Attorney, Alameda County District Attorney’s Office Laura Meyers, Assistant District Attorney, San Francisco County District Attorney’s, Office Nicole Pantaleo, Deputy District Attorney, Marin County District Attorney’s Office, Insurance F

2y ago

150 Views

Shannon McClellan Hon. Diane O. Leasure Ellery M. “Rick .

Attorney at Law Hon. Pamila J. Brown BOG Liaison District Court, Howard County Alan S. Carmel Attorney at Law Sarah Dawn Cline Attorney at Law Adam Sean Cohen Attorney at Law Delegate Kathleen M. Dumais District 15 Suzanne K. Farace Attorney at Law Barry L. Gogel Attorney at Law Michael I. Gordon

2y ago

142 Views

Powers of Attorney Act 2003 A Commentary - Law Society of New South Wales

POWERS OF ATTORNEY ACT 2003: A COMMENTARY 6 POWERS OF ATTORNEY ACT 2003: COMMENTARY The commentary is provided in black text. Reference to the "Act" is a reference to the Powers of Attorney Act 2003 as amended. Reference to the "Regulation" is a reference to the Powers of Attorney Regulation 2011, recently amended by the Powers of Attorney Amendment Act 2013 and the Powers of

7m ago

94 Views

California Safe Drinking Water and Toxic Enforcement Act .

District Attorney of Madera County 209 West Yosemite Avenue Madera, CA 93637 District Attorney of Marin County 3501 Civic Center Drive, Rm. 130 San Rafael, CA 94903 District Attorney of Mariposa County P.O. Box 730 Mariposa, CA 95338 District Attorney of Mendocino County P.O. Box 1000 Ukiah, CA 95482 District Attorney of Merced County

3y ago

163 Views

IN THE UNITED STATES COURT OF APPEALS FOR THE FIRST

Mar 06, 2020 · Attorney General of New Jersey Assistant Attorney General Counsel of Record Attorney for Amicus Curiae JOHN T. PASSANTE State of New Jersey Deputy Attorney General New Jersey Attorney General’s Office Richard J. Hughes Justice Complex 25 Market Street Trenton, NJ 086

2y ago

128 Views

ATTORNEY HANDBOOK - United States Courts

e. Each attorney's or pro se litigant's name must be typed and signed on the last page of the complaint, with: (1) his/her address (2) telephone number (3) if a Pennsylvania attorney, his/her Pennsylvania Attorney ID Number f. To file a complaint, the attorney must have an electronic signature on the complaint and must have an electronic

1y ago

124 Views

Power of Attorney - FedEx

Show the date the Power of Attorney is signed. Corporation Power of Attorney Partnership 1 10 9 8 7 6 5 4 3 2 12 11 1 10 9 8 7 6 5 4 3 2 12 11 1 10 9 8 7 6 5 4 3 2 12 11 Rev 6/13 The number preceding each instruction corresponds to the same number on the example of the power of attorney form. Customs Power of Attorney, Designation as Export .

1y ago

157 Views

Powers of Attorney - Ontario

attorney, a family member or friend may have to apply to be appointed as guardian. Powers of attorney that were properly made under previous laws of Ontario remain legally valid. The forms for a Continuing Power of Attorney for Property and a Power of Attorney for Personal Care contained in this booklet were revised on March 29, 1996 in accordance

1y ago

155 Views

STATUTORY POWER OF ATTORNEY - eForms

repudiated the power of attorney; and the power of attorney still is in full force and effect. 5. I/we make this affidavit for the purpose of inducing _ to accept delivery of the above described instrument, as executed by me/us in my/our capacity of attorney(s)-in-fact for the Principal. _, Attorney-in-fact

1y ago

118 Views

John J. Hoffman Acting Attorney General of New Jersey

JOHN J. HOFFMAN ACTING ATTORNEY GENERAL OF NEW JERSEY Division of Law 124 Halsey Street — 5th Floor P.O. Box 45029 Newark, New Jersey 07101 Attorney for Plaintiffs By: Jah-Juin Ho - #033032007 Deputy Attorney General 973-648-2500 JOHN J. HOFFMAN, Acting Attorney General of the State of New Jersey, and ERIC T.

1y ago

89 Views

Options in Oregon to Help Another Person Make Decisions

Power of Attorney A “Power of Attorney” is a legal document that allows a person to give another person (called an “agent”) the right to act on the person’s behalf. A “Power of Attorney” in Oregon can only be used for financial decisions. The way a “Power of Attorney” is written is important. The authority given to the agent can

3y ago

134 Views

- fcdfa

FRESNO COUNTY SUPERIOR COURT By DEPT.402 JAN SCULLY District Attorney, County of Sacramento RUTH YOUNG, State Bar No. 133606 Deputy District Attorney 906 G Street, Suite 700 Sacramento, CA 95814 Telephone: (916) 874-6174 JACKIE LACEY District Attorney, County of Los Angeles STUART C. LYTTON, State Bar No. 114241 Deputy District Attorney

3y ago

136 Views

Non-Attorney E-File Registration

your motion for e-filing access. Instructions to submit the Non-Attorney E-File Registration: 1. Register for a Non-Attorney Filer Account on the PACER website at www.pacer.uscourts.gov. If you already have a PACER Account, login to Manage My Account, select Non-Attorney E-File Re

2y ago

181 Views

Australian Centre For Robotic Vision Deep Learning Niko Suenderhauf For .

It looks like you're using an ad-blocker