Lecture 4 Fundamentals Of Deep Learning And Neural Networks

2y ago

31 Views

2 Downloads

7.14 MB

96 Pages

Last View : 16d ago

Last Download : 3m ago

Upload by : Ronnie Bonney

Report this link

Download PDF

Transcription

Lecture 4Fundamentals of deep learningand neural networksSerena YeungBIODS 388

Deep learning: Machine learning models based on“deep” neural networks comprising millions (sometimesbillions) of parameters organized into hierarchical layers.Features are multiplied and added together repeatedly,with the outputs from one layer of parameters being fedinto the next layer -- before a prediction is made.

Contrast with linear regression:

Agenda for today- More on the structure of neural network models- Machine learning training loop and concept of loss, in the context ofneural networks- Minimizing the loss for complex neural networks: gradient descentand backpropagation

Let’s start by considering again logistic regression,for binary classification

Nonlinearsquashing to(0,1) withsigmoidnonlinearity

Also commonly used inmodern neural networks!

The logistic regression with sigmoid that we just saw can beconsidered as a single “neuron” model:

A layer of a neural networks consists of a set of neurons that eachtake the same input!

A layer of a neural networks consists of a set of neurons that eachtake the same input!Note: each neuron willhave its own set ofparameters that it learns,which will produce differentoutputs

A layer of a neural networks consists of a set of neurons that eachtake the same input!

Concatenate the multiple outputs from a layer of a neural network tobe the input to the next layer

Represents increasinglycomplex (and hierarchical)function that is beingcomputed!

Fully connected layer: all neurons in the layer takes as input the fullinput to the layer (also called dense layer or linear layer)

How do we train neural networks to learn good values ofthe (many) parameters, to accurately map from inputs todesired outputs?

Optimizationstep

Periodically usevalidation set tomeasure how themodel will do “inthe real world”.Save a version ofthe model if it givesthe best validationperformance seenso far.

Can also run theentire process fordifferent trainingconfigurations, orhyperparameters,to choose thebest ones.Referred to as“hyperparametertuning”.

Agenda- More on the structure of neural network models- Machine learning training loop and concept of loss, in thecontext of neural networks- Minimizing the loss for complex neural networks: gradient descentand backpropagation

Cross-entropy loss: 0.51

Cross-entropy loss: 0.15

Agenda- More on the structure of neural network models- Machine learning training loop and concept of loss, in the context ofneural networks- Minimizing the loss for complex neural networks: gradientdescent and backpropagation

How can we find “good” values of many parameters?

How can we find “good” values of many parameters?One option: Try all combination of possible weights and test how good each oneis. But this would take forever, since there’s infinite possibilities and there is noindication of how best to adjust.Instead: the trick is that we need to have some idea of which “direction” to adjustthe weights to reduce the loss function.Analogy: the game of Marco Polo!

Backpropagation: mathematical technique that breaksdown complex gradient computation into local gradientcomputations that are then combined together. Secretsauce for allowing us to obtain gradient for large neuralnetwork models!(with the help of graphical processing units or GPUs)

Now that we have a deeper understanding of neuralnetworks, let’s look at how they work for common typesof input data.

Some case studies of convolutionalneural networks.

Gulshan et al. 2016--Task: Binary classification of referablediabetic retinopathy from retinal fundusphotographsInput: Retinal fundus photographsOutput: Binary classification of referablediabetic retinopathy (y in {0,1})- Defined as moderate and worsediabetic retinopathy, referable diabeticmacular edema, or bothGulshan, et al. Development and Validation of a Deep Learning Algorithm forDetection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA, 2016.

Gulshan et al. 2016---Dataset:- 128,175 images, each graded by 3-7ophthalmologists.- 54 total graders, each paid to grade between20 to 62508 images.Data preprocessing:- Circular mask of each image was detectedand rescaled to be 299 pixels wideModel:- Inception-v3 CNN, with ImageNet pre-training- Multiple binary cross-entropy lossescorresponding to different binary predictionproblems, which were then used for finaldetermination of referable diabetic retinopathyGulshan, et al. Development and Validation of a DeepLearning Algorithm for Detection of DiabeticRetinopathy in Retinal Fundus Photographs. JAMA,2016.

Gulshan et al. 2016---Dataset:- 128,175 images, each graded by 3-7ophthalmologists.- 54 total graders, each paid to grade between20 to 62508 images.Data preprocessing:- Circular mask of each image was detectedand rescaled to be 299 pixels wideModel:- Inception-v3 CNN, with ImageNet pre-training- Multiple binary cross-entropy lossescorresponding to different binary predictionproblems, which were then used for finaldetermination of referable diabetic retinopathyPre-training means training first on adifferent (usually larger) dataset first to learngenerally useful visual features as a startingpointGulshan, et al. Development and Validation of a DeepLearning Algorithm for Detection of DiabeticRetinopathy in Retinal Fundus Photographs. JAMA,2016.

Gulshan et al. 2016---Dataset:- 128,175 images, each graded by 3-7ophthalmologists.- 54 total graders, each paid to grade between20 to 62508 images.Data preprocessing:- Circular mask of each image was detectedand rescaled to be 299 pixels wideModel:- Inception-v3 CNN, with ImageNet pre-training- Multiple binary cross-entropy lossescorresponding to different binary predictionproblems, which were then used for finaldetermination of referable diabetic retinopathyGraders provided finer-grainedlabels which were thenconsolidated into (easier) binaryprediction problemsGulshan, et al. Development and Validation of a DeepLearning Algorithm for Detection of DiabeticRetinopathy in Retinal Fundus Photographs. JAMA,2016.

Gulshan et al. 2016-Results:- Evaluated using ROC curves,AUC, sensitivity and specificityanalysisGulshan, et al. Development and Validation of a Deep Learning Algorithm forDetection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA, 2016.

Gulshan et al. 2016AUC 0.991Looked at different operating points- High-specificity pointapproximated ophthalmologistspecificity for comparison. Shouldalso use high-specificity to makedecisions about high-risk actions.- High-sensitivity point should beused for screening applications.Gulshan, et al. Development and Validation of a Deep Learning Algorithm forDetection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA, 2016.

Gulshan et al. 2016Gulshan, et al. Development and Validation of a Deep Learning Algorithm forDetection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA, 2016.

Gulshan et al. 2016Gulshan, et al. Development and Validation of a Deep Learning Algorithm forDetection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA, 2016.Q: What could explain the difference intrends for reducing # grades / image ontraining set vs. tuning set, on tuning setperformance?

Esteva et al. 2017---Two binary classification tasks ondermatology images: malignant vs.benign lesions of epidermal or melanocyticoriginInception-v3 (GoogLeNet) CNN withImageNet pre-trainingFine-tuned on dataset of 129,450 lesions(from several sources) comprising 2,032diseasesEvaluated model vs. 21 or moredermatologists in various settingsEsteva*, Kuprel*, et al. Dermatologist-level classification of skin cancer with deepneural networks. Nature, 2017.

Esteva et al. 2017-Train on finer-grained classification (757 classes) but perform binary classification atinference time by summing probabilities of fine-grained sub-classesThe stronger fine-grained supervision during the training stage improves inferenceperformance!Esteva*, Kuprel*, et al. Dermatologist-level classification of skin cancer with deepneural networks. Nature, 2017.

Esteva et al. 2017-Evaluation of algorithm vs.dermatologistsEsteva*, Kuprel*, et al. Dermatologist-level classification of skin cancer with deepneural networks. Nature, 2017.

Rajpurkar et al. 2017--Binary classification of pneumoniapresence in chest X-raysUsed ChestX-ray14 dataset with over100,000 frontal X-ray images with 14diseases121-layer DenseNet CNNCompared algorithm performance with 4radiologistsAlso applied algorithm to other diseases tosurpass previous state-of-the-art onChestX-ray14Rajpurkar et al. CheXNet: Radiologist-Level Pneumonia Detection on ChestX-Rays with Deep Learning. 2017.

McKinney et al. 2020-Binary classification of breast cancer in mammogramsInternational dataset and evaluation, across UK and USMcKinney et al. International evaluation of an AI system for breast cancer screening. Nature, 2020.

SummaryToday we covered:- Structure of neural network models- Machine learning training loop and concept of loss, in the context ofneural networks- Minimizing the loss for complex neural networks: gradient descentand backpropagation- Neural networks for a common type of input data: images(convolutional neural networks)Next time: more on deep learning models for different types of input dataand prediction tasks

Fundamentals of deep learning and neural networks Serena Yeung BIODS 388. Deep learning: Machine learning models based on “deep” neural networks comprising millions (sometimes billions) of parameters organized into hierarchical layer

Related Documents:

CHEMICAL REACTION ENGINEERING

Introduction of Chemical Reaction Engineering Introduction about Chemical Engineering 0:31:15 0:31:09. Lecture 14 Lecture 15 Lecture 16 Lecture 17 Lecture 18 Lecture 19 Lecture 20 Lecture 21 Lecture 22 Lecture 23 Lecture 24 Lecture 25 Lecture 26 Lecture 27 Lecture 28 Lecture

99 Views

2y ago

MSE 460: Electronic Materials, Devices, and Processing

Lecture 1: Introduction and Orientation. Lecture 2: Overview of Electronic Materials . Lecture 3: Free electron Fermi gas . Lecture 4: Energy bands . Lecture 5: Carrier Concentration in Semiconductors . Lecture 6: Shallow dopants and Deep -level traps . Lecture 7: Silicon Materials . Lecture 8: Oxidation. Lecture

154 Views

2y ago

LECTURE NOTES on PROGRAMMING & DATA STRUCTURE Course Code : BCS101

Lecture 1: A Beginner's Guide Lecture 2: Introduction to Programming Lecture 3: Introduction to C, structure of C programming Lecture 4: Elements of C Lecture 5: Variables, Statements, Expressions Lecture 6: Input-Output in C Lecture 7: Formatted Input-Output Lecture 8: Operators Lecture 9: Operators continued

58 Views

1y ago

【E-book】Texts & Questions of 50 Lectures for TOEFL ...

TOEFL Listening Lecture 35 184 TOEFL Listening Lecture 36 189 TOEFL Listening Lecture 37 194 TOEFL Listening Lecture 38 199 TOEFL Listening Lecture 39 204 TOEFL Listening Lecture 40 209 TOEFL Listening Lecture 41 214 TOEFL Listening Lecture 42 219 TOEFL Listening Lecture 43 225 COPYRIGHT 2016

148 Views

2y ago

Partial Differential Equations MSO-203-B - IIT Kanpur

Partial Di erential Equations MSO-203-B T. Muthukumar tmk@iitk.ac.in November 14, 2019 T. Muthukumar tmk@iitk.ac.in Partial Di erential EquationsMSO-203-B November 14, 2019 1/193 1 First Week Lecture One Lecture Two Lecture Three Lecture Four 2 Second Week Lecture Five Lecture Six 3 Third Week Lecture Seven Lecture Eight 4 Fourth Week Lecture .

37 Views

11m ago

ADWORDS-FUNDAMENTALSQ&As

Pass Google ADWORDS-FUNDAMENTALS Exam with 100% Guarantee Free Download Real Questions & Answers PDF and VCE file from: . A key benefit of My Client Center (MCC) is that it allows: . Latest Google exams,latest ADWORDS-FUNDAMENTALS dumps,ADWORDS-FUNDAMENTALS pdf,ADWORDS-FUNDAMENTALS vce,ADWORDS-FUNDAMENTALS dumps,ADWORDS-FUNDAMENTALS exam .

106 Views

3y ago

Deep-Sea Litter Study Using Deep-Sea Observation Tools

Little is known about how deep-sea litter is distributed and how it accumulates, and moreover how it affects the deep-sea floor and deep-sea animals. The Japan Agency for Marine-Earth Science and Technology (JAMSTEC) operates many deep-sea observation tools, e.g., manned submersibles, ROVs, AUVs and deep-sea observatory systems.

64 Views

2y ago

How AI will transform the CFOs role - PwC

and artificial intelligence (AI) — combined with various analytics approaches and tools — can help CFOs move forwards on this path and ultimately transform the entire finance function. According to PwC’s Finance Effectiveness Benchmarking Report 2019, 61% of finance leaders believe that finance functions could become more effective with improved technology.1 In fact, CFOs are uniquely .

61 Views

3y ago

Recent Views

Columbus,Ohio 1890

Slicing Steaks 3563 Beef Tender, Select In Stock 3852 Angus XT Shoulder Clod, Choice In Stock 3853 Angus XT Chuck Roll, Choice 20/up In Stock 3856 Angus XT Peeled Knuckle In Stock 3857 Angus XT Inside Rounds In Stock 3858 Angus XT Flats, Choice In Stock 3859 Angus XT Eye Of Round, Choice In Stock 3507 Point Off Bnls Beef Brisket, Choice In Stock

2y ago

268 Views

Buying Your First Stock - Stock-Trak

Stock Market Game Time: 15 Minutes Requires: StockTrak Curriculum , Computer Access Buying Your First Stock This lesson is an introduction to buying a stock. Students will be introduced to basic vocabulary that is involved with a buying and owning a stock. Stu-dents will be going through the entire process of buying a stock from looking

1y ago

164 Views

TRAINING - CamInstructor

Mastercam Training Guide Mill-Lesson-4-9 6. Change the parameters to match the Stock Setup screenshot below: Stock Setup Stock Origin The stock origin is the X-Y-Z coordinate position of the point indicated by the cross in the picture of the stock model. Use it so Mastercam knows where your stock model is located relative to your part and

3y ago

242 Views

WPX Energy, Inc. - Feltl and Company

WPX Energy, Inc. Common Stock We are offering 27,000,000 shares of our common stock. Our common stock is listed on the New York Stock Exchange under the symbol “WPX.” On July 10, 2015, the last reported sale price for our common stock on the New York Stock Exchange (the “NYSE”) was 11.22 per share.

3y ago

172 Views

Spray 2020 Corporate Profiles - industry-publications

Custom plastic tubes (mono & multi-layer, ABL and Polyami) Stock and custom plastic, metal, and wood caps and closures Stock and custom fine mist, treatment and lotion pumps Stock and custom droppers Stock and custom rollerballs/roll-ons Stock sampler bottles and vials Stock German Quality cosmetic pencil sharpeners

2y ago

180 Views

The Stock Market Profits Blueprint - Liberated Stock Trader

The stock market profits blueprint has been hand crafted to enable you to understand all the factors that play on the stock market. It is called a blueprint because a blueprint is in effect an architectural document to show how something is designed. The Blueprint will show you a powerful way to envisage how the stock market and the stock market

1y ago

181 Views

The Impact of Persian News on Stock Returns Through Text Mining Techniques

Persian news - on the stock prices has been neglected. Consequently, this study aimed to fill this gap. To this aim, the stock index values were collected from the Tehran Stock Exchange along with the . Stock market prediction is a way to understand the future fluctuations of a company's stock price (Jishag et al., 2020). Generally, two .

1y ago

225 Views

Stock Market Uncertainty and the Stock-Bond Return Relation

implied volatility and stock turnover may prove useful for ﬁnancial applications that need to under-stand and predict stock and bond return co-movements. Finally, our empirical results suggest that the beneﬁts of stock-bond diversiﬁcation increase during periods of high stock market uncertainty. This study is organized as follow.

1y ago

158 Views

Operation of Stock Exchange - Williams College

Class Notes Operation of Stock Exchange - 3 - Buying on Margin "Margin" is borrowing money from your broker to buy a stock and using your invest-ment as collateral. Example Buy paying full price Buy stock at 60. Stock price goes to 90. Return (90 - 60)/60 50% Buy on "margin" Buy stock at 60. Borrow 30; you pay 30.

1y ago

138 Views

Stock Market Development and Economic Growth: Empirical Evidence from China

measures used to proxy for stock market size and the size of real economy. Most of the existing studies use stock market index as a proxy for measuring the growth and development of stock market in a country. We argue that stock market index may not be a good measure of stock market size when looking at its association with economic growth.

1y ago

263 Views

A Hybrid Prediction Method for Stock Price Using LSTM and . - Hindawi

the relationship between stock prices and these factors. Although these factors will temporarily change the stock price, in essence, these factors will be reﬂected in the stock price and will not change the long-term trend of the stock price. erefore, stock prices can be predicted simply with historical data.

1y ago

159 Views

A voyage to more stable safety stock and service levels - apics

safety stock targets. Most enterprise resources planning (ERP) systems perform a safety stock calculation. But very few include in the system all sources of variability as inputs to the safety stock formula. Furthermore, ERP tools rarely calculate accurate safety stock inputs or correct erroneous data. Figure 1 shows 13 basic safety stock inputs.

1y ago

149 Views

Factors Affecting Performance of Stock Market: Evidence from . - HRMARS

We used the data of Colombo Stock Exchange (CSE) for Sri Lankan stock market in this research which is the main stock exchange of Sri Lanka. The market capitalization of CSE is over 20 billion USD. Colombo stock exchange is the first south Asian region stock market and overall 52nd who obtain the membership of World Federation of Exchanges.

11m ago

103 Views

Forecasting Stock Price Turning Points in The Tehran Stock Exchange .

Forecasting Stock Price Turning Points in the Tehran Stock Exchange Using Weighted Support Vector Machine. Journal of Entrepreneurship Education, 25(5), 1-12 . 2 1528-2651-25-5-797 Citation Information: Sayrani., M & Sharif, J.S. (2022). Forecasting Stock Price Turning Points in the Tehran Stock Exchange Using Weighted Support Vector Machine. .

7m ago

96 Views

Water Physical Stock Account: 1995-2010 - Tableau Public

10 Water physical stock account for year ended June 2003, by region . 11 Water physical stock account for year ended June 2004, by region . 12 Water physical stock account for year ended June 2005, by region . 13 Water physical stock account for year ended June 2006, by region . 14 Water physical stock account for year ended June 2007, by region

3m ago

22 Views

Lecture 4 Fundamentals Of Deep Learning And Neural Networks

It looks like you're using an ad-blocker