INFO 1998: Introduction To Machine Learning

3y ago
27 Views
3 Downloads
1.01 MB
28 Pages
Last View : 16d ago
Last Download : 3m ago
Upload by : Mariam Herr
Transcription

INFO 1998: Introduction to Machine Learning

Lecture 9: Clustering and Unsupervised LearningINFO 1998: Introduction to Machine Learning

Recap: Supervised Learning The training data you feed into your algorithm includes desired solutionsTwo types you’ve seen so far: regressors and classifiersIn both cases, there are definitive “answers” to learn fromExample 1: RegressorPredicts valueExample 2: ClassifierPredicts label

Recap: Supervised LearningSupervised learning algorithms we have covered so far: k-Nearest NeighborsPerceptronLogistic RegressionDecision Trees and Random ForestLinear Regression

What are some limitations ofsupervised learning?

Today: Unsupervised Learning In unsupervised learning, the training data is unlabeledAlgorithm tries to learn by itselfAn Example: Clustering

Unsupervised LearningSome types of unsupervised learning problems:1Clusteringk-Means, Hierarchical Cluster Analysis (HCA), Gaussian Mixture Models (GMMs), etc.2Dimensionality ReductionPrincipal Component Analysis (PCA), Locally Linear Embedding (LLE)3Association Rule LearningApriori, Eclat, Market Basket Analysis More

Unsupervised LearningSome types of unsupervised learning problems:1Clusteringk-Means, Hierarchical Cluster Analysis (HCA), Gaussian Mixture Models (GMMs), etc.2Dimensionality ReductionPrincipal Component Analysis (PCA), Locally Linear Embedding (LLE)3Association Rule LearningApriori, Eclat, Market Basket Analysis More

Cluster Analysis

Cluster Analysis Loose definition: Clusters have objects which are “similar in some way” (and“dissimilar to objects in other clusters)Clusters are latent variables (variables that are unknown)Understanding clusters can:- Yield underlying trends in data- Supply useful parameters for predictive analysis- Challenge boundaries for pre-defined classes and variables

Clustering ApplicationRecommender SystemsIntuition: People who are “similar”, will like the same thingsA Bunch of Cool Logos

Clustering ApplicationFinding Population Structure in Genetic Data

Running Example: Recommender SystemsUse 1: Collaborative Filtering “People similar to you also liked X” Use other’s rating to suggest contentProsConsIf cluster behavior is clear,can yield good insightsComputationally expensiveCan lead to dominance of certaingroups in predictions

Running Example: Recommend MOVIES

Running Example: Recommender SystemsUse 2: Content filtering “Content similar to what YOU are viewing” Use user’s watch history to suggest contentProsRecommendations made bylearner are intuitiveScalableConsLimited in scope and applicability

Another Example: Cambridge Analytica Uses Facebook profiles to build psychological profiles,then use traits for target advertisingEx. has personality test measuring openness,conscientiousness, extroversion, agreeableness andneuroticism - different types of ads

How do we actually perform this“cluster analysis”?

Popular Clustering AlgorithmsHierarchicalCluster Analysis(HCA)k-MeansClusteringGaussianMixture Models(GMMs)

Defining ‘Similarity’ How do we calculate proximity of different data points?Euclidean distance: Other distance measures: Squared euclidean distance, manhattan distance

Algorithm 1: Hierarchical ClusteringTwo types: Agglomerative Clustering Creates a tree ofincreasingly large clusters(Bottom-up) Divisive Hierarchical Clustering Creates a tree ofincreasingly small clusters(Top-down)

Agglomerative Clustering Algorithm Steps:- Start with each point in its own cluster- Unite adjacent clusters together- Repeat Creates a tree of increasingly largeclusters

Agglomerative Clustering AlgorithmHow do we visualize clustering?Using dendrograms Each width represents distance betweenclusters before joiningUseful for estimating how many clustersyou haveThe iris dataset that we all love

Demo 1

Popular Clustering AlgorithmsHierarchicalCluster Analysis(HCA)k-MeansClusteringGaussianMixture Models(GMMs)

Algorithm 2: k-Means ClusteringInput parameter: k Starts with k random centroids Cluster points by calculating distancefor each point from centroids Take average of clustered points Use as new centroids Repeat until convergenceInteractive Demo: ans/kmeans.html

Algorithm 2: k-Means Clustering A greedy algorithmDisadvantages: Initial means are randomly selected which can cause suboptimal partitionsPossible Solution: Try a number of different starting points Depends on the value of k

Demo 2

Coming Up Assignment 9 is Optional: Will replace your second lowest score if you submit Due at 5:30pm on December 16th, 2020 Last Lecture: Real-world applications of machine learning (December 16th, 2020) Final Project: Due on December 16th, 2020

Decision Trees and Random Forest Linear Regression. What are some limitations of supervised learning? Today: Unsupervised Learning In unsupervised learning, the training data is unlabeled . An Example: Clustering. Unsupervised Learning 1 3 .

Related Documents:

When Restaurant Manager exports its batch sales transactions, the resulting data is in a "horizontal" structure, similar to the following: Ticket Info Employee Info Sale Info Sale Info Sale Info Sale Info Tender Info Member Info Tip Info For Club Office to be able to use the data, it must be converted into the vertical structure that

decoration machine mortar machine paster machine plater machine wall machinery putzmeister plastering machine mortar spraying machine india ez renda automatic rendering machine price wall painting machine price machine manufacturers in china mail concrete mixer machines cement mixture machine wall finishing machine .

BASPINAL@SHAW.CA (Office Email) Contact Info Contact Info Contact Info Contact Info Contact Info Contact Info Contact Info Contact Info 8 JAG-2013-02024 s.22. FIGR0171 2013-12-04 10:55 AM Business Licences Expiring Between 2013-Dec-

info@wonderware.ru Moscow tel. 7 495 641 1616 info@wonderware.ru Кiev tel. 38 044 495 33 40 info@wonderware.com.ua Helsinki tel. 358 9 540 4940 automation@klinkmann.fi Vilnius tel. 370 5 215 1646 info@wonderware.lt Rīga tel. 371 6738 1617 info@wonderware.lv Мinsk tel. 375 17 200 0876 info@wonderware.by Tallinn tel. 372 668 4500 info .

S. NO. District Facility Name Equipment name Manufacturer Model Installation date (m-yy) 31 BANKURA BPHC/CHHATNA CENTRIFUGE MACHINE REMI C854/6 No Info. 32 BANKURA BPHC/CHHATNA INCUBATOR NARAYAN IND No Info. No Info. 32 BANKURA BPHC/CHHATNA BINOCULAR MICROSCOPE OLYMPUS No Info. No Info. 33 BANKURA BPHC/CHHATNA COLORIMETER BEACON OCTA1 No Info.

work/products (Beading, Candles, Carving, Food Products, Soap, Weaving, etc.) ⃝I understand that if my work contains Indigenous visual representation that it is a reflection of the Indigenous culture of my native region. ⃝To the best of my knowledge, my work/products fall within Craft Council standards and expectations with respect to

Introduction to machine and machine tools Research · April 2015 DOI: 10.13140/RG.2.1.1419.7285 CITATIONS 0 READS 43,236 1 author: . machine and power hacksaws lathe machine, Planer lathe machine, Sloter lathe machine etc. Basics of Mechanical Engineering (B.M.E) Brown Hill College of Engg. & Tech.

trigger stroke inspection g 2-12 1998 trigger sensitivity adjustment g 2-12 1998 safety inspection g 2-13 1998 power regulator inspection g 2-13 1998 speargun sten (pipin line) schematic and parts list g 2-14 1998 speargun trouble shooting g 3-1 1998 index mares service manual contents