INFO 1998: Introduction to Machine Learning
Lecture 9: Clustering and Unsupervised LearningINFO 1998: Introduction to Machine Learning
Recap: Supervised Learning The training data you feed into your algorithm includes desired solutionsTwo types you’ve seen so far: regressors and classifiersIn both cases, there are definitive “answers” to learn fromExample 1: RegressorPredicts valueExample 2: ClassifierPredicts label
Recap: Supervised LearningSupervised learning algorithms we have covered so far: k-Nearest NeighborsPerceptronLogistic RegressionDecision Trees and Random ForestLinear Regression
What are some limitations ofsupervised learning?
Today: Unsupervised Learning In unsupervised learning, the training data is unlabeledAlgorithm tries to learn by itselfAn Example: Clustering
Unsupervised LearningSome types of unsupervised learning problems:1Clusteringk-Means, Hierarchical Cluster Analysis (HCA), Gaussian Mixture Models (GMMs), etc.2Dimensionality ReductionPrincipal Component Analysis (PCA), Locally Linear Embedding (LLE)3Association Rule LearningApriori, Eclat, Market Basket Analysis More
Unsupervised LearningSome types of unsupervised learning problems:1Clusteringk-Means, Hierarchical Cluster Analysis (HCA), Gaussian Mixture Models (GMMs), etc.2Dimensionality ReductionPrincipal Component Analysis (PCA), Locally Linear Embedding (LLE)3Association Rule LearningApriori, Eclat, Market Basket Analysis More
Cluster Analysis
Cluster Analysis Loose definition: Clusters have objects which are “similar in some way” (and“dissimilar to objects in other clusters)Clusters are latent variables (variables that are unknown)Understanding clusters can:- Yield underlying trends in data- Supply useful parameters for predictive analysis- Challenge boundaries for pre-defined classes and variables
Clustering ApplicationRecommender SystemsIntuition: People who are “similar”, will like the same thingsA Bunch of Cool Logos
Clustering ApplicationFinding Population Structure in Genetic Data
Running Example: Recommender SystemsUse 1: Collaborative Filtering “People similar to you also liked X” Use other’s rating to suggest contentProsConsIf cluster behavior is clear,can yield good insightsComputationally expensiveCan lead to dominance of certaingroups in predictions
Running Example: Recommend MOVIES
Running Example: Recommender SystemsUse 2: Content filtering “Content similar to what YOU are viewing” Use user’s watch history to suggest contentProsRecommendations made bylearner are intuitiveScalableConsLimited in scope and applicability
Another Example: Cambridge Analytica Uses Facebook profiles to build psychological profiles,then use traits for target advertisingEx. has personality test measuring openness,conscientiousness, extroversion, agreeableness andneuroticism - different types of ads
How do we actually perform this“cluster analysis”?
Popular Clustering AlgorithmsHierarchicalCluster Analysis(HCA)k-MeansClusteringGaussianMixture Models(GMMs)
Defining ‘Similarity’ How do we calculate proximity of different data points?Euclidean distance: Other distance measures: Squared euclidean distance, manhattan distance
Algorithm 1: Hierarchical ClusteringTwo types: Agglomerative Clustering Creates a tree ofincreasingly large clusters(Bottom-up) Divisive Hierarchical Clustering Creates a tree ofincreasingly small clusters(Top-down)
Agglomerative Clustering Algorithm Steps:- Start with each point in its own cluster- Unite adjacent clusters together- Repeat Creates a tree of increasingly largeclusters
Agglomerative Clustering AlgorithmHow do we visualize clustering?Using dendrograms Each width represents distance betweenclusters before joiningUseful for estimating how many clustersyou haveThe iris dataset that we all love
Demo 1
Popular Clustering AlgorithmsHierarchicalCluster Analysis(HCA)k-MeansClusteringGaussianMixture Models(GMMs)
Algorithm 2: k-Means ClusteringInput parameter: k Starts with k random centroids Cluster points by calculating distancefor each point from centroids Take average of clustered points Use as new centroids Repeat until convergenceInteractive Demo: ans/kmeans.html
Algorithm 2: k-Means Clustering A greedy algorithmDisadvantages: Initial means are randomly selected which can cause suboptimal partitionsPossible Solution: Try a number of different starting points Depends on the value of k
Demo 2
Coming Up Assignment 9 is Optional: Will replace your second lowest score if you submit Due at 5:30pm on December 16th, 2020 Last Lecture: Real-world applications of machine learning (December 16th, 2020) Final Project: Due on December 16th, 2020
Decision Trees and Random Forest Linear Regression. What are some limitations of supervised learning? Today: Unsupervised Learning In unsupervised learning, the training data is unlabeled . An Example: Clustering. Unsupervised Learning 1 3 .
When Restaurant Manager exports its batch sales transactions, the resulting data is in a "horizontal" structure, similar to the following: Ticket Info Employee Info Sale Info Sale Info Sale Info Sale Info Tender Info Member Info Tip Info For Club Office to be able to use the data, it must be converted into the vertical structure that
decoration machine mortar machine paster machine plater machine wall machinery putzmeister plastering machine mortar spraying machine india ez renda automatic rendering machine price wall painting machine price machine manufacturers in china mail concrete mixer machines cement mixture machine wall finishing machine .
BASPINAL@SHAW.CA (Office Email) Contact Info Contact Info Contact Info Contact Info Contact Info Contact Info Contact Info Contact Info 8 JAG-2013-02024 s.22. FIGR0171 2013-12-04 10:55 AM Business Licences Expiring Between 2013-Dec-
info@wonderware.ru Moscow tel. 7 495 641 1616 info@wonderware.ru Кiev tel. 38 044 495 33 40 info@wonderware.com.ua Helsinki tel. 358 9 540 4940 automation@klinkmann.fi Vilnius tel. 370 5 215 1646 info@wonderware.lt Rīga tel. 371 6738 1617 info@wonderware.lv Мinsk tel. 375 17 200 0876 info@wonderware.by Tallinn tel. 372 668 4500 info .
S. NO. District Facility Name Equipment name Manufacturer Model Installation date (m-yy) 31 BANKURA BPHC/CHHATNA CENTRIFUGE MACHINE REMI C854/6 No Info. 32 BANKURA BPHC/CHHATNA INCUBATOR NARAYAN IND No Info. No Info. 32 BANKURA BPHC/CHHATNA BINOCULAR MICROSCOPE OLYMPUS No Info. No Info. 33 BANKURA BPHC/CHHATNA COLORIMETER BEACON OCTA1 No Info.
work/products (Beading, Candles, Carving, Food Products, Soap, Weaving, etc.) ⃝I understand that if my work contains Indigenous visual representation that it is a reflection of the Indigenous culture of my native region. ⃝To the best of my knowledge, my work/products fall within Craft Council standards and expectations with respect to
Introduction to machine and machine tools Research · April 2015 DOI: 10.13140/RG.2.1.1419.7285 CITATIONS 0 READS 43,236 1 author: . machine and power hacksaws lathe machine, Planer lathe machine, Sloter lathe machine etc. Basics of Mechanical Engineering (B.M.E) Brown Hill College of Engg. & Tech.
trigger stroke inspection g 2-12 1998 trigger sensitivity adjustment g 2-12 1998 safety inspection g 2-13 1998 power regulator inspection g 2-13 1998 speargun sten (pipin line) schematic and parts list g 2-14 1998 speargun trouble shooting g 3-1 1998 index mares service manual contents