Visualizing Data Using T-SNE - UCLouvain

1y ago

16 Views

2 Downloads

7.30 MB

42 Pages

Last View : 19d ago

Last Download : 3m ago

Upload by : Ophelia Arruda

Report this link

Download PDF

Transcription

Visualizing Data using t-SNEAn Intuitive IntroductionSimon CarbonnelleUniversité Catholique de Louvain, ICTEAM12th of May, 2016

Visualization and Dimensionality ReductionIntuition behind t-SNEVisualizing representations

Visualization is key to understand data easily.Data of house areas in m2 and price in 1000s of onIs the relation linear?1

Visualization is key to understand data easily.QuestionIs the relation linear?1

Dimensionality Reduction is a helpful tool forvisualization.IDimensionality reduction algorithmsIIIMap high-dimensional data to alower dimensionWhile preserving structureThey are used forIIIVisualizationPerformanceCurse of dimensionalityIA ton of algorithms existIt-SNE is specialised for visualizationI. and has gained a lot of popularity2

Visualization and Dimensionality ReductionIntuition behind t-SNEVisualizing representations

Dimensionality Reduction techniques solveoptimization problems.X {x1 , x2 , ., xn Rh } Y {y1 , y2 , ., yn Rl }min C (X , Y)YThree approaches for Dimensionality Reduction:IDistance preservationITopology preservationIInformation preservation4

SNE computes pair-wise similarities.SNE converts euclidean distances to similarities, that can beinterpreted as probabilities.pj i Pexp( k xi xj k2 /2σi2 )22k6 i exp( k xi xk k /2σi )exp( k yi yj k2 )2k6 i exp( k yi yk k )qj i Ppi i 0, qi i 0Hence the name Stochastic Neighbor Embedding.5

Pair-wise similarities should stay the same. 6

Pair-wise similarities should stay the same.pj i qj i 6

Pair-wise similarities should stay the same. 6

Kullback-Leiber Divergence measures thefaithfulness with wich qj i models pj i .IPi {p1 i , p2 i , ., pn i } and Qi {q1 i , q2 i , ., qn i } are thedistributions on the neighbors of datapoint i.IKullback-Leiber Divergence (KL) compares two distributions.XXXpj iC KL(Pi Qi ) pj i logqj iiijIKL divergence is asymmetricIKL divergence is always positive.IWe have our minimization problem: minY C (X , Y)7

Some remaining questions.exp( k xi xj k2 /2σi2 )exp( k yi yj k2 )P,q j i222k6 i exp( k yi yk k )k6 i exp( k xi xk k /2σi )pj i P1. Why radial basis function (exponential)?2. Why probabilities?3. How do you choose σi ?8

Some remaining questions.exp( k xi xj k2 /2σi2 )exp( k yi yj k2 )P,q j i222k6 i exp( k yi yk k )k6 i exp( k xi xk k /2σi )pj i P1. Why radial basis function (exponential)?Focus on localgeometry.This is why t-SNEcan be interpreted astopology-based9

Some remaining questions.exp( k yi yj k2 )exp( k xi xj k2 /2σi2 )P,q j i222k6 i exp( k yi yk k )k6 i exp( k xi xk k /2σi )pj i P1. Why radial basis function (exponential)?2. Why probabilities?Small distance does notmean proximity on manifold.Probabilities are appropriateto model this uncertainty10

The entropy of Pi increases with σi .EntropyH(P) Pipi log2 pi12

Perplexity, a smooth measure of the # of neighbors.PerplexityPerp(P) 2H(P) Entropy of 1.055Perplexity of 2.078 Entropy of 3.800Perplexity of 13.92913

From SNE to t-SNE. SNESymmetric SNE t-SNEModelisation:exp( kxi xj k2 /2σi2 )exp( kxi xk k2 /2σi2 )exp( kyi yj k2 )P2k6 i exp( kyi yk k )pj i Pqj i k6 iCost Function:C PKL(Pi Qi )iDerivatives:dCdyi 2Pj (pj i qj i pi j qi j )(yi yj )14

From SNE to t-SNE. SNEModelisation:exp( kxi xj k2 /2σi2 )P22k6 i exp( kxi xk k /2σi )exp( kyi yj k2 )P2k6 i exp( kyi yk k )pj i qj i Cost Function:C PKL(Pi Qi )iDerivatives:dCdyi 2Pj (pj i qj i pi j qi j )(yi yj )Symmetric SNEModelisation:pij qij pj i pi j2nexp( kyi yj k2 )P2k6 l exp( kyk yl k )Cost Function:C KL(P Q)Derivatives:dCdyi 4IPj (pij qij )(yi yj )FasterComputation t-SNEModelisation:pij qij pj i pi j2n(1 kyi yj k2 ) 1P2 1k6 l (1 kyk yl k )Cost Function:C KL(P Q)Derivatives:dCdyi 4Pj (pij qij )(yi yj )(1 k yi yj k2 ) 1IEven FasterComputationIBetterBehaviour14

The ”Crowding problem”There is much more space in high dimensions.15

Mismatched Tails can Compensate for MismatchedDimensionalitiesStudent-t distribution has heavier tails.16

Last but not least: Optimizationmin C (X , Y)YC KL(P Q) XXijpj i logpj iqj iINon-convexIGradient descent Momentum Adaptive learning rateY (t) Y (t 1) η(t)ITwo tricks:IIIδC α(t)(Y (t 1) Y (t 2) )δYEarly CompressionEarly ExaggerationIllustration Colah’s blog17

Visualization and Dimensionality ReductionIntuition behind t-SNEVisualizing representations

Mapping raw data to distributed representations.IFeature engineering is often laborious.INew tendency is to automatically learn adequate features orrepresentations.IUltimate goal: enable AI to extract useful features from rawsensory data.t-SNE can be used to make sense of the learned representations!19

Using t-SNE to explore a Word embedding.IISystem outputs 1 if central word is in right context, 0otherwise.Algorithms learns representation and classificationsimultaneously.From Machine Learning to Machine Reasoning, L. Bottou (2011)GoalRepresentation captures syntactic and semantic similarity.20

Using t-SNE to explore a Word embedding.http://colah.github.io/21

Explore a Wikipedia article embedding.http://colah.github.io/22

Exploring game state representations.Google Deepmind plays Atari games.Playing Atari with deep reinforcement learning, V. Mnih et Al.GoalLearning to play Space Invaders from score feedback and raw pixelvalues.23

Exploring game state representations.Google Deepmind plays Atari games.IIIA representation is learned with a convolutional neural networkFrom 84x84x4 28.224 pixel values to 512 neurons.Predicts expected score if a certain action is taken.Human-level control through deep reinforcement learning, V. Mnih et Al. (Nature,2015)24

Exploring game state representations.Google Deepmind plays Atari games.Human-level control through deep reinforcement learning, V. Mnih et Al. (Nature,2015)25

Using t-SNE to explore image representations.Classifying dogs and /IEach data point is an image of a dog or a catIred cats, blue dogs26

Using t-SNE to explore image representations.Classifying dogs and cats.RepresentationConvolutional net trained for Image Classification (1000 sne/27

ConclusionIThe t-SNE algorithm reduces dimensionality while preservinglocal similarity.IThe t-SNE algorithm has been build heuristically.It-SNE is commonly used to visualize representations.28

Visualizing Data using t-SNE An Intuitive Introduction Simon Carbonnelle Universit e Catholique de Louvain, ICTEAM 12th of May, 2016. Visualization and Dimensionality Reduction Intuition behind t-SNE Visualizing representations. Visualization and Dimensionality Reduction

Related Documents:

Visualizing Wikipedia using t-SNE - Jasneet Sabharwal

niques (except t-SNE) perform strongly on ar-tiﬁcial data sets but their visualization of real, high-dimensional data sets is poor. Whereas, t-SNE outperforms the other techniques and cap-tures most of the local structure while revealing global structure like presence of clusters at vari-ous scales. t-SNE uses Gaussian distribution for

8 Views

1y ago

Visualizing Data using t-SNE - Harvard University

VISUALIZING DATA USING T-SNE 2. Stochastic Neighbor Embedding Stochastic Neighbor Embedding (SNE) starts by converting the high-dimensional Euclidean dis-tances between datapoints into conditional probabilities that represent similarities.1 The similarity of datapoint xj to datapoint xi is the conditional probability, pjji, that xi would pick xj as its neighbor

18 Views

1y ago

Theoretical Foundations of t-SNE for Visualizing High-Dimensional ...

when applied to high-dimensional clustered data, t-SNE tends to produce a visualization with more separated clusters, which are often in good agreement with the clusters found by a dedicated clustering algorithm (Kobak and Berens, 2019). See Figure 1 for an example of data visualization using such a basic t-SNE algorithm.

7 Views

1y ago

Visualizing Data using t-SNE - Department of Computer Science ...

Visualizing Data using t-SNE Laurens van der Maaten L.VANDERMAATEN@MICC UNIMAAS NL MICC-IKAT Maastricht University P.O. Box 616, 6200 MD Maastricht, The Netherlands Geoffrey Hinton HINTON@CS.TORONTO EDU Department of Computer Science University of Toronto 6 King's College Road, M5S 3G4 Toronto, ON, Canada

8 Views

1y ago

Visualizing Data using t-SNE - Center for Neural Science

9 Views

1y ago

Visualizing Time-Dependent Data Using Dynamic t-SNE

E. Bertini, N. Elmqvist, and T. Wischgoll (Guest Editors) Visualizing Time-Dependent Data Using Dynamic t-SNE Paulo E. Rauber 1,2, Alexandre X. Falcão2, Alexandru C. Telea1 1 Johann Bernoulli Institute, University of Groningen, the Netherlands 2 Institute of Computing, University of Campinas, Brazil Abstract

17 Views

1y ago

Supplementary Materials: Time-series Generative ... - van der Schaar Lab

Additional Visualizations with t-SNE and PCA Figure 2: t-SNE (1st column) and PCA (2nd column) visualizations on Sines, and t-SNE (3rd column) and PCA (4th column) visualizations on Stocks. Each row provides the visualization for each of the 7 benchmarks, ordered as follows: (1) TimeGAN, (2) RCGAN, (3) C-RNN-GAN, (4) T-Forcing, (5)

8 Views

1y ago

Indiana Academic Standards English Language Arts: Grade 2

English Language Arts: Grade 2 READING Guiding Principle: Students read a wide range of fiction, nonfiction, classic, and contemporary works, to build an understanding of texts, of themselves, and of the cultures of the United States and the world; to acquire new information; to respond to the needs and demands of society and the workplace .

51 Views

3y ago

Recent Views

Consumer Guide to Auto Insurance - csimt.gov

consumer guide to auto insurance contents introduction to auto insurance 1 understanding your auto insurance policy 2 required auto insurance 3 optional types of auto insurance 4-5 getting the right coverage 6 accidents and violations 7 how to shop for auto insurance 8 shopping tips 9 frequently asked questions 10-11 insurance complaints/when you have a problem 12

2y ago

815 Views

your guide to understanding auto ins in nh - New Hampshire

Hampshire Insurance Department does not mandate or set Auto Insurance Rates. Auto Insurance Rates will vary by insurance company. This guide is intended to give New Hampshire consumers basic information on auto insurance. It suggests ways to: Lower the cost of your auto insurance, shop for Auto insurance and, file an auto insurance claim.

1y ago

460 Views

OWNER'S GUIDE - NinjaKitchen

auto auto auto. frozen drinks smoothies puree med high pulse low / dough. auto auto auto. frozen drinks smoothies puree med high pulse low / dough. auto auto auto. frozen drinks smoothies puree med high pulse low / dough. auto auto auto. please keep these important safeguards in mind when using the . appliance: mportant: make sure that the .

1y ago

292 Views

Consumer Guide Auto Insurance - Tennessee

Auto insurance doesn't cover paying off your loan if your car is damaged and its market value is less than what you owe. Auto dealers and lenders may offer guaranteed auto protection (GAP) insurance for this purpose. Your auto insurance will cover you if you drive into Canada. To drive into Mexico, however, you'll need to buy Mexican auto .

1y ago

206 Views

NAIC Consumer Shopping Tool for Auto Insurance

Whether you are buying auto insurance for the first time, or shopping to be sure you are getting the best deal, you already know how important auto insurance is. By law in most states, if you own a car, you must have some auto insurance. Remember, there is no such thing as a "full coverage" auto insurance policy. Policies are made up of

1y ago

191 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

685 Views

Decision Tree Tutorial by Kardi Teknomo - TAN THIAM HUAT 陳添發

Male 1 Cheap Medium Bus Female 1 Cheap Medium Train Female 0 Cheap Low Bus Male 1 Cheap Medium Bus Male 0 Standard Medium Train Female 1 Standard Medium Train Female 1 Expensive High Car Male 2 Expensive Medium Car Female 2 Expensive High Car Based on above training data, we can induce a decision tree as the following:

10m ago

89 Views

Broadway towing winchester ky

MO 77 Motors: Rock Hill, SC 7th Avenue Auto Salvage: Fargo, ND 81 Auto Parts & Recycling : Salem, VA 82 Auto Wrecking: Brookfield, OH #9 Truck & Auto Parts (No US Shipping) : Tottenham, ON 97 Auto Wrecking Shull's Towing: Brewster , WA 98 Auto Recyclers: Brooksville, FL 99 Auto Dismantler: Stockton, CA A & A Auto & Truck LLC:

2y ago

471 Views

All about auto insurance - Option Consommateurs

of insurance companies with which they have agreements. Insurance agents: agents work for a specific insurance company. Before you decide to do business with either a broker or an agent, check out prices, the products being proposed and the quality of the service. Buying auto insurance 4 All about auto insurance

1y ago

238 Views

-xglfldo:Dwfk Xjxvw Wkurxjk)2,

Affordable Care Act - insurance comparison, cheapest insurance, cheap health insurance NJ, cheapest insurance company Priority One High Volume - Washington state health insurance plans, affordable health insurance The best performing ad copy included those that made specific reference to finding "health insurance" for

1y ago

267 Views

A Message from Our President - Fox Valley Corvette

Bob Jass Chev-rolet 630-365-6481 Auto Parts 25% in most cas-es Ron Westphal Chevrolet 630-898-9630 Auto Parts 25% in most cas-es Thomsons Auto Parts 630-879-6363 Auto Parts 10% in most cas-es American Mod-ern Insurance Co. Collector Car Auto Insurance 10% on Collector Auto Polic

2y ago

231 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

577 Views

A CONSUMER GUIDE TO AUTO INSURANCE - Maryland

AUTO INSURANCE Comparison shopping is the key to getting the most for your insurance dollar . Consumers think nothing of price shopping for televisions, computer tablets or appliances to save 20 or 30, but forget to shop around for auto insurance where hundreds of dollars can be saved . There are more than 150 auto insurers (or

1y ago

154 Views

Auto Insurance Affordability: Countrywide Trends and State Comparisons

Auto Insurance Expenditures as Percent of Median Income 1990s Average 1.93% 2000s Average 1.71% 2010s Average 1.61%. 3 State Rankings Based on the 2018 affordability index, auto insurance was most affordable in Iowa, where households spent 1.02 percent of income on auto insurance. Other states with low expenditure-

1y ago

183 Views

Business Auto Insurance made simple - Allstate

And with our range of innovative insurance and ﬁnancial products, we can help you protect your lifestyle. Personal Auto Insurance Your Choice Auto Featuring: Accident Forgiveness, Safe Driving Bonus Check, Deductible Rewards and New Car Replacement Standard auto Property Insurance House Condo Renters Manufactured home

1y ago

140 Views

Visualizing Data Using T-SNE - UCLouvain

It looks like you're using an ad-blocker