Introduction To Machine Learning & Data Mining

2y ago

28 Views

3 Downloads

7.11 MB

104 Pages

Last View : 2m ago

Last Download : 3m ago

Upload by : Cade Thielen

Report this link

Download PDF

Transcription

Introduction to Machine Learning & Data MiningJennifer NevillePurdue UniversityMay 24, edu/homes/neville/iris.dat

Data miningThe process of identifying valid, novel, potentially useful, andultimately understandable patterns in data(Fayyad, Piatetsky-Shapiro & Smith 1996)Artificial IntelligenceDatabasesVisualizationStatistics

ExampleDuring WWII, statistician Abraham Wald was asked tohelp the British decide where to add armor to their planes

The data revolutionThe last 35 years of research in ML/DM has resulted inwide spread adoption of predictive analytics toautomate and improve decision making.As “big data” efforts increase the collection of data so will the need for new data science methodology.Data today have more volume, velocity, variety, etc.Machine learning research develops statistical tools,models & algorithms that address these complexities.Data mining research focuses on how to scale tomassive data and how to incorporate feedbackto improve accuracy while minimizing effort.

The data mining eddataMachine ining

Overview Task specification Data representation Knowledge representation Learning technique Search scoring Prediction and/or interpretation

Task specification Objective of the person who is analyzing the data Description of the characteristics of the analysis and desired result Examples: From a set of labeled examples, devise an understandable model that willaccurately predict whether a stockbroker will commit fraud in the nearfuture. From a set of unlabeled examples, cluster stockbrokers into a set ofhomogeneous groups based on their demographic information

Exploratory data analysis Goal Interact with data withoutclear objective Techniques Visualization, adhocmodeling

Descriptive modeling Goal Summarize the dataor the underlyinggenerative processBn TechniquesBnFirmBroker (Bk)DisclosureBranch (Bn)BnSize Density estimation,cluster analysis andsegmentationProblemIn eBkAreaBkLayoffsBnOnWatchlistBkBnAlso known as: unsupervised learning

Predictive modeling Goal Learn model to predictunknown class labelvalues given observedattribute valuesBrokerAge 27Current CoWorkerCount 8Current BranchMode(Location) NY Techniques Classification, regression703564Current FirmAvg(Size) 12DisclosureCount(Yr 1995) 0Past CoWorkerCount(Gender M) 1510DisclosureCount 5Current BranchMode(Location) AZ7179218DisclosureCount(Type CC) 0Past FirmAvg(Size) 90200Past FirmMax(Size) 100049Past CoWorkerCount 35Current RegulatorMode(Status) RegBrokerYears In Industry 1639Also known as: supervised learning34249554

Pattern discovery Goal Detect patterns and rulesthat describe sets ofexamples Techniques --- --- ---- Association rules, graphmining, anomaly detection ---- - ---Model: global summary of a data setPattern: local to a subset of the data --

Overview Task specification Data representation Knowledge representation Learning technique Search scoring Prediction and/or interpretation

Data representation Choice of data structure for representing individual and collections ofmeasurements Individual measurements: single observations (e.g., person’s date of birth,product price) Collections of measurements: sets of observations that describe an instance(e.g., person, product) Choice of representation e of interest given knownvalues of other variables Focus on modeling the conditional distribution P( Y X ) or on modelingthe decision boundary for Y

Learning predictive models Choose a data representation Select a knowledge representation (a “model”) Defines a space of possible models M {M1, M2, ., Mk} Use search to identify “best” model(s) Search the space of models (i.e., with alternative structures and/orparameters) Evaluate possible models with scoring function to determine the modelwhich best fits the data

Knowledge representation Underlying structure of the model or patterns that we seek from the data Defines space of possible models for algorithm to search over Model: high-level global description of dataset “All models are wrong, some models are useful”G. Box and N. Draper (1987) Choice of model family determines space of parameters and structure Estimate model parameters and possibly model structure from training data

Classification treeBrokerAge 27Current CoWorkerCount 8Current BranchMode(Location) NY703564Current FirmAvg(Size) 12DisclosureCount(Yr 1995) 0Past CoWorkerCount(Gender M) 1510DisclosureCount 5Current BranchMode(Location) AZ7Model space:all possible decision trees1792189DisclosureCount(Type CC) 0Past FirmAvg(Size) 90200Past FirmMax(Size) 100049Past CoWorkerCount 35Current RegulatorMode(Status) RegBrokerYears In Industry 16334249554

Scoring functions Given a model M and dataset D, we would like to “score” model M withrespect to D Goal is to rank the models in terms of their utility (for capturing D)and choose the “best” model Score function can be used to search over parameters and/ormodel structure Score functions can be diﬀerent for: Models vs. patterns Predictive vs. descriptive functions Models with varying complexity (i.e., number parameters)

Predictive scoring functions Assess the quality of predictions for a set of instances Measures diﬀerence between the prediction M makes for aninstance i and the true class label value of iS(M ) NtestXi 1Sum overexamples d f (x(i); M ), y(i)Distance betweenpredicted and truePredictedclass labelfor item i Trueclass labelfor item i

What space are we searching?Learned model ( 0 0.8, 1 0.4)XModelScoreModel spaceAlex Holehuse, Notes from Andrew Ng’s Machine Learning Class, http://www.holehouse.org/mlclass/01 02 Introduction regression analysis and gr.html

Searching over models/patterns Consider a space of possible models M {M1, M2, ., Mk} with parameters θ Search could be over model structures or parameters, e.g.: Parameters: In a linear regression model, find the regressioncoeﬃcients (β) that minimize squared loss on the training data Model structure: In a decision trees, find the tree structure thatmaximizes accuracy on the training data

Decision trees

Tree models Easy to understand knowledgerepresentation Can handle mixed variables Recursive, divide and conquerlearning method Eﬃcient inference

Tree learning Top-down recursive divide and conquer algorithm Start with all examples at root Select best attribute/feature Partition examples by selected attribute Recurse and repeat Other issues: How to construct features When to stop growing Pruning irrelevant parts of the tree

FraudAgeDegreeStartYrSeries7 22Y2005N-25N2003Y-31Y1995Y-27Y1999Y 24N2006N-29N2003NYchoose split on Series7Score each attribute splitfor these instances:Age, Degree, StartYr, StartYrSeries7-25N2003Y 22Y2005N-31Y1995Y 24N2006N-27Y1999Y-29N2003NYchoose split on Age 28ScoreN each attribute splitfor these instances:Age, Degree, StartYrFr

Data mining process 6 CS590D 12 Data Mining: Classification Schemes General functionality – Descriptive data mining – Predictive data mining Different views, different classifications – Kinds of data to be mined – Kinds of knowledge to be discovered – Kinds of techniqu

Related Documents:

Specification and Price of Automatic Rendering Machine (FOB ... - AR

decoration machine mortar machine paster machine plater machine wall machinery putzmeister plastering machine mortar spraying machine india ez renda automatic rendering machine price wall painting machine price machine manufacturers in china mail concrete mixer machines cement mixture machine wall finishing machine .

15 Views

3m ago

Mathematical Methods in Machine Learning - UMD

Machine learning has many different faces. We are interested in these aspects of machine learning which are related to representation theory. However, machine learning has been combined with other areas of mathematics. Statistical machine learning. Topological machine learning. Computer science. Wojciech Czaja Mathematical Methods in Machine .

26 Views

1y ago

Lecture 1: Machine Learning Problem - University of Adelaide

Machine Learning Real life problems Lecture 1: Machine Learning Problem Qinfeng (Javen) Shi 28 July 2014 Intro. to Stats. Machine Learning . Learning from the Databy Yaser Abu-Mostafa in Caltech. Machine Learningby Andrew Ng in Stanford. Machine Learning(or related courses) by Nando de Freitas in UBC (now Oxford).

36 Views

1y ago

Artificial Intelligence, Machine Learning, Deep Learning ...

Artificial Intelligence, Machine Learning, and Deep Learning (AI/ML/DL) F(x) Deep Learning Artificial Intelligence Machine Learning Artificial Intelligence Technique where computer can mimic human behavior Machine Learning Subset of AI techniques which use algorithms to enable machines to learn from data Deep Learning

175 Views

3y ago

Machine Learning - B. Supervised Learning: Nonlinear Models B.5. A ...

Machine Learning Machine Learning B. Supervised Learning: Nonlinear Models B.5. A First Look at Bayesian and Markov Networks Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute for Computer Science University of Hildesheim, Germany Lars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL .

13 Views

1y ago

Machine Learning Algorithms - A Review

supervised machine learning is a combination of supervised and unsupervised machine learning methods. It can be fruit-full in those areas of machine learning and data mining where the unlabeled data is already present and getting the labeled data is a tedious process. With more common supervised machine learning methods, you train

28 Views

1y ago

Craft Council of Newfoundland and Labrador - Webflow

work/products (Beading, Candles, Carving, Food Products, Soap, Weaving, etc.) ⃝I understand that if my work contains Indigenous visual representation that it is a reflection of the Indigenous culture of my native region. ⃝To the best of my knowledge, my work/products fall within Craft Council standards and expectations with respect to

307 Views

2y ago

Flock: Hybrid Crowd-Machine Learning Classiﬁers - Stanford University

with machine learning algorithms to support weak areas of a machine-only classiﬁer. Supporting Machine Learning Interactive machine learning systems can speed up model evaluation and helping users quickly discover classiﬁer de-ﬁciencies. Some systems help users choose between multiple machine learning models (e.g., [17]) and tune model .

52 Views

7m ago

Recent Views

Vietnamese Insurance Market Report - Ditp

Insurance agents TOTAL INSURANCE AGENTS IN VIETNAMESE MARKET 6/2016 Until the end of June 2016, total insurance agents increased by 29.5% compared with same period last year to 437,738 agents. Prudential took the lead with 181,808 agents, followed by Bao Viet life with 94,129 agents and Dai-ichi Life with 53,811 agents. e. The number of new .

1y ago

167 Views

Attorney Registration - Certificates of Insurance Upload for Insurance .

Certificate of Insurance ("COI") upload feature Loginfor insurance agents and insurers. Summary: After self-registering for a username and password, agents and insurers will have access to a portal for the upload of Certificates of Insurance. This Guide is for: Insurance agents and insurers who are authorized to upload Certificates of Insurance

1y ago

179 Views

Insurance Act Insurance Agents Regulations - Prince Edward Island

Section 3 Insurance Act Insurance Agents Regulations Page 4 Updated August 1, 2005 t c Restricted life, accident and sickness insurance agents (3) Notwithstanding subsection (2), the Superintendent may, until July 1, 2006, issue a transitional restricted certificate of authority covering life, accident and sickness insurance to

1y ago

131 Views

Insurance Act 1978 - Bermuda Laws

INSURANCE MANAGERS, BROKERS, AGENTS, INSURANCE MARKETPLACE PROVIDERS AND SALESMEN Insurance managers, agents and insurance marketplace providers to maintain lists of insurers for which they act Insurance broker, agent, salesman or insurance marketplace provider deemed agent of insurer in cert

2y ago

280 Views

THE EFFECT OF INSURANCE AGENTS IN INSURANCE PENETRATION IN KENYA By D61 .

Insurance agents sell exclusively the products of a certain insurance company whereas insurance brokers are legally independent from insurance companies. Insurance brokers are often referred to as the insured's agent (Kogi & Maragia, 2011).

1y ago

182 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

CODE OF CONDUCT FOR LICENSED INSURANCE AGENTS - ia

insurance agents when carrying on regulated activities. Secondly, the Code of Conduct supplements the duties and obligations which licensed insurance agents owe their principals (arising from their principal-agent relationship) by providing that agents should comply with the requirements set out by their

1y ago

130 Views

All about auto insurance - Option Consommateurs

of insurance companies with which they have agreements. Insurance agents: agents work for a specific insurance company. Before you decide to do business with either a broker or an agent, check out prices, the products being proposed and the quality of the service. Buying auto insurance 4 All about auto insurance

1y ago

230 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

Independent Insurance Agents & Brokers of Louisiana - IIABL

Independent Insurance Agents & Brokers of Louisiana Frequently Requested Louisiana Insurance Statutes Independent Insurance Agents & Brokers of Louisiana 9818 Bluebonnet Blvd. Baton Rouge, La 70810 (225) 819-8007 www.IIABL.com

1y ago

109 Views

Insurance and Indemnification Guidelines for State of .

the Contractor's insurance company issues the required insurance policies or endorses existing policies to match the insurance requirements of the contract. As proof of coverage, most insurance agents and brokers will provide a document called a certificate of insurance. While a certificate is evidence that the Contractor has an insurance policy,

1y ago

151 Views

SPECIAL REPORT Young Agents Survey - Insurance Journal

agents and agency owners in particular — better get ready to step up. This is good news for young professionals working in independent agencies today — those 40 years old and younger. According to Insurance Journal's Young Agents Survey 2015, 82.7 percent of young agents feel very optimistic or optimistic

1y ago

119 Views

Brokers and Agents and Health Insurance Exchanges A

National Association of Insurance Commissioners distinguishes their roles as follows: Brokers act on behalf of the consumer. They can be compensated by the consumer or receive compensation from an insurance company. Agents are loyal to an insurance company and sell, solicit, or negotiate insurance on behalf of the insurer.

1y ago

129 Views

Consumer Guide to Auto Insurance - csimt.gov

consumer guide to auto insurance contents introduction to auto insurance 1 understanding your auto insurance policy 2 required auto insurance 3 optional types of auto insurance 4-5 getting the right coverage 6 accidents and violations 7 how to shop for auto insurance 8 shopping tips 9 frequently asked questions 10-11 insurance complaints/when you have a problem 12

2y ago

805 Views

Industry Observations Insurance Industry

Jun 30, 2019 · 6/17/2019 Commercial Insurance Branch of Extraco Banks, N.A. Higginbotham Insurance Group, Inc. Insurance Brokers NA 6/13/2019 Links Insurance Services, LLC World Insurance Associates LLC Property and Casualty Insurance NA 6/13/2019 Abram Interstate Insurance Services, Inc. Risk Placement Services,

2y ago

619 Views

Introduction To Machine Learning & Data Mining

It looks like you're using an ad-blocker