Data Mining And Machine Learning - Northwest Knowledge

1y ago

3 Views

1 Downloads

1.01 MB

21 Pages

Last View : 2m ago

Last Download : 3m ago

Upload by : Gannon Casey

Report this link

Download PDF

Transcription

8/29/2015 Data Mining and Machine Learning Erich Seamon University of Idaho www.webpages.uidaho.edu/erichs erichs@uidaho.edu 1 I am NOMAD 2 1

8/29/2015 Data Mining and Machine Learning Outline Outlining the data mining and machine learning paradigm – Growth of data What is data mining & knowledge discovery? – Knowledge discovery process – Types of data mining Machine learning: an aspect of data mining – – – – What is machine learning Training vs. testing Supervised vs. unsupervised vs. reinforced Types of Algorithms Machine learning examples 3 4 2

8/29/2015 Data Growth in 2015 Walmart handles 1M transactions per hour Google processes 24PB of data per day AT&T transfers 30PB of data per day 90 trillion emails are sent per year World of Warcraft uses 1.3PB of storage Worldwide Data Growth at 7.9EB/Yr in 2015 5 6 3

8/29/2015 Understanding Data In many ways, our abilities to comprehend incomplete, disparate, or fragmented data is much more important to the discussion than the growth of data itself (King, et al 2015). Algorithms that allow us to gain knowledge from this incomplete data are the key. 7 8 4

8/29/2015 Data Growth and Machine Learning Machine Learning is used when – A pattern exists – We cannot pin it down mathematically – We have data on it Learning techniques are preferred because: – They reduce time and cost – Produce results that are comparable to mining an entire data set 9 10 5

8/29/2015 Data Mining vs. Machine Learning Machine learning tends to be focused on performing a known task, whereas data mining is about the search for hidden nuggets of information. For instance, you might use machine learning to teach a robot to drive a car, whereas you would utilize data mining to learn what type of cars are the safest Machine learning algorithms are virtually a prerequisite for data mining but the opposite is not true. In other words, you can apply machine learning to tasks that do not involvedata mining, but if you are using data mining methods, you are almost certainly using machine learning. (Lantz, 2013) 11 Guthrie, 2014 “Looking backwards, looking forwards: SAS, data mining and machine learning.” ta-mining-and-machine-learning/ 12 6

8/29/2015 Data Mining and Knowledge Discovery Fawley (1992) defines data mining as “the process of analyzing data from different perspectives and summarizing it into useful information”. Data mining is typically considered a core step of the knowledge discovery process. Abu-Mostafa (2013) additionally terms data mining as “ a practical field that focuses on finding patterns, correlations, or anomalies in large relational databases”. 13 Nine steps that define the data mining/knowledge discovery process (Maimon, Rokach, 2006) 14 7

8/29/2015 Components of Data Mining Machine Learning can be considered a subcomponent of Data Mining (Rokach, 2014) Data Mining approaches can be divided into Discovery and Verification Systems Machine Learning falls under the Discovery area 15 16 8

8/29/2015 Supervised and Unsupervised Learning Supervised Learning discovers patterns in data that related data attributes with a class. These patterns are then used to predict values of the class in future data instances. Unsupervised Learning is where data have no class. The intention of unsupervised learning is to explore the data to find its inherent structure, using various statistical methods 17 18 9

8/29/2015 Reinforcement Learning Reinforcement learning is particularly well suited to problems which include a long-term versus short-term reward trade-off. – robot control, – telecommunications, – backgammon and checkers (Sutton and Barto 1998, Chapter 11). Monte Carlo Methods are sometimes used – Monte Carlo integration – Numerical optimization/iterative simulation 19 20 10

8/29/2015 Supervised Learning Classification – KNN (K nearest neighbor) Can be used in regression as well Classification determined by K nearest neighbors which is most common. Lazy learning – function is approximated localy and computation is deferred until classification – Decision Trees Classification and regression approaches Data mining trees are on data, not the decision. Output classification tree can be used for decision Random forest and bagging methods output tree results Varying decision tree algorithms: CART, CHAID, C4.5, ID3 – Logistic Regression – Naïve-Bayes (spam, text filtering) – Support Vector Machines (SVM) Classification and regression approaches Non-probabilistic binary linear classifier 21 22 11

8/29/2015 Supervised Learning (con’d) Classification – KNN (K nearest neighbor) Can be used in regression as well Classification determined by K nearest neighbors which is most common. Lazy learning – function is approximated localy and computation is deferred until classification – Decision Trees Classification and regression approaches Data mining trees are on data, not the decision. Output classification tree can be used for decision Random forest and bagging methods output tree results Varying decision tree algorithms: CART, CHAID, C4.5, ID3 – Logistic Regression – Naïve-Bayes (spam, text filtering) – Support Vector Machines (SVM) Classification and regression approaches Non-probabilistic binary linear classifier 23 24 12

8/29/2015 Unsupervised Learning Clustering and Dimensionality Reduction – SVD – Singular Value Decomposition. If you have two variables, one is humidity index and another one is probability of rain, then their correlation is so high, that the second one does not contribute with any additional information useful for a classification or regression task. The eigenvalues in SVD help you determine what variables are most informative, and which ones you can do without. – Principal Components – K-means Association Analysis – Apriori – FP-Growth Hidden Markov (related to Hoeffding’s Inequality) 25 PCA K Means 26 13

8/29/2015 20 popular Machine Learning R packages by analyzng the most downloaded R packages from Jan-May 2015. (Kdnuggets – Geethika,2015) learning-packages.html 27 28 14

8/29/2015 Examples Retail: Data drives prices and recommendations Marketing: Market sales and recommendations IT Management: IT operational intelligence Customer Management: Customer insight Operations: Automated response Public Safety: Crime hot spot/COMSTAT Medical diagnosis Climate modeling and downscaling 35 learning methods may be used to establish a mapping between a suitable representation of a material (i.e., its ‘fingerprint’ or its ‘profile’) and any or all of its properties using known historic, or intentionally generated, data. The material fingerprint or profile can be coarse-level chemo-structural descriptors, or something as fundamental as the electronic charge density, both of which are explored here. Subsequently, once the profile u property mapping has been established, the properties of a vast number of new materials within the same subclass may then be directly predicted (and correlations between properties may be unearthed) at negligible computational cost, thereby completely bypassing the conventional laborious approaches towards material property determination alluded to above (Pilania, 2013) Mapping Chemical properties 36 18

8/29/2015 Other topics Generalization/approximation tradeoffs Numerical optimization/simulation Hoeffding’s Inequality – In probability theory, Hoeffding's inequality provides an upper bound on the probability that the sum of random variables deviates from its expected value. Vapnik–Chervonenkis dimension – The VC dimension has utility in statistical learning theory, because it can predict a probabilistic upper bound on the test error of a classification model. VC is the size of the largest finite subset of X – Shattered by H (Hypothesis space) If arbitrarily large finite sets of X can be shattered by H – then VC(H) infinity 37 38 19

8/29/2015 Questions Why use machine learning techniques? What is the value scientifically, financially? How does machine learning stack up to historical information? How does data mining relate to machine learning? Can machine learning techniques be used in everyday practice? 39 40 20

8/29/2015 FINIT 41 FINIT 42 21

Data mining is typically considered a core step of the knowledge discovery process. Abu-Mostafa (2013) additionally terms data mining as a practical field that focuses on finding patterns, correlations, or anomalies in large relational databases. Data Mining and Knowledge Discovery 13 Nine steps that define the data mining/knowledge .

Related Documents:

DATA MINING - University of Rajshahi

Preface to the First Edition xv 1 DATA-MINING CONCEPTS 1 1.1 Introduction 1 1.2 Data-Mining Roots 4 1.3 Data-Mining Process 6 1.4 Large Data Sets 9 1.5 Data Warehouses for Data Mining 14 1.6 Business Aspects of Data Mining: Why a Data-Mining Project Fails 17 1.7 Organization of This Book 21 1.8 Review Questions and Problems 23

15 Views

1y ago

Data Mining in Bioinformatics - UQAM

DATA MINING What is data mining? [Fayyad 1996]: "Data mining is the application of specific algorithms for extracting patterns from data". [Han&Kamber 2006]: "data mining refers to extracting or mining knowledge from large amounts of data". [Zaki and Meira 2014]: "Data mining comprises the core algorithms that enable one to gain fundamental in

43 Views

2y ago

Data Mining: Why Data Mining? - Leiden University

October 20, 2009 Data Mining: Concepts and Techniques 7 Data Mining: Confluence of Multiple Disciplines Data Mining Database Technology Statistics Machine Learning Pattern Recognition Algorithm Other Disciplines Visualization October 20, 2009 Data Mining: Concepts and Techniques 8 Why Not Traditional Data Analysis? Tremendous amount of data

41 Views

3y ago

Data Mining Algorithms - Stanford University

Data Mining CS102 Data Mining Looking for patterns in data Similar to unsupervised machine learning Popularity predates popularity of machine learning "Data mining" often associated with specific data types and patterns We will focus on "market-basket" data Widely applicable (despite the name) And two types of data mining patterns

12 Views

1y ago

SAS® Visual Data Mining and Machine Learning (VDMML)

SAS Visual Data Mining and Machine Learning Presentation Content Introduction to SAS Visual Data Mining and Machine Learning Value of SAS Visual Data Mining and Machine Learning Included Algorithms Tour of the interfaces Visual Programming Open Source

10 Views

1y ago

Exploring SAS Viya: Data Mining and Machine Learning

In this book, we will explore some of the features of SAS Visual Data Mining and Machine Learning, including: Programming in SAS Studio Programming in the Python interface Data mining and machine learning tasks New, advanced data mining and machine learning procedures available in SAS Viya Pipeline building in Model Studio

7 Views

1y ago

Multi Relational Data Mining Approaches: A Data Mining Technique

Data Mining and its Techniques, Classification of Data Mining Objective of MRD, MRDM approaches, Applications of MRDM Keywords Data Mining, Multi-Relational Data mining, Inductive logic programming, Selection graph, Tuple ID propagation 1. INTRODUCTION The main objective of the data mining techniques is to extract .

11 Views

7m ago

Searching and mining the Web for personalized and ...

2.1 Machine Learning Techniques and Information Retrieval 21 2.1.1 Machine Learning Paradigms 22 2.1.2 Applications of Machine Learning Techniques in Information Retrieval 26 2.2 Web Mining 32 2.2.1 Web Content Mining 35 2.2.2 Web Structure Mining 43 2.2.3 Web Usage Mining 46 2.3

29 Views

2y ago

Recent Views

Ministries of Finance and Nationally Determined Contributions

Rodrigo Rojo, IDB Sr. Consultant and advisor to Ministry of Finance of Chile. Colombia German Romero Otalora and Laura Marcela Ruiz Daza — Office of the Vice-Minister — Ministry of Finance. Ireland Paul Ryan — International Finance Division — Ministry of Finance Sean Judge — Department of Finance — Ministry of Finance

1y ago

232 Views

Public HealtH Strategy for 2011-2017 - WHO

ME – Ministry of Economics MES – Ministry of Education and Science MEPRD – Ministry of Environmental Protection and Regional Development MF – Ministry of Finance MH – Ministry of Health MI – Ministry of the Interior MJ – Ministry of Justice MRDLG – Ministry of Regional Development and Local Government MT – Ministry of Transport

3y ago

169 Views

2019 - 2020 Budget Kit - Parliament of Fiji

Ministry of Justice 35 Fiji Corrections Service 37 Ministry of Communications 40 Ministry of Civil Service 43 . Ministry of Health and Medical Services 60 Ministry of Housing and Community Development 64 Ministry of Women, Children and Poverty Alleviation 68 Ministry of Youth and Sports 73 Tertiary Scholarships and Loans Schemes 77 Ministry .

1y ago

124 Views

Ministry of Environment, Government of India

Ministries/Departments of the Government of India, namely, Department of Space, Ministry of Agriculture, Ministry of Chemicals and Fertilizers, Ministry of Coal, Ministry of Commerce and Industry, Ministry of Communications and Information Technology, Ministry of Drinking Water and Sanitation, Ministry of Earth

1y ago

146 Views

Men'S Ministry Guide Tp Rev2 Ai

4 CONTENTS Introduction to the Outreach Ministry Guides Series 6 Introduction to the Men's Ministry Volunteer Handbook 8 Section 1 Men's Ministry Foundations Chapter 1 Why Men's Ministry 12 Chapter 2 Ways The Bible Speaks To Men's Ministry 17 Chapter 3 9 Foundations Of An Effective Men's Ministry 21 Section 2 The Anatomy Of An Effective Men's Ministry

1y ago

111 Views

MANAGERIAL FINANCE - GBV

of Managerial Finance page 2 Introduction to Managerial Finance 1 Starbucks—A Taste for Growth page 3 1.1 Finance and Business What Is Finance? 4 Major Areas and Opportunities in Finance 4 Legal Forms of Business Organization 5 Why Study Managerial Finance? Review Questions 9 1.2 The Managerial Finance Function 9 Organization of the Finance

3y ago

6.8K Views

Chapter 1 The roles of finance function in organisations

The roles of the finance function in organisations 4. The role of ethics in the role of the finance function Ethics is the system of moral principles that examines the concept of right and wrong. Ethics underpins an organisation’s sustained value creation. The roles that the finance function performs should be carried out in an .File Size: 888KBPage Count: 10Explore furtherRole of the Finance Function in the Financial Management .www.managementstudyguide.c Roles and Responsibilities of a Finance Department in a .www.pharmapproach.comRoles and Responsibilities of a Finance Department .www.smythecpa.comTop 10 – Functions of Business Finance in an om23 Functions and Duties of Accounting and Finance nded to you b

2y ago

335 Views

2017-2018 GRANDE ÉCOLE MSc in MANAGEMENT

Descriptif des cours Course Outlines 10 Catalogue des cours/ Course Catalog 2017-2018 FIN: Finance/Finance A : Actuariat/Actuarial, Insurance E : Finance d’entreprise/Corporate Finance The course liste tables and the course outlines G : Finance générale/General Finance M : Finance de marché/Market Finance S : Synthèse/Synthesis IDS: Systèmes d’Information, Sciences de la Décision et .

3y ago

312 Views

Behavioral Finance and Wealth L Management

Introduction to Behavioral Finance CHAPTER1 What Is Behavioral Finance? Behavioral Finance: The Big Picture Standard Finance versus Behavioral Finance The Role of Behavioral Finance with Private Clients How Practical Application of Behavioral Finance Can Create a Successful Advisory Rel

2y ago

377 Views

Catalogue des Cours Course Catalog - ESSEC Business School

10 Catalogue des cours/Course Catalog 2021-2022 FIN: Finance/Finance E : Finance d'entreprise/Corporate Finance G : Finance générale/General Finance M : Finance de marché/Market Finance S : Synthèse/Synthesis IDS: Systèmes d'Information, Sciences de la Décision et Statistiques/ Information Systems, Decision Sciences and Statistics

1y ago

222 Views

PARLIAMENT OF THE REPUBLIC OF FIJI Research and Library .

Ministry of Civil Service 583.8 1,056.4 Fiji Police Force 59.0 25.0 Ministry of Education, Heritage and Arts 23,638.2 18,608.3 Ministry of Health and Medical Services 10,642.7 16,766.0 Ministry of Women, Children and Poverty Alleviation 2,906.3 6,581.5 Ministry of Youth and Sports 212.3 - Ministry of Agriculture 8,662.8 9,216.8

3y ago

169 Views

First Baptist Church Valdosta, Georgia First Family Chimes

Student Ministry 2 2 Children’s Ministry 2 Sunday, April 29 Education Ministry Music Ministry 3 3 Family Night Supper First Family News 3 4 Ministry This Week Facts and Figures 5 5 11:00 Worship Guide Sun. Evening Classes 6 7 Coming Events Adult Ministry 7 7 Volum

3y ago

175 Views

Government of India Ministry of New and Renewable Energy

Government of India Ministry of New and Renewable Energy MNRE . 1,00,000 MW Till year 2022 20,000 MW 20,000 MW 40,000 MW 20,000 MW Solar Park Unemployed Graduate States/Private/ . Ministry MW Potential Ministry of Agriculture 12 Ministry of Chemicals and Fertilizers Ministry of Health and Family 401

1y ago

141 Views

SINGAPORE - Kelly Services

FINANCE Chief Financial Officer Degree/Master 15 20,000 25,000 Finance Assistant Diploma 1-3 2,800 3,400 Finance Controller Degree 10-15 10,000 18,000 Finance Director Degree 15 15,000 20,000 Finance Executive/ Senior Finance Executive Degree 2-5 3,000 6,000 Finance Manager/ Assistan

2y ago

527 Views

Trade Finance & Supply Chain Finance Awards 2022

In February 2022, Global Finance will publish its annual selections for the World's Best Trade Finance and Supply Chain Finance Providers. Global Finance will name the best trade finance providers in more than 100 countries and territories, eight global regions and

1y ago

215 Views

Data Mining And Machine Learning - Northwest Knowledge

It looks like you're using an ad-blocker