Mathematical Methods In Machine Learning - UMD

1y ago

25 Views

2 Downloads

696.60 KB

26 Pages

Last View : 10d ago

Last Download : 3m ago

Upload by : Jerry Bolanos

Report this link

Download PDF

Transcription

Lecture 1: Motivation and OverviewMathematical Methods in Machine LearningWojciech CzajaUMD, Spring 2016Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewOutline1Lecture 1: Motivation and OverviewWojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewIntroductionThere is an abundance of available data. This data is often large,high-dimensional, noisy, and complex, e.g., geospatial imagery.Typical problems associated with such data are to cluster,classify, or segment it; and to detect anomalies or embeddedtargets. Regression and dimensionality reduction are other typesof typical examples of problems that we want to deal with.Our proposed approach to deal with these problems is bycombining techniques from harmonic analysis and machinelearning:Harmonic Analysis is the branch of mathematics that studies therepresentation of functions and signals.Machine Learning is the branch of computer science concernedwith algorithms that allow machines to infer rules from data.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewMachine LearningMachine learning has many different faces. We are interested inthese aspects of machine learning which are related torepresentation theory. However, machine learning has beencombined with other areas of mathematics.Statistical machine learning.Topological machine learning.Computer science.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewMachine LearningAnother way to classify machine learning is by the type of tasks itdeals with, depending on the nature of the learning (training orfeedback) available to a learning system.Supervised learning: The computer is presented with exampleinputs and their desired outputs, given by a ”teacher”, and thegoal is to learn a general rule that maps inputs to outputs.Semisupervised learning: Between supervised andunsupervised learning is semi-supervised learning, where theteacher gives an incomplete training signal: a training set withsome (often many) of the target outputs missing.Unsupervised learning: No labels are given to the learningalgorithm, leaving it on its own to find structure in its input.Unsupervised learning can be a goal in itself (discovering hiddenpatterns in data) or a means towards an end (feature learning).Reinforcement learning: An area of machine learning inspiredby behaviorist psychology, concerned with how software agentsought to take actions in an environment so as to maximize somenotion of cumulative reward.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewMotivation for Machine Learning - Big Data“Big Data” refers to the exponential growth data, with manychallenging tasks of analyzing and efficiently finding the importantinformation that is given in this complex setting.The roots of big data are in the data storage, databasemanagement, and data analytics for, both, commercial andnon-profit applications.The integration of many large datasets is a primary source of bigdata problems present in the modern scientific and researchenvironment, as is evident in applications ranging from ‘omics’data analysis for cancer research, to studies of social networks.Another source of big data problems are large andheterogeneous dynamic data sets, such as those arising in thecontext of climate change analysis, or for the analysis of networktraffic patterns.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewBig Data CharacteristicsIn view of the above, big data can be identified by the following:volume;heterogeneity;dynamics.In addition to the above major characteristics, we can add: ambiguity,complexity, noise, variability, etc.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewBig Data Example 1Large Eddy simulation (LES) around an Eppler foil at Re 10,000. Aseries of high fieldity LES of the flow around Eppler airfoils has beenconducted to generate a comprehensive data base. Reynoldsnumbers vary from 10,000 to 120,000 and the angle of attach variesfrom 0 to 20 degrees.Courtesy of Prof. Elias Balaras (GWU), via US Air Force contract FA9550-12-C-058 (2012): Learning from Massive Data Sets Generatedby Physics Based SimulationsWojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewBig Data Example 2A simplified case of the previous LES for a 3-dimensional flow over adimpled plate.Courtesy of Prof. Elias Balaras (GWU), via US Air Force contract FA9550-12-C-058 (2012): Learning from Massive Data Sets Generatedby Physics Based SimulationsWojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewBig Data Example EstimationLet us provide a small numerical estimation:2,000 x 1,000 x 1,000 2 x 109 grid points;Each grid point characterized by 3 spatial coordinates and 3velocity components, pressure, plus possibly some otherparameters;Flow simulation for 200 time steps;One way to look at it: 2 x 109 points in a space of dimension1,400;As an example, think of computing PCA for M points in Ndimensional space. The cost is O(MN 2 ) O(N 3 );In our case this results in a problem with complexity on the orderof 4 x 1015 4 petaFLOPs;Lawrence Livermore National Laboratory’s IBM Sequoia reaches16 petaFLOPS (16 x 1015 floating point operations per second) it was considered to be the fastest computer in 2012, it runs 1.57million PowerPC cores, costs approx. 250M USD.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewAnother Big Data ExampleConsider the human genome. First estimates pointed at 100,000genes. Nowadays this number has been scaled down toappprox. 45,000.There are many ways of representing genes. One of the morepopular is by means of base pairs: approx. three billion DNAbase pairs represent human genome.Alternatively, we could consider gene expressions (think of it as afunction). There are many ways of assembling such expressions,and they are different for different individuals. Hence resulting ina much larger data set.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewHA and Big Data to-dateHarmonic analysis ideas have been used in many problems dealingwith large and heterogeneous data. Some relevant examples include:Multiscale methodsCompressive sensingSparse representationsGeometric and graph-based methodsScattering transformsAmong those listed, multiscale methods are historically the oldest(though not old) class of approaches. They have been successfullyused in image compression applications. We can view the JPEG2000 as a prototypical “dimension reduction” attempt for a large dataclass.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewMultiscale representationsMultiscale representation (Multiresolution analysis (MRA),pyramid algorithms) can be described as a class of designmethods in representation theory, where the input is subject torepeated transformation (filtering) in order to extract featuresassociated with different scales.In image processing and computer graphics the concept ofmultiscale representations can be traced back to P. Burt and E.Adelson, and J. Crowley.In mathematics, it is associated with wavelet theory and MRA asintroduced by Y. Meyer and S. Mallat.S. Mallat, “A theory for multiresolution signal decomposition: the wavelet representation”, IEEE TPAMI, 1989, Vol. 11, pp. 674–693.Multiscale representations found many applications to imageprocessing and remote sensing: compression, feature detection,segmentation, classification, but also in registration and imagefusion.G. Pajares and J. Cruz, “A wavelet-based image fusion tutorial”, Pattern Recognition, 2004, Vol. 37(9), pp. 1855–1872.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewFiltersIn signal processing, a filter is typically understood as a deviceor a process that removes from a given signal an unwantedcomponent or feature.Originally, electronic filters were entirely analog and passiveand consisted of resistance, inductance and capacitance.Nowadays, digital filters are much more common. They operateon signals represented in digital form. The essence of a digitalfilter is that it directly implements a mathematical algorithm,corresponding to the desired filter transfer function.In practice, a digital filter system often contains ananalog-to-digital and a digital-to-analog converter together with amicroprocessor and some peripheral components (such asmemory to store data). In this talk we shall consider digitalfiltering as a signal transform, i.e., a mathematical procedure.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewFilter characteristicsDigital filters can be discrete or continuous.Digital filters may be linear or nonlinear.Digital filters may be time-independent or may depend on time.Digital filters may depend of the Fourier transform, the Laplacetransform, a state-space representation, or any other representationsystem.The filter should have a specific impulse response.The filter should be causal.The filter should be stable.The computational complexity of the filter should be low.The filter should be hardware or software implementable.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewExample of a filter designLet {x(n), n Z} denote the input signal and let {y(n), n Z} denotethe output. A filter F is a transformation:F : x 7 y.If we assume that the principle of superposition holds, i.e., that thefilter is linear, then combining any two inputs x1 and x2 (with individualoutputs y1 and y2 , resp.) as αx1 βx2 , results in an output of the form:F : αx1 βx2 7 αy1 βy2 .If, in addition, we assume that our filter is time-independent, then thebehavior of the filter does not change with time, i.e., a delayed versionof any input xd (n) x(n d), results in an output with acorresponding delay yd (n) y(n d):F : xd 7 yd .Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewExample of a filter design, cont’dLet δ denote the unit impulse at the origin (δ(0) 1 and δ(n) 0 forn 6 0). Let h denote the response of δ (F (δ) h).Under the above assumptions, we can now assert that the output of ageneral input signal:Xx(n) x(k )δ(n k )k Ztakes the form of:F (x) Xx(k)h(n k ) x ? h(n).k ZThis is a convolution.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewExample: Gaussian filterGaussian filter is a filter whose impulse response is a Gaussianfunction, or an approximation to it.Mathematically, a Gaussian filter modifies the input signal byconvolution with a Gaussian function; this transformation is alsoknown as the Weierstrass transform.Source of imagery: WikipediaWojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewFIlter BanksA filter bank is an array (collection) of band-pass filters that splits theinput signal into multiple components, each one carrying a singlefrequency sub-band of the original signal.A complete filter bank consist of the analysis and synthesis side. Theanalysis filter bank divides an input signal to different subbands withdifferent frequency spectrums. The synthesis part reassembles thedifferent subband signals and generates a reconstruction signal.F : x 7 H1 (x), . . . , Hn (x) 7 G1 (H1 (x)), . . . Gn (Hn (x)) F (x)In filter bank design one often makes use of properties of decimation(downsampling) and interpolation (expansion).The filter bank has perfect reconstruction if F (x) x for all inputsignals x. Equivalently, imperfect reconstruction means that thesynthesis bank is the left inverse of the analysis bank, GH Id.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewOrthogonality and Conjugate Quadrature FIltersFilter F (G, H) is orthogonal if the transformation it generatesis orthogonal, i.e., FF T F T F Id.A finitely supported filter F is a Conjugate Quadrature Filter isa filter that satisfies for every m ZXFn Fn 2m δ(m).2n ZOrthogonal Conjugate Quadrature FIlters are, in mathematicalnomenclature, MRA wavelets.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewWaveletsWe say that a function ψ L2 (R) is an orthonormal wavelet if it canbe used to define a basis, that is a complete orthonormal system, forthe Hilbert space L2 (R), of the formψj,k (x) 2j/2 ψ(2j x k ),where j, k Z. We call these operations dyadic dilations andtranslations.Wavelet transform is an operation of convolving input signals withthe elements of the wavelet basis.Wavelet transforms can be discrete or continuous. We shall focus onthe latter one.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewDiscrete wavelet transformThe first DWT was discovered by Hungarian mathematicianAlfréd Haar in 1909. We now know it as the Haar wavelet ψ s.t.: x [0, 0.5) 1ψ(x) 1 x [0.5, 1) 0otherwiseAlfréd Haar, “Zur Theorie der orthogonalen Funktionensysteme”:Ph.D. Thesis at Georg-August-Universitaet Goettingen 1909; published in Mathematische Annalen 69 (3), pp. 331–371.The concept of wavelets (derived from a French word ondelette,meaning ”small wave”) was introduced by Morlet and Grossmannin the early 1980s. The theory was then developed by Y. Meyer.The most commonly used set of discrete wavelet transforms wasformulated by the Belgian mathematician Ingrid Daubechies in1988.I. Daubechies, “Orthonormal bases of compactly supported wavelets”, Comm. Purr Appl. Math. vol. 41 (1988), pp. 909–996.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewDWT as a filter bankExample of the analysis stage of 1D DWT up to level 3 decompositionwith low-pass filter (g) and high-pass filter (h). The synthesis stage issymmetric and is automatically derived from the OCQF conditions.Source of imagery: WikipediaWojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewAdvantages of waveletsThe major advantage of wavelets over Fourier techniques in generalis that wavelets are localized in both time and frequency whereas thestandard Fourier transform is only localized in frequency.The following is an illustration of the frequency domain decompositioncorresponding to the above DWT.Source of imagery: WikipediaWojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewLimitations of traditional wavelet representationsWavelets provide optimal representations for 1-dimensionalsignals in the sense of measuring asymptotic error with N largestcoefficients in wavelet expansion, and are superior toFourier-type representations.However, in dimensions higher than 1, wavelets are known to besuboptimal for representing objects with curvilinear singularities(edges), even though they outperform Fourier methods.D. Donoho et al., “Data compression and harmonic analysis”, IEEE TIT, 1998, Vol. 44, pp. 2435–2476.A number of techniques have been proposed since theintroduction of wavelets to address this issue, and to find betterdescription of geometric features in images.L. Jacques et al., “A panorama on multiscale geometric representations, intertwining spatial, directional and frequency selectivity”,Signal Processing, 2011, Vol. 91, pp. 2699–2730.Wojciech CzajaMathematical Methods in Machine Learning

Lecture 1: Motivation and OverviewSumaryWe have generally described the area of machine learning thatwill be of interest to us in this lecture.We have motivated the need for machine learning using theconcept of Big Data.We have given a brief overview of traditional multiscale/wavelettechniques used for data compression.Wojciech CzajaMathematical Methods in Machine Learning

Machine learning has many different faces. We are interested in these aspects of machine learning which are related to representation theory. However, machine learning has been combined with other areas of mathematics. Statistical machine learning. Topological machine learning. Computer science. Wojciech Czaja Mathematical Methods in Machine .

Related Documents:

Specification and Price of Automatic Rendering Machine (FOB ... - AR

decoration machine mortar machine paster machine plater machine wall machinery putzmeister plastering machine mortar spraying machine india ez renda automatic rendering machine price wall painting machine price machine manufacturers in china mail concrete mixer machines cement mixture machine wall finishing machine .

15 Views

3m ago

Machine Learning Algorithms - A Review

supervised machine learning is a combination of supervised and unsupervised machine learning methods. It can be fruit-full in those areas of machine learning and data mining where the unlabeled data is already present and getting the labeled data is a tedious process. With more common supervised machine learning methods, you train

28 Views

1y ago

Lecture 1: Machine Learning Problem - University of Adelaide

Machine Learning Real life problems Lecture 1: Machine Learning Problem Qinfeng (Javen) Shi 28 July 2014 Intro. to Stats. Machine Learning . Learning from the Databy Yaser Abu-Mostafa in Caltech. Machine Learningby Andrew Ng in Stanford. Machine Learning(or related courses) by Nando de Freitas in UBC (now Oxford).

36 Views

1y ago

Machine Learning - B. Supervised Learning: Nonlinear Models B.5. A ...

Machine Learning Machine Learning B. Supervised Learning: Nonlinear Models B.5. A First Look at Bayesian and Markov Networks Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute for Computer Science University of Hildesheim, Germany Lars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL .

13 Views

1y ago

Mathematical Metaphysics - Carnegie Mellon University

mathematical metaphysics for a human individual or society. 1 What Mathematical Metaphysics Is Quite simply, the position of mathematical metaphysics is that an object exists if and only if it is an element of some mathematical structure. To be is to be a mathematical o

48 Views

2y ago

Mathematical Modeling: A way of life - IJSER

So, I say mathematical modeling is a way of life. Keyword: Mathematical modelling, Mathematical thinking style, Applied 1. Introduction: Applied Mathematical modeling welcomes contributions on research related to the mathematical modeling of e

77 Views

2y ago

Applied Problems, Mathematical Modeling, Mathematical ...

The need to develop a mathematical model begins with specific questions in a particular application area that the solution of the mathematical model will answer. Often the mathematical model developed is a mathematical “find” problem such as a scalar equation, a system o

82 Views

2y ago

MICRO MACHINING PROCESSES

Abrasive Jet Micro Machining (AJMM) is a relatively new approach to the fabrication of micro structures. AJMM is a promising technique to three-dimensional machining of glass and silicon in order to realize economically viable micro-electro-mechanical systems (MEMS) It employs a mixture of a fluid (air or gas) with abrasive particles. In contrast to direct blasting, the surface is exposed .

114 Views

3y ago

Recent Views

SHORT RUSSIAN PHRASEBOOK FOR ENGLISH-SPEAKING TRAVELERS .

Russian words in English. Version 4.0 December 2011 English-Russian phrases for trips to Belarus (Russia) compiled by Andrei Burdenkov, a certified Minsk guide a@minskguide.travel 2 Common Russian phrases. Numerals Russian phrase Say it in Russian English translation 0 – ноль nol' zero 1 – один Odin one 2 – два Dva two 3 – три Tri three

3y ago

209 Views

UTOMOTIVEEMC - unitest

AUTOMOTIVEEMC SAFETY&EMC2011 Table3ISO11452-2severitylevels SeverityLevel Field/(V/m) I25 II50 III75 IV100 V(opentotheusersofthestandard) a

3y ago

38 Views

DELHI TECHNOLOGICAL UNIVERSITY

4 ME104 Basic Mechanical Engineering AEC 4 4 0 0 3 0 25 - 25 50 - 5 ME106 Workshop Practice AEC 2 0 0 3 0 3 - 50 - - 50 6 HU102 Communication Skills HMC 3 3 0 0 3 0 25 - 25 50 - Total 21 16 1 7. ME-9 II Year: Odd Semester S.No. Code Title Area Cr L T P TH PH CWS PRS MTE ETE PRE 1. PE251 Engineering Materials & Metallurgy AEC 4 3 0 2 3 0 15 15 30 40 2. ME201 Mechanics of Solids DCC 4 3 0 2 3 0 .

3y ago

49 Views

Director Due Diligence

what to consider before nominating a potential director onto a board and should not be considered as a checklist in deciding whether to accept a potential candidate or not. From a potential director’s perspective, the paper aims to guide the individual on what to consider about a company prior to accepting an appointment. Terminology used in the paper Whilst the terms “company” and .

3y ago

154 Views

Hierarchical Level and Leadership Style

ORGANIZATIONAL BEHAVIOR AND HUMAN PERFORMANCE 18, 131--145 (1977) Hierarchical Level and Leadership Style ARTHUR G. JAGO AND VICTOR H. VROOM School of Organization and Management, Yale University This research investigates the relationship between the hierarchical level of managerial personnel and individual differences in their leadership styles, specifically the degree to which they are .

3y ago

64 Views

EFEKTIVITAS MEDIA PEMBELAJARAN E-LEARNING

EFEKTIVITAS MEDIA PEMBELAJARAN E-LEARNING TERHADAP PRESTASI BELAJAR PENDIDIKAN AGAMA ISLAM SISWA DI SMA NEGERI 1 YOGYAKARTA SKRIPSI Diajukan Kepada Fakultas Ilmu Tarbiyah dan Keguruan Universitas Islam Negeri Sunan Kalijaga Yogyakarta Untuk Memenuhi Sebagian Syarat Memperoleh Gelar Sarjana Strata Satu Pendidikan Islam Disusun oleh: Aldila Siddiq Hastomo (09410111) PENDIDIKAN AGAMA ISLAM .

3y ago

179 Views

USAHA PENGERINGAN EMPON-EMPON BAHAN OBAT HERBAL DI .

Keywords: empon-empon, herbal medicine, production, management, marketing. PENDAHULUAN Indonesia merupakan salah satu negara penghasil komoditi obat-obatan yang potensial. Aneka ragam jenis tanaman obat telah diproduksi sebagai bahan baku obat modern maupun obat tradisional (jamu). Prospek pengembangan produksi tanaman obat cukup cerah antara lain karena berkembangnya industri obat modern dan .

3y ago

92 Views

Sara Cristina Oliveira A Guitarra clássica Marques Almeida .

caracterização técnica, estilística e estética do repertório para guitarra de Carlo Domeniconi. Associados a este ponto fundamental, apresenta-se uma abordagem histórica e analítica dos recursos técnicos e estilísticos da guitarra, e, de uma forma sucinta dada a natureza do trabalho, os movimentos estéticos da vanguarda e a sua reflexão na criação artística, que constituem um .

3y ago

36 Views

UNCLASSIFIED//FOR OFFICIAL USE ONLY DELIBERATIVE DOCUMENT .

to provide intelligence and information on transnational organized crime, terrorism, cyber-threat actors, counterintelligence vulnerabilities, economic security, and other developing threats that pose a critical danger to the Nation’s security and our democratic way of life.

3y ago

89 Views

2020 Asia Pacific Virtual Summit Agenda Overview

Cyber Threat Hunting: Resourcing & Methods Sindhu HS, Vice President, Goldman Sachs y To effectively and efficiently execute cyber threat missions leveraging collaborative Tiger Teams to address resource constraints common in every organization y To proactively identify previously undetected malicious activity and

3y ago

28 Views

Corporate Governance and Controls: The Federal Reserve’s .

The Federal Reserve has proposed new supervisory guidance on corporate governance (Governance Proposal) that would apply to large U.S. financial institutions . its internal risk management, its internal controls and, for U.S. G-SIBs, its recovery

3y ago

20 Views

Cross Stitch Tutorial - allstitches4you

cross stitch kit. It is better to cut and fasten off your thread at the back of the needle work as normal, and start again at the new area of the design. Half Cross Stitch Many projects now have areas worked in half cross stitch, for example to give a "soft focus" background.

3y ago

32 Views

Service Manual for WP6 Diesel Engine

Service Manual for WP6 Diesel Engine Preface WP6 mechanical pump series diesel engine has the features of compact structure, reliable

3y ago

121 Views

From Eye to Insight

medaka, and Xenopus, are often used in molecular and developmental biology. An adult zebrafish is shown below. Adult zebrafish (Danio rerio). In molecular and developmental biology, these aquatic vertebrate model organisms are widely applied to study molecular processes of development and as disease models. To study these molecular

3y ago

25 Views

Geometry SIG Update - POSC Caesar

ISO 15926 P&ID/3D Information Models and Proteus Schema Mapping Project yCurrent Team Members – Adrian Laud (Lead), Keith Willshaw, Andrew Prosser, Manoj Dharwadkar yDocument P&ID and 3D Models in terms of Classes and Templates yMapping the Proteus Schema to the ISO 15926 P&ID/3D Information Model

3y ago

45 Views

Mathematical Methods In Machine Learning - UMD

It looks like you're using an ad-blocker