Structured Prediction - Svivek

3y ago

38 Views

2 Downloads

3.76 MB

41 Pages

Last View : 2d ago

Last Download : 3m ago

Upload by : Jerry Bolanos

Report this link

Download PDF

Transcription

Structured PredictionFinal wordsCS 6355: Structured Prediction1

A look back What is a structure? The machine learning of interdependent variables2

Recall: A working definition of a structureA structure is a concept that can be applied to any complex thing, whether itbe a bicycle, a commercial company, or a carbon molecule. By complex, wemean:1.It is divisible into parts,2.There are different kinds of parts,3.The parts are arranged in a specifiable way, and,4.Each part has a specifiable function in the structure of the thing as awholeFrom the book Analysing Sentences: An Introduction to English Syntax by Noel Burton-Roberts, 1986.3

An example task: Semantic ParsingFind the largest state in the USSELECT expression FROM table WHERE conditionMAX (numeric list)ORDERBY predicateDELETE FROM table WHERE conditionUS CITIES US STATESnamenamepopulation populationsizestatecapitalSELECT expression FROM tableExpression 1 Expression 24

A plausible strategy to build the queryFind the largest state in the USSELECT expression FROM table WHERE conditionMAX numeric listORDERBY predicateDELETE FROM table WHERE conditionSELECT expression FROM tableExpression 1 Expression 2US CITIESUS 5

A plausible strategy to build the queryFind the largest state in the USSELECT expression FROM table WHERE conditionSELECT expression FROM table WHERE conditionMAX numeric listORDERBY predicateDELETE FROM table WHERE conditionSELECT expression FROM tableExpression 1 Expression 2US CITIESUS 6

A plausible strategy to build the queryFind the largest state in the USSELECT expression FROM table WHERE conditionUS STATESSELECT expression FROM table WHERE conditionMAX numeric listORDERBY predicateDELETE FROM table WHERE conditionSELECT expression FROM tableExpression 1 Expression 2US CITIESUS 7

A plausible strategy to build the queryFind the largest state in the USSELECT expression FROM table WHERE conditionnameUS STATESSELECT expression FROM table WHERE conditionMAX numeric listORDERBY predicateDELETE FROM table WHERE conditionSELECT expression FROM tableExpression 1 Expression 2US CITIESUS 8

A plausible strategy to build the queryFind the largest state in the USSELECT expression FROM table WHERE conditionnameUS STATESExpression 1 Expression 2SELECT expression FROM tableSELECT expression FROM table WHERE conditionMAX numeric listORDERBY predicateDELETE FROM table WHERE conditionSELECT expression FROM tableExpression 1 Expression 2US CITIESUS 9

A plausible strategy to build the queryFind the largest state in the USSELECT expression FROM table WHERE conditionnameUS STATESExpression 1 Expression 2SELECT expression FROM tableMAX numeric listSELECT expression FROM table WHERE conditionMAX numeric listORDERBY predicateDELETE FROM table WHERE conditionSELECT expression FROM tableExpression 1 Expression 2US CITIESUS 10

A plausible strategy to build the queryFind the largest state in the USSELECT expression FROM table WHERE conditionnameUS STATESExpression 1 Expression 2SELECT expression FROM tableMAX numeric listUS STATESSELECT expression FROM table WHERE conditionMAX numeric listORDERBY predicateDELETE FROM table WHERE conditionSELECT expression FROM tableExpression 1 Expression 2US CITIESUS 11

A plausible strategy to build the queryFind the largest state in the USSELECT expression FROM table WHERE conditionnameUS STATESExpression 1 Expression 2SELECT expression FROM tablesizeOr perhaps population?MAX numeric listsizeSELECT expression FROM table WHERE conditionMAX numeric listORDERBY predicateDELETE FROM table WHERE conditionSELECT expression FROM tableExpression 1 Expression 2US STATESUS CITIESUS 12

A plausible strategy to build the queryFind the largest state in the US At each step many, many decisions to makeSELECT expression FROM table WHERE condition Somenamedecisions aresimply not allowedUS STATESExpression 1 Expression 2- A query has to be well formed!sizeSELECT expression FROM table Even so, many possible options- Why doesto SELECT? MAX numeric listOr“Find”perhapsmappopulation?- Largest by size/population/population of capital?sizeSELECT expression FROM table WHERE conditionMAX numeric listORDERBY predicateDELETE FROM table WHERE conditionSELECT expression FROM tableExpression 1 Expression 2US STATESUS CITIESUS 13

Standard classification tools can’t predictstructuresX: “Find the largest state in the US.”Y: SELECT nameFROM us statesWHERE size (SELECT MAX(size) FROM us states)Classification is about making one decision–Spam or not spam, or predict one label, etcWe need to make multiple decisions– Each part needs a label Should “US” be mapped to us states or us cities?Should “Find” be mapped to SELECT or DELETE?– The decisions interact with each other If the outer FROM clause talks about the table us states, then the inner FROM clause should not talkabout utah counties– How to compose the fragments together to create the whole structure? Should the output consist of a WHERE clause? What should go in it?14

How did we get here?Binary classification Learning algorithms Prediction is easy – Threshold Features (?)Multiclass classification Different strategies One-vs-all, all-vs-all Global learning algorithms One feature vector per outcome Each outcome scored Prediction highest scoring outcomeStructured classification Global models or local models Each outcome scored Prediction highest scoring outcome Inference is no longer easy! Makes all the difference15

Structured output is A graph, possibly labeled and/or directedRepresentation– Possibly from a restricted family, such as chains, trees, etc.– A discrete representation of input– Eg. A table, the SRL frame output, a sequence of labels etc A collection of inter-dependent decisionsProcedural– Eg: The sequence of decisions used to construct the output The result of a combinatorial optimization problemFormally– argmaxy 2 all outputsscore(x, y)16

Challenges with structured output Two challenges1. We cannot train a separate weight vector for each possibleinference outcome For multiclass, we could train one weight vector for each label1. We cannot enumerate all possible structures for inference Inference for binary/multiclass is easy Solution– Decompose the output into parts that are labeled– Define how the parts interact with each other how labels are scored for each part an inference algorithm to assign labels to all the parts17

Multiclass as a structured output A structure is Multiclass– A graph (in general,hypergraph), possibly labeledand/or directed– A graph with one node andno edges– A collection of interdependent decisions– Can be composed via multipledecisions– The output of a combinatorialoptimization problem– Winner-take-allargmaxi wTÁ(x, i)argmaxy 2 all outputsscore(x, y) Node label is the output18

Multiclass is a structure: Implications1. A lot of the ideas from multiclass may be generalized tostructures– Not always trivial, but useful to keep in mind2. Broad statements about structured learning must applyto multiclass classification–Useful for sanity check, also for understanding3. Binary classification is the most “trivial” form ofstructured classification–Multiclass with two classes19

Structured PredictionThe machine learning of interdependent variables20

Computational issuesData annotationdifficultyHow to train themodel?Model definitionWhat are the parts of the output?What are the inter-dependencies?Backgroundknowledge aboutdomainHow to do inference?Semisupervised/indirectlysupervised?21

What does it mean to define the model?Say we want to predict four output variables from some inputxy1y2y3y423

What does it mean to define the model?Say we want to predict four output variables from some inputRecall: Each factor is alocal expert about allthe random variablesconnected to itxy1y2y3y4i.e. A factor can assigna score to assignmentsof variables connectedto itOption 1: Score each decision separatelyPro: Prediction is easy, each y independentCon: No consideration of interactions24

What does it mean to define the model?Say we want to predict four output variables from some inputRecall: Each factor is alocal expert about allthe random variablesconnected to itxy1y2y4y3i.e. A factor can assigna score to assignmentsof variables connectedto itOption 2: Add pairwise factorsPro: Accounts for pairwise dependenciesCons: Makes prediction harder,ignores third and higher orderdependencies25

What does it mean to define the model?Say we want to predict four output variables from some inputRecall: Each factor is alocal expert about allthe random variablesconnected to itxy1y2y4y3i.e. A factor can assigna score to assignmentsof variables connectedto itOption 3: Use only order 3 factorsPro: Accounts for order 3 dependenciesCons: Prediction even harder.Inference should consider alltriples of labels now26

What does it mean to define the model?Say we want to predict four output variables from some inputRecall: Each factor is alocal expert about allthe random variablesconnected to itxy1y2y4y3i.e. A factor can assigna score to assignmentsof variables connectedto itOption 4: Use order 4 factorsPro: Accounts for order 4 dependenciesCons: Basically no decompositionover the labels!27

Some aspects to consider Availability of supervision– Supervised algorithms are well studied; supervision is hard (orexpensive) to obtain Complexity of model– More complex models encode complex dependencies betweenparts; complex models make learning and inference harder Features– Most of the time we will assume that we have a good featureset to model our problem. But do we? Domain knowledge– Incorporating background knowledge into learning andinference in a mathematically sound way29

Training structured models Inference in training makes all the difference from multiclass/binaryclassification Empirical risk minimization principle– Minimize loss over the training data– Regularize the parameters to prevent overfitting We have seen different training strategies falling under this umbrella– Conditional Random Fields– Structural Support Vector Machines– Structured Perceptron (doesn’t have regularization) Different algorithms exist– We saw stochastic gradient descent in some detail31

Training considerations Train globally vs train locallyGlobal: Train according to your final modelxy1y2y3y4Pro: Learning uses all the available informationCon: Computationally expensive32

Training considerations Train globally vs train locallyLocal: Decompose your model into smaller ones and train each one separatelyFull model still used at prediction timey2y1xy4y3y2y1y2y3y4y1y3y1y3y2y4y4Pro: Easier to trainCon: May not capture global dependencies33

Training considerations Local vs global– Local learning Learn parameters for individual components independently Learning algorithm not aware of the full structure– Global learning Learn parameters for the full structure Learning algorithm “knows” about the full structureHow do we choose?– Depends on inference complexity– Jury still out on which one is better– Depends on size of available data too34

Inference What is inference? The prediction step– More broadly, an aggregation operation on the space of outputs for anexample: max, expectation, sample, sum– Different flavors: MAP, marginal, loss augmented. Many algorithms, solution strategies– Combinatorial optimization, one size doesn’t fit all– Graph algorithms, integer linear programming, heuristics, Monte Carlomethods, .How do we choose? Some tradeoffs––––Programming effortExact vs inexactIs the problem solvable with a known algorithm?Do we care about the exact answer?36

How does background knowledge affect yourchoices? Background knowledge biases your predictor in several ways– What is the model? Maybe third order factors are not needed etc– Your choices for learning and inference algorithms– Feature functions– Constraints that prohibit certain inference outcomes38

Data and how it influences your model Annotated data is a precious resource– Takes specialized expertise to generate– Or: very clever tricks (like online games that make data as a sideeffect) Important directions– Learning with latent representations, indirect supervision, partialsupervision– In all these cases Learning is rarely a convex problem Modeling choices become very important! Bad model will hurt40

Looking ahead Big questions (a very limited and biased set)– Representations Can we learn the factorization? Can we learn feature functions?– Dealing with the data problem for new applications Clever tricks to get data Taming latent variable learning– Applications How does structured prediction help you? Gathering importance as computer programs have to deal withuncertain, noisy inputs and make complex decisions41

Global learning algorithms One feature vector per outcome Each outcome scored Prediction highest scoring outcome Structured classification Global models or local models Each outcome scored Prediction highest scoring outcome Inference is no longer easy! Makes all the difference

Related Documents:

Enriched global horizontal irradiance prediction using novel ensemble ...

generic performance capability. The comparative analysis imparts the proposed prediction model results improved GHI prediction than the existing models. The proposed model has enriched GHI prediction with better generalization. Keywords: Ensemble, Improved backpropagation neural network, Global horizontal irradiance, and prediction.

15 Views

1y ago

Structured Settlements - Ringler Associates

Key takeaway: After being educated on the difference between a lump-sum and a structured settlement, 73 percent of Americans would choose a structured settlement payout when they received their settlement in a personal injury case. Chose structured settlement Chose lump sum CHART 4 - REASONS FOR CHOOSING A STRUCTURED SETTLEMENT

14 Views

1y ago

Graphical Models - svivek

Probabilistic Graphical Models Languages that represent probability distributions over multiple random variables –Directed or undirected graphs . Example from Daphne Koller 12. Independence Assumptions of a BN Example from Daphne

9 Views

2y ago

Prediction Of Student Performance Using Weka Tool

Prediction models that include all personal, social, psychological and other environmental variables are necessitated for the effective prediction of the performance of the students [15]. The prediction of student performan

127 Views

2y ago

Prediction of Uterine Contractions Using Knowledge ...

A post-prediction process is also proposed to further enhance the prediction results. The framework conducts the prediction in real time. To the best of our knowledge, this is the ﬁrst study that addresses the potential application of a sequenti

21 Views

2y ago

Prediction, Judgment and Complexity

prediction; that is, providing a forecast (or nowcast) of a variable of interest from available data. In some cases, prediction has enabled full automation of tasks – for example, self-driving vehicles where the process of data collection, prediction of behavior and surroundings, a

67 Views

2y ago

User's Guide - CALYPSO

Crystal structure prediction via particle-swarm optimization, Phys. Rev. B 82, 094116 (2010). Cluster Structure Prediction: Jian Lv, Yanchao Wang, Li Zhu, and Yanming Ma* Particle-Swarm Structure Prediction on Clusters, J. Chem. Phys. 137, 084104 (2012). Two-Dimensional Layer Structure Prediction: 1.

30 Views

2y ago

Day 2 0830-0915 IECEx Dubai Area Classif final Leroux P

API RP 505 «API RP 505 « Recommended Practice for classification of locations for ElectricalRecommended Practice for classification of locations for Electrical Installations at Petroleum facilities classified as Class I, zone 0, zone1, zone2 » Foreword states : « API publications may be used by anyone desiring to do so. Every effort has been made by the Institute to assure the accuracy and .

54 Views

3y ago

Recent Views

PHONE NO. CONTACT TOPIC/SUBTOPIC ORGANIZATION #A

651-757-2762 Deborah Klooz MPCA Paralegal: 651-757-2631 Jean Coleman MPCA Staff Attorney: 651-757-2791 Adonis Neblett MPCA Staff Attorney: 651-757-2017 Carmen Netten MPCA Staff Attorney: 651-757-2759 David Stellmach MPCA Staff Attorney: 651-757-2247 Joseph Dammel MPCA Staff Attorney: 651-757-2545 Michelle Janson MPCA Staff Attorney: #ATTORNEY .

2y ago

403 Views

Local Prosecutors and The Attorney General

Attorney General of Iowa Other Members iii Honorable Arthur K. Bolton Attorney General of Georgia Honorable Chauncey H. Browning, J 1'. Honorable John C. Danforth Attorney General of Missouri Honorable J olm P. Moore Attorney General of Colorado Attorney General of West Virginia Honorable Larry Derryberry Attorney General of Oklahoma

1y ago

178 Views

30th Annual Anti-Fraud Conference Tentative Schedule

Apr 30, 2019 · Jill Nerone, Supervising Deputy District Attorney, Alameda County District Attorney’s Office Laura Meyers, Assistant District Attorney, San Francisco County District Attorney’s, Office Nicole Pantaleo, Deputy District Attorney, Marin County District Attorney’s Office, Insurance F

2y ago

150 Views

Shannon McClellan Hon. Diane O. Leasure Ellery M. “Rick .

Attorney at Law Hon. Pamila J. Brown BOG Liaison District Court, Howard County Alan S. Carmel Attorney at Law Sarah Dawn Cline Attorney at Law Adam Sean Cohen Attorney at Law Delegate Kathleen M. Dumais District 15 Suzanne K. Farace Attorney at Law Barry L. Gogel Attorney at Law Michael I. Gordon

2y ago

142 Views

Powers of Attorney Act 2003 A Commentary - Law Society of New South Wales

POWERS OF ATTORNEY ACT 2003: A COMMENTARY 6 POWERS OF ATTORNEY ACT 2003: COMMENTARY The commentary is provided in black text. Reference to the "Act" is a reference to the Powers of Attorney Act 2003 as amended. Reference to the "Regulation" is a reference to the Powers of Attorney Regulation 2011, recently amended by the Powers of Attorney Amendment Act 2013 and the Powers of

7m ago

94 Views

California Safe Drinking Water and Toxic Enforcement Act .

District Attorney of Madera County 209 West Yosemite Avenue Madera, CA 93637 District Attorney of Marin County 3501 Civic Center Drive, Rm. 130 San Rafael, CA 94903 District Attorney of Mariposa County P.O. Box 730 Mariposa, CA 95338 District Attorney of Mendocino County P.O. Box 1000 Ukiah, CA 95482 District Attorney of Merced County

3y ago

163 Views

IN THE UNITED STATES COURT OF APPEALS FOR THE FIRST

Mar 06, 2020 · Attorney General of New Jersey Assistant Attorney General Counsel of Record Attorney for Amicus Curiae JOHN T. PASSANTE State of New Jersey Deputy Attorney General New Jersey Attorney General’s Office Richard J. Hughes Justice Complex 25 Market Street Trenton, NJ 086

2y ago

128 Views

ATTORNEY HANDBOOK - United States Courts

e. Each attorney's or pro se litigant's name must be typed and signed on the last page of the complaint, with: (1) his/her address (2) telephone number (3) if a Pennsylvania attorney, his/her Pennsylvania Attorney ID Number f. To file a complaint, the attorney must have an electronic signature on the complaint and must have an electronic

1y ago

124 Views

Power of Attorney - FedEx

Show the date the Power of Attorney is signed. Corporation Power of Attorney Partnership 1 10 9 8 7 6 5 4 3 2 12 11 1 10 9 8 7 6 5 4 3 2 12 11 1 10 9 8 7 6 5 4 3 2 12 11 Rev 6/13 The number preceding each instruction corresponds to the same number on the example of the power of attorney form. Customs Power of Attorney, Designation as Export .

1y ago

157 Views

Powers of Attorney - Ontario

attorney, a family member or friend may have to apply to be appointed as guardian. Powers of attorney that were properly made under previous laws of Ontario remain legally valid. The forms for a Continuing Power of Attorney for Property and a Power of Attorney for Personal Care contained in this booklet were revised on March 29, 1996 in accordance

1y ago

155 Views

STATUTORY POWER OF ATTORNEY - eForms

repudiated the power of attorney; and the power of attorney still is in full force and effect. 5. I/we make this affidavit for the purpose of inducing _ to accept delivery of the above described instrument, as executed by me/us in my/our capacity of attorney(s)-in-fact for the Principal. _, Attorney-in-fact

1y ago

118 Views

John J. Hoffman Acting Attorney General of New Jersey

JOHN J. HOFFMAN ACTING ATTORNEY GENERAL OF NEW JERSEY Division of Law 124 Halsey Street — 5th Floor P.O. Box 45029 Newark, New Jersey 07101 Attorney for Plaintiffs By: Jah-Juin Ho - #033032007 Deputy Attorney General 973-648-2500 JOHN J. HOFFMAN, Acting Attorney General of the State of New Jersey, and ERIC T.

1y ago

89 Views

Options in Oregon to Help Another Person Make Decisions

Power of Attorney A “Power of Attorney” is a legal document that allows a person to give another person (called an “agent”) the right to act on the person’s behalf. A “Power of Attorney” in Oregon can only be used for financial decisions. The way a “Power of Attorney” is written is important. The authority given to the agent can

3y ago

134 Views

- fcdfa

FRESNO COUNTY SUPERIOR COURT By DEPT.402 JAN SCULLY District Attorney, County of Sacramento RUTH YOUNG, State Bar No. 133606 Deputy District Attorney 906 G Street, Suite 700 Sacramento, CA 95814 Telephone: (916) 874-6174 JACKIE LACEY District Attorney, County of Los Angeles STUART C. LYTTON, State Bar No. 114241 Deputy District Attorney

3y ago

136 Views

Non-Attorney E-File Registration

your motion for e-filing access. Instructions to submit the Non-Attorney E-File Registration: 1. Register for a Non-Attorney Filer Account on the PACER website at www.pacer.uscourts.gov. If you already have a PACER Account, login to Manage My Account, select Non-Attorney E-File Re

2y ago

181 Views

Structured Prediction - Svivek

It looks like you're using an ad-blocker