CSE 5525: Foundaons Of Speech And Language Processing .

2y ago

6 Views

2 Downloads

6.02 MB

71 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Kaleb Stephen

Report this link

Download PDF

Transcription

CSE 5525: Founda.ons ofSpeech and Language ProcessingDialogueHuan Sun (CSE@OSU)Slides were largely adapted from Prof. Greg DurreH @ UT Aus.n.Some images/examples were from the two textbooks by (1) Jurafsky and Mar.n and (2) Eisenstein.

This Lecture‣ Review QA & connect QA with dialogue‣ Classic chatbots‣ Neural chatbots‣ Task-oriented dialogue

Recall: Seman.c Parsing‣ Parsing ques.on x intoderiva,on d:Berant et al. (2013)

Recall: Seman.c Parsing‣ Parsing ques.on x intoderiva,on d:1. What approaches cannot be used for this task?(1) Feature extraction log-linear model(2) Seq2seq model(3) Classification of x with fixed categoriesBerant et al. (2013)

Recall: SQuAD dataset2. This dataset is sometimes regarded as *not* realistic. Why?(1) Questions are synthesized by computers(2) Answers are not accurate(3) It cannot be used to develop information retrieval models(4) Humans created questions when looking at the paragraphRajpurkar et al. (2016)

Recall: Mul.-hop dataset HotpotQADoc 1Ques%on: What government posi,on was held by the woman who portrayedCorliss Archer in the ﬁlm Kiss and Tell ?Example picked from HotpotQA [Yang et al., 2018]Shirley Temple Black was an American actress, businesswoman, and singer As an adult, she served as Chief of Protocol of the United States Same en.tyDoc 2Kiss and Tell is a comedy ﬁlm in which 17-year-old Shirley Temple acts as Corliss Archer .Doc 3Same en.tyMeet Corliss Archer is an American television sitcom that aired on CBS 3. Which one below is NOT a drawback of this dataset?(1) High lexical word overlap between a question and a single Doc(2) Questions are too long and may involve multiple relations.(3) Answer spans may be found without context considered

Recall: BERT model for QA4. Which QA dataset(s) is the above formulation more often used for?(1) SQuAD(2) HotpotQA(3) GeoQueryDevlin et al. (2019)

Dialogue8

Search/QA as Dialogue‣ Dialogue is a very natural way to ﬁnd informa.on from a search engineor a QA system

Search/QA as Dialogue

Search/QA as Dialogue‣ “Has Chris PraH won an Oscar?” / “Has he won an Oscar”

QA as Dialogue‣ Dialogue is a very natural way to ﬁnd informa.on from a search engineor a QA systemIyyer et al. (2017)

QA as Dialogue‣ Conversa.onal machine reading:answer ques.ons based on apassage‣ Interes.ng and poten.ally usefulideaSaeidi et al. (2018)

This Lecture‣ QA as dialogue‣ Classic chatbots‣ Neural chatbots‣ Task-oriented dialogue

Chatbots

ELIZA‣ Created 1964-1966 atMIT, heavily scriptedEliza responds like a psychotherapistWeizenbaum (1966)

ELIZA‣ Created 1964-1966 atMIT, heavily scriptedFeatured in Young Sheldon: https://www.youtube.com/watch?v 2V9skeVray4image source: https://web.njit.edu/ ronkowit/eliza.htmlWeizenbaum (1966)

ELIZA‣ PaHern matching and subs.tu.on, using transforma.on rule(.*) you (.*) meWhy do you think I 2 you?

ELIZA‣ PaHern matching and subs.tu.on, using transforma.on ruleMy (.) (.*)What else comes to mind when youthink of your 1?

ELIZA‣ Iden.fy keyword, iden.fy context, apply transforma.on rule(.*) you (.*) meWhy do you think I 2 you?My (.) (.*)What else comes to mind when youthink of your 1?‣ Very liHle need to generate new content, but can only have one type ofconversa.on; not coherent

Cleverbot‣ Carpenter (1986), online system built in 2006

Cleverbot‣ “Nearest neighbors”: A user says statement A - Cleverbot ﬁnds ahuman response to statement A in human-human or human-computerUser: Hi there cleverbot.chats - repeat thatCleverbot: How are you?User: Doing okay. what plans do you have today?Cleverbot: I do revision.User: What are you revising for?Cleverbot: Maths history.User: How old are you?Cleverbot: That's none of your business.User: What is maths history?Cleverbot: I don’t know, farming maybeCheck ta-for-machine-learning/

Cleverbot‣ Carpenter (1986), online system built in 2006‣ “Nearest neighbors”: human says statement A, ﬁnd a human responsein human-human or human-computer chats to statement A, repeat that‣ Can ouen give sensibleanswers, but the bot doesn’treally impose high-leveldiscourse structureUser: Hi there cleverbot.Cleverbot: How are you?User: Doing okay. what plans do you have today?Cleverbot: I do revision.User: What are you revising for?Cleverbot: Maths history.User: How old are you?Cleverbot: That's none of your business.User: What is maths history?Cleverbot: I don’t know, farming maybe

Data-Driven Approacheson TwitterRiHer et al. (2011)

Data-Driven Approaches‣ Can treat as a machine transla.on problem: “translate” from currentuHerance to next oneObservation: structural relationship between Stimulus and ResponseRiHer et al. (2011)

Data-Driven Approaches‣ Can treat as a machine transla.on problem: “translate” from currentuHerance to next one‣ How to decide larger phrase (not just word-word) alignment?RiHer et al. (2011)

Data-Driven Approaches‣ Can treat as a machine transla.on problem: “translate” from currentuHerance to next one‣ How to decide larger phrase (not just word-word) alignment?‣ use sta.s.cal measures to ﬁlter out phrase pairs with low correla.onRiHer et al. (2011)

Data-Driven Approachesproposed method phrase alignmentis weakerRiHer et al. (2011)

What other approaches can you think of?RiHer et al. (2011)

Neural Chatbots

Seq2seq modelsIWhat areyou doingam going home [STOP] s ‣ Just like the machine transla.on task men.oned earlier‣ can train seq2seq models for this task

Sub.tles Data for Model Developmentdo you want to meet your sponsor for the last 10 years ?of course ! but he doesn’t want to see me !and where had you been just before ?i' d been to the palace of the legion of honor , the art gallery .yeah , we were just going to hit up taco bell .well , it' s my pleasure .

Sub.tles Datado you want to meet your sponsor for the last 10 years ?of course ! but he doesn’t want to see me !and where had you been just before ?i' d been to the palace of the legion of honor , the art gallery .yeah , we were just going to hit up taco bell .well , it' s my pleasure .‣ Why might this model perform poorly? What might it be bad at?

Lack of Diversity‣ Why?Li et al. (2016)

Lack of Diversity‣ Training to maximize likelihood gives a system that prefers commonresponses:max P(response input)Li et al. (2016)

Lack of Diversity‣ Solu.on: mutual informa.on criterion; response R should bepredic.ve of user uHerance U as well‣ Standard condi.onal likelihood: log P (R U )P (R, U )‣ Mutual informa.on: log log P (R U )P (R)P (U )log P (R)‣ log P(R) probabili.es under a language modelLi et al. (2016)

Lack of Diversity(maximum mutual info)‣ OpenSub.tles dataLi et al. (2016)

Another idea: Speciﬁcity‣ Train a speciﬁcity classiﬁer on labeled dataI don’t know SPECIFICITY 1Going to the store SPECIFICITY 3Ko, DurreH, Li (2019)

Speciﬁcity‣ Train a speciﬁcity classiﬁer on labeled dataI don’t know SPECIFICITY 1Going to the store SPECIFICITY 3‣ When training the decoder, condi.on on the automa&cally predictedspeciﬁcity of the responseIWhat aredon’t know [STOP]you doing s SPECIFICITY 1 (nonspeciﬁc)Ko, DurreH, Li (2019)

Speciﬁcity‣ At test .me, setthe speciﬁcitylevel higher toget less genericresponsesIWhat aredon’t know [STOP]you doing s SPECIFICITY 1 (nonspeciﬁc)Going toWhat areyou doingthe store [STOP] s SPECIFICITY 4 (speciﬁc)Ko, DurreH, Li (2019)

Speciﬁcityissues in the generated responses‣ Can use other models to try to ﬁx these issues. But the facts are s.llall made up, even if they make senseKo, DurreH, Li (2019)

PersonaChatZhang et al. (2018)

State of Chatbots‣ Can force chatbots to give consistent answers with a persona, buts.ll probably not very interes.ng‣ “Wizard of Wikipedia:” chatbot that can discuss topics by retrievingfrom Wikipedia [Dinan et al., 2019]‣ XiaoIce: Microsou chatbot in Chinese; according to [1], 660 milliononline users, over 30 billion conversa.ons people do seem to liketalking to them ?[1] ons-andwins-over-millions-of-fans/

Task-Oriented Dialogue

Task-Oriented Dialogue‣ Personal assistants / API front-ends:Siri, ﬁnd me a good sushirestaurant in ChelseaSushi Seki Chelsea is a sushirestaurant in Chelsea with 4.4 starson Google

Task-Oriented Dialogue‣ Personal assistants / API front-ends:Siri, ﬁnd me a good sushirestaurant in ChelseaSushi Seki Chelsea is a sushirestaurant in Chelsea with 4.4 starson GoogleHow expensive is it?Entrees are around 30 eachFind me something cheaper

Task-Oriented Dialogue‣ Customer service:Hey Alexa, why isn’t my Amazonorder here?Let me retrieve your order.Your order was scheduled to arriveat 4pm today.It never cameOkay, I can put you through tocustomer service.

Air Travel Informa.on Service (ATIS)‣ Given an uHerance, predict a domain-speciﬁc seman.c interpreta.on‣ Can formulate as seman.c parsing, but simple “slot-ﬁlling” solu.ons(classiﬁers) work well tooDARPA (early 1990s), Figure from Tur et al. (2010)

Intents‣ 29 diﬀerent intentswhich ﬂights go from cleveland to indianapolis on april ﬁuhIntent: ﬂightdoes tacoma airport oﬀer transporta.on from the airport to thedowntown areaIntent: ground servicewhat days of the week do ﬂights from san jose to nashville ﬂy onIntent: day namewhat meals are served on american ﬂight 811 from tampa to milwaukeeIntent: meal

Joint Intent Classiﬁca.on and Tagging‣ RNN jointlypredicts intentand slot tags?Liu and Lane (2016)

Joint Intent Classiﬁca.on and Tagging‣ RNN jointlypredicts intentand slot tagswhat is c here?

Joint Intent Classiﬁca.on and Tagging‣ RNN jointlypredicts intentand slot tagsLiu and Lane (2016)

Air Travel Informa.on Service (ATIS)as conversa.onal seman.c parsingSuhr et al. (2018)

Air Travel Informa.on Service (ATIS)as conversa.onal seman.c parsing‣ Need to use dialogue context to do the right thing. Here we’reappending American Airlines as a constraint to the previous query‣ seq2seq model mapping to query with copy mechanismSuhr et al. (2018)

Full Dialogue Task‣ Parsing / language understandingis just one piece of a systemYoung et al. (2013)

Full Dialogue Task‣ Parsing / language understandingis just one piece of a system‣ User uHerance - update dialogue state - take an ac.on decided bypolicy (e.g., query the restaurant database) - say somethingYoung et al. (2013)

Full Dialogue Task‣ Parsing / language understandingis just one piece of a system‣ Dialogue state: reﬂects anyinforma.on about theconversa.on (e.g., searchhistory)‣ User uHerance - update dialogue state - take an ac.on decided bypolicy (e.g., query the restaurant database) - say something‣ Much more complex than chatbots!Young et al. (2013)

‣ Full dialogue: User uHerance - update dialogue state - take an ac.ondecided by policy (e.g., query the restaurant database) - say something“Find me a good sushi restaurant in Chelsea”restaurant type - sushilocation - Chelseacurr result - execute search()“Sushi Seki Chelsea is a sushi restaurant in Chelsea with4.4 stars on Google”“How expensive is it?”get value(cost, curr result)“Entrees are around 30 each”

POMDP-based Dialogue Systems‣ POMDP: user is the “environment,” an uHerance is a noisy signal of state‣ Dialogue model: can look like a parser or any kind of encoder model‣ Generator: use templates or seq2seq modelReinforcement learning type of approachYoung et al. (2013)

Reward for comple.ng task?Find me a good sushi restaurant in Chelsearestaurant type - sushilocation - Chelseacurr result - execute search()Sushi Seki Chelsea is a sushi restaurant in Chelsea with4.4 stars on GoogleHow expensive is it? Okay make me a reserva.on! 1 make reservation(curr result)

Reward for comple.ng task?Find me a good sushi restaurant in Chelsearestaurant type sushiVery indirect signallocation - Chelseaof what shouldhappen up herecurr result - execute search()Sushi Seki Chelsea is a sushi restaurant in Chelsea with4.4 stars on GoogleHow expensive is it? Okay make me a reserva.on! 1 make reservation(curr result)

User gives reward?Find me a good sushi restaurant in ChelseaHow does the userknow the rightsearch happened?restaurant type - sushilocation - Chelseacurr result - execute search()Sushi Seki Chelsea is a sushi restaurant in Chelsea with 14.4 stars on GoogleHow expensive is it?get value(cost, curr result) 1 Entrees are around 30 each

Wizard-of-Oz‣ Learning fromdemonstra.ons: Assume“wizard” can demonstratewhat the dialogue systemshould do (e.g., how toupdate its state and takeac.ons)Kelley (early 1980s), Ford and Smith (1982)

Full Dialogue TaskFind me a good sushi restaurant in Chelseawizard entersthesewizard types thisout or invokestemplates{{restaurant type - sushilocation - Chelseacurr result - execute search()Sushi Seki Chelsea is a sushi restaurant in Chelsea with4.4 stars on Google‣ Wizard can be a trained expert and know exactly what the dialoguesystems is supposed to do

Learning from Sta.c Traces‣ Using either wizard-of-Oz or other annota.ons, can collect sta.c tracesand train from theseBordes et al. (2017)

Other issues: Language Ambiguity & Model Adap.vityFind me a good sushi restaurant in Chelsearestaurant type - sushilocation - Chelseastars - 4 curr result - execute search()‣ User asked for a “good” restaurant — does that mean we should changeour model to ﬁlter by star ra.ng? What does “good” mean?‣ Hard to change system behavior if training from sta.c traces, especially ifsystem capabili.es or desired behavior change

Goal-oriented Dialogue‣ Tons of industry interest!‣ Dozens of startups medium-sized companies in this space‣ Big Companies: Apple Siri, Google Assistant, Amazon Alexa, MicrosouCortana, Facebook, Samsung Bixby, Tencent WeChat, ASAPP‣ Lots of cool work that’s not public yet

Takeaways‣ Some decent chatbots, but unclear how to make these moresophis.cated than they are right now‣ Task-oriented dialogue systems are growing in scope and complexity —really exci.ng systems on the way‣ More and more problems are being formulated as dialogue —interes.ng applica.ons but challenging to get working well

CSE 5525: Foundaons of Speech and Language Processing Dialogue Huan Sun (CSE@OSU) Slide

Related Documents:

TO THE NEW- 2020 Passing Out Batch (Only for Unplaced ...

92 vipul sharma it 93 rishabh jain cse 94 manik arora cse 95 nishant bhardwaj cse . 96 rajit shrivastava it 97 shivansh gaur cse 98 harsh singh cse 99 shreyanshi raj cse 100 rahul bedi cse 101 pallavi anand cse 102 divya cse 103 nihal raj it 104 kanak

108 Views

2y ago

ROLL NO NAME FATHER NAME CSE-001 AAISHA MAKKAR …

cse-148 kuriakose jijo george n t george cse-149 kusum joshi ramesh chandra joshi cse-150 m mithun bose n k mohandasan cse-151 madhuri yadav rajbir yadav cse-152 malini shukla r s sharma cse-153 manisha khattar sunil kumar khattar cse-154 m

57 Views

2y ago

CSE 474 – Introduction to Embedded Systems

1 CSE 474 Introduction 1 CSE 474 – Introduction to Embedded Systems n Instructor: q Bruce Hemingway n CSE 464, Office Hours: 11:00-12:00 p.m., Tuesday, Thursday n or whenever the door is open n bruceh@cs.washington.edu q Teaching Assistants: q Cody Ohlsen, Kendall Lowrey and Ying-Chao (Tony) Tung CSE 474 Introduction 2

103 Views

3y ago

CSE 440: Introduction to HCI - University of Washington

CSE 440: Introduction to HCI CSE 441: Advanced HCI CSE 510: Advanced Topics in HCI CSEP 510: Human-Computer Interaction CSE 332: Data Structures. Who We Are You Computing. Who We Are Eunice Jun Prefer: Eunice / She / Her Background: BS,Cognitive Studies & Computer Science Vanderbilt, 2016

35 Views

1y ago

Speech Therapy (speech) - Medi-Cal

speech 1 Part 2 – Speech Therapy Speech Therapy Page updated: August 2020 This section contains information about speech therapy services and program coverage (California Code of Regulations [CCR], Title 22, Section 51309). For additional help, refer to the speech therapy billing example section in the appropriate Part 2 manual. Program Coverage

111 Views

3y ago

Digital Speech Processing - UC Santa Barbara

speech or audio processing system that accomplishes a simple or even a complex task—e.g., pitch detection, voiced-unvoiced detection, speech/silence classification, speech synthesis, speech recognition, speaker recognition, helium speech restoration, speech coding, MP3 audio coding, etc. Every student is also required to make a 10-minute

126 Views

3y ago

1) Speech articulation and the sounds of speech. 2) The ...

9/8/11! PSY 719 - Speech! 1! Overview 1) Speech articulation and the sounds of speech. 2) The acoustic structure of speech. 3) The classic problems in understanding speech perception: segmentation, units, and variability. 4) Basic perceptual data and the mapping of sound to phoneme. 5) Higher level influences on perception.

127 Views

3y ago

Guide de préparation des demandes d'autorisation ...

additif alimentaire ainsi que d’une nouvelle utilisation pour un additif alimentaire déjà permis. Les dispositions réglementaires pour les additifs alimentaires figurent à la partie B du titre 16 du RAD. L’article B.16.001 énumère les exigences relatives à l’étiquetage des additifs alimentaires. En particulier, l’article B.16.002 énumère la liste des critères qui doivent .

68 Views

3y ago

Recent Views

Finance Management for Schools Bromcom eFinance, powered .

eFinance. The Bromcom Financial Accounting System (FAS) is a purpose designed configuration of one of the world's leading financial management solutions now available to UK maintained schools, academies and multi academy trusts (MATs). Known as eFinance, at its core is a suite of modules from Unit4 Business World.

1y ago

104 Views

eFinance Budget Entry Schools

eFinance Plus Entry The boom poron of the "Expendi ture Budget Process" window will be accessible on your screen. You are now ready to enter your budget for next ﬁscal year. Enter the amount you want to allocate for your next ﬁscal year's budget in the Requested

1y ago

122 Views

Siebel eFinance for Teller Connector to IBM WebSphere .

12 Siebel eFinance for Teller Connector to IBM WebSphere Business Component Composer Guide Version 7.0, Rev. H Siebel Teller Architecture The Siebel Connector for Teller extends the functionality of the Siebel Connector for IFX XML to provide Teller-specific data exchange between Siebel and other systems.

1y ago

101 Views

Siebel eFinance ガイドバージョン6.0

siebelﬁ ebusiness applications siebel efinance ガイド siebel 2000 バージョン6.0.2 2000 年7 月 6jpa1-fb00-06020 sfsbank.book 1 ページ 2001年5月29日火曜日午後5時42分

1y ago

97 Views

1 2 4 5 7 8 9 10

The eFinance Plus Accounting, Human Resources and Payroll System are supported by D&N. This system is an online interactive package designed to handle all phases of K-12 school business. ESU#3/D&N is supporting a new time clocking system called Time Clock Plus. This clocking system will integrate with eFinance Plus as well

1y ago

105 Views

IHRE ONLINE FINANZIERUNG: eFINANCE

eFinance bietet Ihnen die Möglichkeit, Finanzierungs-produkte ab sofort ganz einfach online zu beantragen. In einem transparenten und strukturierten Prozess können Sie die notwendigen Dokumente sicher übermitteln, mit uns verhandeln, und auch elektronisch unterzeichnen. Außerdem können Sie mit Ihrem Kunden-

1y ago

144 Views

Relatório Anual 2014

Prêmio efinance 2014 O Sicredi foi o vencedor da categoria Plataforma de Canais do XIII Prêmio efinance com o case Plataforma Multicanal. A Plataforma Multicanal foi desenvolvida para renovar a tecnologia utilizada nos canais de relacionamento da instituição financeira cooperativa com os associados. Julho

1y ago

102 Views

eFinance Travel Voucher Guide - National Defense University

filling out the travel voucher (CONUS-CONUS). - If it is your current address, check the box. 5 America’s Airmen Dependents - Add all dependents. - If the individual will be claimed on the voucher, click “auto-claim this dependent” before adding them. 6 America’s Airmen

2y ago

102 Views

E-Finance in the Philippines: Status and Prospects for Digital .

the role of digital technology in financial inclusion has not been studied in detail. There has been very limited information available in the existing literature that examines the role of efinance in achieving the objective- of inclusive growth. This paper is an attempt to study the contribution of technology towards financial inclusion in

1y ago

100 Views

Wiener Processes and Ito's Lemma - efinance .cn

Categorization of Stochastic Processes Discrete time; discrete variable Random walk: if can only take on discrete values Discrete time; continuous variable

1y ago

102 Views

AIC eServices for Financing Schemes (eFASS) Navigation Guide

Schemes (eFASS) platform at https://eFinance.aic.sg For detailed steps, refer to page 3 of this navigation guide. Yes, you can apply on behalf of someone in your family.

1y ago

113 Views

2016-2017 Financial Services Guidelines

Receiving POs in eFinance 41 Staff Travel 42 Student Travel 43 Accounts Payable Forms and Instructions 43 TRAVEL PROCEDURES GUIDELINES 45 Required Documentation and Steps 46 Step 1 - Conference Approval 46 Step 2 - Conference Requisition Request 46 . 4 Step 3 - Conference Purchase Order/Payment 46 .

1y ago

103 Views

Introducing the New and Revised Data Points in HMDA

added two e numerations ( "cash -out r efinance" an d " other p urpose") t o Loan P urpose, an d s plit the "non-owner o ccupied" category o f Occupancy Type i nto " se cond r esidence" a nd " in vestment propert y." In ad dition, un der t he 20 15 H MDA R ule, ap plicants h ave t he o ption t o s elf -identify

1y ago

93 Views

Data Point: 2018 Mortgage Market Activity and Trends

The number of r efinance o riginations declined from 2.5 million in 2017 to 1.9 million in 2018. The number of reported home improvement loans declined from 549 ,000 in 2017 to 183,000 in 2018 , a drop that resulted primarily from a change in reporting requirements that excluded unsecured home improvement loans . 5

1y ago

96 Views

Ankeny Community Schools 306 Sw School St. Fixed Asset Inventory and .

reconciliation. ACSD is currently using the Fixed Assets Module of eFinance Plus software to track assets. Vendor will perform all labor to conduct a comprehensive inventory at ACSD site locations. During the inventory process, all of the following information will be captured for each item Asset Identification Information

1y ago

121 Views

CSE 5525: Foundaons Of Speech And Language Processing .

It looks like you're using an ad-blocker