From Linguist In NLP To Humanist In AI: How A Linguist's .

2y ago

9 Views

2 Downloads

4.43 MB

33 Pages

Last View : 19d ago

Last Download : 3m ago

Upload by : Vicente Bone

Report this link

Download PDF

Transcription

From Linguist in NLP to Humanist in AI:How a Linguist's Perspective on Data HasInformed My Work on Ethics in NLPEmily M. BenderUniversity of WashingtonWidening NLP @ ACL 201928 July 2019@emilymbender

My journey into computational linguistics Discovered linguistics freshman year of university; AB (UC Berkeley), MA,PhD (Stanford) all in Linguistics First programming language: Logo (4th grade) First programming class: CS 60A @ Cal, in Scheme Concurrently: Morphology with Sharon Hargus& TA David A. Peterson First compling project:Bantu morphological analyzer in Scheme

My journey into computational linguistics Grad school: Introduction to computational linguistics (Martin Kay),phenomenology (Terry Winograd) RAship in grammar engineering, with Ivan Sag and Dan Flickinger Dissertation (2001): Syntactic Variation and Linguistic Competence: TheCase of AAVE Copula Absence No luck on the job market as syntactician or sociolinguist Short stint in industry (YY Technologies) as a grammar engineer for Japanese Laid oﬀ in first dot-com bust @7 months pregnant

My journey into computational linguistics While at YY, started the Grammar Matrix, in connection with Project DeepThought After a couple more years of temporary positions, hired by UW Linguistics tostart the CLMS program At the time: strong language group in EE working on MT & ASR (MariOstendorf, Jeﬀ Bilmes, Katrin Kirchhoﬀ) CSE had AI/IE folks, who worked with language dataLanguage per se vs.Information encoded in language

Outline The field as I found it in 2003 and where it is now Current issues What a linguist can bring There’s more to NLP than SOTA Towards more interdisciplinary, multilingual and ethical NLP How we can do better

A linguist’s eye view of the recent history of NLP 2000-2015: Machine learning “versus” rule-based systems (aka linguistics) Machine learning in the service of building better NLP applications But also and increasingly: NLP as a proving ground for ML Role for linguistics in feature engineering 2015-now: Deep learning NLP as proving ground for DL No need for feature engineering: oﬀ-load understanding how to representdata to the machines End-to-end everything But also: work asking what is it that the big models are learning?

What’s the problem? End-to-end, task-focused research entails alwayslooking through the window Miss that language itself has structure A language is a general purpose communication tool; a whole pile of systemstrained on end-to-end tasks won’t be

What’s the problem? Languages have structure which varies (within bounds) across languages Linguistically naïve ! language independent (Bender 2009) See also Typ-NLP Workshop on Thursday

Aside: the #BenderRule “Always state the name of the language you are working on, even if it isEnglish” Coined by (at least) Nathan Schneider, Yuval Pinter, Rob Munro & AndrewCaines

Aside: the #BenderRule I invite you to join me in asking authors, if they don’t specify, whichlanguage(s) they tested their systems on Why does this matter, if we always know it’s English unless otherwisespecified? Status quo: Work on non-English is “language specific”, work on English is“NLP” But English is just one language, like any other and not representative of all! A window with its own specific pattern of raindrops

How is English non-representative? It’s a spoken language, not a signedlanguage It has relatively little morphologyand thus fewer forms of each word It has a well-established, longused, roughly phone-basedorthographic system It has relatively fixed word order with white space between words It has massive amounts of trainingdata available (like the 3.3B tokensused to train BERT (Devlin et al2019)) using (mostly) only lower-asciicharacters English forms might ‘accidentally’match database field names,ontology entries, etc.

What’s the problem? If we’re always looking through the window, we missthe variation within languages Sociolinguists have found that variation correlates with speaker demographiccharacteristics, speech situation & more (e.g. Labov 1966) Speakers attach social meaning to linguistic variation and use it to construct& project identities (e.g. Eckert and Rickford 2001) Sociolinguistically naïve NLP will miss these realms of meaning Sociolinguistically naïve NLP won’t work equally well for all users (even inhigh resource languages)

What’s the problem? If we’re always looking through the window, we riskmistaking the scene on the other side for “groundtruth” Work on learning world knowledge or “common sense” from corporaconflates what people say about the world with ground truth “Black sheep” problem (Meg Mitchell, pc) Poor performance of sentiment analyzers because of toxic discourseabout immigration in the US (Speer, 2017)

How can linguists/linguistics help?Understanding the structure of language Not just for rule-based systems! Feature engineering (where applicable) Design of ancillary tasks (see Smith 2017) Error analysis Design of annotation schemes expert annotation: Without it, we can’t know if we’ve solved the problem The field should value this work (see Heinzerling 2019)

How can linguists/linguistics help?Understanding variation in language Where might our assumptions fail for a diﬀerent language? How do we ensure that deployed models work equitably For all users For all indirect stakeholders (see Friedman & Hendry 2019)

How can linguists/linguistics help?Understanding relationship between form & meaning Form: text, speech, sign ( paralinguistic information like gesture or tone) Conventional/standing meaning: logical form (or equivalent) that the linguisticsystem pairs with that form Communicative intent of the speaker: what they are publicly committed to byuttering that form ( additional plausibly deniable inferences) Relationship between communicative intent & the world, e.g.: True assertion, mistaken assertion, lie, accidentally true assertion, socialact related to construction of social world, question about theinterlocutor’s beliefs,

How can linguists/linguistics help?Stepping off the SOTA treadmill Linguistics encourages us to: understand our data be interested in the linguistic form itself — and see the raindrops asdistinct from the view on the other side Language is always changing, but on avery diﬀerent time scale than current NLPleaderboards!

When the SOTA is all that counts SOTA chasing encourages a frenetic pace, especially in combination witharXiv We lose researchers who can’t just drop everything to stay up working allnight We don’t have time for “research slow” (see Kan 2018), or to understand howand why systems work as they do (Niven & Kao 2019) Which SOTA? Just for English? Multilingual? Reproducible? (see Fokkens2017)

Interdisciplinarity NLP/CL is at the intersection of: linguistics, CS, statistics, EE, NLP/CL connects with: biomedical informatics, computational social science,data science, Being interdisciplinary is about cooperation, not competition We are working on problems that require multiple kinds of expertise to solve,and we’ll get there by learning from each other

Towards promoting interdisciplinarity in NLP Tutorials at NAACL 2012 and ACL 2018: “100 things you always wanted toknow about linguistics, but were afraid to ask for fear of being told 1000more” Linguistic Fundamentals for Natural Language Processing: 100 Essentialsfrom Morphology and Syntax (Morgan & Claypool, 2013) Linguistic Fundamentals for Natural Language Processing II: 100 Essentialsfrom Semantics and Pragmatics (Morgan & Claypool, forthcoming 2019)

Towards promoting interdisciplinarity in NLP COLING 2018 PC activities (with Leon Derczynski) Paper types, including “computer assisted linguistic analysis” Review forms emphasizing error analysis and hypothesis testing 9 Best Paper awards, across diﬀerent categories For details, see the COLING 2018 PC blog: http://coling2018.org/category/pc-blog/

Towards promoting interdisciplinarity in NLP UW’s Computational Linguistics Master of Science (CLMS) curriculum design: 3 of 9 courses are in linguistics (exceptions for those who already have lingdegrees) cross-cutting themes emphasize multilinguality, ambiguity resolution andethical considerations recruit cohorts with diverse training and promote collaborative learning prerequisite: introduction to linguistics

Towards more multilingual NLP Bender 2009 “Linguistically naïve ! language independent” Bender 2011 Dos & don’ts for language independent NLP, including:

Towards more ethical NLP:Data Statements (Bender & Friedman 2018) CLMS advisory board member Lesley Carmichael suggested we shouldinclude ethics in the curriculum (late 2015 or early 2016) After trying & failing to find someone to teach it, decided to try myself: Ling 575: Ethics and NLP, WI 2017http://faculty.washington.edu/ebender/2017 575/ While preparing that course, fortuitously met Batya Friedman (UW iSchool) Guest lecture by Friedman on value sensitive design (https://vsdesign.org/)

Data Statements for NLPProposed Schema: Long Form A. Curation Rationale G. Recording Quality B. Language Variety H. Other C. Speaker Demographic I. Provenance Appendix D. Annotator Demographic E. Speech Situation F. Text Characteristics

Why NLP Needs Data Statements Systems trained on naturally occurring text learn the biases held by theauthors of the text (pre-existing bias) Word embeddings pick up gender (e.g. Bolukbasi et al 2016) and race/ethnicity bias (e.g. Speer 2017) Machine learning systems can amplify the biases they learn (e.g. Zhao etal 2017) Systems trained on one subpopulation don’t work as well for others(emergent bias) POS tagging (Hovy and Søgaard, 2015; Jørgensen et al., 2015); ASRengines (Tatman, 2017)

How do data statements help? Emergent bias: Procurers, consumers and advocates can check whether asystem is trained on appropriate data for its deployed use case Emergent bias: As a field, we can track what speaker populations areunderserved Pre-existing bias: Knowing what kind of texts a system is trained on can bekey to working out the source of bias, as in Speer’s (2017) study of wordembeddings and sentiment analysisData statements alone won’t ‘solve’ bias, but if we donot make a commitment to data statements or asimilar practice for making explicit the characteristicsof datasets, then we will single-handedly underminethe field's ability to address bias.

Suggested actions Write (and look for) data statements :) As a reviewer, value work that Explores NLP for lower resource languages Provides careful error analysis Provides careful success analysis Value the interdisciplinary nature of our field Learn enough of the other pillars to engage in meaningful collaboration

Suggested actions Step oﬀ the SOTA treadmill If you’re worried about being scooped, there’s probably a more interestingquestion you could be pursuing But how do we change we the field, so that we can succeed as individualswith fewer, more thoughtful publications?

Suggested actions Where you get the opportunity, value analytical work in addition to (or evenabove) ‘SOTA’ Avoid using ‘technical’ to mean ‘involves math/programming’ Advocate for reviewing structures that value crosslinguistic and/or analyticalwork (see COLING 2018) When people don’t state the language they’re working on, ask :) Feel free to blame this awkward asking-the-obvious question on me Engage broadly with emerging conversation about ethics and NLP and ethicsand AI

Thank you! Slides available online: http://faculty.washington.edu/ebender/slides.html Twitter: @emilymbender

Related Documents:

A PRACTICAL INTRODUCTION TO NLP NLP - Excellence Assured

have been so impressed with NLP that they have gone on to train in our Excellence Assured NLP Training Academy and now use NLP as NLP Practi-tioners, Master Practitioners and Trainers with other people. They picked NLP up and ran with it! NLP is about excellence, it is about change and it is about making the most of life. It will open doors for

18 Views

1y ago

The Power of - cprsuccess.com

5. Using NLP to Overcome Mental Barriers 6. Using NLP to Overcome Procrastination 7. Using NLP in Developing Attraction 8. Using NLP in Wealth Manifestation 9. How to Use NLP to Overcome Social Phobia 10. Using NLP to Boost Self-Condidence 11. Combining NLP with Modelling Techniques 12. How to Use NLP as a Model of Communication with Others 13.

13 Views

1y ago

Natural Language Processing Techniques Using Deep learning ANN

NLP experts (e.g., [52] [54]). This process gave rise to a total of 57 different NLP techniques. IV. CLASSIFYING NLP TECHNIQUES BY TASKS We first classify the NLP techniques based on their text-processing tasks. Figure 1 depicts the relationship between NLP techniques, NLP tasks, NLP resources, and tools. We define an NLP task as a piece of .

21 Views

6m ago

1. NLP Weltkongress - NLP Institutes

1.NLP's state of development calls for the 1st NLP World Congress The field of NLP has now existed for approximately 34 years. "The wild days" (a book describing the first 10 years of NLP) are over. NLP has grown and can be said to have grown up, integrating depth

9 Views

1y ago

NLP & COACHING PRE - STUDY

methods that are still part of good NLP Practitioner and NLP Master Practitioner trainings today, such as anchoring, sensory acuity and calibration, reframing, representational systems Today NLP is still evolving as NLP’ers continue experimenting with the application of NLP. Like most things,

70 Views

2y ago

NLP For Wizardry

NLP Training Videos 9 Introducing Neuro-Linguistic Programming (NLP) (A) 10 History of NLP 12 The Presuppositions of NLP (A) 13 NLP Communication Model 20 Anatomy of the Mind (A) 21 Creating Excellence in your life 25 Modeling 26 Sensory Acuity 28 BMIRS 29 Eye Accessing 30 Eye Access

117 Views

2y ago

NLP Training Guide 2013

To discuss your NLP training options, leave a message on 44 (0) 7944 388621, or visit www.business-nlp-training.uk to book a consultation, or to complete the on-line contact form. PERSONAL BUSINESS NLP TRAINING 1:1 NLP TRAINING SUMMARY Business NLP Ltd offers unique, personalised an

30 Views

2y ago

NLP and You Design - NLP Life Training

NLP LIFE TRAINING www.nlplifetraining.com UK (0)845 260 7930 . NLPLifeTraining.com 2 Special NLP Report NLP and You The Keys to Success, Health & Happiness in Business & in Life Contents Part 1: 3 Introduction - Your First Steps in NLP Part 2: 4 Richard Bandler - A Life In Change Part 3: 9

15 Views

1y ago

Recent Views

MERRILL ALABAMA CAPITOL SECRETARY OF STATE

Aug 24, 2018 · State House 38 Brian McGee state House 40 Pamela Jean Howard State House 41 Emily Anne Marcum State House 43 Carin Mayo State House 45 Jenn Gray state House 46 Felicia Stewart State House 4 7 1Jim Toomey State House 48 IAlli Summerford State House 51 Veronica R. Johnson State House 52 John W. Rogers, Jr. State House 53 Anthony Daniels

2y ago

375 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

Consumer Guide to Auto Insurance - csimt.gov

consumer guide to auto insurance contents introduction to auto insurance 1 understanding your auto insurance policy 2 required auto insurance 3 optional types of auto insurance 4-5 getting the right coverage 6 accidents and violations 7 how to shop for auto insurance 8 shopping tips 9 frequently asked questions 10-11 insurance complaints/when you have a problem 12

2y ago

805 Views

Industry Observations Insurance Industry

Jun 30, 2019 · 6/17/2019 Commercial Insurance Branch of Extraco Banks, N.A. Higginbotham Insurance Group, Inc. Insurance Brokers NA 6/13/2019 Links Insurance Services, LLC World Insurance Associates LLC Property and Casualty Insurance NA 6/13/2019 Abram Interstate Insurance Services, Inc. Risk Placement Services,

2y ago

619 Views

Life Insurance Buyer's Guide Life Insurance - National Association of .

Life Insurance uers uide Naional ssociaion of Insurance Commissioners Compare the Different Types of Insurance Policies There are many types of life insurance pol-icies. You should choose a policy with fea-tures that fit your individual needs. Some things to consider are: Term Insurance vs. Cash Value In-surance. Term insurance is intended to

1y ago

520 Views

your guide to understanding auto ins in nh - New Hampshire

Hampshire Insurance Department does not mandate or set Auto Insurance Rates. Auto Insurance Rates will vary by insurance company. This guide is intended to give New Hampshire consumers basic information on auto insurance. It suggests ways to: Lower the cost of your auto insurance, shop for Auto insurance and, file an auto insurance claim.

1y ago

449 Views

18.01.41 - REPLACEMENT OF LIFE INSURANCE AND ANNUITIES - Idaho

Department of Insurance Replacement of Life Insurance and Annuities. Page 3. 04. Existing Life Insurance or Annuity. "Existing Life Insurance or Annuity" means any life insurance or annuity in force, including life insurance under a binding or conditional receipt or a lif e insurance policy or annuity that is within an unconditional refund period.

1y ago

407 Views

EXAMINATION REPORT OF THE ADMIRAL INSURANCE COMPANY AS OF . - Delaware

Berkley Regional Specialty Insurance Comp 31295 DE Carolina Casualty Insurance Company 10510 IA Clermont Insurance Company 33480 IA Continental Western Insurance Company 10804 IA Firemen's Insurance Com pany of Wash, D.C. 21784 DE Gemini Insurance Company 10833 DE Great Divide Insurance Company 25224 ND

1y ago

258 Views

American International Group, Inc. - Federal Reserve

American General Life Insurance Company AGL U.S. Life Insurance Company AGC Life Insurance Company AGC Life U.S. Life Insurance Company The United States Life Insurance Company in the City of New York U.S. Life U.S. Life Insurance Company The Variable Annuity Life Insurance Company VALIC U.S. Life Insurance Company

1y ago

269 Views

Japan's Insurance Market - Toa Re

with 61.6% of net premiums written, of which automobile insurance totaled 48.8% and compulsory automobile liability insurance totaled 12.8%. Fire insurance accounted for 13.7%, miscellaneous casualty insurance including liability insurance accounted for 11.6%, accident insurance accounted for 9.8%, and marine insurance accounted for 3.2%.

1y ago

179 Views

List of Insurance Companies by Insurance Manager - Cayman Islands dollar

2447 Batan Insurance Company SPC, Ltd. 29-Sep-03 1307714 BBG Insurance Services, Ltd. 09-Aug-16 1254 BCHS Insurance, Ltd. 07-Oct-98 1168 Bearacuda Re 01-Aug-97 2639 Bedrock Insurance Limited 24-Nov-05 2150 Bom Ambiente Insurance Company 14-Jun-00 2565 Boundless Insurance Company, Ltd. 01-Dec-04 769 Bucap Limited 03-Mar-89

1y ago

293 Views

Insurance Certificate 713705-3 and Assistance Program

Name of insurance product: Purchase Protection and Travel Insurance for National Bank of Canada Mastercard credit cards, group insurance policy no. 713705 (Schedule A Certificate number 3)/713705-3 Type of insurance product: Purchase insurance and extended warranty and travel insurance (group insurance) Assistance provider contact information

4m ago

54 Views

Policy - Kiwibank

House Insurance is provided by The Hollard Insurance Company Pty Ltd. The Hollard Insurance Company Pty Ltd is the only organisation responsible for claims under this cover. Administration of House Insurance and claims handling services are managed by Ando Insurance Group Limited on behalf of The Hollard Insurance Company Pty Ltd.

1y ago

133 Views

House insurance - Tower

insurance in New Zealand. We've included limits and exclusions to your house cover throughout this policy wording and on your certificate of insurance. What your house policy does and does not cover What we cover We cover your house, meaning the domestic buildings you own at the situation shown on your certificate of insurance including its: 1.

1y ago

145 Views

From Linguist In NLP To Humanist In AI: How A Linguist's .

It looks like you're using an ad-blocker