Exploring Word Order Universals: A Probabilistic Graphical .

2y ago

17 Views

2 Downloads

443.61 KB

6 Pages

Last View : 2m ago

Last Download : 3m ago

Upload by : Sutton Moon

Report this link

Download PDF

Transcription

Exploring Word Order Universals: a Probabilistic GraphicalModel Approach1IntroductionPrevious statistical methods in the research of word order universals have yielded interesting results but they have to make strong assumptions and do considerable amount ofdata preprocessing to make the data fit the statistical model (Greenberg, 1963; Hawkins,1982; Dryer, 1989; Nichols, 1986; Justeson & Stephens, 1990). Recent studies usingprobabilistic models are much more flexible can handle noise and uncertainty better(Daume & Campbell, 2007; Dunn et al., 2011). However these models still rely on strongtheoretic assumptions and heavy data treatment, such as using only two values of wordorder pairs while discarding other values, purposefully selecting a subset of the languagesto study, or selecting partial data with complete values. In this paper we introduce a novelapproach to use a probabilistic graphical model to study word order universals. Using thismodel we can have a graphic representation of the structure of language as a complexsystem composed of linguistic features. Then the relationship among these features can bequantified as probabilities.2MethodProbabilistic graphical models combine a graphical representation with a complex distribution over a high-dimensional space (Koller & Friedman, 2009).There are two advantages of using this model to study word order universals. First the graphical structurecan reveal much finer structure of language as a complex system. We assume there’s ameta-language that has the universal properties of all languages in the world. We want amodel that can represent this meta-language and make inferences about linguistic properties of new languages. This system is composed of multiple sub-systems such as phonology, morphology, syntax, etc. which correspond to the subfields in linguistics. In this paper we focus on the sub-system of word order only. The other advantage of PGM is that itenables us to quantify the relationships among word order features. A PGM model forword order subsystem encodes a joint probabilistic distribution of all word order featurepairs. Using probability we can describe the degree of confidence about the uncertain nature of word order correlations.The WALS data has posed a difficulty for applying statistical methods because thelanguages are not independent and identically distributed due to relatedness in genealogyor geography. To solve the problem of limited data we use model averaging by usingbootstrap replicates. To solve the dependence problem among the languages we selecteach subset randomly and learn a DAG (directed acyclic graph) structure for this subset.First we use bootstrap to create a resample from the original dataset. Then we divide thesamples into four groups of equal number of languages randomly and learn the DAGstructure and conditional probabilities for each subset; then using the graph fusion algorithm (Matzkevich & Abramson, 1993) we combine all the graphs into the final consensus DAG structure, and use the original data to learn the parameters.The final “consensus” DAG structure is shown in Figure 1. From this graph we can seeword order features are on different tiers in the hierarchy. The root S O V “dominates”all the other features; O V is an important node since it directly “dominates” three otherbranches of nodes; noun modifiers and noun are in the middle tier while Neg V, AdSub Cl, IntPhr and Num N are the leaf nodes which are the least important features interms of their contribution to the word order properties of a language.

Figure 1. DAG for our PGM model3ResultsWe use SamIam1 to do probabilistic inference queries since it has an easy-to-use interfacefor. Figure 2 gives an example: when we know the language is SV and NegV, we can getthe probabilities for all values of other features of this language.Figure 2. Query example in SamIamThe other type of query is MAP which aims to find the most likely assignments to allof the unobserved variables. For example, when we only know that language is VO, wecan use MAP query to find the combination of values which has the highest probability(0.0032 as shown in Table 1).1http://reasoning.cs.ucla.edu/samiam/

Table 1: MAP query exampleOne more useful function is to calculate the likelihood of a language in terms of wordorder properties. If all values of 13 features of a language are known, then the probability(likelihood) of having such a language can be calculated. We calculated the likelihood ofeight languages and got the results as shown in Figure ��10.0000005000.000000000Figure 3. Likelihood of eight languages in terms of word order propertiesAs we can see, English has the highest likelihood to be a language while Hak Chinesehas the lowest. German and French have similar likelihood; Portuguese and Spanish aresimilar but are less than German and French. In other words English is a typical languageregarding word order properties while Hak Chinese is an untypical one.4Evaluation and ImplicationsWe did qualitative evaluation through comparison with the well-known findings in wordorder correlation studies: those of Greenberg’s, Dryer’s, and Daume and Campbell’s.Compare with Greenberg’s and Dryer’s workUniversalsU2: ADP NP N GDependenciesPOST- GNPRE- NGGN- POSTNG- PREU3: VSO- PREU4: SOV- POSTU5: SOV&NG- NAVSO- PRESOV- POSTSOV&NG- NAUNIV88.5174.6380.1186.0883.6190.8869.36

U9: PoQPar ADP NPU10: PoQPar VSOU11: IntPhr- VSU12: VSO- IntPhrU17: VSO- A NU18&19: A N Num N Dem NInitial- PREFinal- POSTPRE- InitialPOST- Finalall values of PoQPar:VSO below 10%Initial- VSVSO- InitialSOV- InitialSOV- not initialVSO- A NAN- NumNAN- DemNNA- NnumNA- NDemU24: RN- POST (or AN)RN- POSTRN- ANTable 2. Comparison with Greenberg’s work43.1250.8115.0713.99below .6330.20OVUNIV VOUNIVCorrelated pairspostposition90.88 preposition83.66GenN87.92 NGen67.83RelN39.11 NRel94.36SQ(final Q)35.69 QS15.13S-AdSub30.82 AdSub-S84.69"wh" phrase in situ 67.86 initial "wh" phrase 32.02Non-correlated pairsA N30.11 N ADEM N54.41 N DEM66.5355.66NUM N48.81 N NUM55.10DEG A45.77 A DEG37.47NEG V31.05 V NEG16.93Table 3. Comparison with Dryer’s workCompare with Dauméand Campbell’s workWe compared the probabilities of single value pairs of the top ten universals with Daumeand Campbell’s results, which are shown in the following graphs (p(true) is the probability of having the particular implication; prob is the probability calculated in a differentway which is not specified2):2http://www.umiacs.umd.edu/ hal/WALS/

(true)123prob45PGM678910the first ten universalsprobabilitiesFigure 4: Compare with Daume and Campbell’s DIST 34prob5PGM678910the first ten universalsFigure 5: Compare with Daume and Campbell’s HIER modelIt can be seen that our model provides moderate numbers which fall between the twoprobabilities in Daume and Campbell’s results. In Figure 4 the four universals that havebiggest gaps are: 1) VS- VO, 2) OV- SV, 8) Noun-Genitive- Initial subordinator word,9)Noun-Genitive- Prepositions and in Figure 5 the two universals that have the biggestgaps are: 6)Prepositions - VO and 7) Genitive-Noun- Postpositions. Our model showsthat the word order pair S V and O V has higher dependency than the DIST model; andthe pair ADP NP and G N has lower dependency in both models.Probabilistic graphic modeling provides solutions to the problems we noticed in the current study of word order universals, which are summarized in the following table:ProblemSolutiononly deal with individual featurestake language as a complex systemhard to quantify strength of relationships probabilities can measure the strength of dependenciesinteraction between features not clearnodes are connected to each other in differentwaysdirection and flow of influencearrows in the graphpreprocessing of datanoremove values of featuresvery littleNull Hypothesis Significance Testingprobability theoryTable 5: Summary of advantages of PGM for word order universal studyReferencesBickel, B. 2010a. Absolute and statistical universals. In Hogan, P. C. (ed.) The Cambridge Encyclopedia of the Language Sciences, 77-79. Cambridge: Cambridge University Press.

Bickel, B. 2010b. Capturing particulars and universals in clause linkage: a multivariateanalysis. In Bril, I. (ed.) Clause-hierarchy and clause-linking: the syntax and pragmatics interface, pp. 51 - 101. Amsterdam: Benjamins.Croft, William. 2003. Typology and universals. 2nd edn. Cambridge: Cambridge University Press.D.M. Chickering, D. Heckerman, and C. Meek. 1997. A Bayesian approach to learningBayesian networks with local structure. Proceeding UAI'97 Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence.Daphne Koller and Nir Friedman. 2009. Probabilistic Graphical Models: Principles andTechniques (Adaptive Computation and Machine Learning series). MIT Press, Aug 31,2009Dryer, M. S. 1989. Large linguistic areas and language sampling. Studies in Language 13,257 – 292.Dryer, Matthew S. & Martin Haspelmath (eds.). 2011. The world atlas of language structures online. München: Max Planck Digital Library.Dryer, Matthew S. 2011. The evidence for word order correlations. Linguistic Typology15. 335–380.Dunn, Michael, Simon J. Greenhill, Stephen C. Levinson & Russell D. Gray. 2011.Evolved structure of language shows lineage-speciﬁc trends in word-order universals.Nature 473. 79–82.E. T. Jaynes. 2003. Probability Theory: The Logic of Science. Cambridge UniversityPress, Apr 10, 2003.Greenberg, J. H. 1963. Some universals of grammar with particular reference to the order of meaningful elements. In Universals of Language, J. H. Greenberg, Ed. MITPress, Cambridge, MA, 73-113.Greenberg, Joseph H. 1966. Synchronic and diachronic universals in phonology. Language 42. 508–517.Greenberg, J. H. (1969). Some methods of dynamic comparison in linguistics.Substanceand structure of language, 147-203.Daumé, H., & Campbell, L. (2007). A Bayesian model for discovering typological implications. In ANNUAL MEETING-ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (Vol. 45, No. 1, p. 65).Hawkins, John A. 1983. Word Order Universals. Academic Press, 1983.Justeson, J. S., & Stephens, L. D. (1990). Explanations for word order universals: a loglinear analysis. In Proceedings of the XIV International Congress of Linguists (Vol. 3,pp. 2372-76).Leray, P., & Francois, O. (2004). BNT structure learning package: Documentation andexperiments.Maslova, Elena & Tatiana Nikitina. 2010. Language universals and stochastic regularityof language change: Evidence from cross-linguistic distributions of case marking patterns. Manuscript.Maslova, Elena. 2000. A dynamic approach to the veriﬁcation of distributional universals.Linguistic Typology 4. 307–333.Matzkevich, I., & Abramson, B. (1992, July). The topological fusion of Bayes nets.In Proceedings of the Eighth international conference on Uncertainty in artificial intelligence (pp. 191-198). Morgan Kaufmann Publishers Inc.Moninder Singh. 1997. Learning Bayesian networks from incomplete data. In Proceedings of the 14th National Conference on Artificial Intelligence (AAAI). AAAI Press,Menlo Park.Murphy, K. (2001). The bayes net toolbox for matlab. Computing science and statistics, 33(2), 1024-1034.Perkins, Revere D. 1989. Statistical techniques for determining language sample size.Studies in Language 13. 293–315.William Croft, Tanmoy Bhattacharya, Dave Kleinschmidt, D. Eric Smith and T. FlorianJaeger. 2011. Greenbergian universals, diachrony and statistical analyses [commentaryon Dunn et al., Evolved structure of language shows lineage-specific trends in wordorder universals]. Linguistic Typology 15.433-53.

Probabilistic graphical models combine a graphical representation with a complex distri-bution over a high-dimensional space (Koller & Friedman, 2009).There are two ad-vantages of using this model to study word order universals. First the graphical structure can rev

Related Documents:

Exploring Digital Video - The College of New Jersey

Also Available from Thomson Delmar Learning Exploring Visual Effects/Woody/Order # 1-4018-7987-X Exploring Sound Design for Interactive Media/Cancellaro/Order #1-4018-8102-5 Exploring Digital Software on the Mac/Rysinger/Order # 1-4018-7791-5 Exploring DVD Authoring/Rysinger/Order # 1-4018-8020-7 exploring DIGITAL VIDEO Second Edition Rysinger

41 Views

2y ago

Psychological Universals: What Are They and How Can We Know?

Issues of cross-cultural generaliz-ability of psychological processes and 3 cross-cultural research strategies to probe universals are . ality and some research in clinical psychology is to recruit partic-ipants from undergraduate psychology classes and to make infer-ences about the human mind on the basis of these participants. This

13 Views

1y ago

ric oodfield american niversity Beirut 23214 Hegel and the ... - Marxists

Metaphysics and politics in Hegel's thought 4 Hegel's metaphysics of thought: toward a logic of universals 103 Introduction 103 The problem of universals between antiquity and modernity 104 Why/how metaphysics? 110 Hegel's metaphysical corpus and the context of the Encyclopedia Logic 111 Introduction to the Logic 115 Being 132 Essence 138

7 Views

3m ago

Getting Started with Word - Poplar Bluff

Word 2016: Getting Started with Word Getting to know Word 2016 Word 2016 is similar to Word 2013 and Word 2010. If you've previously used either version, then Word 2016 should feel familiar. But if you are new to Word or have more experience with older versions, you should first take some time to become familiar with the Word 2016 interface.

15 Views

5m ago

Grammar Springboards Book 3 REVISED

Speech bubble template Word web Word Search template and two grids Mini book and Zig-Zag book templates Puzzle star template The word ladder Word wall blank fl ash cards Badges and Book marks Word wheel – blank Word wheel cover – one blank/one decorated Word slides – template Word slide book Word searches Answers

121 Views

3y ago

Word Problems With Arrays 3rd Grade

3rd grade Steps to solve word problems Math, word ShowMe I teach 3rd grade Math. word problems with dividson. 2nd grade two step word problem. Grade 3 Word Problems. 3rd grade math word problems Grade 3 math worksheets and math word problems. Use these word problems to see if learner

126 Views

2y ago

Microsoft Word 2007— Essential Skills - Clear-Cut Computing

word , not select the entire word. If you are using an older version of Word, to select one word, position the I-beam pointer anywhere in a word and double-click. The word and the space following the word are selected. To select a sentence , hold down the Ctrl key while clicking anywhere in the sentence. Word selects all words in the

14 Views

1y ago

Z P Araling Panlipunan

Araling Panlipunan Ikalawang Markahan - Modyul 5: Interaksiyon ng Demand at Supply Z est for P rogress Z eal of P artnership 9 Name of Learner: _ Grade & Section: _ Name of School: _ Alamin Ang pinakatiyak na layunin ng modyul na ito ay matutuhan mo bilang mag-aaral ang mahahalagang ideya o konsepto tungkol sa interaksiyon ng demand at supply. Mula sa mga inihandang gawain at .

1.1K Views

3y ago

Recent Views

Grammar as a Foreign Language - List of Proceedings

Grammar as a Foreign Language Oriol Vinyals Google vinyals@google.com Lukasz Kaiser Google lukaszkaiser@google.com Terry Koo Google terrykoo@google.com Slav Petrov Google slav@google.com Ilya Sutskever Google ilyasu@google.com Geoffrey Hinton Google geoffhinton@google.com Abstract Synta

2y ago

445 Views

Attention is All you Need - NIPS

Google Brain avaswani@google.com Noam Shazeer Google Brain noam@google.com Niki Parmar Google Research nikip@google.com Jakob Uszkoreit Google Research usz@google.com Llion Jones Google Research llion@google.com Aidan N. Gomezy University of Toronto aidan@cs.toronto.edu Łukasz Kaiser Google Brain lukaszkaiser@google.com Illia Polosukhinz illia .

1y ago

303 Views

GSA Implementation of Google (G) Suite

Google Meet Classic Hangouts Google Chat Google Calendar Google Drive and Shared Drive Google Docs Google Sheets Google Slides Google Forms Google Sites Google Keep Apps Script D

2y ago

316 Views

Google Drive (Google Docs, Google Sheets, Google Slides)

Google Drive (Google Docs, Google Sheets, Google Slides) Employees are automatically issued a Kyrene Google account. Navigate to drive.google.com. Use Kyrene email address and network password to login. Launch in Chrome browser for best experience. Google Drive is a cloud storage sys

2y ago

388 Views

Quick Guide of Using Google Home to Control Smart Devices

Configuration needs Google Home app. Search "Google Home" in App Store or Google Play to install the app. 3.1 Set up Google Home with Google Home app You can skip this part if your Google Home is already set up. 1. Make sure your Google Home is energized. 2. Open the Google Home app by tapping the app icon on your mobile device. 3.

1y ago

326 Views

Elaboração de Provas Online usando o Formulário Google Docs

2 Após o login acesse o Google Drive ou o Google Docs e selecione a ferramenta Google Forms (Formulários). Clique na caixa de Ferramentas do Google, localizada no canto direito superior da tela e selecione o Google Drive. Na tela do Google Drive clique em New , opção More e selecione Google Forms. OBS: É possível acessar o google

10m ago

123 Views

ACS WASC Templates

File upload, Folder upload, Google Docs, Google Sheets, or Google Slides. You can also create Google Forms, Google Drawings, Google My Maps, etc. Share with exactly who you want — without email attachments. Search or sort your list of files, folders, and Google Docs. Preview files and Google Docs.

2y ago

366 Views

Google Drive - San Bernardino City Unified School District

Google Apps All of the Google applications that are available upon logging into Google.com (G , Gmail, Gphotos, Gdrive, etc.). Google Suite Google’s online cloud based office companion applications (Docs, Sheets, Slides). Google Drive Google’s online cloud storage and file sharing/collaboration application.

2y ago

378 Views

Single Sign On for Google Apps with NetScaler Unified Gateway

Google Apps for Work is a suite of cloud computing productivity and collaboration applications provided by Google on a subscription basis. It includes Google’s popular web applications including Gmail, Google Drive, Google Hangouts, Google Calendar and Google

2y ago

295 Views

Serviceteil

Google 84, 87, 124 Google 110 Google AdWords 101, 103 Google Alerts 127 Google Analytics 89 Google Maps 100, 110, 173 Google-Maps 63 Google Places 100, 103, 124 Graphiken 66 H Haftung 170 Haftungsausschluss 72 Hausfarbe 11 Headline 35 Heilmittelwerbegesetz 14, 69, 163 Heilversprechen 164 HONcode 78 HTML 58 HWG 31 I Imagefilm 31

2y ago

336 Views

Best practices for managing identities when you move to Google Cloud

Google Cloud. To provide t he informat ion an organizat ion would ne e d to transfer data and ownership from one Google Account to anot her for s ome of t he noncore Google s er vice s, such as Google Ads, Google Analyt ics, or DV360. Intende d audience Organizat ion administrators. Sta planning Google Cloud / Google Wor kspace migrat ion. Key .

1y ago

481 Views

MANAGERIAL FINANCE - GBV

of Managerial Finance page 2 Introduction to Managerial Finance 1 Starbucks—A Taste for Growth page 3 1.1 Finance and Business What Is Finance? 4 Major Areas and Opportunities in Finance 4 Legal Forms of Business Organization 5 Why Study Managerial Finance? Review Questions 9 1.2 The Managerial Finance Function 9 Organization of the Finance

3y ago

6.8K Views

Chapter 1 The roles of finance function in organisations

The roles of the finance function in organisations 4. The role of ethics in the role of the finance function Ethics is the system of moral principles that examines the concept of right and wrong. Ethics underpins an organisation’s sustained value creation. The roles that the finance function performs should be carried out in an .File Size: 888KBPage Count: 10Explore furtherRole of the Finance Function in the Financial Management .www.managementstudyguide.c Roles and Responsibilities of a Finance Department in a .www.pharmapproach.comRoles and Responsibilities of a Finance Department .www.smythecpa.comTop 10 – Functions of Business Finance in an om23 Functions and Duties of Accounting and Finance nded to you b

1y ago

335 Views

Introduction - Google Earth User Guide

Google Earth Community: Learn from other Google Earth users by asking questions and sharing answers on the Google Earth Community forums. Using Google Earth: This blog describes how you can use some of the interesting features of Google Earth. Selecting a Server Note: This section is relevant to Google Earth Pro and EC users.

3y ago

288 Views

Using Google Forms to Manage Officials Signups

Google Sheets, deleting a response from the form or sheet will not affect the other. Once the Google Form is linked to a Google Sheet, clicking on the spreadsheet icon will open the linked Google Sheet. Google Responses Sheet Google automatically creates and populates the sp

2y ago

276 Views

Exploring Word Order Universals: A Probabilistic Graphical .

It looks like you're using an ad-blocker