BNC Sampler: XML Edition - [bnc] British National Corpus

3y ago
62 Views
5 Downloads
204.91 KB
37 Pages
Last View : 1m ago
Last Download : 3m ago
Upload by : Aydin Oneil
Transcription

BNC Sampler: XML editionJuly 31, 20081What is the BNC Sampler?The BNC Sampler is a subset of the British National Corpus (BNC). This document offers a shortintroduction to the BNC Sampler corpus, with an outline of the areas where the Sampler differs from theBNC XML Edition. Detailed information about the design and construction of the BNC can be foundin the Reference Guide for the British National Corpus (XML Edition) at http://www.natcorp.ox.ac.uk/XMLedition/URG/.The BNC Sampler consists of two collections of written and spoken material of about one millionwords each, originally compiled to mirror the composition of the full BNC as far as possible. The BNCSampler was initially used for a tagging enhancement project at Lancaster University. It was annotatedwith a more detailed set of part-of-speech tags than the BNC and the annotation was manually checkedand post-edited. The results of this tagging enhancement project fed into the development of the secondedition of the BNC (BNC World, 2001).The BNC Sampler is thus likely to be of interest to: those who wish to use a smaller corpus with equal amounts of written and spoken material ; those who want a resource with highly detailed and accurate part-of-speech annotation.1.1Composition of the BNC SamplerAs noted above, one motivation for the original design of the BNC Sampler was that it should reflectas far as possible the full variety of text types in the full BNC, even though the BNC itself was not yetcomplete at the time the texts were selected. The texts were chosen and classified according to the fullrange of selection criteria defined at the start of the BNC project, and thus form an interesting exampleof ‘balanced’ corpus design. The following tables demonstrate the composition of the Sampler withrespect to all of the original BNC design criteria.Size in KbyteSize in w-unitsSize in s-unitsText typeSpoken demographicSpoken context-governedWritten books and periodicalsWritten-to-be-spokenWritten 52 (24.77%)496852 (24.92%)888522 (44.57%)18121 (0.90%)96178 (4.82%)496852Spoken Texts1s-units52144 (41.06%)24174 (19.03%)44852 (35.32%)1056 (0.83%)4750 (3.74%)24174texts475169314184

1.1Composition of the BNC Sampler1WHAT IS THE BNC SAMPLER?Table 3: Domain for context-coverned spoken itutionalLeisureTotalwords80463 (16.19%)134275 (27.02%)145508 (29.28%)136606 (27.49%)493852s-units7322 (30.28%)5673 (23.46%)4816 (19.92%)6363 (26.32%)52144texts913141551Table 4: Age band of demographic respondentwords22387 (4.53%)64652 (13.09%)135973 (27.53%)97834 (19.81%)107112 (21.68%)65894 (13.34%)4938520-1415-2425-3435-4445-5960 Totals-units1254 (2.40%)7471 (14.32%)11640 (22.32%)13724 (26.31%)12619 (24.20%)5436 (10.42%)52144texts26121011647Table 5: Social class of demographic repondentABC1C2DETotalwords164933 (33.39%)98700 (19.98%)137686 (27.88%)92533 (18.73%)493852s-units13383 (25.66%)9641 (18.48%)18619 (35.70%)10501 (20.13%)52144texts16914847Table 6: Sex of demographic respondentMaleFemaleTotalwords241493 (48.89%)252359 (51.10%)990704s-units24183 (46.37%)27961 (53.62%)76318texts232447Table 7: Spoken interaction typeMonologueDialogueTotalwords167714 (16.92%)822990 (83.07%)990704s-units7196 (9.42%)69122 (90.57%)76318texts188098Table 8: Region where spokenUnknownwords54129 (5.46%)s-units1164 (1.52%)2texts6

1WHAT IS THE BNC SAMPLER?SouthMidlandsNorthTotal1.1.21.1Composition of the BNC SamplerRegion where spoken(cont.)375312 (37.88%)27688 (36.27%)199666 (20.15%)14988 (19.63%)361597 (36.49%)32478 (42.55%)10028215065837193698Written textsTable 9: Author age bandUnknown35-4445-5960 Totalwords935786 (93.31%)26550 (2.64%)7629 (0.76%)32856 (3.27%)1002821s-units49128 (96.97%)0 (0%)232 (0.45%)1298 (2.56%)50658texts8112286Table 10: Author sexUnknownMaleFemaleUnknownTotalwords405633 (40.44%)396786 (39.56%)195581 (19.50%)4821 (0.48%)1002821s-units20091 (39.66%)22145 (43.71%)8142 (16.07%)280 (0.55%)50658texts393511186Table 11: Author typeCorporateMultipleSoleUnknownTotalwords79369 (7.91%)368323 (36.72%)550136 (54.85%)4993 (0.49%)1002821s-units4285 (8.45%)17458 (34.46%)28446 (56.15%)469 (0.92%)50658texts93243286Table 12: Audience ageChildTeenagerAdultAnyTotalwords23700 (2.36%)30110 (3.00%)946106 (94.34%)2905 (0.28%)1002821s-units2326 (4.59%)3673 (7.25%)44449 (87.74%)210 (0.41%)50658texts3478186Table 13: Domain for written textsImaginativenatural & pure sciencewords233774 (23.31%)35456 (3.53%)s-units21332 (42.10%)774 (1.52%)3texts185

1.1Composition of the BNC Sampler1WHAT IS THE BNC SAMPLER?Domain for written texts(cont.)106193 (10.58%)5494 (10.84%)76211 (7.59%)3438 (6.78%)306921 (30.60%)9201 (18.16%)60270 (6.01%)3613 (7.13%)58318 (5.81%)3049 (6.01%)43626 (4.35%)1225 (2.41%)82052 (8.18%)2532 (4.99%)100282150658applied sciencesocial scienceworld affairscommerce & financeartsbelief & thoughtleisureTotal101023634786Table 14: Audience levelUnknownLowMediumHighTotalwords9505 (0.94%)172777 (17.22%)568876 (56.72%)251663 (25.09%)1002821s-units363 (0.71%)11564 (22.82%)29136 (57.51%)9595 (18.94%)50658texts122441986Table 15: Written MediumBookPeriodicalMiscellaneous – publishedMiscellaneous – unpublishedTo-be-spokenTotalwords616213 (61.44%)272309 (27.15%)59145 (5.89%)37033 (3.69%)18121 (1.80%)1002821s-units31927 (63.02%)12925 (25.51%)3368 (6.64%)1382 (2.72%)1056 (2.08%)50658texts452486386Table 16: Place of 99 (8.17%)258098 (25.73%)8580 (0.85%)18749 (1.86%)635395 (63.36%)1002821s-units3950 (7.79%)11855 (23.40%)1493 (2.94%)0 (0%)33360 (65.85%)50658texts1123115086Table 17: Written sample typeUnknownWhole textBeginning sampleMiddle sampleEnd sampleCompositeTotalwords430623 (42.94%)187955 (18.74%)170767 (17.02%)151063 (15.06%)26550 (2.64%)35863 (3.57%)1002821s-units22146 (43.71%)7987 (15.76%)12078 (23.84%)7772 (15.34%)0 (0%)675 (1.33%)506584texts431514111286

2FORMAT OF THE BNC SAMPLERTable 18: Written reception statusUnknownLowMediumHighTotalwords262460 (26.17%)226382 (22.57%)256448 (25.57%)257531 (25.68%)1002821s-units12646 (24.96%)9537 (18.82%)13622 (26.89%)14853 (29.32%)50658texts2419192486Table 19: Target audience sexUnknownMaleFemaleMixedTotalwords280387 (27.95%)20002 (1.99%)40288 (4.01%)662144 (66.02%)1002821s-units14950 (29.51%)0 (0%)2227 (4.39%)33481 (66.09%)50658texts28135486Table 20: Written text time period1975-19932words1002821 (100%)s-units50658 (100%)texts86Format of the BNC SamplerThe first edition of the BNC Sampler (1997) was distributed in SGML format; this version has beenautomatically converted to XML but no other changes have been made to the files. The originaldocumentation for the SGML version is available from the BNC website at http://www.natcorp.ox.ac.uk/corpus/sampler/, and includes full information about the CLAWS part-of-speechtagging applied to the Sampler, including a description of the CLAWS system itself.There are several differences between the format of the BNC Sampler (in its second edition) and theBNC XML Edition.tagset for linguistic annotation As noted above, the BNC Sampler has been manually annotated witha more detailed tagset than the BNC XML Edition. More information about the tagset (CLAWS7) used in the sampler is available from the BNC website (http://www.natcorp.ox.ac.uk/corpus/sampler/guide C7.htm).lemmatization Unlike the BNC XML Edition (and BNC Baby), the BNC Sampler has not beenannotated with lemmatized forms of each word, nor does it include simplified part-of-speech tags(pos).tokenization In the BNC Sampler, words forming multi-word units are tagged as one wordtogether. In the BNC XML Edition, multi-word units are marked using a mw elementto enclose sequences of orthographic words, which are also tagged individually.Forexample, in the BNC Sampler the sequence ‘of course’ is marked up as follows: w type "RR" of course /w The multi-word unit is analyzed as an adverb and given one part-of-speech tag (RR). Inthe BNC XML Edition, however, the same sequence would be marked up as follows:5

3SOURCE TEXTS mw c5 "AV0" w c5 "PRF"hw "of" pos "PREP" of /w w c5 "NN1" hw "course"pos "SUBST" course /w /mw The multi-word unit is analyzed as an adverb (AV0) and marked using a mw element. Its twocomponent parts are analyzed separately, and each is annotated with a detailed part-of-speech code(in the c5 attribute), a headword (lemma) ( in the hw attribute) and a simplified part-of-speech code(in the pos attribute).structural markup There are several differences in the way XML markup has been applied between theBNC Sampler and the BNC XML Edition. The treatment of overlapping speech is different; theelements used to mark-up structural divisions are different; the value-ranges and names of someattributes have changed, many header elements have changed their names, etc. We do not itemizethese differences here, since the elements and attributes are fully documented in the complete UserReference Guides for the two corpora.Most of the discussion of XML-specific matters in the documentation for BNC Baby appliesequally to the XML version of the BNC Sampler. Most of the sample scripts provided however needmodification.3Source TextsBibliographic details1 of the files included in the BNC Sampler are as follows:[A7V] 8802 words from The Guardian, electronic edition of 1989-11-08: Foreign news pages. GuardianNewspapers Ltd London 1989[A87] 11070 words from The Guardian, electronic edition of 1989-11-11: Foreign news pages.Guardian Newspapers Ltd London 1989[A8J] 8071 words from The Guardian, electronic edition of 1989-11-23: Foreign news pages. GuardianNewspapers Ltd London 1989[A8W] 10362 words from The Guardian, electronic edition of 1989-12-07: Foreign news pages.Guardian Newspapers Ltd London 1989[A95] 10204 words from The Guardian, electronic edition of 1989-12-08: Foreign news pages.Guardian Newspapers Ltd London 1989[A9E] 18288 words from The Guardian, electronic edition of 1989-12-10: Foreign news pages.Guardian Newspapers Ltd London 1989[A9M] 11338 words from The Guardian, electronic edition of 1989-12-11: Foreign news pages.Guardian Newspapers Ltd London 1989[A9V] 7203 words from The Guardian, electronic edition of 1989-12-13: Foreign news pages. GuardianNewspapers Ltd London 1989[AA4] 8282 words from The Guardian, electronic edition of 1989-12-20: Foreign news pages. GuardianNewspapers Ltd London 19891The word counts given here are for the corresponding version of this text in the BNC XML Edition, and may thus varysomewhat from the count in the BNC Sampler itself.6

3SOURCE TEXTS[AAB] 9884 words from The Guardian, electronic edition of 1989-12-21: Foreign news pages.Guardian Newspapers Ltd London 1989[AAK] 9498 words from The Guardian, electronic edition of 1989-12-22: Foreign news pages.Guardian Newspapers Ltd London 1989[AAT] 6977 words from The Guardian, electronic edition of 1989-12-31: Foreign news pages. GuardianNewspapers Ltd London 1989[AEA] 26515 words from Tomorrow. Taylor, Elizabeth Russell Peter Owen Publishers London 199152-137[ALS] 4149 words from Captain Pugwash and the huge reward. Ryan, John Gungarden Books Rye,East Sussex 1991 4-43[AP6] 1879 words from Monster Raving Loony Party’s draft manifesto for General Election 1992. u.p.[APJ] 3440 words from Report on visit to Peto Institute. Eccleshall, J Davis, J u.p.[B2E] 25386 words from Oh! sister I saw the bells go down. Saunders-Veness, Frances The BookGuild Ltd Lewes, East Sussex 1989 7-73[BMJ] 14268 words from Channel tunnel. Grayson, Leslie The British Library Board London 19901-103[BP6] 6553 words from Welcome to Somerset. u.p.[C9C] 14796 words from The Gardener. Maxwell Consumer Magazines London 1992-12, 1991-03[CAA] 6295 words from New Millennium summer holidays. u.p.[CBB] 18566 words from The myths and legends of Stamford in Lincolnshire. Smith, Martin PaulWatkins Stamford, Lincs 1991 15-108[CCD] 39460 words from The child bride. Wiat, Philippa Robert Hale Ltd London 1990[CDH] 9416 words from Hair Flair. Shaws Publications Ltd London 1992 4-58[CEL] 20709 words from Today. News Group Newspapers Ltd London 1992-12[CF5] 3393 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 199303[CF6] 5958 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 199303[CF7] 480 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 1993-03[CF8] 12806 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 199303[CF9] 41587 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 199303[CHP] 14040 words from Queen Mary’s dolls’ house. Stewart-Wilson, Mary The Bodley Head London1989 10-190[CHR] 10339 words from Return of the red nose joke book. Green, Rod Boxtree London 1991[CL8] 8431 words from Computergram international. u.p.7

3SOURCE TEXTS[CN4] 21989 words from The Artist: a magazine giving instruction in all branches of art. The ArtistPublishing Company Ltd Tenterden 1992 7-49[DCH] 15274 words from Amnesty International meetingDCHPS000 unspecifiedDCHPS001 unspecifiedDCHPS002 unspecifiedDCHPS003 unspecifiedDCHPS004 unspecifiedDCHPS005 unspecifiedDCHPS006 unspecifiedDCHPS007 unspecifiedDCHPSUNK Unknown speaker, otherDCHPSUGP Group of unknown speakers, other[EAP] 6292 words from [New Oxford English Dictionary procedures documents] u.p.[EBK] 6533 words from Action. World Assoc for Christian Comm 1991-07/1993-02[EVR] 11176 words from Egyptian gods and myths. Thomas, Angela P Shire Publications Ltd UK1989 6-60[EVY] 16693 words from Manpower solutions. Dean, Derek J Scutari Projects Ltd UK 1987 1-60[EW4] 27383 words from Proportional representation: which system? Sykes, Leslie The HornbeamPress Leicester 1990 1-76[EX7] 3948 words from Dear Green Place [from Truth, dare or promise] Riley, Denise Virago PressLtd London 1985 237-248[F71] 6772 words from A poet’s response to the pictures of Gauguin: [Picture appreciation lesson]F71PS000 unspecifiedF71PS001 unspecifiedF71PS002 unspecifiedF71PS003 unspecifiedF71PS004 unspecifiedF71PS005 unspecifiedF71PS006 unspecifiedF71PS007 unspecifiedF71PSUNK Unknown speaker, otherF71PSUGP Group of unknown speakers, other[F77] 4930 words from [Etching lesson]PS1L3 46, Andrew, teacherPS1L4 45, teacherPS1L5 14, Kevin, studentF77PSUNK Unknown speaker, other8

3SOURCE TEXTSF77PSUGP Group of unknown speakers, other[F7G] 5736 words from [Teachers’ conference: discussing assessment procedures]PS1M4 40 , Andrew, teacherPS1M5 30 , Angela, teacherPS1M6 30 , Paul, teacherPS1M7 40 , Rod, teacherPS1M8 50 , Don, teacherPS1M9 40 , Alan, teacherPS1MA 30 , Terry, teacherF7GPSUNK Unknown speaker, otherF7GPSUGP Group of unknown speakers, other[F7J] 11165 words from [COHSE/NALGO/NUPE/meeting]F7JPS000 unspecifiedF7JPS001 unspecifiedF7JPS002 unspecifiedF7JPSUNK Unknown speaker, otherF7JPSUGP Group of unknown speakers, other[F86] 9128 words from [Church of Scotland: Meeting on rules and regulations]PS1NE Hugh, moderatorPS1NF Mr BoydPS1NG Mr TorrencePS1NH Mr ForresterPS1NJ Mr McGilveryF86PS000 unspecifiedF86PS001 unspecifiedF86PS002 unspecifiedF86PS003 unspecifiedF86PSUNK Unknown speaker, otherF86PSUGP Group of unknown speakers, other[F98] 5602 words from Computers and the humanities. Kenny, A u.p.[F9M] 8430 words from City psalms. Zephaniah, B Bloodaxe books ltd Newcastle upon Tyne 199211-64[FA4] 7294 words from Further developments of the electronic book. Feldman, Tony BNBR London1991[FB4] 21440 words from The history of Siberia: from Russian conquest to revolution. Wood, AlanRoutledge & Kegan Paul plc London 1991 1-91[FCF] 3625 words from The Weekly Law Reports 1992 Volume 3. u.p.9

3SOURCE TEXTS[FEJ] 17099 words from Model financial statements for public and private companies. Stoy HaywardButterworth & Company (pub) Ltd UK 1990 1-115[FL6] 4863 words from Eating disorders: television discussionFL6PS000 unspecifiedFL6PS001 unspecifiedFL6PS002 unspecifiedFL6PS003 unspecifiedFL6PS004 unspecifiedFL6PS005 unspecifiedFL6PS006 unspecifiedFL6PS007 unspecifiedFL6PS008 unspecifiedFL6PSUNK Unknown speaker, otherFL6PSUGP Group of unknown speakers, other[FLK] 5302 words from Young women in Scotland: television discussionFLKPS000 unspecifiedFLKPS001 unspecifiedFLKPS002 unspecifiedFLKPS003 unspecifiedFLKPS004 unspecifiedFLKPS005 unspecifiedFLKPS006 unspecifiedFLKPS007 unspecifiedFLKPS008 unspecifiedFLKPSUNK Unknown speaker, otherFLKPSUGP Group of unknown speakers, other[FLS] 10830 words from General Portfolio health and safety meetingPS1PT 38, Roger, first aid representativePS1PU 47, Roger, first aid representativePS1PV 36, Peter, first aid representativePS1PW 32, Katie, first aid representativePS1PX 24, Dianne, first aid representativePS1PY 28, Suzanne, first aid representativePS1R0 58, Norman, first aid representativePS1R1 33, Carmel, first aid representativePS1R2 26, Steve, first aid representativeFLSPSUNK Unknown speaker, otherFLSPSUGP Group of unknown speakers, other[FLU] 4243 words from Albert Gunter: sermon10

3SOURCE TEXTSPS1RD Albert, minister[FLY] 6227 words from 11th year science lesson on chemistry of metal processingPS1RS 43, Tony, teacherFLYPSUNK Unknown speaker, otherFLYPSUGP Group of unknown speakers, other[FM4] 11203 words from Tutorial lesson: GCSE maths tutoring sessionPS1S9 50, John, tutorPS1SA 16, Andrew, studentFM4PSUNK Unknown speaker, otherFM4PSUGP Group of unknown speakers, other[FM7] 11058 words from Strangers - talk by PC Bruce: Talk/presentationPS1SF pc bruce, police officerFM7PSUNK Unknown speaker, otherFM7PSUGP Group of unknown speakers, other[FMP] 15376 words from Planning and development in York: greenbelt planning - public enquiryPS1TW 55, John, department of the environment adjudicatorPS1TX 58, Harry, deputy chairmanPS1TY 64, George, barristerPS1U0 46, barristerPS1U1 30, barristerPS1U2 40, barristerFMPPS000 unspecifiedFMPPS001 unspecifiedFMPPS002 unspecifiedFMPPSUNK Unknown speaker, otherFMPPSUGP Group of unknown speakers, other[FMS] 11933 words from Legal advice: pre-retirement coursePS1UD 50, solicitorFMSPSUNK Unknown speaker, otherFMSPSUGP Group of unknown speakers, other[FR2] 21677 words from An introduction to rural geography. Gilg, A Routledge & Kegan Paul plcLondon 1989 67-137[FRY] 9266 words from The railway children: Oxford Bookworms edition. Nesbit, E Escott, JohnOxford University Press Oxford 1993[FSB] 8817 words from The star zoo. Gilbert, H Oxford University Press Oxford 1992 1-55[FU0] 16355 words from Dog-whelks: an introduction to the biology of nucella. Crothers, J H FieldStudies Council UK 198511

3SOURCE TEXTS[FU6] 23216 words from Rosencrantz and Guildenstern are dead. Stoppard, Tom Faber & Faber LtdLondon 1986 9-93[FU7] 8185 words from Revolt in Roundhay [excerpt from Truth, Dare or Promise] Rowbotham, SheilaVirago Press Ltd London 1985[FU9] 2961 words from Chaos. Muhamad, M A Holden, M V Manchester University Press Manchester1987 15-33[FUG] 11104 words from Management training coursePS1U3 50 , Gordon, training managerPS1U4 BrainPS1U5 MikePS1U6 unspecifiedPS1U7 PhilipPS1U8 AnthonyPS1U9 ThomasPS1UA JaneFUGPS000 unspecifiedFUGPSUNK Unknown speaker, otherFUGPSUGP Group of unknown speakers, other[FUH] 11792 words from Tutorial lesson: junior-level mathsPS1UE 50 , John, tutorPS1UF 9, Kerry, studentFUHPSUNK Unknown speaker, otherFUHPSUGP Group of unknown speakers, other[FUT] 8564 words from Presentation on consumer rightsPS1VF 60, retired trading standards officerFUTPSUNK Unknown speaker, otherFUTPSUGP Group of unknown speakers, other[FUU] 8033 words from Talk on fire preventionPS1VG 55, Jack, retired fire prevention officerFUUPS000 unspecifiedFUUPSUNK Unknown speaker, otherFUUPSUGP Group of unknown speakers, other[FX5] 9730 words from Radio Forth: radio broadcastPS223 David, disc jockey, Other participants are radio listening phone ins.FX5PS000 unspecifiedFX5PS001 unspecifiedFX5PS002 unspecifiedFX5PS003 unspecified12

3SOURCE TEXTSFX5PS004 unspecifiedFX5PS005 unspecified

BNC Sampler: XML edition July 31, 2008 1 What is the BNC Sampler? The BNC Sampler is a subset of the British National Corpus (BNC). This document offers a short introduction to the BNC Sampler corpus, with an outline of the areas where the Sampler differs from the BNC XML Edition.

Related Documents:

NI PCI-611x DAQ card Gigabit * network adapter PC Camera * interface ATX power supply NI BNC-2110 SUB-D 9p NI 68p SUB-D 9p BNC BNC NI 68p BNC GPS antenna DC/DC * converter BNC Field antenna BNC RJ45 BNC RJ45 Basler 12p Camera AI0 AI1 CTR 0 OUT User1 User2 PCI slot PCI slot PCI-E slot / onboard blank slot Integrator

PCT-DRS-6-IF-NT PCT-BNC-6 PCT-BNC-59 PCT-BNC-59HE PCT-BNC-9 PCT-BNC-9NT PCT-BNC-M PCT-RCA-6 PCT-RCA-59 PCT-RCA-9NT CONNECTOR LABEL GUIDE TRS / DRS Cable Type Label Color PCT-TRS-6 (6L) Series 6, 60% thru Quad Blue PCT PCT-TRS-6P Series 6

NeoSuck RT CNL U of KansasT 6 Figure 3. BNC-2090 Note: Normally, it works with BNC-2090 interface (Figure 3). To work with the BNC-2110 (Figure 4) please make the following change: Figure 4. BNC-2110 For BNC-2110 To work with BNC-2110, the terminal configuration has to be in differential mode. Terminal configuration can be set to differential mode by modifying the parameter value of

3.3.6 NI BNC-2110 BNC Connector Box The BNC-2110 BNC connector box (Fig. 6b) is a shielded connector block with signal-labeled BNC connectors. The BNC- 2110 connector block simplifies the connection of analog sig-nals, some digital signals, and two user defined connections to the DAQ device. 3.3.7 NI SH68-68-EP, Shielded Cable, 1m

Uses of XML XML data comes from many sources on the web: web servers store data as XML files databasessometimes return query results as XML webservices use XML to communicate XML is the de facto universal format for exchange of data XML languages are used for music, math, vector graphics popular use: RSS for news feeds & podcasts CSC443: Web Programming

The design goals for XML are: 1. XML shall be straightforwardly usable over the Internet. 2. XML shall support a wide variety of applications. 3. XML shall be compatible with SGML. 4. It shall be easy to write programs which process XML documents. 5. The number of optional features in XML is to be kept to the absolute minimum, ideally zero. 6.

The number of optional features in XML is to be kept to the absolute minimum, ideally zero XML documents should be human-legible and reasonably clear The XML design should be prepared quickly The design of XML shall be formal and concise XML documents should be easy to create Terseness in XML markup is of minimal importance

Department of Aliens LAVRIO (Danoukara 3, 195 00 Lavrio) Tel: 22920 25265 Fax: 22920 60419 tmallod.lavriou@astynomia.gr (Monday to Friday, 07:30-14:30) Municipalities of Lavrio Amavissos Kalivia Keratea Koropi Lavrio Markopoulo . 5 Disclaimer Please note that this information is provided as a guide only. Every care has been taken to ensure the accuracy of this information which is not .