INTERNATIONALIZED DOMAIN NAMES-MALAYALAM -

2y ago
22 Views
2 Downloads
862.64 KB
17 Pages
Last View : 3d ago
Last Download : 3m ago
Upload by : Axel Lin
Transcription

Policy DocumentForINTERNATIONALIZEDDOMAINNAMESLanguage: KASHMIRI(Perso-Arabic)

Contents1. PERSO-ARABIC SCRIPTS: GENERAL INTRODUCTION . 31.1.OVERVIEW . 31.2. GENERAL STRATEGY FOR KASHMIRI . 32. RESTRICTION RULES . 73. LANGUAGE TABLE : KASHMIRI. 94. NOMENCLATURAL DESCRIPTION TABLE OF KASHMIRI LANGUAGETABLE . 105. VARIANT TABLE FOR KASHMIRI . 146. EXPERTISE/BODIES CONSULTED . 167. COUNTRY CODE TOP LEVEL DOMAIN (ccTLD) FOR KASHMIRI . 172

1. PERSO-ARABIC SCRIPTS: GENERALINTRODUCTION1.1.OVERVIEWThree languages in India use the Perso-Arabic script. These are Urdu, Sindhi andKashmiri1.Unlike Brahmi derived languages which are abujidas i.e. syllable driven, PersoArabic driven languages are abjads i.e. character based. The concept of the ISCIIsyllable has therefore no pertinence insofar as languages derived from the persoArabic script are concerned. Therefore, unlike Hindi or Tamil for example,Kashmiri has no Augmented Backus Naur Formalism (ABNF). HoweverKashmiri does admit restriction rules as given in Section 5 below. The templatefor Perso-Arabic derived languages admits only the Code-chart with the pertinentcharacters marked in yellow, the corresponding nomenclatural table as well as thevariant list.1.2. GENERAL STRATEGY FOR KASHMIRIOf all the Indian languages, the Perso-Arabic script represents the greatest amountof difficulties and also chances of spoofing and phishing. This is because of theintrinsic nature of the script which has a large degree of homographs and also thefact that Unicode Code block U 600 caters to a large number of scripts and thereis a large degree of resemblance between two or more characters.To simplify the problem and ensure that as far as possible spoofing, pharming andphishing will be reduced to a bare minimum, following principles have been laiddown:1.2.1. MAPPING IN CONSONANCE WITH THE POLICY LAID DOWNBY GOVT. OF INDIA HTTP://www. will always remain in English. The domain names and ccTLDwhich will remain in KASHMIRI. It is assumed that the Bidi algorithm built into the browser used should handlethe problem of English and KASHMIRI efficiently. Script vs. Language: Unicode Code block U 600 caters to a large number oflanguages. Only the pertinent character set for KASHMIRI shall be used.1Kashmiri is also written in the Devanagari script.3

No mixing of two languages will allowed inside the ccTLDPunctuation markers of Code-page KASHMIRI shall NOT be permitted.Thus, the Latin full-stop shall be used instead of the correspondingKASHMIRI punctuation marker.Similarly English Hyphen will be used and not the correspondingKASHMIRI hyphen.All digits will be from the Latin Set i.e. 0,1,2,3,4,5,6,7,8,9 and not the IndoArabic digit set as prescribed in the Code-page for Arabic.ZWJ and ZWNJ shall not be permitted.Space (a major issue in Perso-Arabic scripts) shall not be permitted within theURL.1.2.2. DIRECTIVE PRINCIPLES SPECIFIC TO KASHMIRI:1. PRINCIPLE I: The permissible Character SetThe KASHMIRI code-set will be defined and isolated from the Arabic pagei.e. only those characters which are permissible in KASHMIRI will beretained. Since Unicode Code block U 600 is highly liable to spoofing, thechoice of the character-set pertinent to KASHMIRI alone will reduce spoofingand phishing.2. PRINCIPLE II: Identification of Characters liable to Spoofing.Characters liable to cause spoofing shall be identified and treated as variants.These will also include normalization.3. PRINCIPLE III: Diacritics reduced to a bare minimumAs far as possible, all diacritics will be eliminated from the set. Only the mostimportant and pertinent diacritics shall be retained. These are the shadda andthe sub-script and super-script aleph.ٖ ٰAlif Madd and Hamza Characters (both normal and wavy) most frequentlyused in KASHMIRI are as under and these will be admitted to the permissibleset. إ ۓ ؤ ئ آ ۂ أ ٲ Their corresponding combinations shall be treated as variant. Thus 0622 آ canalso be entered as )0627( ا followed by ٓ(0653) in some KASHMIRI keyboards4

and it is to resolve this alternative mode of entry that such normalization ispermitted in the shape of a variant.4. PRINCIPLE IV: EZAFATA serious issue will be that of the ezafat in words such as bazm-e-adab or ab-ehayat or bu-e-gul. As a palliative suggestion, it is suggested that the ezafat berepresented by ye baree or he-gol hamza separated by a hyphen as in the examplesbelow: خدا - ے - یاد - ے ہوا - و - آب - و فجر - سورۂ - ۂ 5. PRINCIPLE V: Visual Identity of the Word: The case of Space between twowords within a URL.Since a large number of characters in Perso-Arabic can join together unlessseparated by a space, Space is a cardinal issue in all Perso-Arabic drivenlanguages. Space ensures visual identity. Since Space is not permissiblewithin a URL, visual identity where two words constitute a URL constitutes amajor issue.A palliative to this issue would be the use of the hyphen to separate two wordsand thereby ensure legibility.Thus in the case of a site for a mango pickle: aam achar which when writtentogether would be illegible. آمآچار The solution would be to separate out the two words with a hyphen as shownbelow. آچار - آم 6. PRINCIPLE VI: Use of Naskh instead of Nastalique in the URL5

Nashq is more visually clear and reduces also spoofing and pharming becauseof clear legibility of the joining characters.6

2. RESTRICTION RULESThe following restriction rules specific to Kashmiri shall apply. By restriction rules ismeant combinations which though theoretically possible in the language, aredisallowed by the norms of the language.Kashmiri admits the following constraints:1. Kashmiri does not permit voiced aspirated forms for /b/ /t/ /g/ i.e. bi-labial, dentaland velar plosives. This means that ARABIC LETTER HEH DOACHASHMEE (06BE - )ھ shall not be followed by either ofa. ARABIC LETTER BEH ( 0628 -these characters - )ب b. ARABIC LETTER DAL ( 062F - )د c. ARABIC LETTER GAF ( 06AF - )گ d. ARABIC LETTER MEEM ( 0645 - )م e. ARABIC LETTER NOON ( 0646 - )ن 2. Following characters shall be permitted to come only after ARABIC LETTERALEF ( 0627 - )ا a. ARABIC MADDAH ABOVE ( 0653 b. ARABIC HAMZA BELOW ( 0655 -ٰ)ٰ)c. ARABIC WAVY HAMZA BELOW (065F - ٟ )d. ARABIC HAMZA ABOVE ( 0654 -ٔ )3. Following characters only shall be permitted to come before ARABIC HAMZA7

ABOVE ( 0654 -ٰ)a. ARABIC LETTER ALEF ( 0627 - )ا b. ARABIC LETTER WAW ( 0648 - )و c. ARABIC LETTER HEH GOAL ( 06C1 - )ہ d. ARABIC LETTER YEH BAREE ( 06D2 e. ARABIC LETTER FARSI YEH ( 06CC -8 )ے )ی

3. LANGUAGE TABLE : KASHMIRI22 Characters marked in yellow are not applicable to the language9

4. NOMENCLATURAL DESCRIPTION TABLE OFKASHMIRI LANGUAGE TABLEThe following are basic alphabetic characters for KASHMIRI, and will therefore beallowed.UnicodeValueCharacter NameCharacter0627ARABIC LETTER ALEF ا 0628ARABIC LETTER BEH ب 062AARABIC LETTER TEH ت 062BARABIC LETTER THEH ث 062CARABIC LETTER JEEM ج 062DARABIC LETTER HAH ح 062EARABIC LETTER KHAH خ 062FARABIC LETTER DAL د 0630ARABIC LETTER THAL ذ 0631ARABIC LETTER REH ر 0632ARABIC LETTER ZAIN ز 0633ARABIC LETTER SEEN س 0634ARABIC LETTER SHEEN ش 0635ARABIC LETTER SAD ص 10

0636ARABIC LETTER DAD ض 0637ARABIC LETTER TAH ط 0638ARABIC LETTER ZAH ظ 0639ARABIC LETTER AIN ع 063AARABIC LETTER GHAIN غ 0641ARABIC LETTER FEH ف 0642ARABIC LETTER QAF ق 0644ARABIC LETTER LAM ل 0645ARABIC LETTER MEEM م 0646ARABIC LETTER NOON ن 0648ARABIC LETTER WAW و 0679ARABIC LETTER TTEH ٹ 067EARABIC LETTER PEH پ 0688ARABIC LETTER DDAL ڈ 0691ARABIC LETTER RREH ڑ 0698ARABIC LETTER JEH ژ 06A9ARABIC LETTER KEHEH ک 06AFARABIC LETTER GAF گ 11

06BEARABIC LETTER HEH DOACHASHMEE ھ 06C1ARABIC LETTER HEH GOAL ہ 06C3ARABIC LETTER TEH MARBUTA GOAL ۃ 06C4ARABIC LETTER WAW WITH RING ۄ 06CCARABIC LETTER FARSI YEH ی 06D2ARABIC LETTER YEH BARREE ے 06EAARABIC EMPTY CENTRE LOW STOPٰThe following combinations of base character and diacritic as a single character will alsobe allowed:UnicodeValue0622Character NameCharacterARABIC LETTER ALEF WITH MADDA ABOVE آ 0623ARABIC LETTER ALEF WITH HAMZA ABOVE أ 0624ARABIC LETTER WAW WITH HAMZA ABOVE ؤ 0625ARABIC LETTER ALEF WITH HAMZA BELOW إ 0626ARABIC LETTER YEH WITH HAMZA ABOVE ۂ 0651ARABIC SHADDAٰ0656ARABIC SUBSCRIPT ALEFٰٖ065FARABIC WAVY HAMZA BELOWٟ0670ARABIC LETTER SUPERSCRIPT ALEFٰ0672ARABIC LETTER ALEF WITH WAVY HAMZA ٲ 12

06C206D3ABOVEARABIC LETTER HEH GOAL WITH HAMZAABOVEARABIC LETTER YEH BARREE WITH HAMZAABOVE13 ۂ ۓ

5. VARIANT TABLE FOR KASHMIRIThe following variants are based on a single character combination which can be alsoentered as a combination of two characters. It should be noted that these variants havebeen admitted to accommodate keyboards where a single character representing acombination such as alif madd آ is not available and the user has to enter alif andmadd separately3VARIANTS ہ ۃ 06C106C3 آ ا ٰ06220627 0653 أ ا ٰ06230627 0654 ؤ و ٰ06240648 0654 إ ا ٰ06250627 0655 ۂ ی ٰ062606CC 0654 ۂ ہ ٰ06C206C1 0654 ۓ ے ٰ06D306D2 065414

Caveats Other characters distinguished by a single Nukta such as suad zuad have notbeen included, since this would have made the attribution of URL’s toorestrictive. All other cases are handled by the exclusive character set for KASHMIRI andabsence of diacritics.15

6. EXPERTISE/BODIES CONSULTEDExpertise provided by expert of KASHMIRI language from University of Kashmir.16

7. COUNTRY CODE TOP LEVEL DOMAIN (ccTLD)FOR KASHMIRIIndia (Bhārat) localized in Kashmiri بارت –(xn--mgbbh1a)17

This means that ARABIC LETTER HEH DOACHASHMEE ( 06BE - ھ) shall not be followed by either of these characters - a. ARABIC LETTER BEH ( 0628 - ب) b. ARABIC LETTER DAL ( 062F - د) c. ARABIC LETTER GAF ( 06AF - گ) d. ARABIC LETTER MEEM ( 0645 - م) e. ARABIC LETTER NOON ( 0646 - ن) 2. Follo

Related Documents:

Proposal to encode the END OF TEXT MARK for Malayalam January 7, 2018 1 Introduction This is a proposal to encode the following character in the ‘Malayalam block in the BMP: Glyph Code Character name 0D53 MALAYALAM END OF TEXT MARK 2 Description The END OF TEXT MARK is written at the end of texts, sections and documents. It signifies the

Malayalam novels shaped the cultural memory of colonial modernity in Kerala. Keywords: literary history, Malayalam novel, Indulekha, politics of canon, colonial modernity in Kerala Introduction In the legacy of Malayalam literature, Indulekha (1889) by O. Chandu Menon holds the coveted position of the work that marks the birth of Malayalam novel.

141 510 english infotainment nat geo people yes 142 521 general hindi entertainment dd india yes . 171 607 regional malayalam amrita tv yes 172 609 regional malayalam kairali yes 173 611 regional malayalam asianet plus yes 174 612 regional malayalam kairali people yes

Malayalam Indic Input 3 supports various types of Keyboards. Click Settings Icon and select Keyboard. Choose keyboard layout of your choice form list of available keyboards. 4.1. Malayalam Transliteration User can type his message in Roman using the Standard English keyboard, which is transliterated on-the-fly to Malayalam.File Size: 748KB

knowing better English is essential (iv) further examinations are conducted in English (v) for continuing higher education and easy for higher education. The students 2 hailing from lower income prefers learning in Malayalam is easy and (ii) considers Malayalam is the mother tongue ,so there is a need for providing materials in Malayalam . 5.

Domain Cheat sheet Domain 1: Security and Risk Management Domain 2: Asset Security Domain 3: Security Architecture and Engineering Domain 4: Communication and Network Security Domain 5: Identity and Access Management (IAM) Domain 6: Security Assessment and Testing Domain 7: Security Operations Domain 8: Software Development Security About the exam:

mar thoma sleeha syro malabar cathedral issue 39 of 2016, year of mercy september 25, 2016 chicago sunday 8:00 am holy qurbana - malayalam 9:30 am holy qurbana - english 11:00 am holy qurbana - malayalam 5:30 pm holy qurbana - malayalam weekdays & saturday

Mechanical Engineering Aerodynamic Design of Vehicles Compute Aided Analysis & Design Analysis & Design. Mechanical Engineering Wind Energy Fuel Cell Energy. Mechanical Engineering UAV Space Shuttle Air & Space. Mechanical Engineering Robotics Utilities Systems. Civil Engineering Foundation Structural Analysis Analysis & Design. Civil Engineering Bridge Skyscraper Tunnel Construction . Civil .