Kanji Stories - Fu-berlin.de

1y ago

26 Views

3 Downloads

1.11 MB

50 Pages

Last View : 14d ago

Last Download : 3m ago

Upload by : Esmeralda Toy

Report this link

Download PDF

Transcription

漢字Kanji StoriesDevelopment of an Application for Learning Kanji CharactersNils GrabenhorstBachelor ThesisInstitut für InformatikFachbereich Mathematik und InformatikFreie Universität BerlinSupervisor: Prof. Dr. Raúl RojasBerlin, January 18, 2012

AbstractThis Bachelor Thesis presents an attempt to develop an application fortouch screen devices that helps students of the Japanese language to memorize Japanese Kanji characters. The character drawn by the student onthe touch screen will be veriﬁed by the application. The typical approachto Japanese character recognition is to use Nearest Neighbor Classiﬁers,while Hidden Markov Models are used less frequently. Since the purpose ofthe application under development is to teach the writing of the kanji, theimportant aspects of the feature vector have to be strictly checked, otheraspects of the character shape are not essential. The approach chosen forthe recognition algorithm is therefore to explicitly deﬁne each aspect to bechecked. The data available to the recognizer will also give future versionsof the application the opportunity to provide detailed feedback about theparticular aspect violated that caused a match rejection.

Contents1 Introduction22 333436Japanese Writing SystemKanji, Hiragana, Katakana, RōmajiReadings . . . . . . . . . . . . . . . .The Anatomy of Kanji Glyphs . . .2.3.1 Strokes . . . . . . . . . . . .2.3.2 Glyph Components . . . . . .3 Recognizing Characters3.1 A Brief Overview of Recognition Strategies3.2 Data Available . . . . . . . . . . . . . . . .3.3 Variance . . . . . . . . . . . . . . . . . . . .3.4 Avoiding Frustration . . . . . . . . . . . . .3.5 Character Composition . . . . . . . . . . .4 Stroke Shapes4.1 Angles, Curvatures . . . . . . . . . . . . . . .4.2 Recognizing Stroke Shapes: Angle Automaton4.3 Straight Strokes . . . . . . . . . . . . . . . . .4.4 Deﬁning Stroke Shapes . . . . . . . . . . . . .4.5 Adding Glyph Components to a Glyph . . . .5 Component Locations5.0.1 Addressing Strokes5.1 Stroke Sections . . . . . .5.2 Target Polygon . . . . . .5.3 (Very) Big Finger Assistant.6 Evaluation406.0.1 Additional Future Work . . . . . . . . . . . . . . . . . 447 Conclusion441

1 IntroductionThe Japanese language is largely considered very diﬃcult to learn. It is verydiﬀerent from western languages and has some unique grammatical constructions that are unfamiliar. The cultural background the language is embedded within is also quite foreign. The Interagency Language Roundtable ofthe U.S. government has classiﬁed languages according to their diﬃculty fornative english speakers to learn them, ranging from the “Category I languages that closely cognate with English” such as Spanish and French toCategory III languages including Chinese and Japanese that are “Exceptionally diﬃcult for native English speakers to learn”. According to the U.S.Foreign Service Institute, “Limited working proﬁciency”1 can be achieved forCategory I languages after 575-600 class hours. To achieve the same levelof proﬁciency in Japanese would take approximately 2200 class hours.[JK],[Rub98]Apart from unfamiliar grammar, vocabulary, idioms, and cultural background, the Japanese writing system poses a signiﬁcant challenge to anystudent willing to learn the language. While the “alphabets” Hiragana andKatakana are quite easy to master, the logographic Kanji characters are aformidable obstacle in itself. Thousands of seemingly abstract glyphs haveto be memorized, both for writing and reading. Any help breaking down thetask into manageable pieces and oﬀering memory aids will be greatly appreciated. By guiding the student through the kanji in an order that teachesbasic kanji ﬁrst, then kanji that are assembled from previously memorizedkanji, the task of memorizing them becomes much more manageable thanclassic methods of learning the glyphs by rote and repetition.Many kanji look very similar, and for instance once the shape of 未has been memorized, another glyph will inevitably come up causing greatconfusion, such as 末. The meanings are very diﬀerent, the former is un-,not yet, hitherto, still, even now, sign of the ram; the latter is end, close,tip, powder, posterity. The subtle variance in shape is only noticeable ifboth glyphs can be compared next to each other: the length of the topmosthorizontal stroke relative to the one below is diﬀerent. Having memorized theﬁrst character days or even weeks before encountering the second character,1“Able to satisfy routine social and limited oﬃce needs and to read short typewrittenor printed straightforward texts.”[JK]2

the latter will inevitably overpower the memorized ﬁrst character becausethe student will unlikely notice the subtle diﬀerence in shape and, obliviousto the problem at hand, will likely not develop a learning aid to keep theconfusing characters apart. A good teacher will have to point out pitfallssuch as these.The learning technique outlined in [Hei01] deconstructs each kanji intoits primitive elements; these are either strokes, or simpler kanji that havebeen memorized before. These primitive elements are then reassembledby creating a composite ideogram, which is an image or a story tying theprimitive elements of the character together. Rather than memorizing thecharacter itself, its ideogram is memorized. The more imaginative, vivid,charming, disgusting or shocking the ideogram is, the more memorable itwill be. Recalling the ideogram will lend itself to reconstruct the writingof the associated glyph. This system does not yet take the readings intoaccount, but associating each kanji with its meaning is a large step forwardin achieving Japanese literacy.Each time a more complex character is studied, all of its components arereviewed as well, further solidifying previously acquired knowledge. Practicing to write each glyph while thinking about its ideogram will solidifythe connection between the two. Many students use ﬂash cards for studying kanji using variations of the Leitner system, a basic spaced repetitionlearning system where several stacks of ﬂash cards are used to keep trackof the proﬁciency. Stacks with new material is reviewed more often thanthose that represent the better known material. After a card has been reviewed successfully it can proceed to the next stack, the ones the student hasdiﬃculties with are returned to the ﬁrst stack. Each card has to progressthrough each stack at least once. While this works very well if done meticulously, sloppyness may cause problems such as memorizing the wrong strokeorder or diminished eﬃciency due to not drawing each glyph upon review.Merely visualizing a glyph is by far inferior to actually drawing it.Much research has gone into ﬁnding a good timing for spacing the intervals of reviewing information in order to maximize memorization eﬃciency.[JA08]Flash cards can be simulated as a computer program. The beneﬁts areobvious, since the computer can serve out the cards accurately, preventingcheating. Progress can be automatically kept track of even for each individ3

ual character so that characters the student has shown diﬃculty with maybe served more often for review than others. Characters that have not beenreviewed for a long time may be inserted higher up in the review queue.Many people nowadays have a computer they take anywhere they go: theirsmartphone. If there is a kanji studying application installed, idle timesduring the day can be used conveniently to quickly review a few kanji.On the iOS platform, numerous kanji learning applications exist. Someare direct implementations of ﬂashcards systems2 , others oﬀer little gamesasking the student to select one of the shown glyphs matching the supplied meaning3 . Some applications allow the student to write charactersby tracing them on-screen or drawing them on an empty screen ﬁrst, thencomparing them to the correct character4 . One application lets the studenttrace a character, then the program checks whether the stroke order wascorrect5 .As of December 2011, no application seems to exist on iOS that oﬀersa spaced repetition system-based kanji study application using the touchscreen for kanji drawing with character recognition. The aim of this bachelor thesis is to study the feasibility of developing such an application and toimplement a prototype. The emphasis will be on implementing a characterrecognition algorithm which will serve two purposes, one technical and onepsychological. The technical advantage is that the results of the characterrecognition can be applied to determine the spacing and frequency of thecharacter review, presenting characters that the student previously had dif2 ‘StickyStudy: Japanese’ by Justin Nightingale ‘Kanji’ by Lima Sky ‘Kanji Flip’ by Proﬃtt Ink3 ‘Kanji Pop’ by Lima Sky ‘JLTP Study’ by Mathias Navne4 ‘Kanji LS Touch’ by Jan Bogner ‘Remembering the Kanji’ by Mirai LLP5 ‘iKanji touch’ by ThinkMac Software4

ﬁculties with earlier and more often. On the other hand trying to draw acharacter with almost instant feedback will likely be more enjoyable than astatic ﬂashcard-based system.The scope of the application is as follows:1. The application presents an area for drawing kanji characters using aﬁnger on a capacitive touch screen.2. The application asks the student to draw a speciﬁc kanji by displayingthe keyword meaning of the kanji.3. If the kanji is presented to the student for the ﬁrst time, a diﬀerentview is shown, displaying an animation of the writing of the glyph. Ifthe kanji is composed of other kanji, the keyword meanings for theseare also shown. The student will have the opportunity to enter anideogram that is helpful in memorizing the kanji.4. After the student has drawn the glyph, the software checks if the inputis correct. Each kanji object stores a level value. The number of thelevel value correlates to the number of times the kanji has been drawnsuccessfully in a row. If the input is correct, the kanji gets promotedby incrementing the level value, otherwise it will be set back to 1.5. If the student does not know how to write the kanji, he or she maypress a “peek” button to see the same view as in 3 above. Upon doingso, the level of the character is set back to 1.6. Characters of levels 0 to 5 are active and presented to the student.Characters of level 0 are new and will be introduced as described in3. Characters of level 6 are considered “known” and are currently notpresented.7. If the number of active characters drops under a threshold, new onesare taken into the active set. New characters will never contain components that have not been introduced before. Additionally a few ofthe known characters are randomly chosen and added to the active setby setting their levels to 5. This will ensure they are reviewed fromtime to time. Instead of choosing them randomly, more elaborate waysof selecting them may be introduced later, such as the longest time5

since the last review, or the least favorable ratio of successful to failedreviews.2 The Japanese Writing System2.1 Kanji, Hiragana, Katakana, RōmajiThe Japanese writing system uses four distinct sets of characters: Kanji, Hiragana, Katakana and Rōmaji. The following sentence6 is a simple exampleof how each set of characters is used for a speciﬁc �This sentence means, “I went to Tokyo by train with Mr. Miller.”:kinyōbimi金曜日rasantodeミラさんとMr.MillerFridaywithTōkyōeiJRで東京へJR byTokyo tokimashita行きました。wentJapanese writing rarely makes use of spaces to separate words, however theuse of diﬀerent character sets helps breaking sentences apart.RōmajiRōmaji are letters of the Roman alphabet. They are used to transliterateJapanese to target non-Japanese readers. Occasionally Rōmaji is used inconjunction with the other Japanese character, typically for acronyms andcompany or brand names as is the case here for JR, the Japanese ��た。KanaKana are two sets of syllabic characters, Katakana and Hiragana, containing 48 basic characters each. While both have been derived from Kanji,none of these characters carry any meaning. Instead, each character is a6The small type above each Japanese glyph is a transliteration to Roman writing.Sometimes similar transliterations to hiragana glyphs can be found above rare and obscurekanji. This reading aid is known as Furigana or Ruby.6

phonetic code not unlike the characters of the Roman alphabet. Being syllabic however, each character roughly equates a phonetic syllable, with theoccasional second or third character used to modify the pronunciation of thesyllable. The correlation between sound and symbol is in fact even closerthan between letters and sounds of the English or German language. [Got05,p.81]KatakanaKatakana are most commonly used for loan words and foreign names.Other uses are for onomatopoeia7 or for adding emphasis not unlike the useof italics in western writing. In the sample sentence, katakana is used forwriting an english surname, Miller. Since the amount of available syllables in Japanese is considerably lower than in European languages, perfecttransliterations rarely exist as can be observed in this �HiraganaParticles and inﬂections are written using hiragana. Hiragana are phonetically equivalent to katakana. Occasionally nouns and the stems of adverbs,adjectives and verbs of Japanese origin are also written using hiragana, either if no kanji exists, or if the characters are rare and �KanjiKanji are logographic characters. Each kanji is composed of a number ofstrokes and denotes meanings or concepts rather than a phonetic property.Sometimes one character correlates to one word, but they are often combined with one or even more other kanji to form a compound word. Thekanji system was borrowed from China in the 6th century and the charactershave largely remain unchanged, however there are a few rare cases of char7Words imitating sounds7

acters unique to Japan (Kokuji). While many thousands of kanji exist8 , amuch smaller subset is in common use. The Japanese Ministry of Educationhas assembled a list of 2,136 jōyō kanji 9 for regular use. The jōyō kanji areC0C06CJK Strokes31EFthe ones that Japanese students have to learn in elementary, junior highand high school. The 1,006 characters taught in elementary school comprise!"#!" !"%about 90 percent of the characters found in newspapers. [Got05, p.82]&耹 p !"# kinyō!". bimi!"/ �JRで東京へ行きました。"耺 q !"#"'!"/"耻 r 2.2 Readings!"#%!!"."!".%!"/%耼 s Kun’yomiand On’yomi!"#!!".!!"/!Each Kanji character typically has several readings, of which on’yomi are(耽 tthereadingsborrowed from the Chinese language; kun’yomi are the Japanese!"#&!".&readings.CJKStrokes)!"#*for a particular instance of a character耿 u The reading to be chosen31EFis !"#'to be!".'determined by the reader taking into account the group of kanji and!" that!"% form the kanji compound for a word. Context may also play akanav聂p 耹 role. 聃 w Anatomy of Kanji Glyphsq The耺 2.3, 聆 x2.3.1 Strokes耻 r y glyph is composed of strokes that can be further broken down to聈 kanji- Each shapes and more complexs耼CJKbasicstrokestrokes. The basic stroke shapes are:Strokes31EFCJK聉Strokes31EFz.Dot A very short dasht耽!"#!" !"%/ 聊!"#!" {!"%A stroke that is written from left to rightu Horizontal耿 p耹 p耹# 聋CJK Strokes31EF!"#(&!"# "!"#"'!"#%!!"#!(!"#&)&&*"" '',!!((.))/**# , ,!".(!". !"/ !"#)!".)!"."!"/"!"#*!".*!".%!"/%!"# !". !".!!"/!!"#,!".,!".&!"#-!".-!".'!". !". !"##!"/ !"/ %!"#%!".)!".%!".%!"# !"#/!"/%!"/%!". !"./!"/ /"!"# !"#&!"#&!". !".&9!".&!"#%!"#'!"# !"# v Vertical聂 q耺A stroke that is written downward q耺聍!"# }!" !"%w Rise聃 r耻 r耻%& 聎Written left to right, rising upp耹 x Press聆 s耼 s耼0" 耺qDown Written left to right, falling iten lists more than 50,000 kanji characters.[Wik]t耽 r耻' UnicodeTheCopyright 1991-2010 Unicode, Inc. All rights reserved.8as of!".%2010!"/%zs 聉u耿! 耼u耿{t聊v聂v聂( *!"#.!"#*!".*!".!".*!"#(!".( u聋w聃w聃) 耿x聆}v聍x聆* 聂!"/!8

!"# C06!"#""'!". !"."CJK!"/ !"/"Strokes31EF耺 q Strokes 耻CJKr!"#"!"."!"#!".%31EF!"/"!" !"/% !"%Strokes31EF耻 rCJK !"# 耹!" p!"% ! 耼& s! 耼 s !" !"%p & 耹 !"#" 耺 q ( 耽 tt( 耽p & 耹q " 耺' 耻 r ) )耿u Away Written right to left, falling.耿u' 耻 r Throwq "! 耺耼 s31C0More complex strokes are combinations of basic* 聂 vs 耼v* !聂耽 tshapes,using four ways to combine them:r 耻'( stroke 聃 w( 耽 t 聃) wu A sharp turn, most often at a right耿 Break! 耼 s , 聆 x angle. The direction is either to the right or) 耿 uv The stroke to the left is composed of聂 down., 聆* x( 耽y t- 聈* 聂 v three basic strokes with two breaks. 聃 w- 聈 y Hook Similar to the break, this is a sharpz. 聉) 耿w u 聃x The direction is either to the left or, 聆 turn. /聉{聊z, 聆 x down. The hook shown is attached to a vervstroke, therefore it points to the left.*- 聂y聈 tical# 聋 y聈{/ -聊z聉 Bendw A stroke section typically curving . 聃聍 } counter-clockwise to the left or clockwise to. 聉 z# 聋/ 聊 { right.x, 聆 the% 聎 / 聊 { A stroke section typically curving聋 Slant 聍# }聈0 蒉 y# 聋 clockwise to the left or counter-clockwise to 聍 }the right.% 聎 The Unicode Standard 6.0, Copyright 1991-2010 Unicode, Inc. All rights reserved. 聉} z 聍聎 1 % Fig.lists the basic and compound strokes used10 characters. Even though the number of 0 %蒉for CJK{聎/0 聊蒉 is very large, they are composed of a es.ThestrokesThe UnicodeCopyright Inc. Allrightsreserved. are 6.0,number0 蒉StandardThe Unicode Standard 6.0, Copyright 1991-2010 Unicode, Inc. All rights reserved.!"#%'!"#%!"#!!"#!!"# !"#&!"#&!"#"!"#'!"#'!"#%!"#(!"#!!"#(!".%!"/%!"# !". !".!!".!!". !"/!!"/ !"#"!"."!"#)!"#&!".&!"."!"# !"/"!". !"#%!".%!".'!".'!".%!"/%!"#"!"#!!".!!".(!"#*!"# !"#(!"/!!"#&!".&!"#,!"#)!"#*!".*!"#&!". !".(!"##!"# * !"# !".0!".#-!"./.!".!".0聋 /!".#聍 }2.3.2 Glyph Components!"#.!".(!".)!".*!". !".,聊 {!"#-#!".- !".聎蒉 deﬁned order, even if writing them in a diﬀerent!".0order would result in a glyph 316of the same shape.0!".聎 !"#/!"./to other strokes. The strokes are written in a well!"#0%!".#聍 }!"#.Each stroke is in a well deﬁned location relative!"#/聋 !"##Each kanji is composed of a number of strokes.0!".'!".0conceptually very regularly shaped and distinctive.!"##%!".&聉 z!"#,!".0!"./!"/!聈 y!"# !".,!".!聆 x!"#*!".!"./!".!"#,,!". !"/%聃 w!"#)!".#!".%聂 v!"#(!".-!"/"耿 u!"#'!".,!"."耽 t!"#&)!"/ 耼 s !"#!!".(!". !". 耻 r !"#%!".*The Unicode Standard 6.0, Copyright 1991-2010 Unicode, Inc. All rights reserved. '!"%耺 q !"#"!".)!".!"#!"#0!"#0"!".(!" 31EF耹 p !"# !"./!".-!"#/!"#0!"#/!"/!!".!".,!"#.!"#/!".!!".#!". K Strokes!"/"!".!".,!"#)!"#.!"#,!"/!!". !"#(!"# !"#-!".%!"/ )!"# !"."!".!!".(!"#!!"#*!"#'!"/ !".&!"#%!"#)!"/!!"./蒉 !"#0!".0The Unicode Standard 6.0, Copyright 1991-2010 Unicode, Inc. All rights reserved.The UnicodeStandard6.0, Copyright 1991-2010All esare Unicode,alwaysInc.written1: CJK Strokes.31E3 is not used into-right, vertical strokes downward. Combined modern Japanese kanji;1031CB and 31CC areCJK is short for Chinese, Japanese and Korean.counted as two strokeseach. [uni10]9

strokes such as angled strokes or hooked strokes have to be written in onego. One might argue that not adhering to these rules may result in a glyphthat looks exactly the same. However, complying with these rules not onlyensures eﬃcient handwriting; the stroke order rules also increase legibilityin hand-written characters. Experienced writers will not always fully raisethe pen or brush from the paper when moving from the end of one stroke tothe beginning of the next one, hence generating an extraneous line or curvesconnecting the two (connective strokes). If the writer would not observe thestroke order, these extraneous curves would appear in a conﬁguration thatis unexpected to the reader, therefore obscuring the character written.In order to brieﬂy introduce an example for the composition of kanjicharacters, the character for mouth is口 and written using three strokes:The ﬁrst stroke is the vertical one on the left-hand side. After that, an angledstroke starting horizontally at the top left, then proceeding vertically to thebottom right corner is drawn. The horizontal stroke on the bottom comeslast. In order to memorize this character, one can view this as a pictographof a mouth wide open.As another example, consider the character for to say,言.Since strokesare usually written in the order left-to-right, top-to-bottom, the ﬁrst fourstrokes can be written intuitively beginning at the top, working downwards.The bottom part is actually a character that has been memorized before:口.This can be used to a great eﬀect in helping to memorize many characters,since complex characters are composed of simpler characters. Here we canliterally see four sound waves rising out of a talking mouth.Characters often appear as glyph components in more than one other語 is言, 五, and 口.character.the character for word, composed of three components:If one has learned these characters before, a “story” canbe attached to the character and its meaning, stringing the componentstogether. The more memorable the story is, the easier it is to memorizemeaning and character. The stroke order of the compound character isalmost always keeping the stroke order of the components intact. Comparedto less structured learning approaches, the eﬀort to learn a large number ofcharacters can be signiﬁcantly reduced by studying them in an order thattakes the fact that most kanji are composed of simpler components intoaccount. By attaching memorable stories to each character, the retentionrate can be further improved. [Hei01]10

3 Recognizing Characters3.1 A Brief Overview of Recognition StrategiesToday, two dominant approaches exist for on-line handwritten characterrecognition. For languages based on the roman alphabet or similar alphabets, most recognition engines use a Hidden Markov Model for each character to be recognized. Recognition engines for Japanese characters predominantly use nearest neighbor classiﬁers. These calculate the distance of theinput data to all of the reference data sets stored in a database. The shorterthe distance is, the more likely a match has been found. The distance tothe reference data, or template, is determined using a set of features, thefeature vector. Geometrical aspects of data such as length, angle, curvaturemay be extracted from the input data in order to obtain a feature vector.A popular feature is the directional feature, where local directional information is extracted from the data for the four major directions: horizontal,vertical, and the two diagonals. The area of the training data is then dividedinto a regular pattern of regions such as squares. For each region and eacheach major direction, the number of directional features in that region iscounted. The result is a histogram of training data that can be comparedto the histogram of input data.[JLN]Other features include the directional segment strength feature, the strokecount feature, as well as feature points of interest including intersections,endpoints and points of maximum curvature, among others. [Hil93]Elastic matching may be used to further improve the recognition rate.Elastic matching is an optimization problem in trying to best match the input data to the template data by warping the input data in both dimensions.This process allows to compute a distance that is invariant to deformation.[US05]Using Hidden Markov Models (HMMs) for recognizing kanji characters ispossible and has been done successfully, but the sheer amount of charactersposes a challenge in that for each character an HMM has to be generatedand supplied with suﬃcient training data. The feasibility of HMMs improves if each kanji character is considered a ‘word’ composed of a numberof strokes that act as ‘characters’. The number of diﬀerent stroke shapes isvery manageable. The strategy is to have a HMM for each possible strokeshape instead of one HMM out of thousands for each character. For typical11

l:Features:Histograms:Figure 2: Directional Features are often used in kanji recognizers. The directionalfeatures are the converted into a vector of histograms that can be compared against.Normalizing during preprocessing has to take care of ensuring correct positioningand proportioning of the input data.applications, challenges are the variance of stroke order and stroke count injapanese input that is much higher than the variance in letter order and lettercount in western writing11 . Stroke count and stroke order are an importantfeature in the scope of this thesis, since the purpose of the application is toteach correct writing. [JLN]The approach chosen for the recognizer is not to have a machine learningapproach. Instead, the feature data is entered explicitly. This makes itpossible to ensure that a particular feature that is very subtle and may bemissed by a learning algorithm due to in-class variance being considerablylarger than the between-class variance. The fact that stroke shapes are verywell deﬁned raises the hope that manually constructing recognizers for eachglyph is feasible, if laborious.The option of manually constructing HMM parameters was dismissedbecause the states of an HMM cannot be inspected by design. Instead, astate machine was developed that allows inspection in order to ﬁnd out thereason for a stroke not matching. This information is currently logged to theconsole, but may be used in the future to provide more ﬁne-grained feedback11A variance in letter order or letter count is a misspelling. A variance in stroke orderor count in a kanji may result a perfectly legible glyph shape not discernible from a kanjiwritten correctly.12

to the student as for the reason why the character drawn is wrong12 .Neither handwriting nor calligraphy is a scope of this project, hencestrokes are to be strictly separate from each other, simplifying the recognition problem signiﬁcantly.3.2 Data AvailableIn typical OCR applications the data available to the recognition algorithmis limited to a raster image of color values, usually black and white pixels.Other applications have access to on-line data recorded by a touchscreen,graphic tablet or similar device, simplifying the preprocessing signiﬁcantlyand also adding temporal data to the input. This project has access toon-line data due to touchscreen input method processing a series of touchevents. Touch events are received in the order they are drawn and can easilybe attributed to a certain stroke, since each stroke consists of exactly onetouch-down, exactly one touch-up and optionally one or more touch eventsin between.The goal of the recognizing algorithm is not to recognize a character,in fact the algorithm already knows the character the student is supposedto draw. Instead, the algorithm has to check the character drawn by thestudent against the template character and decide whether they match ornot.The data available is therefore,1. The geometry of each stroke, given in the order drawn, and2. The deﬁned properties of the template character.3.3 VarianceRecognizing Kanji characters can be challenging due to the fact that twodistinct characters may have a small between-class variance, but the withinclass variance between the same character handwritten twice or characterswritten in diﬀerent fonts can be very high. Compare the entirely diﬀerent12Feedback may include: The angle of a section of a stroke is wrong. A hook is missing on a particular stroke. A certain point on a stroke should align with another stroke.13

characters 未 and 末, yet 未 and 未 are both the same character in diﬀerentfonts. It will be essential for the recognizer to be aware of subtle diﬀerencessuch as these since it is the goal to teach the student to take care about suchimportant aspects when writing kanji. In the above example, if the studentis asked to write 未, yet the top horizontal stroke written extends beyondeither side of the horizontal stroke below, the recognizer must detect thisand report that the glyph does not match.3.4 Avoiding FrustrationThe recognition algorithm needs to be tuned to avoid causing frustration.Each time a character is recognized as incorrect is a very jarring experiencefor the user. An input that was correct but nonetheless causing the algorithm to return a wrong verdict will be a signiﬁcant cause of frustration,particularly since the user does not understand why the verdict was wrongand what part of the input triggered the error. The user will lose faith inthe accuracy of the application, even if the accuracy is in fact quite good.Therefore the accuracy needs to be as close to 100% as possible for correctinput.Incorrect input has to be detected in most cases as well in order forthe application to be useful. However, for incorrect input it is not quiteas important to achieve an extremely high success rate of recognizing acharacter as being wrong, since the application can give the user feedback byother means, such as displaying the correct character alongside the charact

2. The application asks the student to draw a speciﬁc kanji by displaying the keyword meaning of the kanji. 3. If the kanji is presented to the student for the ﬁrst time, a diﬀerent view is shown, displaying an animation of the writing of the glyph. If the kanji is composed of other kanji, the keyword meanings for these are also shown.

Related Documents:

Use of Mrs.Kanji Web Application to Enhance Japanese ... - Atlantis Press

110 Kanji, and Shokyuu Hyouki II (Writing Basic Kanji II) books (The Department of Japanese Language Education FPBS UPI, 2015) amounted to 119 kanji, after which they were screened and grouped thematically with kanji criteria which can be illustrated and given a mnemonic so that kanji are collected. The list of thematically selected kanji is as .

28 Views

1y ago

THE FIRST 103 KANJI

The Japanese government limited the kanji used in ofﬁcial publications to the 1945 touyou kanji with about 4000 readings. The Japanese Language Proﬁciency Test (JLPT) is based on these 1945 touyou kanji. A Japanese with average education knows around 3000 kanji and it is estimated that around 4000 kanji are used in Japanese literature.

606 Views

3y ago

The Kanji Learner's Course Graded Reading Sets - keystojapanese.com

KLC-GRS Series Introduction The Kanji Learner's Course Graded Reading Sets contain over 30,000 parallel text segments distributed as reading practice for all 2,300 kanji in the course. The exercises for each kanji contain only kanji previously introduced, and are designed to give you contextualized practice with reading the kanji-based vocabulary

23 Views

1y ago

JLPT N5 Vocabulary List - 802 words

307 words from the old "Japanese Language Proficiency Test Level 4") and 46 greetings words for JLPT N5. Kanji Kanji from No.1 to 120 (for N5) in the MLC original Kanji book "Basic Kanji 320 (for N5 and N4)" are shown in the Kanji section. Freq. Frequency The number of times the word appeared in the

1.0K Views

2y ago

Toyota Language Center - Japan Society

course covers the kanji necessary for taking the JLPT N4 and N5 tests. This course is conducted almost entirely in Japanese. Kanji 2 or JLPT N3 (Comparable in difficulty to level 10, 10.5 or 11) For students who have already mastered approximately 300 kanji, and are looking to learn 680 total kanji (taken from the JLPT N3 preparatory books .

335 Views

3y ago

Introduction to Kanji A brief history of Kanji

Introduction to Kanji A brief history of Kanji Chinese characters, along with the Chinese culture, came to Japan in the fourth or fifth century, at a time

122 Views

2y ago

Remembering the Kanji (4th Edition) - kanji 1 to 2042

Remembering the Kanji (4th Edition) Kanji 1 to 2042 series 2 page 1 infancy rut from detailed comrade ford squeeze snapshot erect hearing separate long north wherefore ball raise task suck dike camphor tree explanation ratio horse brain shout grate anti- tears model garden white watcht

54 Views

2y ago

Joining Technologies for Automotive Battery Systems ...

High speed, less thermal input, non-contact process, easy Automation High initial cost, additional shielding system may required Need good joint ﬁt-up (intimate contact), high reﬂective materials 8 Magnetic pulse welding Solid state process, able to join dissimilar materials, high joint strength, dissimilar materials Potential large distortion,

33 Views

3y ago

Recent Views

Chapter 15 Rooming Houses - MassLegalHelp

Individual renters usually have their own separate room and their own agreement with the landlord. For example, you may stay for just a few days, but another renter may stay for 3 months. Rooming houses with 4 or more renters at the same time must be licensed. Some cities and towns have local protections for renters in rooming houses. Rooming House

2y ago

356 Views

Americans rent, buy, sell and think about home.

median rent among Generation X is 1,062 per month. The youngest renters, Generation Z, are typically paying the least at 882 per month.9 This echoes the notion that Generation Z renters are opting to rent the smallest apartments or homes, which translates to lower monthly rental payments. Approximately half of renters (47 percent) are paying for

1y ago

174 Views

Disaster assistance process overview

A guide through the post-disaster recovery process. KEY ASSISTANCE SOURCES TIPS HOMEOWNERS/RENTERS INSURANCE If you have homeowners or renters insurance, this provides you funds to repair or replace property damaged as a result of covered perils during a disaster. Additional types of insurance, such as auto or other peril-specific

1y ago

109 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

Texas Demographic Trends and Projections and the 2020 Census

Income disparities place African Americans and Latinos at greater risk during times of income loss. Renters, renters w/low incomes, Blacks, and households w/children face greater risk of eviction. Persistently low health insurance coverage in the state increases vulnerability of Texans with employer based insurance.

1y ago

137 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

Texas - milestonepnc

State Auto - Homeowners TEXAS 05/2017 State Auto Insurance Company UG-1.0 I - UNDERWRITING GUIDELINES A. Entire State Eligibility Guidelines Premier Protection Plus Standard Available Forms HO0004 - Renters HO0005 - Homeowner Expanded HO0006 - Condominium HO0003 - Homeowner HO0004 - Renters HO0005 - Homeowner Expanded

1y ago

112 Views

Consumer Guide to Auto Insurance - csimt.gov

consumer guide to auto insurance contents introduction to auto insurance 1 understanding your auto insurance policy 2 required auto insurance 3 optional types of auto insurance 4-5 getting the right coverage 6 accidents and violations 7 how to shop for auto insurance 8 shopping tips 9 frequently asked questions 10-11 insurance complaints/when you have a problem 12

2y ago

805 Views

Industry Observations Insurance Industry

Jun 30, 2019 · 6/17/2019 Commercial Insurance Branch of Extraco Banks, N.A. Higginbotham Insurance Group, Inc. Insurance Brokers NA 6/13/2019 Links Insurance Services, LLC World Insurance Associates LLC Property and Casualty Insurance NA 6/13/2019 Abram Interstate Insurance Services, Inc. Risk Placement Services,

2y ago

619 Views

Life Insurance Buyer's Guide Life Insurance - National Association of .

Life Insurance uers uide Naional ssociaion of Insurance Commissioners Compare the Different Types of Insurance Policies There are many types of life insurance pol-icies. You should choose a policy with fea-tures that fit your individual needs. Some things to consider are: Term Insurance vs. Cash Value In-surance. Term insurance is intended to

1y ago

520 Views

your guide to understanding auto ins in nh - New Hampshire

Hampshire Insurance Department does not mandate or set Auto Insurance Rates. Auto Insurance Rates will vary by insurance company. This guide is intended to give New Hampshire consumers basic information on auto insurance. It suggests ways to: Lower the cost of your auto insurance, shop for Auto insurance and, file an auto insurance claim.

1y ago

449 Views

18.01.41 - REPLACEMENT OF LIFE INSURANCE AND ANNUITIES - Idaho

Department of Insurance Replacement of Life Insurance and Annuities. Page 3. 04. Existing Life Insurance or Annuity. "Existing Life Insurance or Annuity" means any life insurance or annuity in force, including life insurance under a binding or conditional receipt or a lif e insurance policy or annuity that is within an unconditional refund period.

1y ago

407 Views

EXAMINATION REPORT OF THE ADMIRAL INSURANCE COMPANY AS OF . - Delaware

Berkley Regional Specialty Insurance Comp 31295 DE Carolina Casualty Insurance Company 10510 IA Clermont Insurance Company 33480 IA Continental Western Insurance Company 10804 IA Firemen's Insurance Com pany of Wash, D.C. 21784 DE Gemini Insurance Company 10833 DE Great Divide Insurance Company 25224 ND

1y ago

258 Views

American International Group, Inc. - Federal Reserve

American General Life Insurance Company AGL U.S. Life Insurance Company AGC Life Insurance Company AGC Life U.S. Life Insurance Company The United States Life Insurance Company in the City of New York U.S. Life U.S. Life Insurance Company The Variable Annuity Life Insurance Company VALIC U.S. Life Insurance Company

1y ago

269 Views

Japan's Insurance Market - Toa Re

with 61.6% of net premiums written, of which automobile insurance totaled 48.8% and compulsory automobile liability insurance totaled 12.8%. Fire insurance accounted for 13.7%, miscellaneous casualty insurance including liability insurance accounted for 11.6%, accident insurance accounted for 9.8%, and marine insurance accounted for 3.2%.

1y ago

179 Views

Kanji Stories - Fu-berlin.de

It looks like you're using an ad-blocker