ITSP Documentation Final Final

2y ago

75 Views

2 Downloads

5.91 MB

14 Pages

Last View : 2m ago

Last Download : 3m ago

Upload by : Milena Petrie

Report this link

Download PDF

Transcription

Project Scribble-By Team Four-Play

Acknowledgments1. The Tech council for giving us this wonderful opportunity to utilise oursummers productively.2. Mayank Kumar Singh, our mentor who guided us throughout our journey.3. All the developers of open-source libraries without which this projectwould have been inconceivable.A special mention for our friend, Rohan Chandratre, who not only produced all the 3D CAD models of our pen, but also gave us some interesting ideas which we ended up implementing in our project!

IndexAcknowledgments .2Index .3Original idea: .4On-line vs Off-line recognition: .4Why we choose this project: .4Original Approach: .5Where it all went wrong: .6Time for CHANGE: .7Here’s what we came up with: .8Hardware: .8Software: . 10Possible improvements and future scope:.13References: .14

Original idea:Making a device (‘smart pen’) which will enable us to convert handwrittenscripts into editable text format in real time. The writer should get a soft copyof whatever he/she writes on the paper in the form of a text document withoutthe requirement of scanning the paper.On-line vs Off-line recognition:In general, handwriting recognition is classified into two types: off-line and online. In off-line recognition, the user feeds the computer a scanned copy of thehandwritten document. The recognition system uses several algorithms andfinally outputs the recognised text. On-line character recognition (also calleddynamic character recognition) uses information about the characters writtenby a person to identify which characters have been written. The informationgenerated by the handwriting process is captured by the computer in realtime, via the use of a digitising tablet, and is usually presented to the recognition system in the form of a time ordered sequence of x and y co-ordinates.Why we choose this project:On-line handwriting recognition is considered superior to off-line characterrecognition for many reasons. One major reason being that it is a real-timerecogniser and outputs the recognised character immediately after the userwrites it. Moreover, the segmentation of letters is not a problem in on-linerecognition since we can use ‘PenUP’ and ‘PenDOWN’ information. However,one major drawback in most existing systems is that the writer cannot use ordinary paper for writing, but instead requires a tablet or some kind of a pressure sensitive surface. We wanted to overcome this problem and enable theuser to write on any kind of paper.

Original Approach:After consulting numerous research papers (see References 1,2) we decidedthat using an IMU sensor, specifically MPU 6050, in combination with a Recurrent Neural Network would be an ideal solution for our task.MPU 6050 is an IMU sensor which has both an accelerometer and a gyroscope embedded in the chip. The plan was to detect the position of the tip ofthe pen using accelerometer measurements along x and y axis. We don’tneed measurement along z axis since the movement is only in the 2-dimensional plane of the paper. These measurements would then be double integrated with respect to time to get the displacement. The formula is-!Before the double integration, some filters had to be applied in order to reducethe noise. Also, a pressure switch was required to detect ‘PenUP’, ’PenDOWN’ states.Once we had the displacement readings, they were to be fed to a trainedRNN. The RNN performs the task of recognising the written character. The following flowchart depicts the working of our original plan:!Flowchart depicting the control loop of our proposed program

Where it all went wrong:The above approach, especially the MPU6050 chip, had sounded verypromising in the beginning because many research papers had claimed that itwas feasible. However, after three weeks into our ITSP, we decided to discardit altogether as were unable to make any reasonable progress.We hooked up the MPU to an Arduino UNO using the I2C protocol. We usedthe I2C library made by Jeff Rowberg (See Reference 5) to get the raw valuesof accelerometer. These values were exported from serial monitor to an Excelsheet using an excel add-on called ‘PLX-DAQ’. To get the values in SI unitswe performed some calibration, that is, calculating the Scale Factor and Bias.Once we have those, Calibrated Value (Original Value – Bias)/Scale Factor .We wrote a python program which imported the calibrated values from the excel sheet using the XLRD module. We applied a moving average filter to thesereadings. A ‘Moving Average’ filter replaces a particular sample by the mean ofit’s neighbouring values. We used 5 preceding and 5 succeeding values, thusreducing the high-frequency noise in the raw data. The filtered plots are strikingly smooth as compared to the raw acceleration-time plots. Here is a comparison of the ‘Before’ and ‘After’ plots, plotted using Matplotlib:!Before applying filterAfter applying filterTo further increase the accuracy and reduce noise, we also implemented a‘Kalman smoother’. This resulted in substantial noise reduction.Although the acceleration-time plots looked satisfactory, this was not reflectedin the displacement-y vs displacement-x plots. We believe that this might havebeen due to fact that on integrating over time there is significant ‘drift’ (devia-

tion from true value), which is caused due to accumulation of small deviationsover a long time. Double integration must have further increased this. So evensmall errors in acceleration values led to incorrect displacement values. Following is a Y vs X plot when we moved the sensor along the trajectory of letter‘b’:!The Y vs X plot of the letter ‘b’. Clearly, it is noteven close to the actual trajectory of the letter ‘b’And that’s the best we got. Most plots simply resembled a y x or y -x line.So we had no choice but to abandon this approach.Time for CHANGE:After experimenting with sensors which gave inaccurate readings, we decidedit was time that we changed our approach. We decided to use image processing to track the nib of the pen. While this adds some constraints to our project,the tracking was so accurate that we felt the pros far outweigh the cons. Toname a few: Training a CNN model for character recognition is a matter of a few lines ofcodes, thanks to an abundance of machine learning libraries such as Kerasand PyTorch. There was no need of any constraints on the tilt of the pen. A user could nownaturally hold the pen.

Here’s what we came up with:In this section, we describe our completely revamped approach. It can bebroadly classified into two sections - Hardware and Software. We describeeach in detail below:Hardware:For the pressure switch, our first approach was to use an Infrared or Ultrasonic sensor to detect a change in the distance between the receiverand the paper. But the required precision could not be attained usingsuch sensors. The distance between the paper and the pen during the‘PenUP’ state is in the range of a few millimetres while the IR/Ultrasonicsensors could detect a change only in the range of centimetres. As a result, we had to resort to a mechanical alternative.Our hardware setup consists of an LED mounted on a specially designed 3D-printed pen and a webcam placed in a plastic enclosure. Wedescribe each of the components in more detail below:The plastic cuboidal enclosure has a webcam placed onthe bottom surface. Three cardboard slits at the top ensure that the paper does not shake while the user iswriting. White chart-paper is used to prevent light composed of unwanted colours from reaching the webcam.Our prediction model uses keystroke data to plot the letter. As a result,we had to devise a mechanism which could distinguish between 2 distinct states - ‘PenUP’ and ‘PenDOWN’. It is mandatory because keeping

the LED always on would lead to two issues. Firstly, it would quicklydrain the battery but more importantly, the OpenCV program would alsodetect random keystrokes when the pen is not in use, say for examplewhen the user is simply holding the pen in his/her hand. To overcomethis issue, we tried various methods. We came up with a simple yet effective solution when our first approach of IR/Ultrasonic sensors failed.Our pen being 3D-printed using the Flashforge Gider II. It took around 2 hours 13minutes to print. 12 grams of ABS plasticwas required for the entire assembly.!We 3D-printed a pen whose refill housing was a fraction longer than thelength of the refill. At the top, a push-button was housed in an enclosurewith the button facing downward. The LED was connected to the batterywith the push-button acting as a switch. When the pen was not in contact with the paper, the refill did not press against the button and the circuit was open. As soon as the pen was pressed against paper, the refillpushed the button, the circuit was complete and the LED glowed! Preexisting pen bodies couldn't be modified to house the pressure switch.As a result, we had no option but to 3D print the pen. After the pen wasprinted, we inserted a standard ball point pen refill into the chamber. Thebattery and the LED were taped to the body of the pen and the connections were soldered to ensure proper functioning.!!The body of the pen which houses the refillThe cap of our pen which acts as a compartment for thepressure switch

For detecting the LED movement above the paper, we use a Logitechwebcam. For mechanical support, the paper is kept on the top surface ofa transparent plastic cuboid with the webcam placed on the bottom surface. It can accurately track the centre of the LED glow as the penmoves across the surface.The final assembly consists of the pressure switch restingagainst the refill. A green LED and a 3V battery is taped tothe body of the pen and the connections are soldered. Thebattery is connected to the appropriate terminals of the LEDthrough the switch. As described previously, the circuit getscompleted when the refill is pressed against any surface andthe LED lights up!Software:We now reach the final step which is predicting the letter which was justdrawn by the user. Since we had collected the (x,y) coordinates of thekeystroke data, plotting it using Matplotlib was a simple task. And oncean image was generated, using a Convolution Neural Network (CNN)was a no-brainer, considering its high accuracy when it comes to classifying images. Other approaches such as Dynamic Time Warping (DTW)were ruled out due to large variation in the keystroke data generated bydifferent users.The EMNIST dataset, which is available at taset , is a gigantic collection of 15 lakh trainingexamples. We hoped that a deep network trained on this dataset wouldgive accurate predictions. But we ran into a few problems as describedbelow. Firstly, the training data folder for most examples had both upper-caseand lower-case samples in the same folder and separating them man-

ually was not feasible, considering each folder had approximately(15,00,000/26 ) 57,000 examples. Secondly, the images generated by the keystroke data ofour pen had minor noise issues which we were not able toeliminate. Such peculiar noise was not present in the EMNIST dataset and hence the letters differed considerably intheir shapes and curves. (Check example on the right)Shape generatedusing our penAn examplefrom the EMNISTdataset Thirdly, all the EMNIST samples had a thick border on all four sidesand the actual size of the letter varied greatly. Centre-cropping thesamples would sometimes chop off the extremes of some examples.On the other hand, the images generated by our pen were centred, ofthe same size, with a thin border on all four sides.Considering the above issues, we decided to generate our own dataset.Thanks to an accurate ball-trackingprogram written by Adrian Rosebrock (See Reference 3), we simplytweaked the script to our own specifications. We added a few lines ofcode which resulted in a pickle file,which was dumped after everybatch consisting of say 30 examples. Once all the examples werestored as lists of (x,y) coordinatesdata, the next script took these coordinates and plotted them usingMatplotlib. Each image was resizedto 28x28 since most of the pre-existing architectures have been implemented for the mentioned size.Each example was stored under asubdirectory for that particular letter,under a folder named ‘Train’. EachThe Validation folder hierarchyfolder contained around 300 examples. 30samples for each letter were randomly chosenand moved to a directory named ‘Test’ which had the same hierarchy asthat of the Train folder.

But 300-30 270 samples per class is insufficient for any neural networkto learn the true features of any letter. It may result in overfitting even incase of a simple feed-forward network. To tackle this issue, we used apython package called ‘Augmentor’. It added random distortions andsmall rotations to each example and in the end, increased the number oftraining examples from 270 to 5,000 per class.The Augmentor package which helped us generate more training dataOnce our training and testing datasets were ready, we explored variousCNN architectures. We experimented with the kernel size, number offeature maps in every Convolution layer, Dropout methods and the number of neurons in the fully-connected layers at the end of our network.After training around 8 different models, we finalised the following modeldue to its high test accuracy and a monotonous decrease of the costfunction which indicates minimal overfitting:!The final architecture of our CNN model

The ‘Master’ script starts an infinite loop. Inside the loop, OpenCV detects the LED glow and saves the keystroke data to a file. This file is further read by a script and converted into a jpeg, each jpeg consisting ofthe drawn letter. Here are a few examples of the jpegs which were plotted using keystroke data generated by the pen:!!!!!!!!!!Letters generated using our penThis image is then fed to the above model and the predicted letter is appended to a string called ‘finalstring’. Spaces and line breaks are alsodescribed. A package called Tkinter is used to update the label of a GUIwindow.For more details, please go through the code.Possible improvements and future scope:Owing to time constraints, we couldn’t implement a few features which couldhave made the writing experience more natural. To name a few:1. Add capital letters, digits and punctuation characters to our dataset.2. Add one more physical switch near the grip of the pen. This switch canbe used to delete the previous predicted character in case of an incorrect prediction or if the user changed his/her mind.3. Make a compartment for the battery and the LED so that replacing thembecomes easy.4. Improve the design of the enclosure so as to allow maximum field ofview for the camera. A perfectly designed enclosure can act as a substitute for foldable study tables which are so popular these days.Future students are more than welcome to pick up right where we left off andenhance the writing experience by adding their own features.

References:1. Maria Atiq Mirza and Nayab Gul Warraich, “Application of Motion Sensors inHand Writing Conversion”, Student Research Paper Conference Vol-2, No-3,Department of Electrical Engineering Institute of Space Technology, Islamabad, July 2015.2. Gregory Cohen, Saeed Afshar, Jonathan Tapson, and Andre van Schaik,“EMNIST: an extension of MNIST to handwritten letters”,The MARCS Institutefor Brain, Behaviour and Development, February 2017.3. Adrian Rosebrock, https://www.pyimagesearch.com/, “OpenCV Track ObjectMovement”.4. Sung-Do Choi, Alexander S. Lee, and Soo-Young Lee, “On-Line HandwrittenCharacter Recognition with 3D Accelerometer”, Proceedings of the 2006 IEEEInternational Conference on Information Acquisition August 20 - 23, 2006,Weihai, Shandong, China.5. Jeff Rowberg, I2C device class (I2Cdev) demonstration Arduino sketch forMPU6050 class.

As a result, we had no option but to 3D print the pen. After the pen was printed, we inserted a standard ball point pen refill into the chamber. The battery and the LED were taped to the body of the pen and the connec-tions were soldered to ensure proper functioning. ! ! Our pen being 3D-pri

Related Documents:

Traffic Signal Electrical Facility (TSEF) 935 United ...

Santa Ana, California 92705 www.iteris.com (949) 270-9400 Traffic Signal Model: RZ4A-WDR-24X-PAK IVDS System ITS Plus Inc. 6220 Trailwood Dr. Plano, Texas 95024 www.itsplus3.com (214) 755-8169 Traffic Signal Model: PM-120C, ITSP-V200E, ITSP-V100 (With Zoom) *NOT* approved for Ramp Meter use. IVDS System Peek Traffic Corporation 5401 N Sam .

14 Views

2y ago

SETU VTEP Product Brochure V2R1 JULY 14 - 4.imimg.com

ITSP's Proxy. Else, SETU VTEP can also get registered with a SIP locations. proxy and thereby utilize the services offered by an ITSP. Virtual Trunking Multisite Connectivity over IP Registration with Multiple SIP Proxies Connect to Remote Location IP-PBX as a SIP Client 500 Direct Dial Access Codes for Peer-to-Peer Calls SIP PROXY .

9 Views

1y ago

Using a Practical Methodology of Enterprise IT Strategy for Digital Success

1Beijing University of Technology, 100 Pingleyuan, Chaoyang District, Beijing, China . Deloitte, IBM, McKinsey and BCG. ITSP methodology is the knowledge asset of IT consulting companies, which is not disclosed to the industry. This paper introduces the practice of ITSP methodology only from the perspective of enterprise IT strategy projects. .

4 Views

1y ago

SIP Trunking Benefits and Best Practices - startrinity.com

2.2 Bandwidth utilization 2 2.3 Flexibility to add new lines 3 2.4 Least Cost Routing (LCR) 4 2.5 Making IP-to-IP calls when possible 5 2.6 SIP trunking - the stepping stone to higher productivity 5 . A SIP trunk is a service offered by an ITSP to use SIP to set up communications between an enterprise PBX and the ITSP. A includes

6 Views

25d ago

Documenting DevOps: Agile, Automation, and Continuous Documentation

While traditional documentation cannot survive the demands of modern development , abandoning documentation altogether equally unviable. An ongoing, automated processes folds modern documentation into the DevOps framework and prevents documentation from becoming a bottleneck to rapid releases. Just as traditional documentation slipstreamed into

17 Views

1y ago

FIN 370 Final Exam - docdroid.net

Final Exam Answers just a click away ECO 372 Final Exam ECO 561 Final Exam FIN 571 Final Exam FIN 571 Connect Problems FIN 575 Final Exam LAW 421 Final Exam ACC 291 Final Exam . LDR 531 Final Exam MKT 571 Final Exam QNT 561 Final Exam OPS 571

225 Views

2y ago

Technical Documentation Challenges in Aviation Maintenance ...

The technical documentation for Boeing aircraft model. CSTA/CAMI Workshop #3: Technical Documentation 5 The volumes of documentation make it easy to understand how documentation problems . align company task cards with the aircraft maintenance manual (Rankin, 2008).

117 Views

3y ago

Alex Rider 1 - Stormbreaker

Alex Rider was woken by the first chime. His eyes flickered open, but for a moment he stayed completely still in his bed, lying on his back with his head resting on the pillow. He heard a bedroom door open and a creak of wood as somebody went downstairs. The bell rang a second time, and he looked at the alarm clock glowing beside him. There was a rattle as someone slid the security chain off .

88 Views

3y ago

Recent Views

Novell SUSE Linux Package Description and Support Level .

aspell-eo An Esperanto Dictionary for Aspell L2 aspell-es A Spanish Dictionary for ASpell L2 aspell-et An Estonian dictionary for aspell L2 aspell-fa A Persian dictionary for aspell L2 aspell-fi Finnish Dictionary Package L2 aspell-fo A Faroese Dictionary for ASpell L2 aspell-fr A French Dictionary for ASpell L2 aspell-ga An Irish Dictionary .

2y ago

348 Views

Dictionary of Aviation - THE AIRLINE PILOTS

Dictionary of Accounting 0 7475 6991 6 . Dictionary of Computing 0 7475 6622 4 Dictionary of Economics 0 7136 8203 5 Dictionary of Environment and Ecology 0 7475 7201 1 Dictionary of Food Science and Nutrition 0 7136 7784 8 Dictionary of Human Resources and Personnel Management 0 7136 8142 X

2y ago

162 Views

12 PUBLIC LAW AND PRIVATE LAW - Home: The National .

INTRODUCTION TO LAW MODULE - 3 Public Law and Private Law Classification of Law 164 Notes z define Criminal Law; z list the differences between Public and Private Law; and z discuss the role of Judges in shaping Law 12.1 MEANING AND NATURE OF PUBLIC LAW Public Law is that part of law, which governs relationship between the State

3y ago

745 Views

Dr. Ram Manohar Lohiya National Law University, Lucknow

2. Health and Medicine Law 3. Int. Commercial Arbitration 4. Law and Agriculture IXth SEMESTER 1. Consumer Protection Law 2. Law, Science and Technology 3. Women and Law 4. Land Law (UP) Xth SEMESTER 1. Real Estate Law 2. Law and Economics 3. Sports Law 4. Law and Education **Seminar Courses Xth SEMESTER (i) Law and Morality (ii) Legislative .

3y ago

496 Views

Oxford and the Dictionary - Oxford English Dictionary

What makes an Oxford Dictionary? People find dictionary-making fascinating. The 250th anniversary of Samuel Johnson’s Dictionary in 2005 was widely celebrated, and the recent BBC television series Balderdash and Piffle had a huge response to its call to viewers to help track down elusive word and phrase or

2y ago

210 Views

Cambridge Essential English Dictionary

These Dictionary Guide Worksheets are downloadable versions of the Guide to the Dictionary presented in the Cambridge Essential English Dictionary, Second Edition. The Guide is designed to help you develop skills in using a dictionary. The worksheets are grouped as five separate units, whi

2y ago

516 Views

The Interactive Arabic Dictionary: Another Collaboratively .

the Interactive Arabic Dictionary” [11], and “Conceptual Design of the Interactive Arabic Dictionary” [12], were the main studies used in HIAST to implement the Interactive dictionary. 2.1. Objectives IAD is a Monolingual dictionary (Arabic-Arabic), targeted to

2y ago

333 Views

Dictionary-guided Scene Text Recognition

A dictionary is an explicit language model, and the ben-eﬁts of a dictionary for scene text recognition are well es-tablished. In most previous works, a dictionary was used to ensure that the output sequence of characters is a legit-imate word from the dictionary, and it improved the accu-r

2y ago

313 Views

Going Online with a German Collocations Dictionary - unibas.ch

dictionary articles on two levels: a minimalistic view for the search and navigation stage and a more detailed view once a collocation is found. Keywords: online dictionary, collocations, dictionary design, learners' dictionary, German language . 1. Introduction Many dictionaries are available on the Web today. However, as yet there areno well-

6m ago

66 Views

A Fault Dictionary-Based Fault Diagnosis Approach for CMOS Analog .

Step 5: Fault dictionary construction: The fault dictionary is a collection of potential faulty and fault-free responses. The signatures obtained will be stored in the dictionary. This dictionary involves for each fault a correspondence between the faulty circuit responses and the defect sites.

4m ago

56 Views

On Entries for Neologisms in English-Chinese Learner's Dictionaries

A New English Chinese Dictionary of Journalism (2007) by Hu Zhiyong, An English -Chinese Dictionary of Neologisms (2009) by Li Mingyi, English-Chinese Neologism Dictionary (2013) by Wu Xuemei, A Dictionary of New Chinese Phrases in English (2015) by China Daily and A Chinese-English Dictionary of New Words and Expressions (2015) by Wu .

4m ago

63 Views

Companies Law - Cayman Islands dollar

Law 1 of 1971-15th December, 1970 Law 7 of 2000- 20th July, 2000 Law 7 of 1973-28th June, 1973 Law 5 of 2001-20th April, 2001 Law 24 of 1974-22nd November, 1974 Law 10 of 2001-25th May, 2001 Law 25 of 1975-9th December, 1975 Law 29 of 2001-26th September, 2001 Law 19 of 1977-10th November, 1977 Law 46 of 2001-14th January, 2002

3y ago

454 Views

It’s the Law!

ciples stated in Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law. Students will be able to explain the application of Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law to observations or events related to SCUBA diving. MateriaLs None audio/visuaL MateriaLs None teachinG tiMe

2y ago

378 Views

WHAT LAW IS ? An Introduction to Law

common law system civil law system!! sources of law in civil law !! a1. primary: statutes (written law) enacted by legislative power are the principal source of law. ! a2. two subsidiary sources of law: ! a2.1 administrative regulations a.2.2 customs!! ! sources of law in common law !!! b1. two primary sources of

1y ago

385 Views

Ross E. Davies, George Mason University School of Law

Jan 15, 2012 · 4. Bryan A. Garner, Preface to the First Pocket Edition of BLACK‘S LAW DICTIONARY, reprinted in BLACK‘S LAW DICTIONARY vii (3d Pocket ed. 2006). Garner is the current editor-in-chief of Black‟s Law Dictionary and (even more surely than was Black in his own time) the most influential contemporary scho-lar of American legal language. 5.

2y ago

297 Views

ITSP Documentation Final Final

It looks like you're using an ad-blocker