Elements: Software For Image Editing Over Voice Commands - IJISRT

1y ago

9 Views

1 Downloads

988.84 KB

7 Pages

Last View : 23d ago

Last Download : 3m ago

Upload by : Oscar Steel

Report this link

Download PDF

Transcription

Volume 5, Issue 5, May – 2020 International Journal of Innovative Science and Research Technology ISSN No:-2456-2165 Elements: Software for Image Editing Over Voice Commands Akash Johnny Kunnath, Albin Saji, Benjamin G Nechicattu, Done Maria James, Suma R St. Joseph’s College of Engineering and Technology, Palai Abstract:- Image editing is simply the processes of manipulating images, in whatever format they are, for instance, digital photographs, old photo-chemical photographs, or illustrations. Traditional analogue image manipulation involves photo retouching, using mechanical tools like an airbrush to modify photographs else by editing illustrations with any traditional art techniques. Graphic software programs can be broadly categorized into vector graphics editors, raster graphics editors, and 3D modellers. This software is the primary tool with which a user may manipulate, enhance, and transform images. Many image editing programs are also used to render or create computer art from scratch over voice commands and direct interaction. I. INTRODUCTION Image editing from scratch has become a timeconsuming process for non-professionals as well as for upgrading professionals. Learning chunks of shortcuts and completely accessing editing tools via mouse and keyboard has become difficult, time-consuming and particularly overhead for at least a few. Here, we introduce an image editing interface that comprises of vocal command recognizer, image editing is difficult to perform with voice alone. For flexible and easy editing-control we use both voice and manual editing interaction, using mouse and keyboard. Selecting an object or a layer within the workspace has become easier. The editing panel is a grid fashion workspace and x-y axes (rulers) are scaled for selection of points at the workspace where an image is to be edited. This application contains an image storage directory linked to the desktop so that importing images becomes easy which is already in the application storage. There are combinations of filters that provide a professional touch to the images. Elements like adding texts and formatting, colour-comb are add-on features. The functions with varying values can be adjusted in percentage/values by saying it while specifying the arguments. Voice interface makes complex tasks easier and accessible as they Allow users to simply state goals without learning an interface. IJISRT20MAY903 Fig 1:- Info Screen II. INTENDED AUDIENCE This software concentrates on people who are involved in image editing. Professionals, freelancers, tutors, students and even people who love to make images better and look good. Moreover, it especially concerns people who are differently-abled who are involved in image editing, with good craftsmanship, may be physically challenged. ‘Elements’ a user-friendly voice-enabled editing platform uses Voice commands to alter images. Out of numerous editing software that are available currently, none of them provide voice capability for editing which in turn makes these editors much more difficult. One of the major add on benefits is that this can be used by the differently-abled persons too mainly those who have some physical disability which makes ‘Elements’ adaptable to a large range of users. All you have to do is just say what you have to do. This makes it easier to use and simply you don’t require by heart commands anymore. Image editing software like Photoshop, Fotos, GIMP, etc. have a wide variety of codes and commands which are a bit difficult to learn, and without these commands by hearted no one can work on any of these platforms with increased efficiency and full accuracy. By using ‘elements ’the same Performance accuracy can be achieved without even by hearting a single piece of command. You can do what you need to do just by saying what is to be done. www.ijisrt.com 1853

Volume 5, Issue 5, May – 2020 International Journal of Innovative Science and Research Technology ISSN No:-2456-2165 III. TECHNOLOGIES USED V. Python Core Programing is based on Python Programing language which is more convenient flexible and fast. Python is more understandable as well readable. Execution and complexity of the program is comparatively easier and less respectively. Python is an interpreter language which helps in sequential execution if the program. Visual Studio We used visual studio programing platform for the development of the project. WPF along with C# framework makes it easier to integrate all the functions and make the software Tkinter Python GUI Python have Tkinter GUI which makes combining the scripts together. This makes it executable on any machines that have python with in thus making the program cross platform. Pillow Library The Python Imaging Library[iii] (PIL/pillow) adds image processing capabilities to the Python interpreter. Basically every operations on image can be done using this pillow library. This gives wide file format support, an efficient internal representation, and fairly powerful image processing capabilities. PhotoImage and BitmapImage interfaces helps to show the image. Pillow library supports image resizing, rotation and arbitrary affine transforms. DESIGN AND IMPLEMENTATION CONSTRAINTS It makes use of Google Speech recognizer. Google Speech Recogniser works over a fast internet connection. Hence the application requires a fast internet connection. Access to mic and recording is a must. VI. ASSUMPTIONS AND DEPENDENCIES It makes use of Google Speech recognizer to take in voice commands from the user. Hence Speech recognition library is needed to be installed along as a dependency. Pythons’ PyAudio package is needed to be taken into consideration since the software involves the usage of the system mic. Python’s Pillow library is a powerful image processing support. Image manipulation and Enhancement can easily be done using the Pillow package. Hence Python Pillow Package has to be considered as a major dependency. Python GUI Tkinter[vii][viii]is used for the development of the project. VII. An image editing project requires a 4 GB recommended Memory. So the system must provide the recommended space. The recommended processor speed of an average 2.0 GHz is suggested. All since it involves a continuous rendering of the image. A graphics card is recommended, if not available, process. VIII. Natural Language toolkit Natural Language toolkit[iv] [vi] is used in order to get the speech and convert it to a machine understandable form so that the machine can make meaning from it. Every commands that is given to the system is tokenized by the NLTK and this enables the system to find out what operation is to be done on the image taken. SYSTEM FEATURES USER INTERFACES The user interface is made simpler so that everyone can use it even without any prior knowledge of how the software works. Got a clean interface than that of any other UI provided by similar products. Google Speech Recognition Engine Google speech recognition engine[x] coverts the speech that is captured to corresponding text[i]. This text is then used by the Natural Language toolkit (NLTK). The Speech is recorded by the system and acquires Google API for speech recognition and uploads the speech to generate the corresponding text[ii]. IV. PLATFORM The software works on Windows PC. Tkinter GUI[vii] can be run on any devices that supports python. Thus can be installed on Windows and Linux Machines and mac OS too. Thus making it a cross platform program. IJISRT20MAY903 www.ijisrt.com Fig 2:- Screenshot of the Workspace with filters 1854

Volume 5, Issue 5, May – 2020 International Journal of Innovative Science and Research Technology ISSN No:-2456-2165 The in-app file browser makes it easier to browse through the image files in your system over voice commands. You can either start with an image or with a board (fig 4). Board feature requires more customisation yet to be done to work with voice which is not available in this version. In later versions board will be as easy as drawing on paper with voice. Workspace imports images to work with. Fig 5 shows the voice activated saving screen. Fig 3:- Add Text Option An example of the UI is shown above (fig 2). Tools are arranged on one side which makes it easier to access. The workspace[viii] is equipped with Suggestions and finetuning sliders (fig 3 text options) for precision. Tools are probably less accessed since the project works over voice commands. Fig 5:- Save options With great precision, images can be edited in the workspace. IX. HARDWARE/COMMUNICATION INTERFACES Hardware interface involves a microphone which comes inbuilt with most of the Personal Computers. If not available, an external mic can be plugged in. A microphone is a must, since we need to gather the voice commands. Fig 4:- Create board X. PROGRAM EXECUTION FLOW Fig 6:- Sequence diagram IJISRT20MAY903 www.ijisrt.com 1855

Volume 5, Issue 5, May – 2020 International Journal of Innovative Science and Research Technology ISSN No:-2456-2165 The Program begins when the application is open. The application welcomes the user with a splash screen. Soon after the application files and libraries are loaded it checks for the internet access. If an internet access is available the mic gets activated and listen for a ‘do’ command. After a ‘do’ audio fingerprint is detected you can say any command to be performed in the image. Selecting the image is much easier with an in app file browser which shows the images with in the PC. All you have to do is to say the name of the image or select the image manually. Selected image is brought to the workspace window. Where you can perform image editing. Now we need to say what operation had to be performed on the image. It is a command. The command is then converted to its corresponding text via Googles Speech Recogniser API. API returns the corresponding text. The command is now tokenised to tokens. For every token, compares to a keyword in the keyword file. If token found, calls the corresponding function and performs the action. Else if no token is found in the keyword file, token is compared with similar file, to avoid miss predictions. If a similar keyword is found. Then the corresponding function to the ‘similar keyword’ is called and then performs the action on to the image. For some functions, arguments are needed to be passed. For instance, say angle for rotation. When a rotation function is called, an argument has to be passed, angle. Now it’s turn for the argument to be listened and is passed to the function. We can perform enough actions on the image until a save or quit command appears. Save command confirms the edited image and saves the image in new name and a new extension as the user prefer. Quit command quits the image editing window without saving the changes. XI. IMAGE MANIPULATION OPERATIONS The usage of pillow library brings up a large space for image editing[v]. Among them few are loaded in to the application. Rotation. Rotation operation rotates the image in the workspace window. The user can specify the angle to which the image has to be rotated. Say rotate to activate command over voice, keyboard press r, or select rotate. Brightness. User can now change the brightness by saying change brightness. Brightness too needs a parameter. A floating point number is passed as the argument for brightness which in turn increase the brightness by that much. Say Brightness to activate command over voice, keyboard B, or select brightness. Contrast. Contrast increases the contrast of the image by a floating point value. Say contrast to activate command over voice, keyboard c, or select contrast. IJISRT20MAY903 Saturation. This operation Increase the saturation of the image or reduces the saturation. Say saturation to activate command over voice, keyboard S, or select saturation. Flip. Flips the image left to right or up to down. This makes the image looks like mirrored or tilted upside down. Say flip right/ flip up to activate command over voice, keyboard F/f, or select flip. Warmth. Increases the red in every pixel and makes the image warmer. Say warmth to activate command over voice, keyboard w, or select warmth. Text. Now we can add text in to an image. When the text function is called, it asks for the sentence to be added in the image. Varity of fonts are also incorporated along. Font type has to be specified by specifying the font name. Position of the text in the image has to be specified by passing the x, y co-ordinates or by saying how much to right or down. Say add text to activate command over voice, keyboard T, or select text. Black and white filter. Now a days black and white images brings nostalgic feel to the image. This can be brought to the image by a call and specify the strength of the filter. Say black and white to activate command over voice, or select black and white. Sharpness. Increases the sharpness of the image and makes the image crispier and less blurred. Say sharpness to activate command over voice, or select sharpness. Detail. Increases the details and the structure of the objects in the image. Say details to activate command over voice, keyboard d, or select details. Crop. Crop the image by passing four arguments corresponding to left, upper, right and lower of the image. Say crop to activate command over voice, keyboard C, or select crop. Blur. Blurs the image or reduces the sharpness of the image which gives a spread appearance to the image. Say blur to activate command over voice, or select blur. Contour. This can select the pixels with same intensity and find out the edges in an image. Say contour to activate command over voice, or select contour. Edge Enhance. Edge enhance enhances the edges of objects in an image. For more edge enhance use more edge enhance. Say edge to activate command over voice, or select edge enhance. Emboss. Creates an emboss effect to the given input image. Say emboss to activate command over voice, or select emboss. www.ijisrt.com 1856

Volume 5, Issue 5, May – 2020 International Journal of Innovative Science and Research Technology ISSN No:-2456-2165 Smooth. Just like blurring, this function smoothens the objects in the image. For more smooth use more Smooth. Say smooth to activate command over voice, or select smooth. Resize. This function reduce the image size, resolution thus reducing the memory space occupied by the image XII. while saving. Say resize to activate command over voice, or select resize in save. Save. Save function saves the image in a specific name and extension format as the user specify in a user space. Say save to activate command over voice. BLOCK REPRESENTATION Fig 7:- Block Diagram XIII. Output RESULT We designed our evaluation to solve few tasks. How does our proposed multimodal interface compare with a traditional image editing interface? Success rate for both interfaces were identical. Even though the multimodal interface slightly shows more attraction. Figures fig 8 is a test image. Following figures (Fig 9–23) are the output of individual operations on image. Fig 9:- Rotated Image Fig 8:- Test Image Fig 10:- brightened Image IJISRT20MAY903 www.ijisrt.com 1857

Volume 5, Issue 5, May – 2020 International Journal of Innovative Science and Research Technology ISSN No:-2456-2165 Fig 11:- contrast varied Image Fig 17:- sharpenned Image Fig 12:- saturated Image Fig 18 : detailed Image Fig 13:- flip vertical Fig 19 : cropped Image Fig 14:- flip horizontal Fig 20 : blured Image Fig 15:- Adding text Fig 21 : edge enhanced Image Fig 16:- Black and white IJISRT20MAY903 www.ijisrt.com 1858

Volume 5, Issue 5, May – 2020 International Journal of Innovative Science and Research Technology ISSN No:-2456-2165 REFERENCES Fig 22:- embossed Image Fig 23:- smoothed Image XIV. CONCLUSION Here we introduced, ELEMENTS, A multimodal interface system to enhance image editing tasks through voice and conventional direct manipulation. Other than editing functionalities "Elements" is enabled with browsing of an image as well as saving an image after editing. We can browse our file manager or even internet by using appropriate voice commands. After editing procedure is complete user can save image using the "save" command and we can specify appropriate location as well as name in which image is to be saved. Thereby implementing each functionalities with voice. Coming to the editing functionalities we have implemented all the features that are essential for an editing tool. Features include brightness, Contrast, crop, rotate, a total of 9 filters etc., and all these using voice commands. "Elements" have an add on functionality of image compression. Image that we select for editing maybe of larger size and we can compress them after according to our requirement, compression ratio is on a scale of 0-100. The key feature that makes "Elements" unique from other editing tools is that it is voice enabled, as it is voice controlled it can be used by the "differently abled people". With the board facilities in later versions makes it more advanced. Voice commands are less complex than shortcuts and is has a user-friendly UI which all makes it easy to use. So now editing is no more a complex task just tell what to do and it’s done. IJISRT20MAY903 [1]. Research on Speech Recognition Technology and Its Application, Youhao Yo international conference on computer science and Electronics engineering. 2012 [2]. Speech Recognition System : A Review Sandheep Sharma, Nithin Washani , International Journal of computer Applications. April 2015 [3]. Pillow 7.1.2, https://pypi.org/project/Pillow/ , 2020 [4]. NLTK 3.5 documentation, https://www.nltk.org/ , 2020 [5]. Python Working with the Image Data Type in pillow, geeks for geeks he-image-data-type-in-pillow/ , 2020 [6]. Natural Language ProcessingPython, tutorials point https://www.tutorialspoint.com/natural language pro cessing/natural language processing python.htm, 2020 [7]. PythonGUI Programming (Tkinter), tutorials point https://www.tutorialspoint.com/python/python gui pr ogramming.htm, 2020 [8]. tk-tools 0.12.0, https://pypi.org/project/tk-tools/, 2020 [9]. pocketsphinx, https://github.com/cmusphinx/pocketsphinx , 2020 [10]. SpeechRecognition3.8.1, https://pypi.org/project/Spe echRecognition/ ,2020 www.ijisrt.com 1859

overhead for at least a few. Here, we introduce an image editing interface that comprises of vocal command recognizer, image editing is difficult to perform with voice alone. For flexible and easy editing-control we use both voice and manual editing interaction, using mouse and keyboard. Selecting an object or a layer within the

Related Documents:

Bruksanvisning för bilstereo Bruksanvisning for bilstereo ... - Jula

Bruksanvisning för bilstereo . Bruksanvisning for bilstereo . Instrukcja obsługi samochodowego odtwarzacza stereo . Operating Instructions for Car Stereo . 610-104 . SV . Bruksanvisning i original

373 Views

1y ago

10 tips och tricks för att lyckas med ert sap-projekt

10 tips och tricks för att lyckas med ert sap-projekt 20 SAPSANYTT 2/2015 De flesta projektledare känner säkert till Cobb’s paradox. Martin Cobb verkade som CIO för sekretariatet för Treasury Board of Canada 1995 då han ställde frågan

734 Views

2y ago

Nordens 25 största medieföretag efter omsättning

service i Norge och Finland drivs inom ramen för ett enskilt företag (NRK. 1 och Yleisradio), fin ns det i Sverige tre: Ett för tv (Sveriges Television , SVT ), ett för radio (Sveriges Radio , SR ) och ett för utbildnings program (Sveriges Utbildningsradio, UR, vilket till följd av sin begränsade storlek inte återfinns bland de 25 största

330 Views

1y ago

SS 02 52 68 Ljudklassning av utrymmen i byggnader - byggtjanst.se

Hotell För hotell anges de tre klasserna A/B, C och D. Det betyder att den "normala" standarden C är acceptabel men att motiven för en högre standard är starka. Ljudklass C motsvarar de tidigare normkraven för hotell, ljudklass A/B motsvarar kraven för moderna hotell med hög standard och ljudklass D kan användas vid

356 Views

1y ago

Apple Developer Program License Agreement (Swedish)

LÄS NOGGRANT FÖLJANDE VILLKOR FÖR APPLE DEVELOPER PROGRAM LICENCE . Apple Developer Program License Agreement Syfte Du vill använda Apple-mjukvara (enligt definitionen nedan) för att utveckla en eller flera Applikationer (enligt definitionen nedan) för Apple-märkta produkter. . Applikationer som utvecklas för iOS-produkter, Apple .

343 Views

1y ago

How Video Editing Works

from: howstuffworks.com Inside This Article 1. Introduction to How Video Editing Works 2. Digital Camcorders 3. Video-Editing Computers 4. Video Editing: Basic Concepts 5. Running Adobe Premiere 6. Editing a Video: Capture and Clips 7. Editing a Video: Timeline and Transit

34 Views

2y ago

Practice Workbook Answers

L2: x 0, image of L3: y 2, image of L4: y 3, image of L5: y x, image of L6: y x 1 b. image of L1: x 0, image of L2: x 0, image of L3: (0, 2), image of L4: (0, 3), image of L5: x 0, image of L6: x 0 c. image of L1– 6: y x 4. a. Q1 3, 1R b. ( 10, 0) c. (8, 6) 5. a x y b] a 21 50 ba x b a 2 1 b 4 2 O 46 2 4 2 2 4 y x A 1X2 A 1X1 A 1X 3 X1 X2 X3

65 Views

2y ago

B. Sc. BOTANY - gscgadchiroli.ac.in

23. Sharma, P. D. [1991] : The Fungi (Rastogi & Co. Meerut) 24. Vasishta, B. R. [1990] : Fungi (S. Chand & Co. New Delhi) 25. Sharma, O. P. : Fungi (TMH)

115 Views

3y ago

Recent Views

MERRILL ALABAMA CAPITOL SECRETARY OF STATE

Aug 24, 2018 · State House 38 Brian McGee state House 40 Pamela Jean Howard State House 41 Emily Anne Marcum State House 43 Carin Mayo State House 45 Jenn Gray state House 46 Felicia Stewart State House 4 7 1Jim Toomey State House 48 IAlli Summerford State House 51 Veronica R. Johnson State House 52 John W. Rogers, Jr. State House 53 Anthony Daniels

2y ago

375 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

Consumer Guide to Auto Insurance - csimt.gov

consumer guide to auto insurance contents introduction to auto insurance 1 understanding your auto insurance policy 2 required auto insurance 3 optional types of auto insurance 4-5 getting the right coverage 6 accidents and violations 7 how to shop for auto insurance 8 shopping tips 9 frequently asked questions 10-11 insurance complaints/when you have a problem 12

2y ago

805 Views

Industry Observations Insurance Industry

Jun 30, 2019 · 6/17/2019 Commercial Insurance Branch of Extraco Banks, N.A. Higginbotham Insurance Group, Inc. Insurance Brokers NA 6/13/2019 Links Insurance Services, LLC World Insurance Associates LLC Property and Casualty Insurance NA 6/13/2019 Abram Interstate Insurance Services, Inc. Risk Placement Services,

2y ago

619 Views

Life Insurance Buyer's Guide Life Insurance - National Association of .

Life Insurance uers uide Naional ssociaion of Insurance Commissioners Compare the Different Types of Insurance Policies There are many types of life insurance pol-icies. You should choose a policy with fea-tures that fit your individual needs. Some things to consider are: Term Insurance vs. Cash Value In-surance. Term insurance is intended to

1y ago

520 Views

your guide to understanding auto ins in nh - New Hampshire

Hampshire Insurance Department does not mandate or set Auto Insurance Rates. Auto Insurance Rates will vary by insurance company. This guide is intended to give New Hampshire consumers basic information on auto insurance. It suggests ways to: Lower the cost of your auto insurance, shop for Auto insurance and, file an auto insurance claim.

1y ago

449 Views

18.01.41 - REPLACEMENT OF LIFE INSURANCE AND ANNUITIES - Idaho

Department of Insurance Replacement of Life Insurance and Annuities. Page 3. 04. Existing Life Insurance or Annuity. "Existing Life Insurance or Annuity" means any life insurance or annuity in force, including life insurance under a binding or conditional receipt or a lif e insurance policy or annuity that is within an unconditional refund period.

1y ago

407 Views

EXAMINATION REPORT OF THE ADMIRAL INSURANCE COMPANY AS OF . - Delaware

Berkley Regional Specialty Insurance Comp 31295 DE Carolina Casualty Insurance Company 10510 IA Clermont Insurance Company 33480 IA Continental Western Insurance Company 10804 IA Firemen's Insurance Com pany of Wash, D.C. 21784 DE Gemini Insurance Company 10833 DE Great Divide Insurance Company 25224 ND

1y ago

258 Views

American International Group, Inc. - Federal Reserve

American General Life Insurance Company AGL U.S. Life Insurance Company AGC Life Insurance Company AGC Life U.S. Life Insurance Company The United States Life Insurance Company in the City of New York U.S. Life U.S. Life Insurance Company The Variable Annuity Life Insurance Company VALIC U.S. Life Insurance Company

1y ago

269 Views

Japan's Insurance Market - Toa Re

with 61.6% of net premiums written, of which automobile insurance totaled 48.8% and compulsory automobile liability insurance totaled 12.8%. Fire insurance accounted for 13.7%, miscellaneous casualty insurance including liability insurance accounted for 11.6%, accident insurance accounted for 9.8%, and marine insurance accounted for 3.2%.

1y ago

179 Views

List of Insurance Companies by Insurance Manager - Cayman Islands dollar

2447 Batan Insurance Company SPC, Ltd. 29-Sep-03 1307714 BBG Insurance Services, Ltd. 09-Aug-16 1254 BCHS Insurance, Ltd. 07-Oct-98 1168 Bearacuda Re 01-Aug-97 2639 Bedrock Insurance Limited 24-Nov-05 2150 Bom Ambiente Insurance Company 14-Jun-00 2565 Boundless Insurance Company, Ltd. 01-Dec-04 769 Bucap Limited 03-Mar-89

1y ago

293 Views

Insurance Certificate 713705-3 and Assistance Program

Name of insurance product: Purchase Protection and Travel Insurance for National Bank of Canada Mastercard credit cards, group insurance policy no. 713705 (Schedule A Certificate number 3)/713705-3 Type of insurance product: Purchase insurance and extended warranty and travel insurance (group insurance) Assistance provider contact information

3m ago

54 Views

Policy - Kiwibank

House Insurance is provided by The Hollard Insurance Company Pty Ltd. The Hollard Insurance Company Pty Ltd is the only organisation responsible for claims under this cover. Administration of House Insurance and claims handling services are managed by Ando Insurance Group Limited on behalf of The Hollard Insurance Company Pty Ltd.

1y ago

133 Views

House insurance - Tower

insurance in New Zealand. We've included limits and exclusions to your house cover throughout this policy wording and on your certificate of insurance. What your house policy does and does not cover What we cover We cover your house, meaning the domestic buildings you own at the situation shown on your certificate of insurance including its: 1.

1y ago

145 Views

Elements: Software For Image Editing Over Voice Commands - IJISRT

It looks like you're using an ad-blocker