Share Email Print
cover

Proceedings Paper

Open source OCR framework using mobile devices
Author(s): Steven Zhiying Zhou; Syed Omer Gilani; Stefan Winkler
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Mobile phones have evolved from passive one-to-one communication device to powerful handheld computing device. Today most new mobile phones are capable of capturing images, recording video, and browsing internet and do much more. Exciting new social applications are emerging on mobile landscape, like, business card readers, sing detectors and translators. These applications help people quickly gather the information in digital format and interpret them without the need of carrying laptops or tablet PCs. However with all these advancements we find very few open source software available for mobile phones. For instance currently there are many open source OCR engines for desktop platform but, to our knowledge, none are available on mobile platform. Keeping this in perspective we propose a complete text detection and recognition system with speech synthesis ability, using existing desktop technology. In this work we developed a complete OCR framework with subsystems from open source desktop community. This includes a popular open source OCR engine named Tesseract for text detection & recognition and Flite speech synthesis module, for adding text-to-speech ability.

Paper Details

Date Published: 27 February 2008
PDF: 6 pages
Proc. SPIE 6821, Multimedia on Mobile Devices 2008, 682104 (27 February 2008); doi: 10.1117/12.766588
Show Author Affiliations
Steven Zhiying Zhou, National Univ. of Singapore (Singapore)
Syed Omer Gilani, National Univ. of Singapore (Singapore)
Stefan Winkler, National Univ. of Singapore (Singapore)


Published in SPIE Proceedings Vol. 6821:
Multimedia on Mobile Devices 2008
Reiner Creutzburg; Jarmo H. Takala, Editor(s)

© SPIE. Terms of Use
Back to Top