Share Email Print

Proceedings Paper

Translation lexicon acquisition from bilingual dictionaries
Author(s): David Scott Doermann; Huanfeng Ma; Burcu Karagol-Ayan; Douglas William Oard
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Bilingual dictionaries hold great potential as a source of lexical resources for training automated systems for optical character recognition, machine translation and cross-language information retrieval. In this work we describe a system for extracting term lexicons from printed copies of bilingual dictionaries. We describe our approach to page and definition segmentation and entry parsing. We have used the approach to parse a number of dictionaries and demonstrate the results for retrieval using a French-English Dictionary to generate a translation lexicon and a corpus of English queries applied to French documents to evaluation cross-language IR.

Paper Details

Date Published: 18 December 2001
PDF: 12 pages
Proc. SPIE 4670, Document Recognition and Retrieval IX, (18 December 2001); doi: 10.1117/12.450737
Show Author Affiliations
David Scott Doermann, Univ. of Maryland/College Park (United States)
Huanfeng Ma, Univ. of Maryland/College Park (United States)
Burcu Karagol-Ayan, Univ. of Maryland/College Park (United States)
Douglas William Oard, Univ. of Maryland/College Park (United States)

Published in SPIE Proceedings Vol. 4670:
Document Recognition and Retrieval IX
Paul B. Kantor; Tapas Kanungo; Jiangying Zhou, Editor(s)

© SPIE. Terms of Use
Back to Top