Share Email Print

Proceedings Paper

Improving face image extraction by using deep learning technique
Author(s): Zhiyun Xue; Sameer Antani; L. Rodney Long; Dina Demner-Fushman; George R. Thoma
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

The National Library of Medicine (NLM) has made a collection of over a 1.2 million research articles containing 3.2 million figure images searchable using the Open-iSM multimodal (text+image) search engine. Many images are visible light photographs, some of which are images containing faces (“face images”). Some of these face images are acquired in unconstrained settings, while others are studio photos. To extract the face regions in the images, we first applied one of the most widely-used face detectors, a pre-trained Viola-Jones detector implemented in Matlab and OpenCV. The Viola-Jones detector was trained for unconstrained face image detection, but the results for the NLM database included many false positives, which resulted in a very low precision. To improve this performance, we applied a deep learning technique, which reduced the number of false positives and as a result, the detection precision was improved significantly. (For example, the classification accuracy for identifying whether the face regions output by this Viola- Jones detector are true positives or not in a test set is about 96%.) By combining these two techniques (Viola-Jones and deep learning) we were able to increase the system precision considerably, while avoiding the need to manually construct a large training set by manual delineation of the face regions.

Paper Details

Date Published: 25 March 2016
PDF: 11 pages
Proc. SPIE 9789, Medical Imaging 2016: PACS and Imaging Informatics: Next Generation and Innovations, 97890J (25 March 2016); doi: 10.1117/12.2216278
Show Author Affiliations
Zhiyun Xue, National Library of Medicine (United States)
Sameer Antani, National Library of Medicine (United States)
L. Rodney Long, National Library of Medicine (United States)
Dina Demner-Fushman, National Library of Medicine (United States)
George R. Thoma, National Library of Medicine (United States)

Published in SPIE Proceedings Vol. 9789:
Medical Imaging 2016: PACS and Imaging Informatics: Next Generation and Innovations
Jianguo Zhang; Tessa S. Cook, Editor(s)

© SPIE. Terms of Use
Back to Top