Share Email Print

Proceedings Paper

Solving computer vision tasks with diffractive neural networks
Author(s): Tao Yan; Jiamin Wu; Tiankuang Zhou; Hao Xie; Feng Xu; Jingtao Fan; Lu Fang; Xing Lin; Qionghai Dai
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Modern computer vision tasks are achieved by first capturing and storing large-scale images and then performing the processing electronically, the paradigm of which has the fundamentally limited speed and power efficiency with the continuous increase of the data throughput and computational complexity. We propose to build the all-optical artificial intelligent for light-speed computing, which performs advanced computer vision tasks during the imaging so that the detector can directly measure the computed results. The proposed method uses light diffraction property to build the optical neural network, where the neuron function is achieved by tuning the optical diffraction with a nonlinear threshold. Since every target scene has different frequency components, the proposed diffractive neural network is trained to perform various filtering on different frequency components and achieves different transform functions for the target scenes. We demonstrate the proposed approach can be used for high-speed detecting and segmenting visual saliency objects of the microscopic samples and macroscopic scenes as well as performing the task of object classification. The low power consumption, light-speed processing, and high-throughput capability of the proposed approach can serve as significant support for high-performance computing and will find applications in self-driving automobile, video monitoring, and intelligent microscopy, etc.

Paper Details

Date Published: 18 November 2019
PDF: 8 pages
Proc. SPIE 11187, Optoelectronic Imaging and Multimedia Technology VI, 111870T (18 November 2019); doi: 10.1117/12.2545609
Show Author Affiliations
Tao Yan, Tsinghua Univ. (China)
Jiamin Wu, Tsinghua Univ. (China)
Tiankuang Zhou, Tsinghua Univ. (China)
Hao Xie, Tsinghua Univ. (China)
Feng Xu, Tsinghua Univ. (China)
Jingtao Fan, Tsinghua Univ. (China)
Lu Fang, Tsinghua Univ. (China)
Xing Lin, Tsinghua Univ. (China)
Qionghai Dai, Tsinghua Univ. (China)

Published in SPIE Proceedings Vol. 11187:
Optoelectronic Imaging and Multimedia Technology VI
Qionghai Dai; Tsutomu Shimura; Zhenrong Zheng, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?