Share Email Print
cover

Proceedings Paper

Reinforcement learning and design of nonparametric sequential decision networks
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

In this paper we discuss the design of sequential detection networks for nonparametric sequential analysis. We present a general probabilistic model for sequential detection problems where the sample size as well as the statistics of the sample can be varied. A general sequential detection network handles three decisions. First, the network decides whether to continue sampling or stop and make a final decision. Second, in the case of continued sampling the network chooses the source for the next sample. Third, once the sampling is concluded the network makes the final classification decision. We present a Q-learning method to train sequential detection networks through reinforcement learning and cross-entropy minimization on labeled data. As a special case we obtain networks that approximate the optimal parametric sequential probability ratio test. The performance of the proposed detection networks is compared to optimal tests using simulations.

Paper Details

Date Published: 11 March 2002
PDF: 8 pages
Proc. SPIE 4739, Applications and Science of Computational Intelligence V, (11 March 2002); doi: 10.1117/12.458718
Show Author Affiliations
Emre Ertin, Battelle Memorial Institute (United States)
Kevin L. Priddy, Battelle Memorial Institute (United States)


Published in SPIE Proceedings Vol. 4739:
Applications and Science of Computational Intelligence V
Kevin L. Priddy; Paul E. Keller; Peter J. Angeline, Editor(s)

© SPIE. Terms of Use
Back to Top