Share Email Print
cover

Proceedings Paper

Computational modeling of trust factors using reinforcement learning
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

As machine-learning algorithms continue to expand their scope and approach more ambiguous goals, they may be required to make decisions based on data that is often incomplete, imprecise, and uncertain. The capabilities of these models must, in turn, evolve to meet the increasingly complex challenges associated with the deployment and integration of intelligent systems into modern society. Historical variability in the performance of traditional machine-learning models in dynamic environments leads to ambiguity of trust in decisions made by such algorithms. Consequently, the objective of this work is to develop a novel computational model that effectively quantifies the reliability of autonomous decision-making algorithms. The approach relies on the implementation of a neural network based reinforcement learning paradigm known as adaptive critic design to model an adaptive decision making process that is regulated by a quantitative measure of risk associated with each possible decision. Specifically, this work expands on the risk-directed exploration strategies of reinforcement learning to obtain quantitative risk factors for an automated object recognition process in the presence of imprecise data. Accordingly, this work addresses the challenge of automated risk quantification based on the confidence of the decision model and the nature of given data. Additionally, further analysis into risk directed policy development for improved object recognition is presented.

Paper Details

Date Published: 6 September 2019
PDF: 9 pages
Proc. SPIE 11136, Optics and Photonics for Information Processing XIII, 111360J (6 September 2019);
Show Author Affiliations
C. M. Kuzio, Old Dominion Univ. (United States)
A. Dinh, Old Dominion Univ. (United States)
C. Stone, Old Dominion Univ. (United States)
L. Vidyaratne, Old Dominion Univ. (United States)
K. M. Iftekharuddin, Old Dominion Univ. (United States)


Published in SPIE Proceedings Vol. 11136:
Optics and Photonics for Information Processing XIII
Khan M. Iftekharuddin; Abdul A. S. Awwal; Victor H. Diaz-Ramirez; Andrés Márquez, Editor(s)

© SPIE. Terms of Use
Back to Top
PREMIUM CONTENT
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?
close_icon_gray