Share Email Print
cover

Proceedings Paper

Managing training data from untrusted partners using self-generating policies
Author(s): Dinesh Verma; Seraphin Calo; Shonda Witherspoon; Irene Manotas; Elisa Bertino; Amani Abu Jabal; Geeth de Mel ; Ananthram Swami; Greg Cirincione; Gavin Pearson
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

When training data for machine learning is obtained from many different sources, not all of which may be trusted, it is difficult to determine which training data to accept and which to reject. A policy-based approach for data curation, where the policies are generated after examining the properties of the offered data, can provide a way to only accept selected data for creating a machine learning model. In this paper, we discuss the challenges associated with generating policies that can manage training data from different sources. An efficient policy generation scheme needs to determine the order in which information is received, must have an approach to determine the trustworthiness of each partner, must have an approach to decide how to quickly assess which data subset can add value to a complex model, and must address several other issues. After providing an overview of the challenges, we propose approaches to solve them and study the properties of those approaches.

Paper Details

Date Published: 10 May 2019
PDF: 15 pages
Proc. SPIE 11006, Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications, 110060P (10 May 2019); doi: 10.1117/12.2519682
Show Author Affiliations
Dinesh Verma, IBM Thomas J. Watson Research Ctr. (United States)
Seraphin Calo, IBM Thomas J. Watson Research Ctr. (United States)
Shonda Witherspoon, IBM Thomas J. Watson Research Ctr. (United States)
Irene Manotas, IBM Thomas J. Watson Research Ctr. (United States)
Elisa Bertino, Purdue Univ. (United States)
Amani Abu Jabal, Purdue Univ. (United States)
Geeth de Mel , IBM United Kingdom Ltd. (United Kingdom)
Ananthram Swami, U.S. Army Research Lab. (United States)
Greg Cirincione, U.S. Army Research Lab. (United States)
Gavin Pearson, United Kingdom Ministry of Defence (United Kingdom)


Published in SPIE Proceedings Vol. 11006:
Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications
Tien Pham, Editor(s)

© SPIE. Terms of Use
Back to Top
PREMIUM CONTENT
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?
close_icon_gray