Title | ||
---|---|---|
Model, Data and Reward Repair: Trusted Machine Learning for Markov Decision Processes |
Abstract | ||
---|---|---|
When machine learning (ML) models are used in safety-critical or mission-critical applications (e.g., self driving cars, cyber security, surgical robotics), it is important to ensure that they provide some high-level guarantees (e.g., safety, liveness). We introduce a paradigm called Trusted Machine Learning (TML) for making ML models more trustworthy. We use Markov Decision Processes (MDPs) as the underlying dynamical model and outline three TML approaches: (1) Model Repair, wherein we modify the learned model directly; (2) Data Repair, wherein we modify the data so that re-learning from the modified data results in a trusted model; and (3) Reward Repair, wherein we modify the reward function of the MDP to satisfy the specified logical constraint. We show how these repairs can be done efficiently for probabilistic models (e.g., MDP) when the desired properties are expressed in some appropriate fragment of logic such as temporal logic (for example PCTL, i.e., Probabilistic Computation Tree Logic), first order logic or propositional logic. We illustrate our approaches on case studies from multiple domains, e.g., car controller for obstacle avoidance, and a query routing controller in a wireless sensor network. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/DSN-W.2018.00064 | 2018 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W) |
Keywords | Field | DocType |
trusted machine learning,model repair,data repair,reward repair | Data modeling,Computer science,Probabilistic CTL,Markov decision process,Propositional calculus,First-order logic,Artificial intelligence,Probabilistic logic,Temporal logic,Machine learning,Liveness | Conference |
ISSN | ISBN | Citations |
2325-6648 | 978-1-5386-6708-8 | 1 |
PageRank | References | Authors |
0.37 | 10 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shalini Ghosh | 1 | 121 | 10.61 |
Susmit Jha | 2 | 460 | 33.61 |
Ashish Tiwari | 3 | 1630 | 106.62 |
Patrick Lincoln | 4 | 2535 | 241.90 |
Xiaojin Zhu | 5 | 3586 | 222.74 |