• Aucun résultat trouvé

Expert in the loop: Human machine interaction to build safe AI based systems

N/A
N/A
Protected

Academic year: 2022

Partager "Expert in the loop: Human machine interaction to build safe AI based systems"

Copied!
32
0
0

Texte intégral

(1)

Expert in the loop: Human- machine interaction to build safe AI-based systems

Maria A. Zuluaga

(2)

Reliable AI: Healthcare as a use case

06/2011

IBM Watson wins at Jeopardy

1

12/2012

Watson to be trained for oncological patient treatment

2

02/2013

Lung cancer treatment assistant: Watson’s first commercial product

3

05/2015

Why Watson is taking so long?

4

06/2018

Layoffs at Watson Health are linked to

“problems with AI”

5

1 IEEE Spectrum -http://disq.us/t/dyp9wt; 2IEEE Spectrum -http://disq.us/t/gki3ri; 3Forbes -https://bit.ly/2TBIYJi

4IEEE Spectrum -http://disq.us/t/1q91ji9; 5IEEE Spectrum -http://disq.us/t/33p3080

(3)

The challenges

1. Data is inherently complex and expensive to collect

Molecular & -omics data Biomedical images Clinical records

(4)

The challenges

1. Data is inherently complex and expensive to collect

2. High-impact, high-risk decisions with little tolerance to errors

“…the rate of agreement was about 33 percent — so the hospital decided not to buy the system ”

STAT, Boston Globe Media - Sept 5, 2017

source: https://www.mskcc.org/locations/directory/memorial-hospital

source: https://bit.ly/2xOq7QK

(5)

The challenges

1. Data is inherently complex and expensive to collect

2. High-impact, high-risk decisions with little tolerance to errors

There is a need for robust and reliable methods that can be

safely deployed in practice

(6)

Expert in the loop

ML System Expert Actions

Status

Output

(7)

Expert in the loop

ML System Expert Actions

Status

Adaptation

Challenge 1:

Interactivity

Output

(8)

Expert in the loop

ML System Expert Actions

Status

Adaptation

Monitoring and awareness

Challenge 1:

Interactivity

Challenge 2:

Feedback

Output

(9)

INTERACTIVITY TO DEAL WITH DATA

Challenge 1:

(10)

Low quality training data

Human placenta: 3D reference and view from the 3 standard anatomical planes

adapted: [Wang et al. MICCAI 2015]

Challenges: Poor 3D image quality and varying locations

State of the art

Large number of interactions

Constrained interactions

Limited learning ability of the underpinning model

(11)

Low quality training data

Human placenta: 3D reference and view from the 3 standard anatomical planes

adapted: [Wang et al. MICCAI 2015]

Challenges: Poor 3D image quality and varying locations

State of the art

Large number of interactions

Constrained interactions

Limited learning ability of the underpinning model

Minimally interactive Flexible

Robust

Works in 2D & 3D

Human placenta:

yz plane

(12)

A Deep Interactive Framework for Image Segmentation

1. I

NTERACTION FOR DATA AUGMENTATION

2. I

NTERACTION TO DEFINE A PRIOR FOR SMOOTHER RESULTS

G. Wang, MA. Zuluaga, et al. IEEE TPAMI(41) 2019

(13)

A Deep Interactive Framework for Image Segmentation

G. Wang, MA. Zuluaga, et al. IEEE TPAMI(41) 2019

The original image I and the initial segmentation are concatenated with the geodesic distance maps and used as input of the refinement net (R-Net).

1. I

NTERACTION FOR DATA AUGMENTATION

(14)

A Deep Interactive Framework for Image Segmentation

G. Wang, MA. Zuluaga, et al. IEEE TPAMI(41) 2019

The original image I and the initial segmentation are concatenated with the geodesic distance maps and used as input of the refinement net (R-Net).

1. I

NTERACTION FOR DATA AUGMENTATION

(a)

(b) Image I (c) Segmentation (d) Foreground GM (e) Background GM (a) Image with scribbles

(15)

A Deep Interactive Framework for Image Segmentation

G. Wang, MA. Zuluaga, et al. IEEE TPAMI(41) 2019

Trainable conditional random field (CRF) using freeform functions as pair- wise potentials

2. I

NTERACTION TO DEFINE A PRIOR FOR SMOOTHER RESULTS

Scores from P- or R-Net

Mean-field approximation Q to infer the CRF 𝝍𝑷

(16)

A Deep Interactive Framework for Image Segmentation

G. Wang, MA. Zuluaga, et al. IEEE TPAMI(41) 2019

(17)

Currently: Deal with small objects

• Understand the effects of “bad” interactions

P. Mathur

(18)

FEEDBACK

Challenge 2:

(19)

Hidden Technical Debt of ML Systems

Sculley et al. NeurIPS 2015

(20)

Hidden Technical Debt of ML Systems

ML Code

Sculley et al. NeurIPS 2015

(21)

Hidden Technical Debt of ML Systems

Our idea:

ML Systems to monitor ML Systems

Sculley et al. NeurIPS 2015

(22)

Model monitoring: Probe principle

R. Candela, MA. Zuluaga, P. Michiardi, M. Filippone (under review)

(23)

Model monitoring: Probe principle

R. Candela, MA. Zuluaga, P. Michiardi, M. Filippone (under review)

Monitored System Monitoring System

Hypothesis: We can train a model to monitor the performance of other models over time

(24)

Model monitoring: Empirical study

Materials

• Data: 23.4K time series from the M4 benchmark

• Monitoring models: LSTM, CNN, Bayesian CNN, GP

• Monitored models: 10 different methods

R. Candela, MA. Zuluaga, P. Michiardi, M. Filippone (under review)

(25)

Model monitoring: Empirical study

Materials

• Data: 23.4K time series from the M4 benchmark

• Monitoring models: LSTM, CNN, Bayesian CNN, GP

• Monitored models: 10 different methods

Method

• Measure the capacity of the monitoring models to estimate the forecast prediction error of the monitored models

R. Candela, MA. Zuluaga, P. Michiardi, M. Filippone (under review)

𝑠𝑀𝐴𝑃𝐸 = 1 ℎ ෍

𝑡=1

2 |𝑦

𝑡

− ො 𝑦

𝑡

|

𝑦

𝑡

+ | ො 𝑦

𝑡

|

(26)

Results: Monitoring

R. Candela, MA. Zuluaga, P. Michiardi, M. Filippone (under review)

(27)

Results: Monitoring

• Further direction: Can we use it for model selection?

R. Candela, MA. Zuluaga, P. Michiardi, M. Filippone (under review)

(28)

Results: Model Selection

R. Candela, MA. Zuluaga, P. Michiardi, M. Filippone (under review)

(29)

Error monitoring vs meta-learning

R. Candela, MA. Zuluaga, P. Michiardi, M. Filippone (under review)

• Competitive results using simpler “base” learners

(30)

Summary

• Interactive ML is a promising way to achieve robustness and reliability by keeping experts involved and informed

• Demonstrations in two aspects:

– Efficient user input to deal with training data – System feedback to alert the expert

ML System Expert

Output

(31)

Acknowledgments

Prof. P. Michiardi Prof. M. Filippone R. Candela P. Mathur

EURECOM

University of Electronic Science and Technology of China

Dr. G. Wang

KCL

Prof. S. Ourselin Prof. T. Vercauteren

UCL

Prof. A. David, MD Dr. R. Pratt

Dr. T. Doel Dr. R. Rodionov

KU Leuven

Prof. J. Deprest, MD Dr. M. Aertsen

GOSH

Dr. P.A. Patel, MD

Amadeus

(32)

THANK YOU

Expert in the loop: Human-machine interaction to build safe AI-based systems

Références

Documents relatifs

Design error: The machine designer could intuitively imagine a change of the machine logic, for instance simplifying the behaviour of pair 2: nullifying the autopilot mode reset.

Energy based control for safe human-robot physical interaction 5 Post contact establishment : For S p , the safety criterion represents the maximum amount E p limit of potential

Figure 2 shows our general approach based on query rewriting techniques and summarised as follows: "given an initial query, provide sets of queries that can help data

KFs are a set frames learned automatically offline that consist: 2D features, encoded through Speeded Up Robust Features (SURF) descriptors; 3D information, captured by Fast

In addition, most of the process steps are computed automatically: behavioural mismatch, adaptor generation, adaptor evaluation, updates of the system in case of suppression. To

However, this research relies on a large dataset of labelled P&ID symbols[5], so applying this rich library of techniques to EWDs requires either a manually intensive amount

With this in mind, we propose a new manipulation plan- ning method that aims to increase the feasible space for on-line reactive control based on the idea of constrained

It is now commonly accepted that there is a continuum between perception and action, so that the perception of sound can be a motor process for a listener. […] there is