Machine Learning for risk ranking automation in IRSN level 2 PSA

(1)

HAL Id: hal-02870748

https://hal.archives-ouvertes.fr/hal-02870748

Submitted on 16 Jun 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Copyright

Machine Learning for risk ranking automation in IRSN level 2 PSA

Guillaume Kioseyian, Marine Marcilhac Fradin

To cite this version:

Guillaume Kioseyian, Marine Marcilhac Fradin. Machine Learning for risk ranking automation in

IRSN level 2 PSA. International Conference on Probabilistic Safety Assessment and Management,

PSAM, Jun 2020, VENISE, Italy. �hal-02870748�

(2)

Machine Learning for Risk Ranking Automation in IRSN Level 2 PSA

Guillaume Kioseyian

Severe Accident Department, IRSN, France. E-mail: guillaume.kioseyian@irsn.fr Marine Marcilhac-Fradin

Severe Accident Department, IRSN, France. E-mail: marine.marcilhacfradin@irsn.fr

Level 2 PSA producing more and more output data, IRSN has been developing computing tools since 2017 to perform effective post-treatments and to provide in-depth analysis.

These tools have allowed the automation of numerous steps in the post-treatment of release categories generated by the accident progression event tree. The L2 PSA post-treatment is a crucial part to determine and analyse the risk ranking, and consequently to identify critical severe accident scenarios. The analysis of these specific accidental sequences points out the safety improvements to bring about, such as addition or modification of procedures or safety devices.

The toolbox developed has been recently completed with a Machine Learning algorithm-based-tool, using Regression Trees method. Thus, for future L2 PSA, a fully automated post-treatment of release categories is available, and the Farmer diagram can be automatically generated for each risk metric.

The results obtained with this new method are very satisfactory since the risk ranking automatically obtained is similar to that obtained manually. Moreover, the calculation time for this automatic grouping is about fifteen minutes whereas it is about eight days when done manually.

This application paves the way for other automations or process improvements of L2 PSA with the use of Machine Learning approaches.

Keywords: Machine Learning, L2 PSA, risk ranking, severe accident, radiological consequences.

1. Introduction

Level 1 Probabilistic Safety Assessment (L1 PSA) aims at identifying accidental sequences leading to core damage and at quantifying their frequency, whereas Level 2 PSA (L2 PSA) objective is to assess then the frequency and amplitude of radionuclides releases to the environment resulting from the progression of such accidents, so-called severe accidents.

For several years, IRSN, the technical and scientific support organization of the French Nuclear Safety Authority (ASN), has been developing L2 PSA for the Nuclear Power Plants (NPP) operated in France, with significant efforts to integrate a realistic modelling of the severe accident progression. L2 PSA is notably used at IRSN for safety review activities. It is carried out with a set of software tools, all designed by IRSN engineers and researchers, Guigueno et al. (2016). Thanks to these tools, each L1 PSA accidental sequence is extended by L2 PSA severe accident scenarios, with an assessment of radioactive releases and radiological consequences.

The main outcome of L2 PSA is a risk

“frequency versus radiological conséquences”

ranking of potential accidental sequences in a facility. Such a graph is called a Farmer diagram.

This step constitutes a key step in L2 PSA because the risk ranking obtained is decisive for identifying the sequences that deserve operator to modify or add safety devices or procedures.

Updating this risk ranking should highlight the gain obtained. Moreover, this risk ranking is used to communicate clearly and concisely towards non-L2 PSA specialists.

During the last years, the time required to complete this step has increased significantly.

Firstly, the number of severe accident scenarios whose radioactive releases and radiological consequences are assessed is today much more important. For instance, the last internal initiating events L2 PSA for the French 900 MWe Pressurised Water Reactors (PWR) has generated about 500,000 severe accident scenarios grouped in about 50,000 release categories (RC) in which the amplitude and kinetics of releases are supposed to be of the same order of magnitude. Secondly, the set of radiological risk metrics has been extended, Kioseyian et al. (2018). Today, four Farmer diagrams are available at the end of the L2 PSA:

- two metrics to assess the short-term radiological consequences (the effective dose and the thyroid equivalent dose);

Proceedings of the 30th European Safety and Reliability Conference and the 15th Probabilistic Safety Assessment and Management Conference.

Edited by Piero Baraldi, Francesco Di Maio and Enrico Zio

Copyright © 2020 by ESREL2020 PSAM 15 Organizers. Published by Research Publishing, Singapore

ISBN: 981-973-0000-00-0 :: doi: 10.3850/981-973-0000-00-0 esrel2020psam15-paper

(3)

2 Guillaume Kioseyian and Marine Marcilhac-Fradin - one metric to evaluate the long-term

radiological consequences (the contamination of soils with caesium-137);

one metric to facilitate the understanding of the results towards non-L2 PSA specialists (the International Nuclear and radiological Event Scale, INES).

In addition, it is planned to complete the current long-term risk metric in the coming years, with the French new indicators and guidance values for defining post-accident zoning, Kioseyian et al. (2018).

Due to the large amount of L2 PSA output data, developments have been made to automate as much as possible the realization of these four risk rankings. Nevertheless, at the time of those developments, one step could not be automated and was therefore performed “manually” by IRSN L2 PSA engineers. This is a time- consuming task. For that reason, important developments have recently been carried out to implement an automated process by means of Machine Learning (ML) algorithms.

This paper provides an overview of these developments. Then, the method used to automate the remaining step is described.

2. Farmer diagram building steps

At the end of the Accident Progression Event Tree (APET) quantification, the available results are the frequency of each release category and the radiological consequences they induced. At this step, the different Farmer diagrams mentioned in the introduction are available but not exploitable as they contain an excessive number of points. For instance, as explained above (cf. part 1), the L2 PSA for the French 900 MWe PWR has generated about 50,000 release categories.

Five steps are necessary in order to build usable Farmer diagrams with a reduced number of points. These points correspond to consistent groupings of RC. The proper realization of these steps is crucial, since they determine the Farmer diagram and therefore the final risk ranking acquired.

2.1 Step 1: containment failure modes for each release category evaluation

The first step consists in evaluating the different containment failure modes for each release category. This step is independent of the radiological risk metrics. Thus, whatever the number of risk metrics, this step has to be done only once.

This first post-processing step of the APET quantification has been automated thanks to software tools developed in R language. These tools consist in associating the release categories with the containment failures modes.

2.2 Step 2: evaluation of the dominating containment failure modes for each release category by using release pathways

For each release category, the software tools designed by IRSN provide the contribution of all possible release pathways of the PWR (containment, steam generator, filtered containment venting system, base-mat, etc.) to the radiological risk metrics values, which depend on the fission products groups. For instance, the current long-term risk metric, that is the contamination of soils with caesium-137, solely depends on caesium-137 whereas the thyroid equivalent dose, one of the short-term risk metrics, is essentially focused on iodine. To any release pathway is associated a retention coefficient for each radionuclide. Thus, the contribution of the release pathways is specific to each radiological risk metric.

For each release category, the second step consists in assessing the contribution to the given risk metric value of the containment failure modes by exploiting the release pathways.

Indeed, each containment failure mode is associated with one release pathway.

Containment failure modes with a contribution above a given threshold (set by the user) are then retained.

This step is repeated as many times as there are radiological risk metrics.

This second post-processing step of the APET quantification has been automated thanks to software tools developed in R language.

2.3 Step 3: grouping of the release categories by

dominating containment failure modes

The third step consists in grouping the release

categories by dominating containment failure

modes. As the latter are specific to the

radiological risk metrics, this step must be

repeated for each of them.

(4)

This post-processing step has been automated by means of tools in R language.

After this third step, the number of points on the Farmer diagram moves on from about 50,000 to about 100.

2.4 Step 4: splitting of groups with large risk metric value dispersion

For each group of each radiological risk metric obtained at the end of step 3, the dispersion of the metric value is analysed. If this dispersion is too large (over one decade), step 4 consists in splitting the group in order to reduce the variability, and make it acceptable in each sub- group.

At the time of the L2 PSA post-processing developments, the automation of this step could not be performed, and it was therefore done manually. This step is complex. Indeed, it requires assessing the influence of the available parameters to the risk metric value so as to retain the parameters that reduce the dispersion as efficiently as possible while having a physical meaning. To illustrate this last point, for the L2 PSA for the French 900 MWe PWR, many groups have been split according to the safety systems operating and their impact on the physical phenomena.

2.5 Step 5: keeping the groups with significant contribution to the global frequency or to the overall risk

At the end of the step 4, for each risk metric, the number of points on the Farmer diagram moves on from about 100 to about 150. This number is too large to be displayed in the different Farmer diagrams. For this reason, step 5 retains the groups with significant contribution to the global frequency or to the overall risk (product of the frequency by the risk metric value). This step is repeated for each radiological risk metric.

This fifth post-processing step of the APET quantification has been automated thanks to software tools developed in R language.

3. Use of Machine Learning to automate step 4

For the last L2 PSA for the French 900 MWe PWR, this step required at least two days of work to be done manually for each radiological risk metric. As IRSN considers four risk metrics, eight days were necessary at least to execute step 4.

As this step is costly and repetitive, IRSN has recently developed new tools based on Machine

Learning algorithms in order to automate the step 4.

3.1 Machine Learning algorithm specifications First of all, an in-depth review was undertaken of the available Machine Learning algorithms. This preliminary study allows to select the algorithm that best suits the objectives and expectations.

Firstly, as a reminder, the fourth step aims at reducing the radiological risk metric variability in a group. To this end, the release categories groups with large metric dispersions are split.

Because a target value is considered (i.e. the risk metric value), ML “supervised learning”

algorithms were preferred to “unsupervised learning” algorithms. In addition, as the target value is a numeric data, supervised learning algorithms that can be applied to regression learning tasks, in opposition to classification learning tasks, must be kept.

Taking into account this first requirement, the selected ML groups of algorithms are:

Regression Trees, Model Trees, Artificial Neural Networks and Support Vector Machines, Lantz (2019).

Secondly, the goal is not to obtain a predictive model. Indeed, high-performance tools, based on simplified physical models, are already available, Guigueno et al. (2016). They can assess in a few seconds the radioactive releases and radiological consequences induced by each release category. The second prerequisite is to use a ML algorithm that produces an understandable descriptive model.

Such a model will allow the user to observe at a glance what are the parameters used by the machine to split efficiently a group of RC. This requirement is very important because, as explained above (cf. part 2.4), the split of a group has to be performed while keeping a physical meaning. Thus, it is helpful for the user to facilitate access to information created by the machine in order to check the split building. For these reasons, Artificial Neural Networks and Support Vector Machines are excluded.

3.2 Regression Trees and Model Trees

Regression Trees and Model Trees offer a great interpretability since they use tree-like graph, well-known by PSA specialists, to explain the relationship between features (input variables) and a numeric outcome.

Regarding a group of RC that needs to be

split, the partition strategy is identical for both

Regression Trees and Model Trees,

Lantz (2019). Beginning at the “root node”,

(5)

4 Guillaume Kioseyian and Marine Marcilhac-Fradin these ML algorithms consist in separating the RC. The split is done according to the feature improving the best the homogeneity that results from the split outcome. This homogeneity is measured by statistics such as variance, which gives a measure of how the data is distributed from the mean. The criterion for stopping the split is met when the overall statistical improvement of the tree, induced by the addition of a new branch, is less than a threshold (set by the user in advance). The trees end with

“terminal nodes”, also known as “leaf nodes”

(they are simply called “leaf’ in the following paper).

The main difference between Regression Trees and Model Trees is in the value returned by the leaf nodes. In Regression Trees, this value corresponds to the average value of the sample that reaches the leaf. By contrast, in Model Trees, the leaf nodes do not terminate in a numeric prediction, but in a linear model.

Indeed, in Model Trees, a regression model is built at each leaf from the sample that reaches the leaf. Because of this, Model Trees are considered as an improvement of Regression Trees.

Model Trees may result in a more predictive accurate model. However, as explained above (cf. part 3.1), one of the goals is not to obtain a predictive model but a descriptive model that highlights the physical parameters influencing the most the risk metric value and displays the resulting structure in a human-readable format.

To this end, the use of Regression Trees and Model Trees is similar.

Finally, Regression Trees have been chosen as the ML algorithm to automate step 4.

Applications of Regression Trees are wide.

For example, they are used in finance, Andriyashin (2005), or medicine, Pouliakis et al.

(2015).

3.3 Implémentation on the L2 PSA for the French 900 MWe PWR

The use of Regression Trees to automate step 4 has thus been implemented on the L2 PSA for the French 900 MWe PWR.

The R package called rpart, providing an implementation of Regression Trees, has been used.

It should be noted that the goal of the paper is to demonstrate the effectiveness of the fully automated post-treatment method of the release categories, and not to present the French 900 MWe PWR L2 PSA results that would have to be fully described with the assumption retained and contextualized. Thus, all the values

presented in the following paper are standardized.

3.3.1 First application on a targetedgroup of release categories

To verify if Regression Trees are relevant to automate step 4, one specific group of release categories has been targeted. As this group presents a too large radiological risk metric dispersion value, it requires to be split after step 3. This group results from the realization of the third step for the effective dose metric. It brings RC whose only dominating containment failure mode is a containment leak via penetrations (that is a failure of containment isolation). This group has been selected due to its great complexity to be split manually for the French 900 MWe PWR L2 PSA results presentation.

This targeted group is composed of 5,000 RC. Its standardized effective dose dispersion extends from 1 to 250. The variability is therefore not acceptable and the group has to be split.

At this point, a matrix of 5,000 lines is available. Each line corresponds to a release category. In addition, this matrix is composed of about 50 columns (parameters) that define a release category (containment break size, containment pressure, containment spray system operating, pH of the sump water, state of the venting system, etc.), plus an additional one giving the value of the effective dose. Parameters are categorical variables with several modalities.

As mentioned above (cf. part 2.4), the split of a group has to ensure a physical meaning. For this reason, only about 15 parameters out of 50 were selected by expert judgement. Furthermore, to prepare the data for the learning process, those selected categorical variables have been converted into binary variables (values are 0 or 1). For instance, as illustrated in Table 1, for one given categorical variable, called X, with four modalities, four binary variables have been created.

Table 1. Transformation of categorical variables ,X

4 3 1 2

X 1 X 2 X 3 X 4

0 0 0 1

0 0 10

10 0 0

0 10 0

Then, a regression tree model has been built

from this updated matrix. This model,

represented in Figure 1, ends with four leaf

(6)

nodes. In each of them, the average standardized effective dose value of the release categories that have reached the leaf is showed, and the proportion of the total sample is specified. For instance, 49 % of the RC of the targeted group are in the fourth leaf, and the average standardized effective dose value in this sample is equal to 83.

\ PH_SUMP_WATER_POSTVR_2

Fig. 1. First regression tree model obtained

Based on the regression tree model built, the pH of the sump water during the post-reactor pressure vessel (RPV) rupture phase is the parameter that provides the best split (among those previously selected by expert judgment).

When the pH is acidic (modality 1 of the PH_SUMP_WATER_POSTVR_2 binary variable), the machine considers that no other split is necessary. By contrast, when the pH is non-acidic (modality 0 of the PH_SUMP_WATER_POSTVR_2 binary variable), the machine makes a further split according to the containment spray system operating before the RPV rupture. When it is not started during this phase (modality 1 of the SPRAY_PREVR_4 binary variable), a distinction is made in accordance with the occurrence of a radioactive liquid leak outside the containment.

This first regression tree model is quite interesting and provides a split having a physical meaning. However the dispersion of effective doses within the leaf 1 (average standardized value equal to 6) remains too important, since the values do not fit on one decade. The standardized effective doses extend from 1 to 50 in this leaf. For this reason, the tree model obtained is not satisfactory.

As the criterion of homogeneity used by the machine for the splits is variance, which

measures the deviations from the average, the method leads to more splits at higher effective doses. To solve the problem encountered at low values while keeping the efficiency obtained at high values, effective doses have undergone a log-transformation.

The new regression tree model obtained after this operation is visible in Figure 2. Now, the best split at the root node informs if the containment spray system was started before the RPV rupture. More, this new tree model ends with six leaf nodes versus four for the previous one. At this point, it can be noted that some splits are not required. For instance, the split realized by the SPRAY_PREVR_1 variable is not necessary. Indeed, union of release categories gathered in leaf nodes number 2 and 3 would conduct to a set with an acceptable effective dose dispersion for the expert.

Fig. 2. Second regression tree model obtained

Finally, the splits carried out by Regression Trees on this targeted group provide sets in which the dispersion of the metric value is acceptable. Indeed, in each set, the standardized effective doses fit on one decade. More, the physical meaning of the model is ensured since each set can be defined by a state of the spray system and by the pH acidity of the sump water.

In addition, the sets are very close to those operated manually. Thus, results of the test are very promising.

3.3.2 Extension of the method on the global scale

of the L2 PSA for the French 900 MWe PWR

After the conclusive test executed in part 3.3.1,

the method has been spread out to all groups and

(7)

6 Guillaume Kioseyian and Marine Marcilhac-Fradin all radiological risk metrics successfully. Thus, step 4 is completely automated.

3.3.3 Risk rankings obtained when step 4 is done by ML algorithm compared to when step 4 is done manually

In 2017, in the framework of the French 900 MWe PWR L2 PSA results presentation, to ensure the overall consistency of the splits made manually, a graph showing the distribution of the core damage frequency as a fonction of the risk metric value was plotted. Two curves were presented in this graph: one resulting from all the 50,000 RC, the other resulting from groups of RC obtained at the end of step 4, which was done manually.

Now, a third curve can be added in this graph: one building from groups of RC obtained at the end of step 4 when performed by means of Machine Learning algorithm. Figure 3 provides the three graphs for the effective dose metric.

Fig. 3. Distribution of the core damage frequency as a function of the effective dose

Figure 3 shows that the distribution of the core damage frequency when step 4 is done by Machine Learning algorithm (red curve) sticks the curve building from all the release categories (black curve) as well as when step 4 is done manually (blue curve). Besides, it should be noted that the number of groups generated at the end of step 4 is greater when this step is done by ML algorithm. As said earlier (cf. part 3.3.1), some groups carried out by ML algorithm are useless. Some works might be done to fix it.

Besides, for each radiological risk metric, the risk ranking provided by the Farmer diagram can be compared according to the way of executing step 4 (by ML algorithm or manually). Figure 4 provides a superposition of the two Farmer diagrams obtained for the effective dose metric (risk “frequency versus effective dose metric”

ranking). The orange dots are those obtained when step 4 is done manually. The dots in blue are those obtained when step 4 is done by ML algorithm. For each point on the Farmer diagram, the product of the frequency by the risk metric value (effective dose in Figure 4) gives a contribution to the risk. The risk ranking for the considered metric is thereby built. Thus, the sequences that deserve operators to modify or add safety devices or procedures appear. These sequences are the most significant in frequency and / or the most significant in radiological consequences.

The orange frames give the first four groups of the risk ranking for the considered metric when step 4 is done manually. The dots in blue are numbered according to their position in the risk ranking when step 4 is done by ML algorithm.

Fig. 4. Superposition of the two Farmer diagrams obtained for the effective dose metric according to the manner of executing step 4

Figure 4 shows that the first two groups in the risk ranking are identical whatever the manner of executing step 4. Actually, this analysis is not very surprising as those groups do not require to be split during step 4. Indeed, the dispersion of the risk metric value within those groups is already acceptable at the end of step 3.

Thus, the two blue dots (R1 and R2) overlap

(8)

perfectly with the two orange dots. This observation makes it possible to ensure that the automation of step 4 is correctly coded anyway.

Then, it appears in Figure 4 that the third group in the risk ranking when step 4 is done manually is split into two different groups (R3 and R4) when step 4 is done by ML algorithm.

The third contributor when step 4 is done manually corresponds to severe accident with a failure of containment isolation (group presented in part 3.3.1) with no start-up of the containment spray system before the RPV rupture. As seen previously in Figure 2, when step 4 is carried out by means of ML algorithm, those situations have been split in two groups according to the pH of sump water after the vessel rupture. Figure 5 takes figure 2 to show how R3 and R4 are obtained.

Fig. 5. Definition of R3 and R4

Thus, this observation confirms the fact that some groups carried out by ML algorithm are useless from the expert point of view since the dispersion of risk metric values within the union of those groups would be acceptable, but ML algorithm splits them due to the improvement of variance (cf. part 3.2). R3 and R4 might be therefore gathered in the same group as when step 4 is done manually. Actually, the frequency of the third orange dot corresponds to the sum of the frequencies of R3 and R4, and its effective dose value is equal to that of R4 (the effective dose displayed for a group of RC in a Farmer diagram is the one of the RC with the greatest product of the frequency by the risk metric value). It can be noticed anyway that the automation of step 4 leads to observe, at the third position of risk ranking, the same dominating

containment failure mode with the same state of the containment spray system.

The previous analysis can be renewed when presenting the fourth group in the risk ranking when step 4 is manually executed. Indeed, ML algorithm splits this group in two (R8 and R9).

Actually, the frequency of the fourth orange dot corresponds to the sum of the frequencies of R8 and R9, and its effective dose value is equal to that of R8. However, due to this split, the frequency has been shared and, consequently, the group has been downgraded. It is the perverse effect of doing too much splits. As said earlier, some works might be done to fix it.

4. Conclusion

A Machine Learning algorithm-based-tool is now implemented in R language within the toolbox developed by IRSN. It automates the last remaining step, which consists in splitting the release categories groups with a high risk metric dispersion. Regression Trees method is used to perform this automation.

The groups carried out by the machine are consistent with the requirements. First, the dispersion of radiological consequences in each group is acceptable since it fits on one decade.

Second, the physical meaning of each group is ensured by the selection of variables of interest on the basis of expert judgement. Third, the construction of groups is accessible since Regression Trees resulting structure is a tree-like graph model.

Thus, a fully automated post-treatment of release categories generated by the accident progression event tree is available for future L2 PSA. Consequently, the Farmer diagram of each metric can be automatically generated.

The risk ranking provided when this step is automated with Machine Learning algorithm is similar to that obtained manually. Besides, the calculation time for this automatic grouping is about fifteen minutes whereas it is about eight days when done manually. The method can be further improved by automatically gathering together the groups that the machine has created in a not useful manner from the expert point of view.

Given the very promising look of the work

done, automation or improvement of others

processes of L2 PSA are currently considered

together with the use of Machine Learning

approaches. The latter could valorize the

physical analyses supporting the L2 PSA

development for others purposes including

emergency preparedness and response.

(9)

References

Andriyashin, A. (2005). Financial Applications of Classification and Regression Trees. Master thesis, CASE, Humboldt University, Berlin.

Guigueno, Y. et al (2016). Software tools and methodologies for level 2 PSA development available at IRSN. PSAM13.

Kioseyian, G. et al (2018). Feedback on the use of risk metrics for level 2

p

S

as

. PSAM 14.

Lantz, B. (2019). Machine Learning with R. Packt Publishing.

Pouliakis, A. et al (2015). The Application of Classification and Regression Trees for the Triage of Women for Referral to Colposcopy and the Estimation of Risk for Cervical Intraepithelial Neoplasia. Hindawi Publishing Corporation.

8 Guillaume Kioseyian and Marine Marcilhac-Fradin