• Aucun résultat trouvé

Detecting Electricity Theft

N/A
N/A
Protected

Academic year: 2021

Partager "Detecting Electricity Theft"

Copied!
1
0
0

Texte intégral

(1)

Detecting Electricity Theft

Patrick GLAUNER

Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg

patrick.glauner@uni.lu

Electrical power grids are the backbone of today’s society. Losses during generation and distribution cause major problems, including financial losses to electricity providers and a decrease of stability and reliability. They can be classified into technical losses and non-technical losses. Technical losses are naturally occurring and mainly include losses to power dissipation in electrical components, such as in generators, transformers and transmission lines due to internal electrical resistance. They are possible to detect and control given a knowledge of the quantities of loads.

Non-technical losses (NTL) faced by electricity providers include, but are not limited to, electricity theft by rewiring or manipulating meters. Other types include faulty meters and errors in meter readings and billing. There are different estimates of the financial losses caused by NTLs and they can range up to 40% of the total electricity distributed in countries such as Brazil, India, Malaysia or Lebanon. They are also of relevance in developed countries, for example estimates of NTLs in the US range from USD 1-6 billion.

Fig. 1: That is what electricity theft looksa.

In order to detect NTLs, inspections of customers are carried out, based on predictions whether there may be a NTL at a customer. The inspection results are then used in the learning of algorithms in order to improve predictions. However, carrying out inspections is expensive, as it requires physical presence of technicians. It is therefore important to make accurate predictions in order to reduce the number of false positives.

This project is in cooperation with CHOICE Technologies Holding Sàrl, which provides real data and domain expertise.

ahttp://extra.globo.com/incoming/13321838-a74-9d3/w448/Eletrotraficante-Rio-das-Pedras.jpg

Introduction

Detecting NTLs is challenging and includes the following factors:

Wide range of possible causes of NTLs, such as different fraudulent types of customers.

Imbalance of the data, meaning that there are significantly more regular customers than customers with NTLs.

Inspection labels may be false-negative, because technicians got bribed or threatened.

The inspection sample is biased and does not represent the population of all customers.

Varying levels of NTLs in different cities/countries.

Not only poor people steal, even the government does.

Challenges

NTL detection can be treated as a special case of fraud detection. It highlights two approaches as key methods to detect fraudulent behavior in credit card fraud, computer intrusion and telecommunications fraud:

1. Expert systems that represent domain knowledge in order to make decisions typically using hand-crafted rules.

2. Data mining or machine learning techniques that employ statistics to learn patterns from sample data in order to make decisions for future unseen data.

Both approaches have their justification and neither is generally better or worse than the other one in artificial intelligence.

Most methods in the literature use supervised learning, please find a comprehensive dis-cussion in our first paper.

It must be noted that most NTL detection methods are supervised. Anomaly detection - a superclass of NTL - is generally challenging to learn in a supervised manner.

Related work

The data used in this paper is from an electricity provider in Brazil. It consists of three parts: (i) 700K customer data, such as location, type, etc., (ii) 31M monthly consumption data from January 2011 to January 2015 such as consumption in kWh, date of meter reading and number of days between meter readings and (iii) 400K inspection data such as presence of fraud or irregularity, type of NTL and inspection notes.

Most inspections do not find NTLs, making the classes highly imbalanced. In order for the models to be applied to other regions or countries, they must be assessed on different NTL proportions. Each sample contains 100K inspection results.

10 -1 10 0 10 1 10 2

NTL proportion [%]

0.44 0.46 0.48 0.50 0.52 0.54 0.56 0.58

A

U

C

Change of AUC for different NTL proportions

Boolean Fuzzy Fuzzy SGD Fuzzy SGD 30% SVM SVM 60%

Fig. 2: Comparison of optimized classifiers tested on different NTL proportions.

NTL detection

We are planning to evaluate unsupervised methods, in particular deep learning, in order to detect NTL more accurately by finding hidden correlations in the data. Furthermore, we are planning to use other features in our models, such as the location, latent features and hand-crafted features that put consumption patterns in relation to similar customers and the past. We are also planning to investigate cost-based optimization in order to maximize the total electricity recovered through inspections. Also, we are planning to make our implementations faster and more scalable using Apache Spark.

Next steps

The initial Boolean and fuzzy models perform worse than random guessing and are there-fore not suitable for real data, as they trigger too many inspections while not many of them will lead to NTL detection. Optimized fuzzy and SVM models trained on 30% and 60% NTL proportion, respectively, result in significantly greater AUC scores. However, both perform very differently, as the optimized fuzzy system is more conservative in NTL prediction. In contrast, the optimized SVM is more optimistic, leading also to a higher FPR. In general, neither can be named better than the other one, as picking the appropriate model from these two is subject to business decisions.

Conclusions

P. Glauner, A. Boechat, L. Dolberg, R. State, F. Bettinger, Y. Rangoni and D. Duarte: Large-Scale Detection of Non-Technical Losses in Imbalanced Data Sets. arXiv:1602.08350. 2016. Submitted to The Seventh IEEE Conference on Innovative Smart Grid Technologies (ISGT 2016).

First paper

Patrick GLAUNER is a PhD student in machine learning at the University of Luxembourg in collaboration with CHOICE Technologies Holding, under the supervision of Dr. Radu STATE. He graduated as valedictorian in computer science from Karlsruhe University of Applied Sciences and obtained his MSc in machine learning from Imperial College London. He was a CERN Fellow, worked at SAP and is an alumnus of the German National Aca-demic Foundation (Studienstiftung des deutschen Volkes). His current interests include artificial intelligence, machine learning, deep learning, anomaly detection and big data.

Références

Documents relatifs

They take as an example the classical bridge problem: "The capacity (investment) decision is made on the basis of the consumer's marginal valuation of a unit of capacity under

Lorsque le parc, confronté à une plus faible demande, tend à être en surcapacité, une logique de court terme prévaut pour faire baisser le prix de l'électricité au niveau du

Pour rendre plus tangible cette difficulté à mobiliser les investisseurs, on reprendra ici un chiffre cité par l’UEMOA 5 : entre 1990 et 2006, la zone UEMOA n’a attiré que 0,4 %

- Вносът на електроенергия, въпреки че е най-скъп в сравнение с останалите възможни за изграждане нови генериращи източници (0.05 $/кВтч), през тази и следващата

● In the high case, nuclear electrical generating capacity is projected to increase by about 20% by 2030 and more than double by 2050 compared with 2020 capacity.. ● In the low

● In the high case, nuclear electricity production is expected to increase almost 40% from 2019 levels by 2030 and eventually more than double by 2050.. The share of nuclear in

● In the high case, it is assumed that several of the nuclear power reactors scheduled for retirement will be given a life extension, so that only 49 GW(e) of nuclear electrical

● In the high case, it is assumed that several of the nuclear power reactors scheduled for retirement will be given a life extension, so that only 55 GW(e) of nuclear electrical