HAL Id: hal-01558385
https://hal-mines-paristech.archives-ouvertes.fr/hal-01558385
Submitted on 7 Jul 2017
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de
A novel method for decomposing electricity feeder load into elementary profiles from customer information
Alexis Gerossier, Thibaut Barbier, Robin Girard
To cite this version:
Alexis Gerossier, Thibaut Barbier, Robin Girard. A novel method for decomposing electricity feeder
load into elementary profiles from customer information. Applied Energy, Elsevier, 2017, 203, pp.752
- 760. �10.1016/j.apenergy.2017.06.096�. �hal-01558385�
A novel method for decomposing electricity feeder load into elementary proles from customer information
Alexis Gerossier ∗ Thibaut Barbier ∗ Robin Girard
{alexis.gerossier},{thibaut.barbier}@mines-paristech.fr
∗ Both authors equally contributed to this paper
MINES ParisTech, PSL Research University, Center for processes, renewable energy and systems (PERSEE), 1 rue Claude Daunesse, 06904 Sophia Antipolis, France
June 2017
Abstract
To plan a distribution grid involves making a long-term forecast of sub-hourly demand, which requires modeling the demand and its dynamics with aggregated measurement data. Distribution system operators (DSOs) have been recording electricity sub-hourly demand delivered by their medium-voltage feeders (around 1,00010,000 customers) for several years. Demand proles dier widely among the various considered feeders. This is partly due to the varying mix of customer categories from one feeder to another. To overcome this issue, elementary demand proles are often associated with customer categories and then combined according to a mix description. This paper presents a novel method to estimate elementary proles that only requires several feeder demand curves and a description of customers. The method relies on a statistical blind source model and a new estimation procedure based on the augmented Lagrangian method. The use of feeders to estimate elementary proles means that measurements are fully representative and continuously updated. We illustrate the proposed method through a case study comprising around 1,000 feeder demand curves operated by the main French DSO Enedis. We propose an application o that uses the obtained proles to evaluate the contribution of any set of new customers to a feeder peak load. We show that proles enable a simulation of new unmeasured areas with errors of around 20%. We also show how our method can be used to evaluate the relevancy of dierent customer categorizations.
1 Introduction
1.1 Motivation
Electricity represented 18% of total nal energy consumption in 2013 [3] and is expected to con- stitute a quarter of nal energy consumption by 2040 [1]. 42% of global CO2 emissions in 2012,
i.e. 13.8 gigatons of CO2, are due to electricity and heat production [2]. To reduce CO2 emis- sions due to electricity, many states are devel- oping energy transition strategies. This kind of transition involves signicant changes to electric- ity ows in the distribution network (with e.g.
decentralized production, improved eciency of
buildings and appliances, new uses and demand
Nomenclature
a f Consumption trend of feeder f relative to temperature
b f Temperature threshold of feeder f B Matrix of demand proles of
customer categories
β Column vector associated with B c f k Annual consumption of category k
for feeder f
d f Demand of feeder f
d k Elementary prole of customer category k
ε f Residual term for modeling feeder demand f
F Number of feeders f Feeder
K Number of customer categories k Customer category
m k Average demand share of a given category k
p f k Share of electricity used by category k for feeder f
σ 2 k Empirical variance of p 1 k , . . . , p F k T Number of instants
t Instant
T f Outside temperature of feeder f u Vector (1, . . . , 1) | of length K v Vector (T −1 , . . . , T −1 ) | of length T V inter Inter group variability
V tot Total variance
X Matrix of feeder demands
x Column vector associated with X
y Year
⊗ Kronecker product
response enabling energy consumption manage- ment [18]).
These changes impact the planning process of distribution system operators (DSOs). The cur- rent network planning process considers the two most extreme situations [16], i.e. maximum de- mand with minimum supply, and maximum sup- ply with minimum demand. While planning with such a method does not require a deep model- ing of the dierent dynamics and their correla- tions, it does not take into account the aggre- gation eect between supply and demand [15].
The above-mentioned changes make it necessary to model all of the aggregated demand dynamics.
1.2 Literature review
In this section we present two kinds of existing approach for modeling aggregated demand. The rst is bottom-up, and uses individual customer proles, which are summed to obtain aggregated demand. The second is a global approach in which the aggregated load curve is directly mod- eled using aggregated measurement data.
1.2.1 Bottom-up approaches
Measuring the electricity demand of individual
electricity customers is a simple way to establish
their load proles and dynamics, and therefore
a necessary step in bottom-up modeling. The
current smart-meter roll-out in Europe will pro-
vide precise measurements of individual demand
proles. Around 80% of customers are sched-
uled to receive a smart-meter by 2020 [28]. How-
ever, this massive deployment is hindered by cost
and privacy issues [21]. In 2014, only 23% of
smart-meters in the European Union were in-
stalled in localized areas for private customers
[13]. In some countries, this share is still insu-
cient to be representative, and the corresponding deployment is too recent to adequately cover long periods. To deal with the lack of individual mea- surements and characterize the behavior of elec- tricity customers, researchers have attempted to classify them into dierent categories.
The classication of electricity demand pro- les is a ourishing research topic (see reviews [19], [25]). Researchers use individual measure- ments from smart-meters as input and apply dif- ferent clustering methods [31]. This reduces the dimension, which makes it easier to manipulate data [22]. With the resulting classication, each customer is associated with a cluster and its cor- responding load prole [26]. The classication and the obtained load proles can be used for a number of applications.
First, a ne classication can be made in or- der to help decision-makers design personalized policies for specic customers [7].
Secondly, the classication allows a DSO to plan its network and anticipate its investments [23, 27]. For example, the French DSO uses a model named "Bagheera" combining about 50 customer categories to plan its low-voltage net- work [16]. Classication is combined with the evolution of category distributions to forecast ag- gregated demand in prospective scenarios [5].
Last, classication and load proles allow us to understand the contribution made by each cate- gory to aggregated demand [27].
Large measurement campaigns are necessary with these methods since a representative set of customers is required. This constraint makes continuous updating of the proles dicult, which is an issue since it remains necessary to adapt the proles to the changing consumption habits [4, 26].
1.2.2 Global approaches
In global approaches, models forecast aggregated electricity demand with past measurements and explanatory variables, such as expected temper- ature or sometimes economic progress [30].
In order to obtain past measurements, most DSOs have been recording the electric power de- livered by their medium-voltage feeders (around 1,00010,000 customers) for several years.
These measurements are aggregated, but exhaus- tive, since all electricity customers' contributions are taken into account. This aggregated electric- ity demand data is considered as a nonlinear, non-stationary series, and is often made up by a superposition of several distinct frequencies [29]
with daily to monthly periods in global models [8]. Additionally, the demand series can be di- vided into dierent parts (e.g. working time, hol- idays) [9, 17].
The global approach produces accurate fore- casts. However, these are based on aggregated past measurements, which are not available when planning a new unmeasured zone. This type of planning is improved with specic information about customers, which DSOs possess thanks to the Customer Information System (CIS) [23].
The CIS stores information on all customers re- garding their electric connection to the grid, an- nual energy consumption, type of contract, and contracted power.
In all of the reviewed global methods [29]
for modeling demand dynamics, the explanatory variables used, such as expected temperature or sometimes economic changes [30], do not charac- terize the feeder-specic local features. In partic- ular, none of them employs CIS general statistics.
Finally, the drawback of these methods when
used for planning purposes is that they cannot
adapt to a change in the mix of customer cate-
gories. For example, in the case of the develop- ment of a commercial area in a residential feeder, such methods fail to take into account the corre- sponding information. If the prole dierences of the two sectors is not accounted for, this might result in an overestimation of the future peak and hence an over-sizing of the network.
1.3 Contributions
Our paper presents a novel method to estimate elementary proles. The proposed method re- lies on a statistical model that takes into ac- count the mix of customer categories. To do this, we assume that the demands aggregate dierent shares of elementary proles associated with dif- ferent customer categories. These proles are op- timally found by minimizing prediction errors in a new algorithm relying on the augmented La- grangian method.
Unlike bottom-up methods, our method only requires several feeder demand curves and a de- scription of customers. The advantages of aggre- gated measurements compared to a set of indi- vidual load curves are: the availability of long- term historical data, full representativeness, and continuous updates. We show that the method performs similarly or better than a bottom-up method in the literature when predicting new lo- cal areas.
We illustrate the proposed method through a case study comprising around 1,000 feeder de- mand curves operated by the main French DSO Enedis. The proles obtained are essential to size the distribution network. This is illustrated by an application that evaluates the contribu- tion of any set of new customers to a feeder peak load. We show that proles enable a simulation of new unmeasured areas with errors of around 20%. We also show how our method can be used
to evaluate the relevancy of dierent customer categorizations.
1.4 Description of the paper
In section 2, the methodology is described. A case study is presented in section 3 with the re- sulting proles by category. Section 4 describes two applications that use the obtained proles.
One is employed to estimate the contribution of set of new customers to a feeder peak load. The other evaluates forecasting errors for unmeasured areas, by testing dierent categories and compar- ing performances with a similar framework case study in the literature. Finally, some conclusions are presented and discussed in section 5.
2 Methodology
2.1 The problem of recovering load proles and the forecasting method
Our paper assumes that the sub-hourly demands d f (t) of a feeder f aggregate dierent proles d 1 (t), . . . , d K (t) associated with K categories of customers with weights p f 1 , . . . , p f K ,
d f (t) =
K
X
k=1
p f k d k (t) + ε f (t). (1)
We take the elementary proles d k (t) to be com-
mon to all feeders, while the weights vary from
one feeder to another. The corresponding resid-
ual term ε f (t) is meant to be small. The time t
can vary along any set. The aim is to recover un-
known elementary electricity proles d k (t) . For
each feeder f ∈ {1, . . . , F } , d f (t) is observed
and, thanks to the CIS, for each category k ∈
{1, . . . , K } , we also have access to the weight p f k .
The process of obtaining proportions from the CIS and dening categories is the categorization step, and is described in subsection 2.3. Once the K proles have been obtained on a set of feeders, it is possible to turn Equation (1) into a simulation algorithm. The process is described in Figure 1. In the signal processing community, the corresponding problem is called blind signal separation and is well-known (see e.g. [11]).
2.2 Optimization problem
The aim is to nd the elementary proles d k (t) from aggregated demand d f (t) according to Equation (1). We write and solve the following optimization problem.
To mathematically write this optimization problem, we dene a matrix A of size (F, K ) whose elements are proportions p f k for k ∈ {1, . . . , K } and f ∈ {1, . . . , F } . Aggregated demands d f (t) for all feeders and instants {1, . . . , T } are gathered in a matrix X of size (F, T ). We are trying to compute demand pro- le d k (t) for all categories and instants: these unknown values can be put in a matrix B of size (K, T ). It is useful to dene β (resp. x), the col- umn vector obtained by stacking rows of B (resp.
X ) on top of each other. Two constraints limit the values of matrix B:
1. Each component of β is an electricity de- mand. Since electricity producers are not considered in this paper, components should be positive.
2. For each class k , components should have an average unit, i.e. P
t d k (t) = T , to have comparable proles. To write this constraint in mathematical terms, we dene the col- umn of length K , u = (1, . . . , 1) | , and the column of length T , v = (T −1 , . . . , T −1 ) | in
order to write the average unit constraint, with a Kronecker product ⊗ , as (I K ⊗v | )β = u .
The optimization problem then writes
min β kx − (A ⊗ I T )β k 2 (2) s.t. β ≥ 0
(I K ⊗ v | )β = u
An alternating direction method of multipliers [10] is used to recursively solve problem (2):
1. minimize the function with the equality con- straint by employing the augmented La- grangian method,
2. retain only positive components to satisfy the positivity constraint,
3. adjust a penalty variable balancing positiv- ity and the minimization.
The algorithm is implemented with the R lan- guage [24]. Special care is taken on the rst step, since the minimization requires inverting a large matrix of size K(T + 1) . With common Kro- necker product rules, matrix to be invert is re- duced to size K divided the number of ops by approximately T 3 .
2.3 Categorization of electricity cus- tomers
The aggregated demand prole d f (t) of a feeder
f aggregates a large group of customers (a few
thousands). The CIS provides general features
on these customers, i.e. annual consumption,
type of contract, and contracted power, which
can be used to cluster them into K dierent
categories. Once the features are selected, the
DATASET
LOAD PROFILES RECOVERY
DECOMPOSITION
UNKNOWN DEMAND
SIMULATION ALGORITHM
Catw1 CatwK
0 24
0 24
Demands Proportions
Feederw1
FeederwF
10% 27%
47% 5%
hour
hour
hour
hour
Profiles Categoryw1
CategorywK
0 24
0 24
Proportions
Newwfeeder
79% 12%Newwfeeder
hour
Catw1 CatwK
Catw1 CatwK
0 24
Demand
Figure 1: Diagram detailing the method. A dataset of F feeder measurements is used to nd the K category proles. Once the load proles recovery is operated, a new feeder whose category distribution is known can be run through the simulation algorithm in order to obtain its expected demand.
total annual consumption c f k of a category k ∈ {1, . . . , K } in a feeder f ∈ {1, . . . , F } is com- puted from each annual individual consumption.
The corresponding weight p f k is a normalized ver- sion of this consumption
p f k = c f k P K
k=1 c f k
∈ [0, 1] (3) It is important that the size of the dataset F should be larger than the number of categories K . Empirically, it was observed that the condi- tion F > 5K is preferable in order to obtain a wide range in the set of category distributions, and thus a more precise result. Features should be general enough to keep a reasonably low K for three reasons: (i) to obtain a robust prole, (ii)
to avoid an excessively long computing time, and (iii) to ensure that user privacy is not violated.
Figure 2 sets out four dierent categorizations, based on information from the CIS. The rst categorization divides the total energy into two groups: residential and tertiary. The second splits the tertiary into 7 categories to make a total of 8 categories, i.e. residential, agriculture, commercial, public equipment, oce and hospi- tal, industry, restaurant and hotel, and medium- voltage (MV) customers (e.g. large buildings that have a specic contract with the operator).
A 9-group division results from splitting the res- idential share into two groups: base tari and special tari 1 . Finally, an even more precise cat-
1
Special tari charges less during xed o-peak peri-
egorization, i.e. 12 groups, is proposed. Com- mercial buildings are split into 2 categories re- ecting low and high annual consumption. Sim- ilarly, MV customers are divided into 3 groups:
low, medium and high.
On Figure 2, category heights for a category k represent the average demand shares for a given category m k = F 1 P F
f=1 p f k .
The share in category distribution is dierent for every feeder. For instance, there are more restaurants in a city center than in a rural area and so the two electricity shares are dierent.
This share has to vary between feeders to e- ciently compute the demand proles. We com- puted the coecients of variation
σ k
m k (4)
where σ k 2 is the empirical variance of p 1 k , . . . , p F k . The coecients are always higher than 40%, and thus the dierent categorizations are suciently spread from one feeder to another for our algo- rithm.
3 Case study
3.1 Data description
In this case study, we use electricity feeder de- mand measured every ten minutes in 3 geograph- ical regions in France. Data come from the main French DSO, Enedis. The three regions encom- pass a large French city and the surrounding countryside. The three cities are Blois, Lyon and Rennes. Each region is divided into around 500 feeders, and each of these feeders provides electricity for about 1,000 customers. For each feeder, we know the demand measured for 4 years
ods (i.e. during the night) but more during peak hours.
from 2010 to 2013. We discard some feeders be- cause the measures are too scarce and their over- all quality is not sucient. This can result from database errors or from network reconguration or physical injuries on the grid [17]. Ultimately, between 200 and 400 feeders are selected for each region.
3.2 Temperature eect and normal- ization
Aggregated demand measurements cannot be di- rectly compared since some feeders are connected to more customers than others, causing a large discrepancy in average consumption. In order to be used as inputs in the method, measurements therefore need to be pre-processed. The two steps of this pre-processing are: removal of the temperature eect, and normalization by weekly consumption.
Electricity demand is mostly inuenced by
outdoor air temperature, as residents turn on
electric devices to adjust their indoor tempera-
ture (heating and air conditioning). In France,
the air conditioning eect is low and not consid-
ered in this paper, but the heating eect is high
during cold weather. French electric demand
represents 40% of the European thermal sensi-
tivity [14]. Indeed, since most French heating
devices are electric, demand strongly increases
when temperature decreases. However, this ef-
fect is well understood and can be removed and
treated separately with a method used by the
French TSO [20, pp 1112]: one linear regres-
sion for each hour of week. Therefore, for each
feeder f , we can determine a temperature thresh-
old b f and a trend a f > 0 such as for each degree
colder than threshold b f , demand increases by
a f . A new demand series is dened from the
2 8 9 12
residential
tertiary
residential
base tariff base tariff
special
tariff special
tariff
Agriculture Commercial-low&
Commercial-high Public&equipment Office&&&hospital Industry
Restaurant&&&hotel MV&customer-low MV&customer-medium MV&customer-high
MV&customer MV&customer Commercial Commercial
Commercial
Mean share&
of&
demand
Category name Number&of&categories
Figure 2: Example of dierent categorizations (in 2, 8, 9 or 12 groups) for the region near Lyon.
There are F = 320 feeders in this dataset. The height of a division shows the mean share of the category in all feeders in the region.
initial d f 0 (t)
d f 1 (t) = (
d f 0 (t) if T f (t) > b f d f 0 (t) − a f b f − T f (t)
otherwise.
where T f (t) is the outside temperature of feeder (5) f at instant t . In fact, trends a f and threshold b f are calculated for each hour of the day but the hour index is omitted for clearer notation.
The new series is thus supposed to be indepen- dent from the temperature, and demand dynam- ics are supposed to be similar during cold and warm periods.
To obtain comparable measurements between
feeders, demand is normalized. Each measure-
ment within a given week is divided by the energy
it consumed during that week. This total energy
can be predicted using dierent models, such as
that employed in [6], and is thereafter supposed
to be known. After the normalization, data val-
ues uctuate around a dimensionless value equal
to 1.
Hour%of%the%day
0 6 12 18 24
average%weekly consumption
+50q +100q +200q +150q
-50q -100q
Commercial Public%eq.
Rest.%.%hotels Industry
Lyon%2011
Figure 3: Weekday proles of 4 dierent categories computed with the algorithm (9 overall cate- gories) using aggregated consumption data relating to Lyon in 2011. Plots represent the variations around the average weekly consumption and not absolute consumptions.
3.3 Proles
As previously described (see Figure 1), we dis- aggregated the electricity demand in order to recover a load prole d k (t) for each category k ∈ {1, . . . , K } . The number of overall categories depends on the customer categorization: 2, 8, 9 and 12 categories were tried out (see Figure 2).
A total of 12 datasets is formed (for each region:
Blois, Lyon and Rennes; and for each year: from 2010 to 2013) and separately used as input into matrix X in problem (2).
Figure 3 presents the proles obtained for K = 9 with only 4 categories shown: commer- cial, public equipment, restaurant and hotel, in- dustry. Proles are computed with the demand dataset of Lyon in 2011. Proles are presented for a typical weekday (144 values, once every 10 minute). Since we have normalized the data, the
variations around the average weekly consump-
tion are displayed. Dierent eects are note-
worthy, e.g. the electricity consumption of com-
mercial buildings increases by around 75% dur-
ing working hours, and decreases by 50% dur-
ing the night. Conversely, the consumption of
public equipment (mainly public lighting and
lifts) greatly increases at night. These proles
are a pertinent way to understand electricity de-
mand patterns. Proles can be plotted for other
datasets (another region or another year) in or-
der to analyze specic characteristics.
4 Applications of the method
4.1 Estimation of the contribution of new customer sets to a feeder peak load
To plan the expansion of a new area, the DSO has to estimate the evolution of peak demand. The proles obtained enable it to quantify and fore- cast the contribution of the new set of customers in the peak load demand. Indeed, for a feeder f at year y 0 with proportions p f 1,y
0
, . . . , p f K,y
0
we can determine the residuals ε f y
0(t) in Equation 1 and for new proportions p f 1,y
1, . . . , p f K,y
1
in a fu- ture year y 1 the forecast demand is obtained by
d f y
1(t) =
K
X
k=1
p f k,y
1