R EVUE DE STATISTIQUE APPLIQUÉE
R OBERT C. M ORGAN G. P. N ASON
Wavelet shrinkage of itch response data
Revue de statistique appliquée, tome 47, n
o3 (1999), p. 81-98
<http://www.numdam.org/item?id=RSA_1999__47_3_81_0>
© Société française de statistique, 1999, tous droits réservés.
L’accès aux archives de la revue « Revue de statistique appliquée » (http://www.sfds.asso.fr/publicat/rsa.htm) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une in- fraction pénale. Toute copie ou impression de ce fichier doit contenir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
WAVELET SHRINKAGE OF ITCH RESPONSE DATA
Robert C.
Morgan*,
G.P. Nason*** Unilever Research US Limited, 45 River Road,
Edgewater,
NJ 07020, USA**
Department of Mathematics,
Universityof Bristol, University
Walk, BRISTOL BS81 TW,England
RÉSUMÉ
Cet article s’intéresse au
problème
dedébruitage
de fonctions constantes par morceauxen utilisant un rétrécissement
adéquat
obtenu par moyennes mobiles ou pardécomposition
enondelettes de Haar. Les fonctions constantes par morceaux sont bruitées par un bruit de type
poissonnien
et résultent de mesures deperception
dedémangeaisons
par dessujets
humainsau cours d’une
expérience
destinée à étudier l’association entre le sentiment dedémangeaison
et le flux
sanguin
lors d’uneiontophorèse
d’histamine. Nous montrons que la transformationen ondelettes invariante par translation avec rétrécissement universel fournit une méthode d’estimation
rapide
etautomatique produisant
de bonnes estimations du processus deperception
des
démangeaisons
etqu’elle
s’avèresupérieure
à la méthode des moyennes mobiles ou auxméthodes standards de rétrécissement par ondelettes. L’article examine ensuite
plus
en détailla structure de bruit des
signaux
et met en valeurquelques
nouvellestechniques
d’ondelettesqui pourraient
être utiles.Mots-clés : Rétrécissement par ondelettes, Bruit
poissonnien, Iontophorèse.
ABSTRACT
This article addresses the
problem
ofdenoising piecewise
constant functionsby using
both an
jump-tolerant moving
average and Haar waveletshrinkage.
Thepiecewise
constantfunctions are contaminated with Poisson-like noise and are measurements of
perceived
itchby
human
subjects
in anexperiment
to relateperceived
itch to bloodflow in response to histamineiontophoresis.
We show that the translation-invariant wavelet transform with universal waveletshrinkage provides
a fast and automatic method forproducing good
estimates of theunderlying perceived
itch and that the method issuperior
to thejump-tolerant moving
average and standard waveletshrinkage.
The article then looks in more detail at the noise structure of thesignals
andhighlights
some new wavelettechniques
thatmight
be useful.Keywords:
Waveletshrinkage,
Poisson noise,Iontophoresis.
1. Introduction
This article addresses the
problem
ofdenoising piecewise
constant functions.The
piecewise
constant functions in this article are recordedsignals
where thesignal
level
registers
asubject’s perception
of itch. Simultaneous measurements of bloodfloware taken and the eventual aim of the
experimentation
is to measure the extent towhich
perceived
itch level is related to bloodflow. Once the relation isquantified
future itch measurements can be more
reliably
andeasily
measuredusing
bloodflowmeasurements.
However,
thepiecewise
constant functions whichregister
theperceived
level ofitch
(controlled by
asubject)
are themselves measured in the presence of noise. The functions need to be denoised beforethey
can be related to bloodflow measurements and this article addressesonly
thisproblem.
Section 2 describes theexperimental
issues in more detail.
There are many ways in which
piecewise
constant functions can be denoised see forexample Spokoiny [14]
and references contained therein. We describe two here:one is based on
adapting
amoving
average; the other is based on waveletshrinkage using
Haar wavelets. Section 3 shows thatthe jump-tolerant moving
average methodperforms well,
butonly
if itsparameters
are chosenmanually
anddifferently
for eachsignal.
Section 4 describes waveletshrinkage using
the discrete wavelet transform and shows how itperforms
wellexcept
for a noticeable Gibb’s effectnear jumps. Using
Coifman and Donoho’s TI-transform
[2]
the over- andundershooting
effects almostcompletely disappear resulting
inextremely good
estimates. Section 5 looks in moredetail at the noise that is
present
with the itch response data and makessuggestions
about more advanced
shrinkage
methods that could be used.To
keep
this article self-contained we have included a briefdescription
ofwavelets and wavelet
shrinkage
in Section 4.However,
ours is a sparsedescription
andmore
comprehensive expositions
may be found in Donoho et al.[6]
or in Nason and Silverman[13].
For morecomprehensive
discussions of wavelets see Daubechies[3], Meyer [12]
or Burrus andGopinath [1].
2. Itch response data
2.1. Introduction and
background
to dataIn
attempting
todesign personal products
whichgive pleasing
sensations to theskin,
it isimportant
todevelop
a detailedunderstanding
of theunderlying physiological
andneurophysiological
mechanisms which takeplace.
One area ofresearch that aims to address this is the measurement of blood flow in the
skin,
in the presence of certain chemicals. The chemicals can be used toproduce
various commonsensations such as
itching, stinging
andbuming,
so that links may be drawn between the occurrence of these sensations and the behaviour of the bloodflow. The use of wavelets inhelping
toanalyse
the results of theseexperiments
wasinvestigated
forthe
particular
sensation ofitch,
but theprocedure
would be similar for any sensation.2.2.
Experimental
detailsThe sensation of itch may be stimulated
by
the presence of histamine. In order that this is not confused with other sensations such aspain,
it is necessary that the histamine isbrought
under the skin’non-invasively’.
In other words it should not beinjected.
The method used is known asiontophoresis.
A chamber filled with ahistamine solution is
placed
in contact with the skin. Inside the chamber are twoelectrodes,
which are also in contact with the skin. When avoltage
isapplied
acrossthe
electrodes,
an electric current will passthrough
theskin, drawing
histamine with it.Using
this method it ispossible
to control the exact dose of histamine which enters the skinby altering
thevoltage.
The chamber isapproximately
3cm in diameter. As it ispossible
that the action ofattaching
the chamber may itself causeitch,
aperiod
ofseveral minutes is allowed to
elapse
between attachment andapplication
ofvoltage,
in order for any such itch to settle down. It is
accepted
that after thisperiod
the staticpresence of the chamber on the skin will not cause itch. Further details about the
methodology
andapplications
ofiontophoresis
aregiven
in Harris[7].
These
experiments
test thehypothesis
that the level of itch which asubject
feelsis linked to the increase in blood flow around the skin area where the
iontophoresis
takes
place.
To test thishypothesis,
the blood flow andsubjective
itch are measuredsimultaneously.
The blood flow responsesignal or flux
is measuredusing
laserDoppler apparatus.
Subjective
itch is measuredby
thesubject moving
a slideralong
apotentiometer.
Two scales are used:
2022 Visual
Intensity
Scale(VIS)
- apurely quantitative
zero to onescale,
on whichzero
represents
noitch,
and onerepresents
the most intense itchimaginable by
thesubject.
Thesubject
is trained in the use of this scale andoperates
thepotentiometer continuously
themself.2022 Visual Affective Scale
(VAS)
- afive-point qualitative scale,
with the fivepoints
representing descriptors
such asannoying, unpleasant, painful.
Theinvestigator operates
thepotentiometer, updating
thereading
every minuteby asking
the pan-ellist which of the
descriptors
best describes theirpresent feeling.
Forpresentation
these
descriptors
are ordered and translated topoints
on a zero to one scale.2.3. An
example
data setFigure
1 shows an actualoutput
from aniontophoresis experiment
where theiontophoresis
isbegun
at t = 2000. Eachexperiment produces
a set of four curvesall measured over time.
Typically
the flux(top
line infigure)
andsubjective
curves(bottom line)
are very different. The fluxusually
increasessteadily
afteriontophoresis,
with a small amount of noise around the
signal.
Thesubjective
curves consist of almost verticaljumps, separated by
horizontalplateaux - they
areapproximately piecewise
constant. There is a lot of very
spiky
noise around thesignal
of thesubjective data, mostly
due to therecording apparatus.
One overall and eventualgoal
of ouranalyses
is to link the flux data to the
subjective
data: before this can be donesuccessfully
theVIS and VAS curves need to be denoised. The next section considers two methods that
might
beappropriate.
FIGURE 1
Complete output for subject
11.2.Top left: blood flow
ationtophoresis
site.Top right: bloodflow
at remote site. Bottomleft:
recorded itchintensity
VIS scale.Bottom
right:
recorded itchintensity
VAS scale.The scale is "Time in secondsl8" which means that t = 800
on the
plot corresponds
to 100seconds from
thebeginning of
theexperiment.
These units are chosen
by
therecording
apparatus.3. Methods for curve estimation
There are very many methods that can be used to estimate curves within the presence of noise.
Only
a smallproportion
of these are suitable for curves where it is known apriori
that the curves containdiscontinuities,
such as with our VAS and VIScurves above. To be of any use in our
practical
context our eventual chosen method has to becompletely
automatic and fast.Many
methods were considered but since the curves seemed to bepiecewise
constant we ended upconsidering
two methods:one based on
an jump-tolerant moving
average; the other based on waveletshrinkage
with Haar wavelets. Note there are still many other
competitors
that we could haveused such as local
polynomials.
3.1.
Adapted moving
averagesWe devised the
following moving
average method which wasdesigned
topreserve
large jumps
and smooth out small ones. Themoving
average routinedepends
on two user-defined
parameters: smoothing width,
m, and minimumjump size,
r,which are chosen
subjectively.
First defineThen
our jump-tolerant moving
average ai is defined asThe
jump-tolerant moving
average is best described with the aid ofdiagrams.
Suppose
m = 5. Then for agiven y2
there are threepossible
ways in which ai will be calculated. Ingeneral,
thepoint y2 itself,
theprevious
5points
and the next5
points
are all used tocompute
ai. If all of thesepoints
are inside therange yi ±
r(Case
1: seefigure 2)
thenthey
are all included in the average.If, however,
Yi+3 say, were to fall outside therange yi ±
r(Case
2: seefigure 3)
then it would not be included in the average, and neither would any of the laterpoints.
These laterpoints
are excluded even if some of them fall inside the
range yi ±
r(Case
3: see finalpoint
in
figure 3). (Note:
If Yi-3 was outside the range, then it would be excluded from the averagealong
with all earlierpoints).
FIGURE 2
Moving
average calculation. Case 1(All points
used to computeaverage).
The
smoothing width,
m, and theminimum jump size,
r, are chosensubjectively
to obtain the best fit. The value of m determines the
degree
ofsmoothing along
theplateaux,
while r determines the minimumjump
between two consecutivepoints
thatis considered as a
step change
rather than noise. Whenoptimising
m, theassumption
that the
underlying signal
isapproximately piecewise
constant means that within eachplateau
it isimpossible
to over-smooth thedata, suggesting
that m may be chosenarbitrarily large.
However in cases where the size of the noise is close to the size of discontinuities in thesignal,
a smaller value of m isadvantageous
as it reducesthe
smoothing
out of these discontinuities.Also,
thealgorithm
becomes slow iflarge
values of m are chosen.
Choosing
r can be difficult: if it is toolarge
then the routine smooths out discontinuities in thesignal,
whereas if it is too small then the routineinterprets
noise as real discontinuities. For agiven signal
it isusually possible
tochoose a suitable value of r
by hand, by manually assessing
the size of the smallestdiscontinuity
andestimating
theamplitude
of thenoise,
thenchoosing
r to be betweenthese values. When the size of the noise is close to the size of the discontinuities it is not
possible
to choose an ideal value of r, and the effect ofover-smoothing
needs tobe reduced
by choosing
a small value of m as mentioned above.FIGURE 3
Moving
average calculation. Cases 2 and 3.The
moving
average routineproved
successful insmoothing
the noise from the VIS curves whilstretaining
the discontinuities in thesignal.
Forexample,
seeFigure
4.By choosing separate
values of m and r for eachsubject
it would have beenpossible
to achieve a verygood
fit in each case.However,
thisapproach
seemed toocomplex
andsubjective,
so anattempt
was made to choose values of m and r which could be used for allsubjects.
It waspossible
to choose values of m and r which gavereasonably good
results for allsubjects, although
it took some work to find these. It would therefore be useful if the choice of theseparameters
could be made automatic which is not easy.Furthermore,
in cases where theamplitude
of the noise was aslarge
as thesmallest
discontinuity (determined by observing
where thesignal
exhibited astep change
rather than a shortspike)
the method suffered from theproblem
that there was no ideal value of r, and evenreducing
the value of m did not solve thisentirely.
Tosummarise,
there were twoproblems
with themoving
average routine:w not automatic - necessary to choose a suitable value of m and r for the
sample
of curves, and in order to achieve a very
good fit,
necessary to choose m and rseparately
for each curve. Would beadvantageous
to have an automaticmethod,
i.e. one which did notrequire
anysubjectively
chosenparameters.
2022 poor
performance
when the noiseamplitude
isgreater
than orequal
to the smallestdiscontinuity
insignal.
Raw data curve
Moving
average fitFIGURE 4
VIS
curve for subject
11.2 : Raw data andmoving
average.These two
problems
with themoving
average method indicated that it would be worthwhileseeking
an alternative"smoothing"
method. The mainrequirement
ofthis new method would be that it could
perform
as wellautomatically
as themoving
average method
performed
with manualparameter
selection. Ifpossible
the methodshould be able to
perform
as well as themoving
average routine withparameters
chosenseparately
for each curve, butcertainly
it should be able toperform
as wellas the
moving
average routine with one set ofparameters
chosen for all the curves in thesample.
It would be a bonus if the method was able to reconstructsignals
in thepresence of noise whose
amplitude
was of a similar size to thestep changes.
4. Wavelet
shrinkage
4.1. Introduction
Wavelet
shrinkage
isa beautifully simple
method: thenoisy signal
isrepresented
as a wavelet
expansion.
The wavelet coefficients are then shrunk and theresulting expansion
forms an estimate of the true,underlying signal.
Weonly give
the barestdetails here.
However,
there are many papersdescribing
thetechnique notably
theseminal
Donoho, Johnstone,
Picard andKerkyacharian [6]
but see also Nason and Silverman[13]
for asimple
introduction.In mathematical terms the wavelet
shrinkage procedure
goes as follows. Weassume that
noisy
VIS or VASsignals
aregenerated
on the interval[0,1]. Suppose
that the true
underlying
function isgiven by f (t)
and that thenoisy signal, {yi}ni=1
is
given by
thesignal plus
noise modelwhere
{~i}ni=1
is aindependent
andidentically
distributed sequence of Gaussian random variables with mean zero and common variance ofu 2.
The basic waveletshrinkage
method also assumes that n =2J
for someinteger J,
in other words the number of datapoints
has to bedyadic.
Note also that model(4) only
includes data that areequally-spaced
on a fixeddesign.
More recent research work has relaxed the restrictivedesign,
number of datapoints
and error distributionassumptions
and someof these are described in section 5.
Wavelets. As mentioned above the
underlying
functionf (t)
has a waveletexpansion.
This is similar to a Fourierexpansion
or anexpansion
in terms oforthogonal polynomials.
The main difference is that the basis functions are wavelets(small waves)
and not sine/cosine orpolynomials.
The two basicbuilding
blocks inour
expansion
for the VIS/VAS curves are the Haarscaling
functiongiven by
and the Haar mother wavelet
given by
The mother wavelet and
scaling
function aresubjected
to translation andscaling by
factors of 2
(also
calleddilation).
Forexample,
we cansimultaneously
dilate themother wavelet
by
a factor of2j
and translate itby 2-jk
forintegers j
and kby forming
Functions can be built out of scaled and translated mother wavelets and translated
scaling
functions at aparticular
scalejo.
Thefollowing expansion
is used:where
The
equation
is verysuggestive
and shows how the functionf (x)
can be constructed out of: shifted versions of thescaling
functions atjo - roughly speaking representing
the overall level of the function over x ; and scaled and shifted versions of the mother wavelets
- roughly speaking representing
the detail of the function at different scales.Expansion (5)
is not restricted to Haar wavelets it works for many different classes of wavelets which sometimes have veryspecial
and usefulproperties:
such asvanishing
moments,
symmetry, orthogonality. However,
for oursimple
VAS/VIS functions Haarwavelets are ideal because
they exactly
emulate theunderlying piecewise
constantnature of the curves.
In all that follows assume that
jo =
0. Thequantity jo
is often called theprimary
resolution as it determines the fineness on which theexpansion
is based. Inreal
problems
it is aparameter
that needs to be chosen.The discussion above describes the wavelet transform of a function
f (x).
However,
ourdata, given by
model(4)
isactually
discrete.Fortunately,
there is asimple
discretized version of(5).
Thenoisy
data can be transformed into discrete wavelet coefficients dby
where y =
(Yi,
Y2, ....Yn) T
are ourdata,
and d is a n x 1 vector of wavelet coefficients(discretized
versions of the ek anddjk given
in(5)).
The discrete wavelet coefficients in d have aspecial pyramidal
structure: there isonly
one c.coefficient,
onedo.
coefficient,
twodl.,
and so on. This means that the coefficients can be written as apyramid
asIndeed,
there is a fastalgorithm
due to Mallat[11] that computes
the discrete wavelet transformusing
a"pyramid algorithm".
Thepyramid algorithm only requires O(n) operations
in contrast to the matrixmultiplication
in(6)
whichrequires O(n2).
The Haar wavelets are a
family
of orthonormal wavelets andconsequently
thetransform matrix in
(6)
isorthogonal
and hence the inverse matrix ismerely W T . Further,
there is afast, O(n)
inversealgorithm
as well.Wavelet
shrinkage
The discrete wavelet transform of our y =f
+ E noisemodel in
(4)
iswhere 03B8 ==
W f.
The noise E in the wavelet domain is also iid Gaussian with the samevariance as in the data domain because of the
orthonormality
of the transform W.The main reason for
using
waveletshrinkage
arises from thefollowing
heuristics:1. the
signal, f, (in
this case the VIS and VAScurves)
can berepresented efficiently by
few waveletcoefficients, 03B8, (in
this case the VAS/VIS curves can besparsely represented by
Haarwavelets)
2. since the noise is iid it will be
statistically
the same for each coefficient. In otherwords,
the noise does not show anypreference
for anyparticular
coefficient or set of coefficients.Therefore if the noise level is small
enough
then the wavelet transform of our VIS/VAS curves will consist of a fewlarge
coefficients(which
come from thesignal
plus noise)
and many smaller coefficients(which correspond to just noise).
An obviousway
of recovering
the truesignal
is tokeep
thelarge
coefficients and remove any smallones. This
technique
is known as hardthresholding
but there are many variants. Theconcepts
oflarge
and small in this case are taken withrespect
to the noise level o- which is estimated. In theseexamples
we taken the finest scale wavelet coefficients and usethe mean absolute deviation
(MAD)
estimator of cr dividedby
0.6745(this
constantensures a consistent estimator for Gaussian
data). Many
methods have been created to choose a threshold value at which smaller coefficients are removed. In this paper weonly
consider two: universalthresholding (or VisuShrink)
and SUREshrink. Detaileddescriptions
of these twotechniques
can be found in Donoho and Johnstone[4]
and[5]
respectively.
After
shrinkage
the transform can be invertedproducing
an estimate of the true VIS or VAS curve.4.2. A
simple example
As an
example
of waveletshrinkage
weapply
thetechnique
to thedata presented
in
figure
5 which contains the VIS curve forsubject
09. VIS curve forsubject
09 issimple
as itclearly only
contains onehump.
The curve contains 8192points
andwas recorded over about 20 minutes. Curve 09’s
simplicity
is useful because if ourmethods work well on this
simple
curve thenthey
should also work well on morecomplicated examples.
FIGURE 5
Raw VIS
curve for subject
09.Figure
6 shows aplot
of the coefficients from a discrete wavelet transform ofsubject
09’s VIS curve. Theplot
shows thepyramidal
structure of the wavelet coefficients: each small vertical line in theplot corresponds
to a discrete wavelet coefficient. The horizontal axis in thisplot corresponds exactly
to the horizontal time axis infigure
5(although they
are labelleddifferently).
The coefficient at thetop (resolution
level0)
isdoo,
those at resolution level 1 ared10
andd11,
and soon just
like the
pyramid
in(7), except
we have omitted toplot
co. The coefficients at thetop
are coefficients oflarge-scale
wavelets and those at the bottom are coefficients of the small-scale wavelets. Notice how there are many more small-scale wavelets -basically
because you can fit more in at that resolution level and retainorthogonality
between wavelets.
Wavelet
Decomposition
CoefficientsFIGURE 6
Haar wavelet
coefficients for subject
09’s VIS curve.Each level in this
plot
has beenmagnified
to its maximum extent so that you can see the structure.If all coefficients
were to beplotted
to the same scale then you would not see the
higher
resolutioncoefficients they
would be too small.Our heuristics above are illustrated very
nicely by figure
6. Theunderlying signal (the large hump)
is exhibited in thelarge
coefficients at horizontalposition
1450 and2270 at resolution levels from 2
upwards.
The"hump"
issparsely represented by
these coefficients. The noise has
actually spread
out over all coefficientsfairly evenly.
However,
since thehigher
resolution levels have beenmagnified (so
that you can seethem)
the noise is clearest at resolutionlevel 12).
Now we
apply
hardthresholding estimating
the noiseby using
the robustMAD-based estimate.
Using
universalthresholding
we obtain a threshold value ofapproximately
0.136.Any
coefficient smaller than this value in absolutemagnitude
is removed - this results in a tableau of coefficients
depicted
infigure
7. Noticehow the
procedure
hassuccessfully
removed most of the noise.Finally,
to obtain ourestimate we invert the hard thresholded wavelet coefficients: the estimate is shown in
figure
8. Note that the estimate is verygood apart
from around thejump:
here theestimate overshoots both at the
jump
up andjump
down. The next section describes how we used a moresophisticated
waveletalgorithm
toproduce
a better estimate.Wavelet
Decomposition
CoefficientsFIGURE 7
Haar wavelet
coefficients after
hardthresholding using
a universal threshold.Each level in this
plot
has beenmagnified
to its maximum extent so that you can see the structure.
4.3.
Using
theTI-transform
The overshoot illustrated in
figure
8 is agood practical example
of Gibb’s effect at work. A number ofthings
were tried toimprove
the estimation:2022 we tried to
change
theunderlying
wavelet from a Haar wavelet.However,
none ofthese wavelets gave an estimate that was as
good
as with Haar thusconfirming
ourinitial guess that Haar
wavelets,
sincethey
match theunderlying
structure of thesignal,
areappropriate
for the itch data.FIGURE 8
Estimate
produced by
inversionof hard
thresholdedcoefficients
inFigure
7.2022 we tried
changing
the threshold valuemanually. Occasionally
we obtained a"perfect"
reconstruction withoutovershoot, although
most of the time we either obtained a bad estimate and/or overshoot.However,
this manualfiddling
is notautomatic and
thus,
forlarge
collections of itch data would be too timeconsuming.
. we tried
changing
the method ofthresholding
from hardthresholding.
Ifanything,
the results were worse.
However,
since theproblem
seemed to be a Gibb’s effect(or under/overshooting
on
discontinuities)
we tumed to the work of Coifman and Donoho[2]
whoexplained
that Gibbs effects were
probably
due to the discontinuitiesbeing
located between wavelets(remember
wavelets arealigned
to an nesteddyadic grid). They suggested
thesimple procedure
ofCycleSpinning
whichproduces
waveletshrinkage
estimatesby producing
an average estimate of several shifted versions of the above wavelet shrunk estimate. The idea behind this is that some of the shifted versions will beperfectly aligned
with the discontinuitiesproducing
a better overall estimate. An extension of this idea is toproduce
allpossible
shifted estimates toproduce
a translation-invariant estimate. Thegood
news is that the waveletalgorithm underlying
translation-invariantestimate,
called theTI-transform,
can beproduced rapidly
inO(nlogn) operations hardly
much more than theordinary
discrete wavelet transform.The curve estimation
procedure
with theTI-transform
is similar to that of the waveletshrinkage
method as above: transform thenoisy
datausing
theTI-transform,
shrink thecoefficients,
and thenperform
an "inverse" transform. There areactually
many different "inverse" transforms: one can invert any
particular
shifted set ofcoefficients,
or invert an average of some or all of them. Weadopt
the"average
all shifts" inversion here. The
thresholding
methods described above can still be used here.Figure
9 shows an estimateproduced by
the TI-transform method. It can be seenthat the
overshooting
effect isconsiderably
reduced.Figure
9 illustrates anotherinteresting
feature about this set of itch data: notice that the TI-estimate on the"plateau"
is still a bitnoisy
whereas the estimate at the "zero-level" is very smooth. All the wavelet methods above were somewhatinappropriately
used because the noise is nothomogeneous
and thethresholding
methods above assume
stationary
noise. The noise iscomposed
of two differenttypes
and wavelet methods that can beapplied
are described in the next section.FIGURE 9
Estimate
of
the true itchintensity of subject
09produced by
theTI-transform.
5. A closer look at the noise
The noise structure was examined in closer detail
by looking
at small sections of the VIS curve fromsubject
09. These are shown infigure
10 andsuggest
that the noise behaves in two distinctpatterns depending
on whether theunderlying signal
iszero or
positive.
First of all the data with zero
signal
wasinvestigated.
Aplot
of the first thousandpoints
is shown at thetop
offigure
10. The noise appears to exist at three distinctamplitudes:
. 0.01 itch units - these are very
frequent
e 0.02 - around 1 of these
spikes
for every 100 datapoints
e 0.03 -
only
1 in the whole data setThe data with
positive signal
was theninvestigated.
Onviewing
a sequence of 1000points
from this section of thesignal (bottom plot
inFigure 10)
itappeared
that the
signal
occured at a level of0.145,
and that there was noise at thefollowing amplitudes:
. 0.005 - very
frequently.
. 0.015
. 0.025
On closer
inspection
it was revealed that the situation wasslightly
morecomplex.
Thesignal
seemed to oscillate between the values 0.14 and 0.15. As wellas this oscillation there was noise of
amplitude
0.01 in both directions and also noise ofamplitude 0.02,
but this did not occurfrequently
andonly
occured in thepositive
direction.
One
possible explanation
for the variousamplitudes
was thatthey
were dueto human induced noise
(i. e.
handshaking
whileoperating
thepotentiometer)
andmachine induced noise.
However,
human noise isunlikely
to have a constantamplitude
so it could be due to different
types
of machine inducednoise,
forexample
differentamplitudes
of feedback in the electricalcircuitry.
Thesepossible explanations
wereinvestigated
furtherby recording
the VIS curve without asubject operating
thepotentiometer
and it was found that machine noise wasresponsible.
"Zero-signal"
section"Non-zero-signal"
sectionFIGURE 10
V09
Non-interpolated:
Twodifferent
sectionsof
1000points.
The added noise is
clearly
discrete in nature and it ispossible
that the Poisson distribution(or
some closevariant) might
be suitable formodelling
it.Indeed,
most wavelet
shrinkage
models(including
the ones we usedabove)
assume additivenormally
distributed errors which iscertainly
not the case here(and
the models do not assume a discrete process whoseintensity might
bepiecewise constant). Kolaczyk [9]
provides
a more detaileddescription
of what to do in certain cases of Poissonnoise, although
this extension isbeyond
the scope of thepresent
paper.Indeed, empirically
the variance of the noise does not increase with the mean so strict Poisson noise is
unlikely. However,
since we know theamplitude
and form of the noise it isquite
easy to choose a threshold value. Since we are
using
Haar wavelets we caneasily compute
the size of thelargest
Haar wavelet coefficient duepurely
to the noise(since
we know the size of the smallest noise
spike). Choosing
the threshold like this results in a threshold of 0.022. Thisproduces
anacceptable estimate, although
a few noisespikes
areerroneously
retained.Figure
11 shows a denoised version of the first 8192 observations of VASrecording
in the bottomright
offigure
1 - thedenoising
wascarried out
using
theTI-transform,
with universalthresholding
andusing
the MAD-based estimate of noise level estimation. This shows that
although
the noise structureis not
normally
distributed the TI-method for normal noise isreasonably
robust.FIGURE 11
Denoised version
of Subject
11’s VASrecording.
The
TI-transform
with universalthresholding using
a MAD-based estimateof
the noise level was used. Theresulting
threshold was 0.031.6. Conclusions and further work
In this paper we have demonstrated how itch
sensitivity
data may be denoised.We have used two different
methods: jump-tolerant moving
averages and Haar waveletshrinkage.
Our mainobjective
was to find a fast automatic method to denoise itchsensitivity
curves - use of the TI-transform with universalthresholding
workedextremely
welland just
as well asthe jump-tolerant moving
averages with a manual choice ofmoving
average parameters.Further
investigation
of the noise structure revealed that it was discrete in nature and that classical waveletshrinkage
methods are, intheory, inappropriate (although,
the one we
finally
selected appears to workwell).
It may be the case that modelsusing
Poisson distributed noise may
be just
as, if not more, successful.Our
analyses
above assume that the noise is i.i.d. It may beappropriate
to extendour
analyses
to toincorporate
Poisson-like error distributions and also more advancedthresholding techniques
such as:choosing
different thresholds for different resolution levels(for
correlateddata),
see Johnstone and Silverman[8]; applying
the methods of Kovac[10]
which extend the waveletshrinkage
method to datasets witharbitrary design
and number ofpoints
whilstretaining
thespeed
ofcomputation.
7.
Acknowledgments
We would like to thank Francis McGlone for
supplying
the data used in this article. We would like to thank three referees and the editorial team forhelpful
comments on this article.
Références
[1] BURRUS S.L.,
GOPINATH R.A.(1997).
Wavelets: introduction to wavelets and wavelettransforms.
PrenticeHall, Englewood Cliffs,
NJ.[2]
COIFMANR.R.,
DONOHO D.L.(1995).
Translation-invariantde-noising.
InA. Antoniadis and G.
Oppenheim, editors,
Wavelets and Statistics:Proceedings of the
XVth RencontresFranco-Belges
desStatisticiens.,
volume 103 of Lecture Notes in Statistics.Springer-Verlag.
[3]
DAUBECHIES I.(1992).
Ten Lectures on Wavelets.SIAM, Philadelphia.
[4]
DONOHOD.L.,
JOHNSTONE I.M.(1994).
Idealspatial adaptation by
waveletshrinkage. Biometrika, 81(3):
425-55.[5]
DONOHOD.L.,
JOHNSTONE I.M.(1995). Adapting
to unknown smoothness via waveletshrinkage.
J. Am. Statist.Ass.,
90: 1200-1224.[6]
DONOHOD.L.,
JOHNSTONEI.M.,
KERKYACHARIANG.,
PICARD D.(1995).
Waveletshrinkage: asymptopia? (with discussion).
J. R. Statist. Soc. B, 57: 301-337.[7]
HARRIS R.(1967). Iontophoresis.
In S.Licht, editor, Therapeutic Electricity
andUltraviolet
Radiation,
pages 156-178. ElizabethLicht,
New Haven.[8]
JOHNSTONEI.M.,
SILVERMAN B.W.(1997).
Wavelet threshold estimators for data with correlated noise. J. R. Statist. Soc.B,
59: 319-351.[9]
KOLACZYK E.(1998).
Waveletshrinkage
estimation of certain Poissonintensity
signals using
corrected thresholds.(submitted
forpublication).
[10]
KOVAC A.(1998).
Waveletthresholding for unequally spaced
data. PhDthesis, Department
ofMathematics, University
ofBristol, Bristol,
UK.[11]
MALLAT S.G.(1989).
Atheory
for multiresolutionsignal decomposition:
the wavelet
representation.
IEEE Transactions on PatternAnalysis
and MAchineIntelligence, 11(7):
674-693.[12]
MEYER Y.(1992).
Wavelets andOperators. Cambridge University Press, Cambridge.
[13]
NASONG.P.,
SILVERMAN B.W.(1994).
The discrete wavelet transform in s.Journal
of Computational
andGraphical Statistics,
3: 163-91.[14]
SPOKOINY V.G.(1998). Image denoising: pointwise adaptive approach.
Tech-nical