• Aucun résultat trouvé

Inferring geostatistical properties of hydraulic conductivity fields from saline tracer tests and equivalent electrical conductivity time-series

N/A
N/A
Protected

Academic year: 2021

Partager "Inferring geostatistical properties of hydraulic conductivity fields from saline tracer tests and equivalent electrical conductivity time-series"

Copied!
16
0
0

Texte intégral

(1)

HAL Id: insu-02957324

https://hal-insu.archives-ouvertes.fr/insu-02957324

Submitted on 10 Nov 2020

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Distributed under a Creative Commons Attribution - NonCommercial - NoDerivatives| 4.0

Inferring geostatistical properties of hydraulic

conductivity fields from saline tracer tests and

equivalent electrical conductivity time-series

Alejandro Fernandez Visentini, Niklas Linde, Tanguy Le Borgne, Marco Dentz

To cite this version:

Alejandro Fernandez Visentini, Niklas Linde, Tanguy Le Borgne, Marco Dentz.

Inferring

geo-statistical properties of hydraulic conductivity fields from saline tracer tests and equivalent

elec-trical conductivity time-series.

Advances in Water Resources, Elsevier, 2020, 146, pp.103758.

(2)

ContentslistsavailableatScienceDirect

Advances

in

Water

Resources

journalhomepage:www.elsevier.com/locate/advwatres

Inferring

geostatistical

properties

of

hydraulic

conductivity

fields

from

saline

tracer

tests

and

equivalent

electrical

conductivity

time-series

Alejandro

Fernandez

Visentini

a,∗

,

Niklas

Linde

a

,

Tanguy

Le

Borgne

b

,

Marco

Dentz

c

a Institute of Earth Sciences, University of Lausanne, Lausanne CH-1015, Switzerland b Université de Rennes 1, CNRS, Géosciences Rennes UMR 6118, Rennes 35042, France c IDAEA-CSIC, Jordi Girona 18-26, Barcelona 08034, Spain

a

r

t

i

c

l

e

i

n

f

o

Keywords:

Equivalent electrical conductivity Approximate Bayesian computation Geostatistics

Solute spreading and mixing Hydrogeophysics

a

b

s

t

r

a

c

t

WeuseApproximateBayesianComputationandtheKullback–Leiblerdivergencemeasuretoquantifytowhat extenthorizontalandverticalequivalentelectricalconductivitytime-seriesobservedduringtracertestsconstrain the2-DgeostatisticalparametersofmultivariateGaussianlog-hydraulicconductivityfields.Consideringaperfect andknownrelationshipbetweensalinityandelectricalconductivityatthepointscale,wefindthatthehorizontal equivalentelectricalconductivitytime-seriesbestconstrainthegeostatisticalproperties.Thevariance, control-lingthespreadingrateofthesolute,isthebestconstrainedgeostatisticalparameter,followedbytheintegral scalesintheverticaldirection.Wefindthathorizontallylayeredmodelswithmoderatetohighvariancehavethe bestresolvedparameters.Sincethesalinityfieldattheaveragingscale(e.g.,themodelresolutionintomograms) istypicallynon-ergodic,ourresultsserveasastartingpointforquantifyinguncertaintyduetosmall-scale het-erogeneityinlaboratory-experiments,tomographicresultsandhydrogeophysicalinversionsinvolvingDCdata.

1. Introduction

Time-lapse electrical geophysical methods are popular in hydro-geology (e.g.,Binley et al., 2015; Singha et al., 2015) as they pro-videnon-intrusivemeansforremote anddensespatio-temporal sam-plingrelatedtoflowandtransportprocesses.Amongthese,the direct-current(DC)methodiscost-effective,easytoemployandprobablythe mostcommonlyused(Binleyetal.,2015).Ithasbeenthoroughly as-sessedthroughnumericalinvestigations(e.g.,Vanderborghtetal.,2005; SinghaandGorelick,2005;FowlerandMoysey,2011),laboratoryand controlledtankexperiments (Slater etal., 2000;Koesteletal.,2008; Jougnotetal.,2018),andfieldinvestigations(e.g.,Dailyetal.,1992; Binleyetal.,2002;SinghaandGorelick,2005).

DCmeasurementsaregenerallybasedontwopairsof electrodes: onepairforestablishingaknownelectricalcurrentbetweentwopoints, andtheother formeasuringtheresultantelectrical voltagebetween twootherpoints(e.g.,KellerandFrischknecht,1966).Inthecontext of time-lapseDCtomographicexperiments,themeasurementprocess is repeatedusingmultiplecurrent andvoltageelectrodepairsat dif-ferentpositions,andthemeasurementprotocolisrepeatedovertime. Suchameasurementprocessisoftenreferredtoastime-lapse Electri-cal ResistivityTomography(ERT),anditoutputstime-seriesof elec-tricalresistances(voltageoverinjectedcurrent)thatinsaturated me-diacarryinformationaboutthetime-evolutionofthesalinity

distribu-∗Correspondingauthor.

E-mailaddress:[email protected](A.FernandezVisentini).

tion(e.g.,LesmesandFriedman,2005).Thetime-lapse ERTmethod hasbeenappliedduringconservativesalinetracerteststoextractboth flow and transport information. Retrieval of hydraulic conductivity fromsuchdataisdiscussed,forexample,inKemnaetal.(2002)and Vanderborght etal.(2005) andtherangeof applications span from thecalibration of meanhydraulic conductivity values (Binley et al., 2002) to retrieval of the full distribution of hydraulic conductivity (PollockandCirpka,2012).Extractionofsolutetransportparameters hasbeenstudiedindetailand(Kemnaetal.,2002),forinstance, pro-videda fielddemonstrationofretrievingequivalent1-Dstream-tube advective-dispersive transportparameters in thecontext of 3-D con-servative saline transport, results later corroborated numerically by Vanderborghtetal.(2005).Also(Koesteletal.,2008)inferredthe3-D distributionofsolutevelocitiesanddispersivitiesinasoilcolumnusing time-lapseERTdata.

Overtime,theuseofgeoelectrical-monitoredtracertestshasevolved fromqualitativeanalysessuchassalineplumemotiondetectionand geometry delineation (e.g., Slater et al., 2000) to obtain quantita-tiveandspatially-resolvedhydrologicalconstraints. Nevertheless, us-ingtime-lapse DCdataforquantitativehydrogeologicalpurposes re-mainsapersistentchallenge(Singhaetal.,2015).Thischallengeis inti-matelyrelatedtotheuse oftime-lapse inversionmethodologies that provide resolution-limitedtime-evolvingimagesofelectrical resistiv-ityorconductivitythroughtime(Singhaetal.,2015).Themost com-monapproachtotranslateresultinggeophysicaltime-lapsetomograms

https://doi.org/10.1016/j.advwatres.2020.103758

Received10July2020;Receivedinrevisedform18September2020;Accepted19September2020 Availableonline19September2020

0309-1708/© 2020TheAuthors.PublishedbyElsevierLtd.ThisisanopenaccessarticleundertheCCBY-NC-NDlicense (http://creativecommons.org/licenses/by-nc-nd/4.0/)

(3)

intosalinitydistributionsrestsontwostrongassumptions.Thefirstis thatthereexistsapetrophysicalrelationship,(e.g.,Archie,1942),with knownspatially-invariantparametersdefinedatthediscretizationscale ofthetomogram,implyingthatitcorrespondstotheRepresentative El-ementaryVolume(REV)scale(Hill,1963)ofbulkelectrical conductiv-ityand,consequently,thattheimpactofsalinityheterogeneityis neg-ligiblebelowthisscale.Thesecondassumptionisthattheresolution ofthegeophysicaltomogram isthesameasthemodeldiscretization, whichishardlytrueforanyelectricalsurvey.Inreality,thetomogram representsspatially-varyingweightedaveragesoveramuchlarger vol-ume(e.g.,Friedel,2003).Withthesetwoassumptions,temporal differ-encesintime-lapsetomogramscanreadilybetranslatedintoestimates of salinitydifferences.Unfortunately,thisapproachtypicallyleadsto anunderestimationofactualtracermasswitherrorsoftenapproaching oneorderofmagnitude(e.g.,Binleyetal.,2002;SinghaandGorelick, 2005;Laloyetal.,2012).Researchhasaddressedthesecond assump-tionbyupscalingthepetrophysicalrelationshiptothetomographic res-olutionusingeitherlinearizedinversetheory(Day-Lewisetal.,2005; Nussbaumeretal.,2019)orMonteCarlo-basedsimulationapproaches (e.g.,Moyseyetal.,2005).

Inthiswork,weareprimarilyconcernedwiththefirstassumption, namelythattheimpactofsalinityvariationsisnegligiblebelowagiven scale.Toavoidcomplicationsinherenttotomographicimaging,we fo-cushereonthecaseofatime-evolvingequivalentelectricalconductivity tensorofa2-Dsquaresampleofunitlengththatisinvadedbyasaline (i.e.,electricallyconductive)tracer.Inatomographicsetting,thisscale canbethoughtofasthemodelresolutionatagivenlocationofinterest. Forthiscase,theequivalentelectricalconductivityinagivendirection isreadilyobtained,basicallybydividingtheelectriccurrentwiththe im-posedvoltage.Thetotalcurrentisthemacroscopicfluxoftheinternal currentdensityfield(i.e.,thedistributionofsmall-scalecurrentswithin thesample)that,foragiveninternaldistributionoflocalconductivities, isestablishedsuchthatitsassociatedenergylossduetoJoule’s dissipa-tion, integratedoverthedomain,isminimized(e.g.,Feynmanetal., 2011;Bernabé andRevil,1995).Thisgoverningprincipleleadsto pat-ternsofcurrentchannellinganddeflectionthroughandfromhighand lowelectricalconductivityzones,respectively,anditgovernsthetime variationsofthecurrentdensityfieldasthesalinetracerinvadesthe sample(e.g.,LiandOldenburg,1991).Accuratepredictionofthe time-evolutionoftheequivalentelectricalconductivityofthemedium,thus, requiresaccountingforinteractionsoccurringthroughoutthedomain and,givenanarbitrarily-shapedtime-evolvingelectricalconductivity field,thisremainsanopenupscalingproblembelongingtothefamilyof conductivityupscalinginspatiallynon-stationaryfields(e.g., Sanchez-Vilaetal.,2006andreferencestherein).Thecurrentlackofphysically accurateupscaling proceduresimpedes reliablequantitativeanalyses ofasalineplume’sfatefromgeoelectricalmonitoring.Forinstance,in themostcommoncasewhereArchie’spetrophysicallaw(Archie,1942) isusedtoinferthemeansalineconcentrationwithinthesamplefrom itsequivalentelectricalconductivity,theunderlyingassumptionisthat theinternalelectricalconductivityfieldbehavesasanadditive prop-ertythatcanbeupscaledbytakingitsarithmeticaverage.Thisisonly trueiftheelectricalconductivityfieldisconstantorifitsdistribution is layeredandtheequivalentelectrical conductivityismeasured par-alleltothislayering,correspondingtotheupperWienerbound(e.g., MiltonandSawicki,2003).Ingeneral,sinceportionsofthe concentra-tionfieldareby-passedbytheestablishedcurrentpatterns,theupper Wiennerbounddoesnotapplyandthisleadstotheabove-mentioned apparentmasslossasdemonstrated,forexample,inarecentlaboratory study(Jougnotetal.,2018).Theseissuesalsoimpacttheperformanceof manyfully-coupledhydrogeophysicalinversionapproachesand model-ingstudiesthatinterpretequivalentelectricalconductivitytime-series using equivalenttransportparameterswithin anadvective-dispersive description(e.g.,Kemnaetal.,2002;Vanderborghtetal.,2005;Koestel etal.,2008).Onamorepositivenote,thediscussionabovealsosuggests

thatelectricalconductivitytime-seriesatagivenscalecarrystatistical informationontheconcentrationfieldanditstemporalevolution.

Hereweinvestigatetowhatextenttracertestsassociatedwith time-seriesofequivalentelectricalpropertiesapre-definedscalecanbeused toinfergeostatisticalpropertiesofhydraulicconductivityfieldsbelow thisscale.ThisisachievedbyconsideringinferencewithinaBayesian inferenceframework(e.g.,Gelmanetal.,2013;Tarantola,2005),more specificallythroughanApproximateBayesianComputationalapproach (e.g.,Beaumontetal.,2002;Sissonetal.,2018).Forcomparison pur-poses,themassbreakthroughcurveisalsoevaluatedanditsinformation contentiscomparedtoitselectricalpeers.UsingaBayesianapproach al-lowsassessingtheinformationgainedonthepropertiesofinterestwith respecttotheirassumedpriorstatistics.Weperformourstudyusinga databaseconsistingof105synthetically-generatedequivalentelectrical

conductivitytensorandmassbreakthroughtime-series collected dur-ingsalinetracertestswithina2-Ddomainwithhydraulic heterogene-ityprescribedbymultivariateGaussianfields.Weconsider advectively-dominatedsolutetransport(i.e.,highPécletnumbers),wherethe con-centrationfieldevolutionispredominantlydeterminedbythe under-lyingflow field,which inturn depends on theunderlying hydraulic conductivityfieldundertheconstantappliedpressuregradient.Inthis study,weconsideridealizedscenariosasitisassumedthatthereisno spatialvariationsinpetrophysicalpropertiesandthatthepetrophysical relationshipisknown.

InSection2 wereview thebasicgoverning equationsdescribing groundwaterflow,solutetransportandelectricalconductiontogether withtheirnumericalimplementations.InSection3weintroducethe in-ferenceproblemofinterestalongwiththeBayesianinferencetools.The mainresultsarepresentedanddiscussedinSections4and5, respec-tively.Section6concludesthepaper.

2. Governing equations and problem setup 2.1. Groundwaterflow

Forsteady-stateflowandintheabsenceofsourcesorsinks,mass conservationofanincompressiblefluidisexpressedbythecontinuity equationforthespecificdischarge q(x) :

.q (x )=0, (1)

where x =(𝑥,𝑦)𝑇 denotesthe2-Dpositionvectorandxandythe hor-izontalandverticalcoordinates,respectively.Darcy’slawrelates q (x ) withthehydraulicconductivityfield𝐾(x )andthehydraulichead(x ) via

q(x) =−𝐾(x )∇(x ). (2)

AdoptingDarcy’slaw,thegroundwaterflowequationreads: ∇𝐾(x )∇(x )+𝐾(x )∇2(x )=0. (3)

It is customary totreat thelog-hydraulic conductivity field 𝑌(x ) (≡ ln(𝐾(x ))withinageostatisticalframeworkwith𝑌(x )modelledasa second-orderspatially-stationaryergodicrandomfunction.Inthisstudy, weconsidermultivariate-Gaussianrandomfields withanexponential covariancestructure(e.g.,Rubin,2003)withamean𝜇Yandavariance 𝜎2

Y.Theintegralscalesofthefieldareexpressedbytheintegralscale 𝐼yintheverticaldirectionandananisotropyfactor𝜆 (=𝐼x𝐼y).After

specifying𝐾(x ),theflowfield q (x )isobtainedbysolvingEq.(3)with prescribedboundaryconditions(Section2.4.2).

2.2. Solutetransport

Theevolutionoftheconcentrationfield𝑐(x ,𝑡)ofapassivetracer be-ingtransportedwithinthesteady-stateflow-fieldq (x )canbedescribed withinanEulerianframeworkusingtheadvection-dispersionequation

(4)

where𝜃 istheporosityand D isthedispersiontensor.Inthisstudywe assumeaspatially-constantporosityanddispersiontensor,and further-moreweassumezerodispersivity.InthiscaseandconsideringEq.(1), Eq.(4)simplifiestotheadvection-diffusionequationwithconstant co-efficients:

𝜃 𝜕𝑐

𝜕𝑡 +q (x ).𝑐𝜃𝐷𝑚∇2𝑐=0, (5)

whereDmdenotesthemoleculardiffusioncoefficient.Aftersolvingfor

𝑐(x ,𝑡),theflux-weightedtracermass-breakthroughtime-seriesM(t)are definedby

𝑀(𝑡)=∫Γ𝑜𝑢𝑡𝑞𝑥

(x )𝑐(x ,𝑡)𝑑x

∫Γ𝑜𝑢𝑡𝑞𝑥(x )𝑑x

, (6)

with𝑞𝑥(x )beingtheflow-componentinthex-directionandΓoutthe

out-flowboundaryofthemodeldomain.

2.3. DCconduction

ElectricchargeconservationisintheDCproblemexpressedbythe continuityequationofthecurrentdensityfieldJ(x,t) attime-lapse ac-quisitiontimet.Intheabsenceofcurrentsourcesandnetaccumulation ofelectriccharge,ittakesthefollowingform:

.J (x ,𝑡)=0. (7)

Ohm’slawrelates J (x ,𝑡)withtheelectricalconductivity𝜎(x ,𝑡)and theelectricfield E (x ,𝑡)viathelinearrelationship J (x ,𝑡)=𝜎(x ,𝑡)E (x ,𝑡). Adopting thequasistaticapproximation, ∇×E (x ,𝑡)=0,allows to ex-pressE (x ,𝑡)=−∇𝜙(x ,𝑡),where𝜙(x ,𝑡)istheelectricalpotential.Writing

J (x ,𝑡)intermsof𝜙(x ,𝑡)asJ (x ,𝑡)=−𝜎(x ,𝑡)∇𝜙(x ,𝑡)andreplacingthis ex-pressionintoEq.(7)resultsinthegoverningLaplaceequationforthe electricalpotentials:

𝜎(x ,𝑡)∇𝜙(x ,𝑡)+𝜎(x ,𝑡)∇2𝜙(x ,𝑡)=0. (8) Weconsiderthehorizontalandverticalcomponentsofthe equiv-alent electrical conductivitytensor time-series of a2-D square sam-pleofunitlength.ThisimpliessolvingEq.(8)withalternativemixed Dirichlet–Neumann boundary conditions or “excitation modes”. For

𝜎H(t)(𝜎V(t)),aconstantelectricalpotentialdifferenceΔ𝜙

H𝜙V)along

thehorizontal(vertical)directionisimposed,withzeroelectrical po-tential gradient along the top and bottom (left and right) bound-aries.Theresultingelectricalpotentialfieldsare,respectively,𝜙𝐻(x ,𝑡) and𝜙𝑉(x ,𝑡).Thecorrespondingequivalentelectricalconductivity

time-seriesarecomputedas

𝜎𝐻(𝑡)= 1 Δ𝜙𝐻∫Γ𝑦𝜎(x ,𝑡)∇𝑥𝜙𝐻(x ,𝑡)𝑑x , (9) and 𝜎𝑉(𝑡)= 1 Δ𝜙𝑉 ∫Γ𝑥𝜎(x ,𝑡)∇𝑦𝜙𝑉(x ,𝑡)𝑑x , (10) wheretheintegrationpathsΓyandΓxareanytwogivencontours sepa-ratingtheleftandrightboundariesandthetopandbottomboundaries, respectively, andtheintegrandsineachequationisthehorizontalor verticalcomponentofthecurrentdensityfieldresultingfromeach ex-citationmode.

2.4. Numericalimplementationsandproblemsetup

Wecreateadatabaseof105time-seriesof𝜎H(t),𝜎V(t)andM(t)that

arecollectedduringtracertestssimulatedwithinmultivariateGaussian log-hydraulic conductivityrealizations ina square-shapeddomainof sidelength𝐿=1mdiscretizedinto250×250elements.

Fig. 1.Generated sample of size 𝑃=105 of geostatistical parameters m=

(𝜎2

𝑌,𝐼y,𝜆)drawnfromajointpdf𝜋(m).Eachrealizationisusedtogetherwith

anassociatedR-realizationtocreatealog-hydraulicconductivityfieldonwhich flowandtransportsimulationsareperformed.

2.4.1. Generationofhydraulicconductivityfields

The log-hydraulic conductivity field realizations 𝑌(x ) are gener-atedusing thefast circulantembeddingtechnique (seeDietrichand Newsam,1997fordetails).Agivenrealizationdependsonthe speci-fiedgeostatisticalmodelparametersandR ;a250×250arandomdraw fromastandardnormaldistribution.Thegeostatisticalmodel parame-tersdeterminethespatialregularity(smoothnessclass),while R deter-minesthelocationsofhighandlowlog-hydraulicconductivityvalues relativetothemeanvalue𝜇Yofthegeostatisticalmodel.Here𝜇Yis

fixedat-6whileremainingparametersaretreatedasrandomvariables

m =(𝜎2

Y,𝐼y,𝜆)describedbyajoint probabilitydensityfunction(PDF) 𝜋(m ).Thevariance𝜎2

𝑌 israndomlydrawnfromauniformPDFwith

sup-port[0,5.5],theintegralscale𝐼yisdrawnfromalog-uniformPDFwith

support[L/25,L/2]m,andtheanisotropyfactor𝜆 (=𝐼𝑥𝐼y)isdrawn

fromauniformPDFwithsupport[1,𝐿𝐼y](i.e.,conditionallyon𝐼y).

Thediscretizationimpliesthatheterogeneitiesobtainedwiththe small-estintegralscalesareresolvedwithatleast10cellsineachdirection. Thelog-uniformdistributionof𝐼y isherechosentofavorrealizations

withfinelystructuredfields.Thegeneratedsampleofthegeostatistical modelparametersofsize𝑃=105isrepresentedinFig.1.Notethateach

drawisassociatedwithauniqueR ,whichtogetherformalog-hydraulic conductivityfieldrealization.

2.4.2. Flowsimulations

The groundwater flow equation (Eq. (3)) is solved numeri-cally using the open-source finite-difference solver MODFLOW-2005 (Harbaugh,2005).Theprescribedboundaryconditionsareahorizontal headgradientof0.05inducingflowfromlefttorightandno-flow con-ditionsforthetopandbottomboundaries.Theheadgradientvaluewas chosensuchthatforahomogeneousfieldequaltoexp(𝜇Y)thetracer

ar-rivaltimeoccursapproximatelyathalfofthesimulatedtime-duration ofthetracerexperiment.Inthesimulations,thehydraulicconductivity betweentwoadjacentcellsistakenastheirharmonicmean.Thechosen

(5)

numericalschemeusedtosolvethesystemoflinearequationsis the preconditionedconjugategradientmethod(Hill,1990).

2.4.3. Transportsimulations

The advection-diffusion equation (Eq. (5)) is solved using the groundwater solute transport simulator package MT3D-USGS (Bedekar etal., 2016). Theinitial condition is a homogeneous con-centration field of 0.01g l−1 and the boundary conditions are: (i)

constantconcentrationof1gl−1along theleftboundary(ii)no-flux

alongthetopandbottomboundariesand(iii)free-fluxalongtheright boundary.Theporosityisassumedconstantandequalto0.3.Forthe advectionterminEq.(5),thethird-orderTotalVariationDiminishing (TVD) approach(CoxandNishikawa,1991) isused.TheTVD solver wasfoundtobeveryrobustandshowedminimalnumericaldispersion when benchmarked against planar fronts. Nevertheless, in order to mask the small numerical dispersion, the diffusion coefficient was slightly increased from 𝐷𝑚=1.6× 10−9 m2 s−1 (the standard value

forthediffusioncoefficientofsaltinwater)to𝐷𝑚=2× 10−8m2 s−1. Thelatter(larger)valueisobtainedbyfitting theanalyticalsolution foraconcentrationprofileforastepinjectionin1-D(e.g.,Ogataand Banks,1961)toaTVD-calculatedconcentrationprofileobtainedfora homogeneoushydraulicconductivityfieldequalto𝜇Ywhenthe

diffu-sioncoefficientisimposedtobetheoneofsaltinwater.Eachsimulated tracerexperimentlastsfor4×103sandduringthistimeperiod,400

equidistantsamples𝑐𝑖(x )(𝑖=1,,400)ofthesimulatedconcentration fieldsarerecordedattimes𝑡=(𝑖−1)Δ𝑡,withΔ𝑡=4× 103s∕400=10s.

The injected tracer typicallydoes not fully replace the initial back-groundtracerattheendofthesimulationperiod.Thisisaconsequence oftheshortsimulationtimeimposedbycomputationalconstraintsand large low-velocity regions.The meanPéclet numberis ~ 6 ×103,

defined as 𝑃𝑒= 𝐷̄𝑢

𝑚,where ̄𝑢 is the tracervelocity for the constant

hydraulicconductivityfield.

2.4.4. Electricalsimulations

Foreachsampledconcentrationfield𝑐𝑖(x ),the2-Dsquaredomain isalternativelyexcitedbyimposinganelectricalpotentialdifferenceof 1Vwithapairoflineelectrodesalongeithertheverticalor horizon-talboundariesofthesample.Theremainingboundariesareprescribed zeroelectricalpotentialgradientnormaltotheboundaries.The result-ingelectricalpotentialfields𝜙𝐻

𝑖 (x )and𝜙𝑉𝑖(x )associatedtothe

horizon-talandverticalmodes,respectively,arecomputedbynumerically solv-ingtheLaplaceequation(Eq.(8))withthefinite-elementsolver mod-uleofthePythonlibrarypyGIMLi(Rückeretal.,2017).Forsimplicity, theinputelectricalconductivitydistribution𝜎𝑖(x ),usedforsolvingthe boundary-valueproblemsateachtimestepisassumedtobeperfectly andlinearlyrelatedtothetransportsimulationoutput𝑐𝑖(x ).The

result-ingnormalizeddimensionlesstime-seriesdenotedas𝜎H,𝜎VandMvary

within[0.01,1].Thedatagenerationissummarizedbythepseudo-code inAlgorithm1.

3. Inference problem

Weareinterestedinassessingtowhatextentthetime-series𝜎H,𝜎V

andMmayconstrainthegeostatisticalparameters m =(𝜎2

Y,𝐼y,𝜆).We

considerthefollowingfivecombinationsoftime-series:

d 𝐻∶={𝜎𝐻},

d 𝑉 ∶={𝜎𝑉},

d 𝑀∶={𝑀},

d 𝐻𝑉 ∶={𝜎𝐻,𝜎𝑉},

d 𝐻𝑉𝑀∶={𝜎𝐻,𝜎𝑉,𝑀}.

Thedatavectors d 𝐻,d 𝑉 and d 𝑀 areusedtoassesstheindividual performanceofeachtypeof time-series; d 𝐻𝑉 is usedtoevaluatethe

Algorithm 1: Datagenerationprocedure.

for 𝑗=1to105do

Draw geostatistical model realization m =(𝜎2

𝑌,𝐼y,𝜆)

andR

Generate hydraulic conductivity field𝐾(x ) Simulate steady-state Eulerian flow fieldq (x )

for 𝑖=1to400 do

Specify sampling time 𝑡 as 𝑡=(𝑖−1)Δ𝑡 Simulate concentration field 𝑐𝑖(x ) Compute𝑀𝑖,𝜎𝐻𝑖 and𝜎𝑖𝑉

end

Save 𝐾(x )

Save concentration field time-series

C =[𝑐1(x ),,𝑐400(x )]

Save time-series of mass breakthrough

𝑀=[𝑀1,,𝑀400] and electrical conductivity

𝜎𝐻=[𝜎𝐻

1 ,,𝜎400𝐻] and𝜎𝑉 =[𝜎𝑉1,,𝜎𝑉400]

end

performanceofelectricaldataaloneand d 𝐻𝑉𝑀isusedtoevaluatethe valueofusingallthedataatthesametime.Wecasttheproblemasa Bayesianinferenceframeworkasoutlinedbelow.

3.1. Bayesianinferenceframework

In a finite-dimensional Bayesian inference framework, a model is described in termsof M random variables with realizations m = (𝑚1,,𝑚𝑀)thatcan beused asinputtoa physicalforward

simula-torproducingNsimulateddata d 𝑠𝑖𝑚=(m )(e.g.,Gelmanetal.,2013; Tarantola, 2005).Theprior probability density function𝜋(m ) is up-dated using Bayes’ theorem to a posterior probability density func-tion𝜋(m |d𝑜𝑏𝑠)afterconsidering theobserved data d 𝑜𝑏𝑠=(𝑑

1,,𝑑𝑁)

usingalikelihoodfunction𝜋(d 𝑜𝑏𝑠|m).Thisfunctionevaluatesthe like-lihoodof any modelrealization given andthe residual error vector

e =d 𝑜𝑏𝑠d 𝑠𝑖𝑚 andanassumedunderlyingobservational noisemodel (e.g.,Tarantola,2005).Bayes’theoreminitsunnormalizedformreads:

𝜋(m |d𝑜𝑏𝑠)∝𝜋(d 𝑜𝑏𝑠|m)𝜋(m ). (11)

In our context, the prior is given by the PDF described in Section2.4.1and d 𝑜𝑏𝑠isthenoise-contaminatedoutputoftheforward simulator,(m 𝑡),when evaluatedusing oneof thetestcases m 𝑡 de-scribedinSection4.2.Fortheelectricaltime-series,(m )isformedby thesequentialapplicationofthefollowingforwardmappings:(i)the re-alizationofthehydraulicconductivityfieldK(x ),(ii)solvingthe ground-waterflowEq.(3),(iii)theadvection-diffusionEq.(5),(iv)theLaplace Eq.(8)and(v)evaluatingtheequationsdefining𝜎H(9)and𝜎V(10). 3.2. Posteriordensityapproximation

InBayesianinference,MonteCarlo(MC)samplingcanbeusedto approximate𝜋(m |d𝑜𝑏𝑠)byaMCintegrationoverafinitesampleofthe

soughtdistribution(e.g.,MosegaardandTarantola,1995;Gelmanetal., 2013).ThesimplestapproachisAcceptance-RejectionSampling(ARS), whichconsistsofdrawingsamplesm proportionallytothepriordensity andacceptingthemassamplesoftheposteriordensityproportionally totheirlikelihood𝜋(d 𝑜𝑏𝑠|m).Thisisanexactsamplingmethod(e.g., Mosegaard andTarantola, 1995)andit canbe used off-lineusing a largeensembleofpriormodelrealizationsgiventhat,unlikeinaMarkov ChainMC(MCMC)samplingmethod,thereisnodependencebetween themodelproposals.Itsmaindisadvantageisthat,sincethe parame-tersearchis unguided(unlikeMCMC),theprobabilityof acceptance

(6)

decreasesexponentiallywiththedimensionalityMofthemodel param-eterspace.Asmoredimensionsareaddedtotheproblem,theratioof the(hyper)volumeof highlikelihoodvalues (regionsof large accep-tance probability),tothetotalvolumeof themodelspace,decreases exponentiallytozero(e.g.,Scales,1996;CurtisandLomax,2001).This so-calledcurseofdimensionalitymayresultinunrealistically-largeprior modelsamples,evenwhenaddressingonlyahandfulofparameters.In thecontextofthisstudy,weareinterestedinonlythreegeostatistical parameters(Section2.1)possiblysuggestingthatARScouldbeagood choice.

However,whenthescaleofthemodellingdomainisinsufficiently largecomparedtotheintegralscalesofthefieldY(x )under consider-ation, ergodicconditionsarenotfulfilledimplyingapotentiallyhigh dependenceonR (Section2.4.1).Thishigh-dimensionalvariableis dif-ferentforeachrealizationofY(x )anditultimatelycontrolsthe loca-tionsofhigh-andlowhydraulicconductivityregions.Evenifweare uninterestedin R assuch,itformspartofourdatagenerationprocess and,thus,itenterstheinferenceproblemasanuisancevariable(e.g., Gelmanetal.,2013)thatneedstobeaccountedfor.Consequently,our definitionoftheforwardsimulatorgivenabovehastobeexpandedto (m ,R ).Assumingindependenceof m and R examples,theactual in-ferenceproblemtosolvereads

𝜋(m ,R |d𝑜𝑏𝑠)∝𝜋(d 𝑜𝑏𝑠|m,R )𝜋(m )𝜋(R ). (12) Toobtainthesoughtdensity,weneedtomarginalize𝜋(m ,R |d)with respectto R :

𝜋(m |d)=

∫ 𝜋(m ,R |d)𝑑R . (13) Duetoitshigherdimensionality(morethan62,500variablesinour examples),theproblemexpressedbyEq.(12)ispracticallyimpossible tohandlewiththeformalBayesianARSalgorithm.Forthisreason,we resorttoanapproximateversionoftheARSthatisoutlinedinthe fol-lowingsubsection.

3.2.1. ABCacceptance-rejectionsamplingalgorithm

TheARSalgorithmimplemented withinanApproximateBayesian Computational (ABC) framework, labelled Approximate Acceptance-RejectionSampling(AARS)algorithmfromnowon,isanapproximate samplingmethodthatproducesasmoothapproximationof𝜋(m |d).The readerisreferredto(Sissonetal.,2018)foranoverviewonABC meth-ods.TheAARSalgorithmrequirestwoadditionalinputs:(i)adistance metric𝜌(d 𝑠𝑖𝑚,d 𝑜𝑏𝑠)forcomparingthecalculateddatawiththeobserved dataand(ii)akerneldensityfunction K (𝜌)forweightingthedistance metricanddefininganacceptanceprobability.Together,theyreplace thelikelihoodfunction.

Inourwork,thedistancemetric𝜌(d 𝑠𝑖𝑚,d 𝑜𝑏𝑠)istakenastheL1-norm:

𝜌(d 𝑠𝑖𝑚,d 𝑜𝑏𝑠)= 1 𝑁 𝑁 ∑ 1 |d𝑠𝑖𝑚d 𝑜𝑏𝑠| (14)

andtheKerneldensityischosentobeauniformfunction:

K (𝜌)= {

1 0≤𝜌∕ℎ≤1

0 1<𝜌∕ℎ, (15)

wheretheacceptancebandwidthh ischosensuchthatthe0.5th per-centileofthedistributionof𝜌 orderedfromthelowesttothehighest distance areaccepted.Inourcase,thismeansthat K (𝜌)acceptsthe modelsproducingtheS=500lowestdistancesoutoftheK=105

sam-pledpriorsamples.

TheAARSalgorithm,describedinpseudo-codeinAlgorithm2, pro-ceedssimilarlytotheformalARSalgorithm.

ConsideringAlgorithm2,itcanbenoticedthattheAARSalgorithm drawssamplesfromthejointdistribution

𝜋𝐴𝐴𝑅𝑆(m ,R ,d |d𝑜𝑏𝑠)=K

(𝜌)𝜋(d 𝑜𝑏𝑠|d,m ,R )𝜋(m )𝜋(R ), (16)

Algorithm 2: Approximate Acceptance-Rejection Sampling (AARS)algorithm.

for 𝑘=1,,𝑃 do

Draw m (𝑘) from 𝜋(m ) and R (𝑘) from 𝜋(R )

Generate a data instance d = d 𝑠𝑖𝑚 from the underlying unobserved likelihood𝜋(d 𝑜𝑏𝑠|d,m (𝑘),R (𝑘))

Accept m (𝑘) with an acceptance probability

𝐴𝑃=K (𝜌)

end

which,whenintegratedoverallgenerateddatainstancesgivestheAARS approximationofthe(R -marginalized)posteriordensity:

𝜋𝐴𝐴𝑅𝑆(m |d𝑜𝑏𝑠)=

∫ 𝜋𝐴𝐴𝑅𝑆(m ,R ,d |d𝑜𝑏𝑠)𝑑d ; (17) or,

𝜋𝐴𝐴𝑅𝑆(m |d𝑜𝑏𝑠)=𝜋(m )

K (𝜌)𝜋(d 𝑜𝑏𝑠|d,m ,R )𝑑d . (18)

AspointedoutbySissonetal.(2018),fromEq.(18)theAARScanbe interpretedasaformalBayesianARSalgorithmusinganapproximated likelihoodfunctionthatisaKernelDensityEstimation(KDE)ofthetrue likelihood:

𝜋𝐴𝐴𝑅𝑆(d 𝑜𝑏𝑠|m,R )=

K (𝜌)𝜋(d 𝑜𝑏𝑠|d,m ,R )𝑑d . (19)

Forbuildingtheempiricalposteriorprobabilitydensities, we per-formKDEoverthesamplesobtainedfrom𝜋𝐴𝐴𝑅𝑆(m |d𝑜𝑏𝑠).For

consis-tency,thepriorPDF(Section2.4.1)is computedbyperformingKDE overthegenerated sampleofsize𝑃 =105. TheKDEapproachis

de-scribedinthefollowingsubsection.

3.2.2. Multivariatekerneldensityestimation(KDE)

GivenasampleX ={x 1,,x S}ofsizeSofM-variaterandomvectors

belongingtoacommondistributiondescribedbythedensityg,theKDE estimator̂𝑔ofgisgivenby(e.g.,WandandJones,1994)

̂𝑔H(x )= 1 S S𝑖=1 K H(x x 𝑖), (20)

withtheestimatorfunction K Hdefinedas: 𝐊𝐇(𝐱)=|𝐇|−

𝟏

𝟐𝐾(𝐇𝟏𝟐𝐱), (21)

wherethekernelfunctionKisasymmetricmultivariatedensity. Fur-thermore,|H| isthedeterminantof theM×Mbandwidth matrix H ,

whichissymmetricandpositivedefiniteingeneraland,iftheM vari-ablesareassumedindependent,itisdiagonalwithentries H 𝑖givenas √

H 𝑖=ℎ𝜎𝑖, whereh isthebandwidthparameterand𝜎i thestandard

deviationoftheithcomponentoftherandomvariable.

TheestimatorofEq.(20)isanaverageofkerneldensitiesthatare centeredatthesamplepointsandwhosedecayiscontrolledby H .The particularchoiceofKdoesnotsubstantiallyinfluencetheperformance oftheKDEapproach,butthechoiceofthebandwidthh,defining H

(i.e.,thetailsofK),isamostcrucialaspect,giventhatunder-or over-smoothedestimatorswillbeproducedifitistakentoosmallorlarge, respectively(e.g.,WandandJones,1994).Inthepresentwork, Kis chosenasthestandardmultivariatenormalfunction

K H(x )=(2𝜋)𝑁 2|H|− 1 2𝑒𝑥𝑝 { −1 2x TH −1x } . (22)

Forh,acommonchoicewhendealingwithunimodaldistributions, astheonesexpectedinthisstudy,isbasedonSilverman’sruleofthumb (e.g.,Silverman,1986): ℎ𝑠=𝑀4+2 1 𝑀+4 𝑑𝑀+41 . (23)

(7)

Fig.2.(Coloronline)(a)Weaklyheterogeneoushydraulicconductivityfieldwithgeostatisticalparameters(𝜎2

𝑌,𝐼y,𝜆)=(0.005,0.130m,3.179).(b)Corresponding

steady-stateflowfieldand(c)normalizedconcentrationfieldattime103s.(d)Time-seriesofthehorizontalandverticalequivalentelectricalconductivity,

mass-flux,andmeantracerconcentration,denoted𝜎H,𝜎V,Mand𝜇

c,respectively.Thelight-blueverticalline,alsopresentin(e),marksthetime103softheconcentration

fieldshownin(c).(e)Time-derivativesof𝜎H,𝜎V,Mand𝜇 c.

The reliabilityof theused information measure (Subsection 3.3) largelydependsonthequalityoftheinputdensityestimationsprovided bytheKDEapproach(e.g.,Budkaetal.,2011).Consideringthe trade-offspertainingtothechoiceofh,amanualtuningprocesswas neces-sary,whichresultedinthechoiceof=0.75𝑠fortheresultspresented herein.

3.3. Informationmeasure:Kullback–Leiblerdivergence

Thedegreeofknowledgebroughtbytheobserveddatad 𝑜𝑏𝑠 pertain-ingtothegeostatisticalmodelparametersisevaluatedbycomparingour approximationof𝜋(m |d𝑜𝑏𝑠)with𝜋(m ).TheKullback–Leiblerdivergence (KLD)(KullbackandLeibler,1951),alsotermedRelativeInformation Content(Tarantola,2005),isprobablythemostwidelyused quantita-tivemeasureforcomparingPDFs:

𝐾𝐿𝐷(𝜋(m |d𝑜𝑏𝑠);𝜋(m ))= ∫ 𝜋(m |d𝑜𝑏𝑠)𝑙𝑛 ( 𝜋(m |d𝑜𝑏𝑠) 𝜋(m ) ) 𝑑m , (24)

wherethebaseofthelogarithmistakenase,givingtheinformation in unitsof nats (e.g.,Cover andThomas, 2012).The integration of Eq.(10) is performedoverthesupport ofthedensitiesandtheKLD isfiniteasthesupportof𝜋(m |d𝑜𝑏𝑠)iscontainedinthesupportof𝜋(m ) (e.g.,CoverandThomas,2012).TheKLDiszerowhen𝜋(m |d𝑜𝑏𝑠)≡ 𝜋(m )

(i.e.,thedatacarrynoinformationaboutthemodelparameters)andit increasesastheposterior becomesmorecompactwithrespecttothe priorasaconsequenceofconditioningtothedata.Notethatwhenthe priorandposteriordensitiesareGaussianwiththesamemean,butthe standarddeviationoftheposteriorishalfthestandarddeviationofthe prior,thentheKLDis0.27nats.

Sinceoursamplesaredrawnfromapproximateposteriordensities

𝜋𝐴𝐴𝑅𝑆(m |d𝑜𝑏𝑠)thatareKDE(i.e.,smoothed)versionsofthetarget

den-sities𝜋(m |d𝑜𝑏𝑠)(Eq.(18)),thechosenAARSapproachprovidesa con-servativeframeworkforassessingtheinformationcontentintermsof

theKLDmeasure,sinceitisalwaystruethat

𝐾𝐿𝐷(𝜋𝐴𝐴𝑅𝑆(m |d𝑜𝑏𝑠);𝜋(m ))<𝐾𝐿𝐷(𝜋(m |d𝑜𝑏𝑠);𝜋(m )), (25)

whichimpliesthattheinformationcontentintheconsideredtime-series isatleastaslargeastheestimatesobtainedbyouranalysis.

4. Results

Wefirstshowtwoexamplesofgenerateddataforend-membercases ofweakandstronghydraulicheterogeneity.Then,wedescribethe re-sultsobtainedfordifferentgeostatisticalparametervaluecombinations intermsoftheKLDandbiasmeasures.Indoingso,wediscussresults obtainedforone R -realization,aswellasensemblestatisticsdeduced from50 R -realizations.

4.1. Twoexamplesofgenerateddata

Fig.2showsanexampleofdataobtainedforaweaklyheterogeneous hydraulic conductivity field with (𝜎𝑌2,𝐼y,𝜆)=(0.005,0.130m,3.179)

(Fig.2a),resultinginanapproximatelyconstantflowfield(Fig.2b).The correspondingconcentrationfield,shownatthesamplingtime103s,

whenthetraceroccupiesapproximately50%ofthemodeldomain, dis-playsanoverallplanarfront(Fig.2c).

Thetime-seriesof𝜎Hand𝜎V(Fig.2d)evolveaccordingtothelower

andupperWienerbounds.Theseupscalingformulasforlaminated ma-terials(e.g.,MiltonandSawicki,2003)correspondtotheharmonicand arithmeticmeansofthelocalelectricalconductivities,respectively.The arithmeticaveraginggoverning𝜎Vismanifestedbylinearscalingwith

time.Inthiscase,𝜎Vformsanalmostperfectpredictorofthemean

salin-ity(𝜇c)withinthesample.𝜎H,onthecontrary,stronglyunderestimates 𝜇c.Forthiscase,themeanvelocityofthetracerfrontisgivenbythe

time-derivativeof𝜎V(Fig.2e),informationthatisavailablebeforethe

(8)

Fig.3. (Coloronline)(a)Stronglyheterogeneoushydraulicconductivityfield,definedwithgeostatisticalparameters(𝜎2

𝑌,𝐼y,𝜆)=(5.111,0.085m,1.028).(b)

Corre-spondingsteady-stateflowfieldand(c)normalizedconcentrationfieldattime103s.(d)Time-seriesofthehorizontalandverticalequivalentelectricalconductivity,

mass-flux,andmeantracerconcentration,denoted𝜎H,𝜎V,Mand𝜇

c,respectively.Thelight-blueverticalline,alsopresentin(e),marksthetime103s,ofthe

concentrationfieldin(c).(e)Time-derivativesof𝜎H,𝜎V,Mand𝜇

c.Thelargepeaksexhibitedby𝑑𝜎

𝐻

𝑑𝑡 and𝑑𝑀𝑑𝑡 approximatelycoincidewiththefirstarrivalofthe

tracerattheoutlet.

Theseeasilyinterpretableresultsarenowcontrastedwiththose ob-tainedforastronglyheterogeneoushydraulicconductivityfield,defined with (𝜎2𝑌,𝐼y,𝜆)=(5.111,0.085m,1.028). The resulting fieldhas

small-scale structuresandis close toisotropic(Fig.3a). Yet itsassociated flowfieldexhibitspronouncedchanneling(Fig.3b)resultinginahighly heterogeneousconcentrationfield(Fig.3c).Neither𝜎Hnor𝜎Vfollow

anyknownupscalinglaw.TheybothstarttovarymuchearlierthanM

(Fig.2d),whichonlyreactswhenthetracerarrivesattheoutlet.These earlyvariationsareclearlyseeninthetime-derivativesofthe electri-calresponses(Fig.2e),whicharenon-zerofromthemomentthetracer injectionstartsandexhibitsmallpeaksthatarerelatedtointernal con-nectioneventsofthesolutethatareinvisibletoM.Both𝜎HandMshow

asteepincreasearound103s,andalargepeakintheirtime-derivatives,

correspondingtoearlybreakthrougharrival.Forthiscase,𝜇cisatearly

timesmuchlargerthanalldataandisasymptoticallyapproximatedby

M,followedinorderofmagnitudeby𝜎Hand𝜎V. 4.2. Testcases

WenowapplytheBayesianinferenceapproachusingthreedifferent combinationsofthegeostatisticalmodelparametervalues:

(i) m 1∶=(4.70,0.06m,1.50).Thisleadstoastronglyheterogeneous

hydraulicconductivityfieldthatisapproximatelyisotropicand exhibitssmallstructures(Fig.4a).

(ii) 𝐦2∶=(0.80,0.06m,10.00). This leadstoa mildly-to-moderately

heterogeneous field that exhibits a high degree of layering (Fig.4d).

(iii) 𝐦3∶=(4.70,0.38m,1.50). This leadsto a highlyheterogeneous

fieldexhibitinglarge-scalestructures(Fig.4g).

InFig.4,examplerealizationsofgeneratedlog-hydraulic conductiv-ityfieldsforthethreetestcasesareshowntogetherwiththeir corre-spondingflowandconcentrationfields.

4.3. Informationassessmentofdatatypes

Foreachtestcaseofthemodelvectorm ,50datasetsd 𝑜𝑏𝑠𝑗 (Section3) aresimulatedusing hydraulicconductivityfieldscreatedwith differ-entR -realizations.Theforwardresponsesarecontaminatedwithnoise havingzeromeanandameandeviationof0.005representing50%of thebaselineelectricalconductivity.Theevaluationofthedifferentdata typesandgeostatisticalparametervaluesisconsideredbothinterms oftheensembleofrealizations(ensembleperformance)andintermsof randomly-pickedsinglerealizations(i.e.,thefieldsshowninFig.4).In additiontotheestimatedjointposteriorPDF,wealsoconsiderthe cor-respondingmarginaldistributionstoevaluatetheabilityofthedatato constrainindividualgeostatisticalparameters.Forthemarginal analy-sis,wealsoconsiderarelativebiasmeasure,computedastheratioof themeanbiasofthemarginalposteriors,withrespecttothetruevalues of m ,tothemeanbiasofthemarginalpriorswithrespecttothetrue values.Fromnowon,wedropthesuperscript“obs” whenreferringto theobservedconditioningdata.

4.3.1. Testcasem1

Table1summarizestheresultsobtainedfortestcase m 1.

WhenconsideringthejointKLDsobtainedfortheensembleof real-izations,wefindthat d 𝐻𝑉 hasthelargestmeanKLD,closelyfollowed by d 𝐻𝑉𝑀.Theleastinformativedatatype d 𝑀 hasameanKLDthatis ~ 75%oftheonefor d 𝐻𝑉,while d 𝐻and d 𝑉 havevaluesin-between. TheKLDstandarddeviations havesimilarvalues amongallthedata typesandrepresent ~ 20%ofthemeanvalues.

WenowturntotheresultsobtainedforthefieldsinFig.4a–candthe correspondingtime-serieshighlightedinFig.5a–c.Forthisspecific real-ization,theKLDsspanasmallrangeofonly ~ 13%.Also,theordering isdifferentandthemostandleastinformativedatasetsforthiscaseare

(9)

de-Fig.4. (a,d,f)Realizationsoflog-hydraulic conductivityfieldsand(b,e,h)associatedflow and(c,f,i)concentrationfieldsattime103s

forthethreeevaluatedtestcases(a–c)m1,(d–

f)m2and(g–i)m3.Notethatthelocationsof

high-andlowhydraulicconductivityregions aregovernedbyrandomR-realizations.

Table1

KLDsandmeanrelativebiasesofm1forthedifferentconditioningdatatypes.Columns1and2show

themean𝜇KLDandstandarddeviation𝜎KLDoftheKLDsusingtheensembleofhydraulicconductivity

realizations.Column3showstheKLDvaluesforthejointposteriorPDFsusingonerealizationof theconditioningdataobtainedfromFig.4a–candhighlightedinFig.5a–c.Thesubsequentpairsof columnsshowthemarginalKLDvaluesandrelativemeanbiasesforthemarginalposteriorsofeach componentofm1usingthisspecificrealization.

Ensemble m 𝜎2

Y 𝐼 y ( m ) 𝜆

𝜇KLD 𝜎KLD KLD KLD Bias KLD Bias KLD Bias d𝐻 0.8171 0.1445 0.7351 0.3754 0.5018 0.1418 0.7560 0.0904 0.8525 d𝑉 0.8016 0.1131 0.6418 0.2617 0.6703 0.1021 0.6760 0.0675 1.2163 d𝑀 0.6625 0.1580 0.6973 0.2153 0.8223 0.1771 0.5325 0.0980 1.1271 d𝐻𝑉 0.8830 0.1352 0.6937 0.2501 0.6716 0.1536 0.7058 0.0899 0.7971 d𝐻𝑉𝑀 0.8712 0.1318 0.6985 0.2386 0.7232 0.1537 0.7189 0.1050 0.7672

viationsoftheKLDsdiscussedabove)thestochasticvariationsthatare inherentundernon-ergodicconditions.Thevariabilityinthegenerated datadue tovariationsin the R -realizations,foragivengeostatistical model,isindicatedbytheinsetsinFig.5a–c.

TheposteriormodelsamplesobtainedbytheAARSalgorithmand usedforbuildingtheempiricalposteriorPDFsforeachtypeofdataare showninFig.5.Thedensitydistributionofthese3-Dcloudsofpoints convey aqualitativeviewoftheabilityofthedifferentdatatypesto constrainthegeostatisticalparameters.Noeye-catchingdifferences dis-tinguishthedifferentpointclouds,reflectingtherathersimilarvalues oftheassociatedKLDs.

The KLDs computed for the marginal posterior PDFs, labelled marginalKLDsfromnowon,arethelargestfor𝜎2

Y,followedbyIyand𝜆,

thatonaverage,represent ~ 50%and~ 25%oftheKLDsof𝜎2

Y,

respec-tively.Wefindthat𝜎2

Yisbestconstrainedby d 𝐻,producingthelargest

marginalKLDandthesmallestbias.Forthisparameter,thepoorest per-formanceisachievedby d 𝑀 thathasboththesmallestmarginalKLD andthelargestbias.Thiscanbeseenintheestimatedmarginal poste-riorprobabilitydensity(Fig.6a)displayingamassdistributionwhichis thefurthestawayfromthetruevalue𝜎2

Y=4.70.ForIy,onthecontrary, d 𝑀 featuresthehighestmarginalKLDandthesmallestbias(Fig.6b). Theabilityofthedatatoconstrain𝜆 islow(Fig.6c)withd 𝐻𝑉𝑀 featur-ingthehighestmarginalKLD.Therelativemeanbiasesarenegatively correlatedwiththeassociatedKLDmeasure,showingconsistency be-tweenthetwomeasures.

(10)

Fig.5.PosteriormodelparametervectorsamplesofsizeS=500obtainedbytheAARSalgorithmfortestcasem1=(4.7,0.06m,1.5)usingdifferentdatasetsas

conditioningdata.Thecoloredcloudsofpointsrepresentthesamplesfordatasets(a)d𝐻;(b)d𝑉;(c)d𝑀;(d)d𝐻𝑉;(e)d𝑉𝑀;(f)d𝐻𝑉𝑀.Thecolormapencodesthe

L1distance𝜌 betweensimulatedandobserveddata,normalizedbytheminimumandmaximumvaluesof𝜌 ofthetestcase.Theinsetplotsof(a),(b)and(c)show,

respectively,the50realizationsoftime-series𝜎H,𝜎VandMgeneratedform

1usingdifferentR-realizations.Thedataconsideredhereforinferenceareshownby

thick-coloredcurves.TheresultingKLDvaluesaregivenforeachdataset.

Fig.6. (Coloronline)MarginalposteriorPDFsassociatedtoeachtypeofconditioningdatad𝐻,d𝑉,d𝑀,d𝐻𝑉,d𝑉𝑀andd𝐻𝑉𝑀fortestcasem1=(4.7,0.06m,1.5).

MarginalpriorandposteriorPDFscorrespondingto(a)𝜎2

Y(b)I𝑦and(c)𝜆.

4.3.2. Testcasem2

Table2summarizestheresultsobtainedfortestcasem 2.

WhenconsideringthejointposteriorKLDsfortheensemble,wefind that d HVMisthemostinformativedatasetfollowedbyd Hand d HV.Far behind,featuringmeanKLDsthatare ~ 60%ofd HVM,are d Vandd M.

Oftheindividualdatasets,wefindthat d Hismuchmoreinformative

than d Vand d M.Thestandarddeviationshavesimilarmagnitudesand represent∼ 20−35%ofthemeanvalues.

Wenowconsidertheresultsobtainedusingthetime-series(Fig.7a– c)obtainedfromthefieldsinFig.4d–f.TherankingforthejointKLDs aresimilartotheensemblemeanKLDs,exceptthat d H performsthe

best.Thepointcloudsoftheposteriorsamples(Fig.7)clearlyshows that d H(Fig.7a)constrainthegeostatisticalmodelparametersmuch

betterthan d V(Fig.7b)and d M(Fig.7c).

ThemarginalKLDsareagainthelargestfor𝜎2

Y,followedbythose

of 𝐼y and𝜆.Wefindthat𝜎Y2 is themost constrainedby d Handthe

leastconstrained by d M asreflectedbytheir marginalKLDsandthe

compactnessoftheirposterior PDFs(Fig.8a).AllthemarginalPDFs for𝜎2

Yexhibitasmallbiastowardslargervariances,withthesmallest

andlargestbiasesexhibitedfor d HVMand d M,respectively.For𝐼y,the

marginalKLDassociatedwithd Hiswell-abovetheothers(Fig.8b).The

secondmostandthirdmostbestperformingdatasetforthisparameter ared HVand d HVM,while d Mperformsthepoorest.ThemarginalKLDs

(11)

Table2

KLDsandmeanrelativebiasesofmforthedifferentconditioningdatatypes.Columns1and2show themean𝜇KLDandstandarddeviation𝜎KLDoftheKLDsusingtheensembleofhydraulicconductivity

realizations.Column3showstheKLDvaluesforthejointposteriorPDFsusingonerealizationof theconditioningdataobtainedfromFig.4d–fandhighlightedinFig.7a–c.Thesubsequentpairsof columnsshowthemarginalKLDvaluesandrelativemeanbiasesforthemarginalposteriorsofeach componentofm2usingthisspecificrealization.

Ensemble m 𝜎2

Y 𝐼 y ( m ) 𝜆

𝜇KLD 𝜎KLD KLD KLD Bias KLD Bias KLD Bias dH 2.1341 0.5027 2.3829 1.3179 0.3687 0.8065 0.0573 0.5979 0.0971 dV 1.4448 0.4425 1.1625 0.7427 0.4359 0.1115 0.6425 0.1003 0.7639 dM 1.3897 0.4768 1.0892 0.4866 0.4889 0.0875 0.6883 0.0952 0.8219 dHV 2.1114 0.4465 2.0030 0.9984 0.3019 0.6399 0.1306 0.5412 0.1657 dHVM 2.2256 0.4648 2.0741 1.0955 0.2452 0.6158 0.1417 0.4410 0.2640

Fig.7. PosteriormodelparametervectorsamplesofsizeS=500obtainedbytheAARSalgorithmfortestcase𝐦2=(0.80,0.06m,10.00)usingdifferentdatasetsas

conditioningdata.Thecoloredcloudsofpointsrepresentthesamplesfordatasets(a)dH;(b)dV;(c)dM;(d)dHV;(e)dVM;(f)dHVM.ThecolormapencodestheL1

distance𝜌 betweensimulatedandobserveddata,normalizedbytheminimumandmaximumvaluesof𝜌 ofthetestcase.Theinsetplotsof(a),(b)and(c)show, respectively,the50realizationsoftime-series𝜎H,𝜎VandMgeneratedform

1usingdifferentR-realizations.Thedataconsideredhereforinferenceareshownby

thick-coloredcurves.TheresultingKLDvaluesaregivenforeachdataset.

andbiasesfor𝜆 (Fig.8b)followtherankingof𝐼y.Forthistestcase m 2,

thedatabetterconstrainthegeostatisticalparametersthanfortestcase

m 1asreflectedbygenerallymuchlargerKLDvalues.

4.3.3. Testcasem3

Table3summarizestheperformanceofthedifferentdatasetsfortest casem 3.

ConsideringtheensemblestatisticsofthejointposteriorKLDs,we findthat d HVhasthelargestmeanKLD,closelyfollowedby d HVMand

d H. Again, d M featuresthe smallestmeanKLDwith avalues thatis

~ 63% of that for d HV. The standarddeviations arevarying within

~ 15%andrepresent ~ 25%ofthemeanvalues.

Wenowconsidertheresultsfromthedatatime-series(Fig.9a–c) obtainedfromthefieldsinFigs.4g–i.ThejointKLDford Histhelargest

closelyfollowedbyd HVand d HVM.TheirKLDsare ~ 30%higherthan

theothers.Thepointcloudsofposteriormodelrealizations(Fig.9)are

rathersimilar,buttheresultsobtainedfromd H(Fig.9a)aremore

com-pactcomparedto d Vand d M.Forinstancethereisminimalscatterin

the𝜆-direction(c.f.,Fig.9b)andthehigh𝜎2

Yisbetterconstrained(c.f.

Fig.9c).

ThemarginalKLDsareagainthehighestfor𝜎2

YfollowedbyIyand 𝜆.Therelativemeanbiasesshowasimilartrend,beingsmallestfor𝜎2

Y.

Themarginalprobabilitydensitiesfor𝜎2

Y(Fig.10a)showthat d Hbest

constrainthisparameter,followedbyd HVand d HVM.ThemarginalKLD

for d Hareonly ~ 10%largerthanfor d HV and d HVM,butitsbiasis

30%lower.Notealsothatd Misstronglybiasedtowardstoolow𝜎2

Y.For Iyand𝜆,bothKLDsandbiasesindicatethat d H,d HVand d HVMarethe

(12)

Fig.8. (Coloronline)MarginalposteriorPDFsassociatedtoeachtypeofconditioningdatadH,dV,dM,dHV,dVManddHVMfortestcase𝐦2=(0.80,0.06m,10.00).

MarginalpriorandposteriorPDFscorrespondingto(a)𝜎2

Y(b)I𝑦and(c)𝜆.

Table3

KLDsandmeanrelativebiasesof𝐦⨚forthedifferentconditioningdatatypes.Columns1and2show

themean𝜇KLDandstandarddeviation𝜎KLDoftheKLDsusingtheensembleofhydraulicconductivity

realizations.Column3showstheKLDvaluesforthejointposteriorPDFsusingonerealizationof theconditioningdataobtainedfromFigs.4g–iandhighlightedinFig.9a–c.Thesubsequentpairsof columnsshowthemarginalKLDvaluesandrelativemeanbiasesforthemarginalposteriorsofeach componentofm3usingthisspecificrealization.

Ensemble m 𝜎2

Y 𝐼 y ( m ) 𝜆

𝜇KLD 𝜎KLD KLD KLD Bias KLD Bias KLD Bias dH 1.1845 0.3195 1.0166 0.4818 0.3098 0.2681 0.6388 0.2031 0.3962 dV 1.0565 0.3313 0.7459 0.3046 0.6171 0.0488 0.9635 0.0356 1.0166 dM 0.8413 0.2798 0.6205 0.2019 0.7347 0.1949 0.6983 0.1296 0.5169 dHV 1.3462 0.3491 1.0123 0.4491 0.4202 0.2680 0.6577 0.2054 0.4002 dHVM 1.2932 0.3100 1.0068 0.4200 0.4429 0.2915 0.6164 0.2245 0.3647

Fig.9. PosteriormodelparametervectorsamplesofsizeS=500obtainedbytheAARSalgorithmfortestcase𝐦3=(4.70,0.38m,1.50)usingdifferentdatasetsas

conditioningdata.Thecoloredcloudsofpointsrepresentthesamplesfordatasets(a)dH;(b)dV;(c)dM;(d)dHV;(e)dVM;(f)dHVM.ThecolormapencodestheL1

distance𝜌 betweensimulatedandobserveddata,normalizedbytheminimumandmaximumvaluesof𝜌 ofthetestcase.Theinsetplotsof(a),(b)and(c)show, respectively,the50realizationsoftime-series𝜎H,𝜎VandMgeneratedform

1usingdifferentR-realizations.Thedataconsideredhereforinferenceareshownby

(13)

Fig.10. (Coloronline)MarginalposteriorPDFsassociatedtoeachtypeofconditioningdatadH,dV,dM,dHV,dVManddHVMfortestcase𝐦3=(4.7,0.38m,1.5).

MarginalpriorandposteriorPDFscorrespondingto(a)𝜎2

Y(b)I𝑦and(c)𝜆.

5. Discussion 5.1. Generalfindings

TheabsolutevaluesofthecomputedKLDsandbiasesaredependent onthechoicesmadewhenapproximatingtheposteriorprobability den-sities(Section2.2),suchasthewidthoftheacceptancekernelofthe AARSalgorithm(Algorithm2)andthebandwidthof thekernel den-sityfunctionusedtorepresenttheprobabilitydensities.Forthisreason, wefocusourdiscussionbelowonrelativedifferencesbetweendatasets andtestcases.Wefirstsummarizethemainresultsthatapplytoalltest casesbeforediscussingthetestcasesone-by-one.Afterthis,wediscuss broaderimplicationsofthisresearch.

Consideringtheensemblestatisticsof50hydraulicconductivity re-alizationsforeachtestcase,wefindthattheinformationcontentofd H

measuredbytheKLDishigherthan d V,whichinturnishigherthan d Mforthethreetestcasesconsidered:m 1(Table1), m 2(Table2)and

m 3(Table3).Theaddedvalueofcombiningdifferentdatatypes(d HV

and d HVM)isgenerallyfoundtobecomparativelylow.When

consider-ingindividualhydraulicconductivityrealizationsandassociatedfields (Fig.4),wegenerallyobtainrelativerankingsofthedifferentdatatypes thatareconsistentwiththoseof theensemble means.Giventhatwe considernon-ergodicmodeldomains,theactuallocationsofhigh-and lowhydraulicconductivitiesgovernedbythenuisancevariableR plays animportantroleinthedata-generatingprocess.Itsimpactis mani-festedbythecomparativelyhighstandarddeviationsoftheKLD esti-mates(Tables1–3)and(inthevariabilityofthegeneratedtime-series (Figs.5a–c,7a–cand9a–c).Despitethisinherentstochasticvariability, weconsistentlyfindthatthebestconstrainedparameteris𝜎2

Y,followed

byIyand𝜆.Theindividualtestcasesarediscussedindetailbelow.

5.2. Lessonslearnedfromthethreetestcases

Testcase m 1featuresahighlyheterogeneousfield𝐾(x )with rela-tivelysmallstructures(Fig.4a),forwhichonecouldpossiblyassume thatergodicconditionsarefullfilledandconsequentlythatthe geosta-tisticalparametersarewell-representedwithinthemodellingdomain, yet itcorresponds tothe least-constrained testcase. Indeed, the R -realization playshere averyimportantrole,implyingarather weak mappingfromthetime-seriestothegeostatisticalparametersof inter-est.Tounderstandthis,notefirstthat{𝜎H}and{𝜎V}areonlysensitive

totheunderlyinggeostatisticalparametersthroughthesolutespreading patternsthattheseparametersinduce.Indeed,theelectricalresponses resultfromoptimalcurrentpatternsestablishedthroughoutthehighly

non-ergodicandtime-evolvingdistributionoflocalconcentrations(i.e., conductivities) thatare,in turn,drivenbytheflow field𝐪(x ). Asin theexampleinFig.2,thehydraulicconductivityfield𝐾(x )has small-scalestructuresandisclosetoisotropic(Fig.4a)butitsassociatedflow field𝐪(x )exhibitspronouncedchanneling(Fig.4b). Thistendencyof theflowfieldtoconcentrateinpreferentialflowchannelsforhigh𝜎2

𝑌

iswell-known(e.g.Cvetkovicetal.,1996).Hence,anergodic𝐾(x )is noguaranteeofwell-sensedgeostatisticalparameterswhenusing geo-electricallymonitoredsalinetracertests.Nevertheless,comparedtothe prior,theestimatedmarginalposteriordensitiessuggestthatthe geo-statisticalmodelthatneedsacomparativelyhigh𝜎2

Y(Fig.6a)andvery

smallorhighIy(Fig.6b)areunlikely.

Testcase m 2correspondstoalayereddistributionofhydraulic con-ductivitywithamoderate𝜎2

Y.TheKLDs(Table2),andconsequently

theconstrainingnatureofthetime-series,aremuchhigherthanfortest cases m 1 (Table1)and m 3(Table3).For m 2,thesmallestvariations

between the R -realizationsareobserved (Fig.7a–c)sincethe actual locationoftheflowchannelsisofsecondaryimportanceinthe data-generatingprocess.Thehydraulicconductivityfield(Fig.4d)andits as-sociatedflowfield(Fig.4e)arevisuallymoresimilartoeachotherthan for m 1.Thisisaconsequenceofthelargeanisotropyfactorimposing

horizontallycontinuousstructureswithinwhichtheflow-fieldchannels arenaturallydeveloped.Both{𝜎H}and{M}arehighlysensitivetothe

arrivalofhorizontalconnectionsthatareestablishedbythesolutewhen itarrivestotheoutlet.ConsideringthemarginalKLDs,wefindthathigh andlow𝜎2

Y-valuesareincompatiblewiththedata(Fig.8a),asislarge Iy.Forthistestcase m 2,𝜆 isparticularlyinterestingasitstruevalueis

highand,therefore,oflowpriorprobability(Fig.8c).Weseeastrong abilityofalltime-seriesincluding{𝜎H}toconstrainthisparameter.

Testcase m 3isahighlyheterogeneoustestcasethatdistinguishes itselffrom m 1 byitslarger Iy.Aconsequenceof theresulting larger

structuresisthatthegenerateddatavarywidelybetweenthedifferent hydraulicconductivityrealizations(seeinsetsinFig.9a–c).YettheKLDs (Table 3) arehigherthanfortestcase m 1.Consideringthemarginal

posteriorPDFs,alldatasetsindicate thattheunderlyinggeostatistical modelhasahigh𝜎2

Y(Fig.10a),atleastamoderatelyhighIy(Fig.10b)

and(thatthefieldisclosetoisotropic(Fig.10c).

5.3. Physicalinsightsandopenquestions

Inouridealizednumericalinvestigation,wefoundconsistentlythat geoelectricaldataperformedbetterthanmassbreakthroughdatain con-strainingthegeostatisticalparameters.Thisisaconsequenceofthefact that,foragivengeostatisticalmodel,theactualpositioningofhigh-and

(14)

Fig.11. Naturallogarithmoftheabsolutevalueofthecurrentdensityfields(andtheirstreamlines)resultingfromexcitingthesamplebothinthe(a–c)horizontal and(d–f)verticalmodes.TheelectricalconductivitydistributionisgivenbythesalineconcentrationfieldsshowninFig.4,thatis,attime103sforthethree

evaluatedtestcases(a,d)m1(column1),(b,e)m2and(c,f)m3.

lowhydraulicconductivityfields,governedbythenuisancevariable R , hasalargerimpactonthemassbreakthroughdatathanonthe geoelec-tricaldata(e.g.,comparetheinsetsinFig.9a–c).Weunderstandthis asaconsequenceofthelocalflux-averagednatureofthetracer break-through,comparedtothemoreintegrativenon-linearvolume-averaging of theelectrical responsesover theconcentrationfield.Additionally, since{M}isonlysensitivetothetime-evolutionofthesolute concen-trationfieldattheoutlet,itcannotdeterminethecausalityofthearrival times,thatis,iftheyoriginatefromlargehorizontalcorrelationscales orfromhighvariance,forinstance.

Wealsofoundthat{𝜎H}always hasa higherconstrainingpower

than{𝜎V}.Thiscanbeunderstoodbynotingthat{𝜎H}issensitiveto

electricalconductionpathscreatedbytheconcentrationfieldintheflow direction,leadingtoaverystrongsensitivitytotracerarrivalsatthe outlet(e.g.,Fig.2e,orthegenerallysteepslopesinthegenerated time-seriesin theinsetsof Figs. 5a, 7aand9a). InFig.11we plotthe generatedcurrentdensitydistributionsdetermining{𝜎H}and{𝜎V}for

theconcentrationfieldsshowninFigs.4c,fandj.Weseethatfor{𝜎H}

(Figs.11a–c)thesupportofthecurrentdensityfield(i.e.theregions of highcurrent flow)is almostcoincidentwiththeareaoccupiedby theinvading tracerdrivenbytheflow-field. Thisdoesnot occurfor {𝜎V}(Figs.11d-f),indicatingwhy{𝜎H}ismoreinformativethan{𝜎V}.

Clearly,{𝜎V}resultsfromcurrentpatternsthataremainlyconstrained

byverticalconnectionbottlenecksthatbecomemorecommonfurther awayfromtheinletregion.Thiscanbeappreciatedbythehighdensity ofcurrentfieldstreamlinesobservedattheinletregionsinFigs.11d, 1111eand11f.Thissuggeststhatthemainabilityof{𝜎V}tosensethe

geostatisticalparametersisthroughitssensitivitytothetrailingendof thetracerfront.Again,itistheconnectivity-aspectoftheelectricaldata thatisatplay.

Ourresultsalsosuggestastrongdependenceontheinjectiontype. Forapulseinjection,weexpect{𝜎H}tobemuchlessinformative,

com-paredtothepresentcontinuousinjectioncase,astherewillbeno hor-izontalconnectionsofsalinitytosense.Thatis,theconnectivity

cre-atedbyestablishingacontinuousconcentrationfieldacrossthedomain isveryhelpfulforelectrical-basedinferenceofgeostatisticalproperties fromtracertests.

Foralltestcases,wefindthat𝜎2

Yisthebestconstrainedparameter.

Thisisexplainedbythefactthat𝜎2

Ycontrolsthespreadingrateofthe

solute(e.g.,GelharandAxness,1983)andis,thus,afirst-orderfeatureof thetime-series.Itwilldeterminethetime-spacingorpaceofoccurrence ofthehorizontalconnectioneventsassensedparticularlywellby{𝜎H}.

However,alsothetrailingpartofthetracerfieldassensedby{𝜎V}is

affectedby𝜎2

Y.

Oneopenquestionistowhatextenttheelectricaldatacanconstrain mixingandspreading.Intuitively,thereshouldbeastrongsensitivity tothespreadingwidthas𝜎Hishighlysensitivetothefrontofthetracer

plumeand𝜎Vtoitsend.Sincesolutespreadingultimatelycontrols

so-lutemixing(e.g.,Villermaux,2019),thehighsensitivityoftheelectrical datatotheformerindicatesthatthesedataareabletoatleastquantify themixingpotentialofthesolute(e.g.,deDreuzyetal.,2012).This willbethetopicoffutureresearch.Furthermore,theequivalent electri-calconductivitytensortime-seriesisdeterminedbythetime-evolution oftheconcentrationfield,whichinturnisdrivenbytheflow-field.This suggeststhatthattheelectricaldatamightbemorestronglyrelatedwith theflow-fieldthanthegeostatisticalmodeloflog-hydraulic conductiv-ity.Inthefuture,weplantostudythegeoelectricalsensitivityto flow-fielddescriptors(e.g.,Koponenetal.,1996;Englertetal.,2006). Sim-ilarly,wewouldliketorelatetheelectricaldatatoconcentrationfield descriptors.However,astheconcentrationfieldistime-variant,thisis morechallengingtosummarizethanthesteady-stateflowfield.One possibilityistorelateittothespatialdistributionoflocalizedtemporal momentsofthesoluteconcentrationfield(CirpkaandKitanidis,2000).

5.4. Implicationsforfield-basedstudies

Ourworkhasseveralimplicationsforfield-basedand laboratory-basedelectricaltime-lapsemonitoringof tracertests.Thefirstisthat

Figure

Fig. 1. Generated sample of size
Fig. 2. (Color online) (a) Weakly heterogeneous hydraulic conductivity field with geostatistical parameters (
Fig. 3. (Color online) (a) Strongly heterogeneous hydraulic conductivity field, defined with geostatistical parameters (
Fig. 5. Posterior model parameter vector samples of size S = 500 obtained by the AARS algorithm for test case m 1 = (4
+2

Références

Documents relatifs