• Aucun résultat trouvé

Aphis gossypii/Aphis frangulae collected worldwide: Microsatellite markers data and genetic cluster assignment

N/A
N/A
Protected

Academic year: 2021

Partager "Aphis gossypii/Aphis frangulae collected worldwide: Microsatellite markers data and genetic cluster assignment"

Copied!
9
0
0

Texte intégral

(1)

HAL Id: hal-03229073

https://hal.inrae.fr/hal-03229073

Submitted on 25 May 2021

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Distributed under a Creative Commons Attribution - NonCommercial - NoDerivatives| 4.0

International License

Aphis gossypii/Aphis frangulae collected worldwide:

Microsatellite markers data and genetic cluster

assignment

Pascale Mistral, Flavie Vanlerberghe-Masutti, Sonia Elbelt, Nathalie Boissot

To cite this version:

Pascale Mistral, Flavie Vanlerberghe-Masutti, Sonia Elbelt, Nathalie Boissot. Aphis gossypii/Aphis

frangulae collected worldwide: Microsatellite markers data and genetic cluster assignment. Data in

Brief, Elsevier, 2021, 36, pp.106967. �10.1016/j.dib.2021.106967�. �hal-03229073�

(2)

Contents lists available at ScienceDirect

Data

in

Brief

journal homepage: www.elsevier.com/locate/dib

Data

Article

Aphis

gossypii

/

Aphis

frangulae

collected

worldwide:

Microsatellite

markers

data

and

genetic

cluster

assignment

Pascale

Mistral

a

,

Flavie

Vanlerberghe-Masutti

b

,

Sonia

Elbelt

a

,

Nathalie

Boissot

a,∗ a INRAE, GAFL, 84143 Montfavet, France

b CBGP, INRAE, CIRAD, IRD, Montpellier SupAgro, Univ Montpellier, Montpellier, France

a

r

t

i

c

l

e

i

n

f

o

Article history:

Received 2 February 2021 Revised 11 March 2021 Accepted 12 March 2021 Available online 18 March 2021

Keywords: Aphis gossypii Aphis frangulae Microsatellite Population structure Diversity Host plant Aphid

a

b

s

t

r

a

c

t

Aphisgossypii is a cosmopolitan aphid species able to colo- nize hundreds of plant species from various families [1]. It causes serious damage to a wide range of crops and it is considered a major pest of cucurbits and cotton [2]. It re- produces clonally, by obligate parthenogenesis, on secondary hosts present throughout the year in the intertropical area. At higher latitude, some lineages clonally overwinter but part of the population may have a sexual reproduction in autumn on primary host such as Hibiscussyriacus, to generate cold resistant overwintering eggs [3]. It is highly challenging to distinguish A.gossypii from its sister species Aphisfrangulae as both are colonizing solanaceous plants as secondary hosts but the primary host of A.frangulaeis Frangulaalnus[4]. This paper describes a worldwide collection of both species from December 1989 to September 2019. Aphids were collected in- dividually on plants (19 families) or in traps. The location, the morph type and the botanical family of the host plant were registered. DNA was extracted from each aphid and amplified at 8 microsatellite loci [5]. Amplicons were analysed with ABI technology and their size was defined with Genemap- per software. We named each unique combination of alleles, called a multilocus genotype (MLG), and then each individual was given its MLG. The matrix of alleles of all MLGs was run

Corresponding author.

E-mail address: nathalie.boissot@inrae.fr (N. Boissot). https://doi.org/10.1016/j.dib.2021.106967

2352-3409/© 2021 Institut National de recherche pour l’agriculture, l’alimentation et environnement. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license

(3)

2 P. Mistral, F. Vanlerberghe-Masutti and S. Elbelt et al. / Data in Brief 36 (2021) 106967

for a Bayesian analysis to describe the genetic structure of the diversity collected and then each MLG had a probability to belong to a genetic group [6, 7]. Probability of assignation to each genetic group revealed by the analysis was reported to each individual according to its MLG.

This dataset can be used to analyze host plant specificities in A.gossypii, genetic diversity in A.gossypii and relative in- cidence of variants in diverse geographical regions, admix- ture between two sister species ( Aphis gossypii and Aphis frangulae).

© 2021 Institut National de recherche pour l’agriculture, l’alimentation et environnement. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/)

SpecificationsTable

Subject Genetics

Specific subject area Genetic diversity of a pest of crops on which it clonally reproduces, but has potentially one sexual generation per year. Some clones are observed on several continents.

Type of data Table

How data were acquired Microsatellites amplification by multiplex PCR

Amplicon separation with ABI 3100 Genetic Analyser 3730XL, Size of amplicons determined with Genemapper software Check data quality with R-script

Assignment of each individual to a genetic cluster via a Bayesian analysis with Structure software

Data format Raw

Analysed Parameters for data collection Date, host plant,

Serial sampling in melon/cotton fields grown in areas with high density of cultivated host plant,

Serial sampling over year in a specific geographical region, Sampling on a same host plant in very remote areas. Description of data collection The dataset is composed of:

• Aphid collection information

Sampling characteristics: date, country, GPS point, Host on which aphids were collected: Botanical Family, genus, species Sample characteristics: Morph, Sex

• 16 microsatellite length (both alleles at 8 microsatellite markers (Ago126, Ago24, Ago53, Ago59, Ago66, Ago69, Ago84, Ago89)

• The corresponding MultiLocus Genotype (MLG) name (given for an allelic combination)

• The probabilities to belong to each of the 9 genetic clusters according to a Bayesian clustering (Structure software results)

Data source location There were 129 sampling localities, distributed in 17 countries, see Supplemental 4 for details

Data accessibility https://doi.org/10.15454/HNGGMX

ValueoftheData

• Microsatellites data obtainedfrom different teams cannot be pooled for analyses because data maybe lab-dependent. We gathereda large set ofmicrosatellitesdata obtainedby a unique teamforavery importantpest,Aphisgossypii,attackingcropsworldwide[1,2].The samplingwasdonebydifferentcollaborativepartners.

(4)

• The data can benefitto any researchersworking on aphids, andto anyprofessor/students lookingforalargesetofdataforpopulationgeneticsanalysisinanorganismwithacomplex reproductionsystemsuchasA.gossypii[3].

• Thedatasetmight beusedtoinvestigatei/thesoundofAphisgossypii/Aphisfrangulae dif-ferentiation[4],ii/ therole ofworldwideclonesascrop pest, iii/theimportance ofsexual reproductionin Aphis gossypiipopulation in a givenarea, iv/ therelationship between ge-neticgroupandspecializationonhostplantfamilies.

• Moreover, we propose to sendreference DNAs toany researcherwho runs a genetic pop-ulationanalysisinA. gossypiiwithmicrosatellitesandwouldlike tointegrate/use thelarge databasegivenhere.ThereferenceMLGsareNM1,C6,C9,CUC1,GWD,Pot1,PsP4andBurk1.

1. DataDescription

Thedatasetgathersinformationon16,016individualsAphisgossypii/Aphisfrangulaecollected in 17countriesfrom 1989to 2019 from all continents. Aphids collectedwere eitherapterous (10,177), winged(5654)oreggs(15).Theywere collectedon 19plantfamiliesorin traps de-positedinmelonfields.

Foreachindividual,primary datacollectedarecountrywithGPSposition(aphidswere col-lected in the10 kmaround thisposition, exceptin Australiawhere aphidswere collectedup to70kmfromthisposition),date,morph,hostplantortrap.Foreachaphidcollectedthe sec-ond type ofdata consistsinthe alleliccomposition at8 microsatellitesandits corresponding MultiLocusGenotypename,andlasttheprobabilityofassignationtoageneticgroupgivenbya Bayesiananalysis.

Table1summarizesthenumberofindividualscollectedaccordingtotheirhostplantspecies andtheirmorph.

Table2givesthecommonMLGsbetweenthreegeographicalareas.

Fig.1picturesthesamplelocationsonaworldmapandthenumberofindividualscollected ineachcountry.

Table 1

Number of aphids collected on 19 botanical families of host plant and their morph. Morph

Botanical family / trap Apterous Winged Egg Unknown Total

Asteraceae 186 186 Bignoniaceae 5 5 Brassicaceae 4 4 Chenopodiaceae 4 4 Convolvulaceae 15 15 Cucurbitaceae 8641 3668 70 12,403 Euphorbiaceae 14 14 Fabaceae 18 18 Lamiaceae 19 1 20 Liliaceae 5 5 Lythraceae 5 5 Malvaceae 843 843 Polygonaceae 5 5 Portulacaceae 5 5 Rhamnaceae 139 36 15 100 290 Rosaceae 10 10 Rutaceae 60 60 Solanaceae 182 182 Zygophyllaceae 9 9 Trap 8 1876 1884 Unknown 73 73 Total 10,177 5654 15 170 16,016

(5)

4 P. Mistral, F. Vanlerberghe-Masutti and S. Elbelt et al. / Data in Brief 36 (2021) 106967

Table 2

MLGs shared by aphids collected in several geographic areas. 1/ Europe, Tunisia, Turkey, 2/ Benin, Burkina, Cameroon, Senegal, 3/ Brazil, California, West Indies and 4/ Vietnam, Thailand, Australia.

Benin, Burkina, Cameroon, Senegal

Brazil, California, West Indies

Vietnam, Thailand, Australia

26 MLGs 13 MLGs 24 MLGs

Europe, Tunisia, Turkey

2222 MLGs Aub1, Aub2 Burk1, Burk2, Burk3, Burk4, Burk5, Burk7 C1, C4, C5, C8, C9, C14 Hib3, Hib5, Hib6, Hib7, Hib8

Ivo PsP1, PsP4

Al11, Al11–27, Al12–12 Aub3, Aub4, Aub5 Burk1 C5, C9 GWD Hib1 M12–42 PsP2, PsP3 C4, C12, C13 Hib4 NM1

Fig. 1. Distribution of aphids sampling (made from https://fr.batchgeo.com ) and number of aphids collected within countries. Aphids were collected within a 10 km radius of the point (70 km for Australia) for the red points, locality unknown for Madagascar (yellow point). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

(6)

Fig. 2. A) Number of MLGs identified in aphids according to their host plant families (representing 88% of the collected aphids). B) Number of MLGs identified in aphids trapped in melon fields and aphids collected on host plants belonging to Cucurbitaceae, Malvaceae and Solanaceae families (representing 95,4% of the collected aphids). Aphids trapped had only common MLGs with aphids collected on plants belonging to these three families. Venn diagrams are made from http://www.interactivenn.net .

Fig. 3. Percentage of MLG assignment to nine clusters resulting from a Bayesian clustering analysis of the assignment of 2360 MLGs to 1 to 10 clusters (K), with 10 simulations per K. 260 out of the 261 expected MLGs for individuals belonging to Aphis frangulae , i.e. identified in aphids collected on Frangula alnus , were assigned to the first cluster with a probability over 0.77. These MLGs are characterized by null alleles at Ago24 and Ago84 and are homozygous for the 110 allele at Ago53.

Fig.2 containstwo Venndiagramsillustrating A) thenumber ofMLGobservedin individ-uals collectedon fivebotanical families(98,7% ofthecollected aphids)andB) thenumberof MLGobservedinindividualscollectedintrapsandonCucurbitaceae,MalvaceaeandSolanaceae plants.Thesethreehostfamilieswere chosenbecausetheyhostedindividualswithMLGs com-monwiththoseintraps.

Fig. 3 givesfor each MLGthe percentage assignation to thenine clusters. It is theresults of aBayesian clusteringafter analyzingthe assignment of2358MLGs to 1to 10 clusters(K), with10simulationsper K.260out ofthe261expectedMLGsforAphisfrangulae,i.e.identified

(7)

6 P. Mistral, F. Vanlerberghe-Masutti and S. Elbelt et al. / Data in Brief 36 (2021) 106967

inindividualscollectedonFrangula alnus,were assignedtothe firstcluster withaprobability over0.77.TheseMLGsarecharacterizedbynullallelesforAgo24andAgo84andanhomozygous 110–110allelesforAgo53.

Supplemental1Primersequencesformicrosatelliteamplification

Supplemental2RscriptforcheckingMLGconsistency

Supplemental3Resultsofcomputationtodeterminethemostlikelynumberofgenetic clus-tersinthedatasetaccordingtoaBayesiananalysisandcorrespondinggraphics

Supplemental4Listofsamplinglocations(GPScoordinatesandcountries)

2. ExperimentalDesign,MaterialsandMethods

2.1. Aphidssampling

Aphids, winged or apterous, visually expected as belonging to the Aphis gossypii species group, were removed with a brush from their host plant and individually sunk in 70–90% ethanolin a numbered tube. On Frangula alnus,the primary hostof Aphis frangulae, we also collectedaphid eggs. Winged individualscollected onFrangula alnus were examined fortheir genitaliaunder abinocular microscopeto separate males fromfemales. Aphidswere also col-lectedin non-biasedsuctiontrapsdesignedto samplewingedinsectsdailyatthe cropheight

[8].The trapsweredepositedinmelonfieldsduringspringandsummer. Sampleswerestored fewdaystoseveralweeksbeforeDNAextraction.

2.2.DNAextraction

Ethanolwasremovedand50μl of5% chelex100 (Chelex100,Bio-rad)wereaddedtoeach tube.Aphidswerecoarsegrindedandsubmittedtoathermalshockat56° for30minand95– 100° for 5min.After ashortcentrifugation(2000× rpmfor30s)to pelletdebrisandChelex beads,DNAremains inthe supernatantwhichcan be storedafew daysat−20° CbeforePCR amplification.

2.3.Microsatelliteamplification

Theprimersequences, amplifyingeightmicrosatellitelocispecificoftheA. gossypiigenome

[5], are given in the supplemental 1. The forward primer of each microsatellite locus was labelled with a fluorescent dye (FAM, NED, PET, VIC) chosen to analyze simultaneously the eight microsatellite loci (Ago24-FAM, Ago53-VIC, Ago59-NED, Ago66-VIC, Ago69-NED, Ago84-PET, Ago89-PET and Ago126-FAM). The primers labeled by NED, PET and VIC fluorochromes weresuppliedby Applied BiosystemsTM (https://www.fishersci.fr), all otherprimers were

sup-pliedbyEurofins Genomics (https://eurofinsgenomics.eu/). DNAamplificationswere performed intwo polymerasechainreactions (PCR)in afinal volume of5μL inathermocycler (Master-cycler,Eppendorf).The firstPCR is amultiplex of Ago53,Ago59,Ago66,Ago69, Ago84,Ago89 andAgo126,containing2.5μLofQIAGENMultiplexPCRMasterdNTPmix,5units/μlof HotStart-TaqDNApolymeraseand3mM MgCl2,0.2μmofeach primer,1μL ofChelexDNAextraction

dilutedat 1/10and RNAasefree watertosupplement at5μL. Amplificationswere performed accordingtoaprogrammingwith15minat95° Cfollowedby25cyclesof30sat95° C,90sat 56° Cand30sat72° Candafinal extensionduring30minat60° C.ThesecondPCR,usingthe primerspecificofthelocusAgo24,wasperformedinthesameconditionsexceptfortheprimer concentration(0.1μm)andthethermocyclerprogramming:5minat95° C,35cyclesof30sat 95° C,45sat62° Cand30sat72° C,andafinalelongationof7minat72° C.

(8)

2.4. Microsatelliteanalyses

Amixcontaining10μLofHi-DiFormamide(Applied Biosystems)and0.15μLof GeneScan-500LIZ Size Standard was deposited in each well of a specific plate for automatic sequencer (ABI 3100 Genetic Analyser, 3730XL), and then 1 μL of each of the two PCRs was added in the well andsubmitted to denaturation ina thermocycler (95° C 3min – 4° C 10 min). Sepa-ration anddetection of PCR products were carried out by a capillaryelectrophoresis with an automaticsequencer(ABI3100GeneticAnalyser,3730XL).Wedeterminedthesizeoftheallele ateach locusby comparisonwithGeneScan-500LIZ SizeStandardwithGeneMapperv3.7 soft-ware (AppliedBiosystems, Foster City,California, USA).When onlyone peakwasobserved for alocus,thelocuswasconsideredhomozygous.Moreover,aphidswithaknowncombinationof alleles (collectedinrearings available inthe lab),were used too,helpinginthe calibrationof the readingon Genemapper.These controlswere reinforcedwhen we changedthe device, its capillars orpolymersformigration.Cross readingwasdone anytimea changinginthe reader occurredovertime:currentreadersharedexpertise forallelesize determinationwiththenew reader.

All individualscollected onFrangula alnus did not amplifiedthe microsatellitesAgo24 and Ago84 whilethey were expectedbelonging toAphis frangulae species.Because theset of mi-crosatellites were definedforAphis gossypiispecies, weassumed that individualscollectedon

Frangula alnus carry two null alleles for both microsatellites Ago24 and Ago84; these alleles werecoded0.

Onlyindividualsforwhichatleastsixout oftheeightmicrosatelliteswere amplifiedwere keptinthedataset.Tominimizetheriskofmiss-readinginGenemapper,wecheckedsamples forwhichcombinationofalleleswasobservedonlyonce.Forthosecombinations,weassumeda miss-readingwhenthecombinationdiffersfromanyotheroneinsizeallelesforonlyoneDNA baseatoneofthe16alleles.Asfaraspossiblewecheckedthisassumptionbyasecondreading inGenemapper.Then,wecorrectedtheallelesizeandweassignedaMultiLocusGenotypename (MLG)toeachuniquecombinationofalleles.

AlltheprocesswascheckedbytheRscriptgiveninthesupplemental2.

2.5. Populationstructureanalysis

The 2358 different MLGs were subjected to a Bayesian clustering [6],using an admixture model withaburn-in of250,000 iterationsandasubsequent MarkovChain of500,000 itera-tions. Foreach putative numberofclusters (K,ranging from1 to 10),we compared10 repli-cate runs, to assess the consistency ofthe estimated values.We used the Evanno method to determine the mostlikelynumber ofgenetic clusters[7]. Foreach K,the L(K)i, i.e.estimated Ln Prob of Data in the results file given by the structure software for the run i, was col-lected, andthe L”(K)i = L(K)i − 2L(K-1)i +L(K-2)i wascalculated.The meanofL(K)andL(K)”

were plotted (see supplemental 3). The likeliest numberof K(for K>2) is givenby the peak ofL(K)”.

For thepresent dataset, thelikeliest number ofK wasequal to nine, we then performed one run ofthe admixture modelwitha burn-in of500,000iterationsand aMarkov Chain of 1000,000iterations.ProbabilitiesofassignationofeachMLGtothenineclusterswereobtained andplottedinFig.3.

EthicsStatement

(9)

8 P. Mistral, F. Vanlerberghe-Masutti and S. Elbelt et al. / Data in Brief 36 (2021) 106967

CRediTAuthorStatement

Pascale Mistral:hasorganized andparticipated toall samplingsince 2005.She organized thedataset forsampling characteristics.She produced mostofthe genetic dataand their cu-ration;Flavie Vanlerberghe-Masutti:coordinatedor participatedto all successiveprojects for whichaphidsweresampled,thenshewashighlyinvolvedinfundingacquisition.Shereviewed andeditedthedatapaper;SoniaElbelt:workedoncuration/validationofthedataset.Shebuilt thefiguresandtables,annotatedtheR-scriptandwrotetheoriginaldraft;NathalieBoissot: co-ordinatedorparticipatedtosuccessiveprojectsforwhichaphidsweresampledsince2004,then shewashighlyinvolvedinfundingacquisition.Shesupervisedthedatacuration/validationand carriedontheBayesiananalysis.Shereviewedandeditedthedatapaper.

DeclarationofCompetingInterest

Theauthorsdeclarethattheyhavenoknowncompetingfinancialinterestsorpersonal rela-tionshipswhichhave,orcouldbeperceivedtohave,influencedtheworkreportedinthisarticle.

Acknowledgments

WethankJoel Chadoeuf(INRAE-GAFL) forhelpinginthe Rscript writing.Aphids sampling andgenotypingdatapresentedherewerepartofstudiesmainlysupportedbytheDepartments BAPandSPEofINRAE,bytheFrenchMinistèredesAffairesEtrangères,theFrenchMinistèrede laRechercheandtheFrenchMinistèreofAgriculture.

SupplementaryMaterials

Supplementary material associated withthis article can be found in the online version at doi:10.1016/j.dib.2021.106967.

References

[1] T.A. Ebert , B. Cartwright , Biology and ecology of Aphis gossypii Glover (Homoptera: aphididae), Southwest. Entomol. 22 (1997) 116–153 .

[2] H.F. van Emden , R. Harrigton , Aphids as Crop Pests, CABI, Oxfordshire, UK, 2007 .

[3] S. Thomas, N. Boissot, F. Vanlerberghe-Masutti, What do spring migrants reveal about sex and host selection in the melon aphid? BMC Evol. Biol. 12 (2012), doi: 10.1186/1471-2148- 12- 47 .

[4] R.L. Blackman , V.F. Eastop , Taxonomic issues, in: H.F. van Emden, R. Harrigton (Eds.), Aphids as Crop Pests, CABI, Oxfordshire, UK, 2007, pp. 1–29 .

[5] F. Vanlerberghe-Masutti, P. Chavigny, S.J. Fuller, Characterization of microsatellite loci in the aphid species Aphis

gossypii Glover, Mol. Ecol. 8 (1999) 693–695, doi: 10.1046/j.1365-294x.1999.00876.x .

[6] J.K. Pritchard , M. Stephens , P. Donnelly , Inference of population structure using multilocus genotype data, Genetics 155 (20 0 0) 945–959 .

[7] G. Evanno , S. Regnaut , J. Goudet , Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study, Mol. Ecol. 14 (2005) 2611–2620 .

[8] A. Schoeny, P. Gognalons, Data on winged insect dynamics in melon crops in southeastern France, Data Br. 29 (2020), doi: 10.1016/j.dib.2020.105132 .

Figure

Table 1 summarizes the number of individuals collected according to their host plant species and their morph.
Fig. 1. Distribution of aphids sampling (made from https://fr.batchgeo.com  ) and number of aphids collected within  countries
Fig. 2. A) Number  of  MLGs identified in  aphids according  to their host  plant families  (representing 88% of the collected  aphids)

Références

Documents relatifs

Given a set of interacting agents, the capabilities of whom are known, given a set of goals, and given an assignment function mapping each goal to a coalition (i.e. a set) of agents,

With all the preparation in flood behaviour and impact study, it is time to further investigate into the strategies for evacuation management and address the need for evacuation

If both agents have the same type, the social planner always prefers to assign the object to the old agent, as she retains the option value of assigning the object to the young

The paper also characterizes optimal assignment rules when monetary transfers are allowed and agents face a recovery cost, when agents’ types are private information and when agents

The proposed model has the following sets: SP is the set of P people who looking for an orientation; SN is the set of N standard needs; SS is the set of S

In this question, we consider graphs embedded in R 2 in such a way that edges may inter- sect, but the number of intersection points is finite, and each edge only contains the

The first one should be a socket client application that takes a server address (IP address + port) and an input file and sends its content to a (possibly) remote server.. The

The Server Message Block (SMB) and Network File System (NFS) protocols both can be used to implement file sharing among client and server machines.. Let us assume a