• Aucun résultat trouvé

Its all fun guys: a comparison of bioinformatic pipelines for metabarcoding plant and soil fungal communities

N/A
N/A
Protected

Academic year: 2022

Partager "Its all fun guys: a comparison of bioinformatic pipelines for metabarcoding plant and soil fungal communities"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-01938707

https://hal.archives-ouvertes.fr/hal-01938707

Submitted on 5 Jun 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Distributed under a Creative Commons Attribution| 4.0 International License

Its all fun guys: a comparison of bioinformatic pipelines for metabarcoding plant and soil fungal communities

Charlie Pauvert, Marc Buee, Valerie Laval, Veronique Edel-Hermann, Laure Fauchery, Angelique Gautier, Isabelle Lesur, Jessica Vallance, Corinne Vacher

To cite this version:

Charlie Pauvert, Marc Buee, Valerie Laval, Veronique Edel-Hermann, Laure Fauchery, et al.. Its all fun guys: a comparison of bioinformatic pipelines for metabarcoding plant and soil fungal communities.

International Phytobiomes Conference 2018, Dec 2018, Montpellier, France. �hal-01938707�

(2)

ITS ALL FUN GUYS

A COMPARISON OF BIOINFORMATIC PIPELINES FOR METABARCODING PLANT AND SOIL FUNGAL COMMUNITIES

CHARLIE PAUVERT

1

, MARC BUÉE

2

, VALÉRIE LAVAL

3

, VÉRONIQUE EDEL-HERMANN

4

, LAURE FAUCHERY

2

, ANGÉLIQUE GAUTIER

3

, ISABELLE LESUR

1,5

, JESSICA VALLANCE

6

, CORINNE VACHER

1

1. BIOGECO, INRA, Univ. Bordeaux, 33615 Pessac, France

2. INRA, Université de Lorraine, UMR IAM 1136, Laboratoire d’Excellence ARBRE, Centre INRA-Lorraine, 54280 Champenoux, France

3. BIOGER, INRA, 78850 Thiverval Grignon, France

4. Agroécologie, AgroSup Dijon, CNRS, INRA, Univ. Bourgogne Franche-Comté, 21000 Dijon, France 5. HelixVenture, 33700 Mérignac, France

6. SAVE, Bordeaux Sciences Agro, INRA, ISVV, Univ. Bordeaux, 33882 Villenave d’Ornon, France

Acknowledgments Attendance at this congress was possible thanks to the award for the best presentation of the Pathobiome 2018 congress. We thank Matthieu Barret, Martial Briand, Lucas Auer, Gregory Gambetta, Guilherme Martins, Frédéric Barraquand and Tania Fort for useful comments and discussions. We also thank the INRA MEM metaprogramme (Meta-Omics of Microbial Ecosystems) for financial and scientific support. The mock community sequencing was funded by the INRA MEM MetaBAR project (PI: MB) and the bioinformatic analyses were performed as part of the INRA MEM Learn-biocontrol project (PI: CV). Additional funding was received from the LABEX COTE (ANR-10-LABX-45) and the LABEX CEBA (ANR-10-LABX-25-01). CP’s PhD grant was funded by the INRA and Bordeaux Sciences Agro (BSA). We thank the Genotoul sequencing facility (Get-PlaGe) for sequencing the mock community and the Genotoul bioinformatics facility (Bioinfo Genotoul) for providing computing and storage resources.

Plant and soil fungal communities influence plant fitness and ecosystem functioning.

Fungi A Fungi B Fungi C

….

Composition

Bioinformatics Metabarcoding

DNA were pooled

amplified and sequenced with Illumina MiSeq

DNA were

independently amplified and sequenced with

Sanger

Introduction

Methods

Results

Conclusion

Objectives

Fig 1 Taxonomic composition of the artificial fungal community. The nodes of the tree are the ITS1 sequences (n = 175) of the fungal strains (n = 189) that constitute the artificial or

« mock » community.

The MiSeq sequences were analyzed with 288 unique combinations of bioinformatic softwares and parameters (referred to hereafter as pipelines)

1. Make and sequence a mock community of 189 Dikarya strains commonly found in agricultural and forest soils and in plant tissues.

2. Compare the ability of 288 combinations of bioinformatic softwares and parameters to recover the fungal strains present in this mock community, in the expected proportions.

Fig 2 Overview of the bioinformatic pipelines compared in this study →

Sensitivity

Percentage of recovered mock strains

Precision

Percentage of OTU/ASV matching mock strains

Compositional similarity Bray-Curtis similarity between the community recovered and

the mock community

Pipeline comparison

Highest Sensitivity

Highest Precision

Highest Similarity

Best Trade-of 1. Assembly QUALITY_R1 FASTQJOIN_150 QUALITY_R1 QUALITY_R1

2. Extraction No No No No

3. Variation VSEARCH DADA2 DADA2 DADA2

4. Chimeras Retained Removed Retained Removed

5. Filtering All 10 All All

Si5

Si1

P1

Se1 Fig 4 Values of precision and

compositional similarity to the mock fungal community, for all 288 bioinformatic pipelines. Only the top four pipelines are labeled

These fungi are classically studied by metabarcoding

approaches targeting the ribosomal internal

transcribed spacer (ITS)

But there is no clear consensus concerning the most appropriate bioinformatic pipeline.

Table 1 Pipelines with the best performances for each three criteria and the pipeline with the best trade-off. Pipelines are labeled below the table and the steps are numbered as in Fig 2.

None of the pipelines scored highest in all three criteria (Table 1).

Observed fungal richness depended strongly on the bioinformatic pipeline (Fig 3).

Community richness and composition were well recovered by DADA2 (Fig 3 & 4).

Fig 3 Total number of OTUs (or ASVs) retrieved by the top four pipelines (Table 1). The black horizontal line indicates the expected richness.

No overestimation of the number of fungal strains.

One of the best to recover the composition of the mock community.

Good performance on precision and sensitivity as well.

The Si5 pipeline used single forward (R1)

sequences with DADA2 and no filter other than the removal of low-quality and chimeric sequences.

One of the

best trade-offs

Références

Documents relatifs

In Section 2 we recall the conventional parametrization of the neutral-kaon decay am- plitudes, and the definition of the CP-violating and CP-conserving parameters. In Section 3,

In the area of construction research, we have established three new initiatives in collaboration with industry partners: a national guide for municipal infrastructure, a centre

Dans ce chapitre, on définit ce système dans des coordonnées héliocentriques, On démontre l’analyticité du hamiltonien séculaire, au voisinage du point circulaire non incliné

Bastien Pellegrin, Romain Prulho, Agnès Rivaton, Sandrine Thérias, Jean-Luc Gardette, Emmanuelle Gaudichet-Maurin, Christel Causserand.. To cite

En 1987, le Conservatoire du Littoral, organisme gestionnaire de la presqu'île de Sainte Lucie (Aude), décidait d'effectuer un lâcher de Chevreuils sur ce site, à

2: Speed at which the inoculant is displaced by resident fungal species (cases (B) and (Da)) and speed of invasion of the inoculant (cases (A) and (C)) as a function of the

Soil factor effects on plant community structure are addressed by highlighting plant trait selection by degraded habitats in ultramafic soils for mine reclamation [3], by specifying

confidence intervals for the position of SNPs retained as cofactors in the whole population for flowering and ripening periods were investigated for in silico candidate gene