Récemment recherché

Aucun résultat trouvé

Étiquettes

Aucun résultat trouvé

Document

Aucun résultat trouvé

Accueil Écoles Thèmes

Connexion

A pipeline for automatic taxonomic community inventories on data from NGS : an example with freshwater diatoms

Partager "A pipeline for automatic taxonomic community inventories on data from NGS : an example with freshwater diatoms"

N/A

N/A

Protected

Année scolaire: 2021

Info

Protected

Academic year: 2021

Partager "A pipeline for automatic taxonomic community inventories on data from NGS : an example with freshwater diatoms"

Copied!

2

0

0

2

0

0

Chargement.... (Voir le texte intégral maintenant)

Télécharger maintenant ( 2 Page )

Texte intégral

(1)

HAL Id: hal-00999998

https://hal.archives-ouvertes.fr/hal-00999998

Submitted on 5 Jun 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

A pipeline for automatic taxonomic community inventories on data from NGS : an example with

freshwater diatoms

Lenaïg Kermarrec, Philippe Chaumeil, Frédéric Rimet, Jean-Marc Frigerio, Valerie Laval, Marc-Henri Lebrun, Agnes Bouchez, Alain Franc

To cite this version:

Lenaïg Kermarrec, Philippe Chaumeil, Frédéric Rimet, Jean-Marc Frigerio, Valerie Laval, et al.. A pipeline for automatic taxonomic community inventories on data from NGS : an example with freshwater diatoms. 4th International Barcode of Life Conference, University of Adelaide. Adélaide, AUS., Nov 2011, Adélaide, Australia. 1p. �hal-00999998�

(2)

Kermarrec L. (1), Chaumeil P. (2), Rimet F. (1), Frigerio J.M. (2), Laval V. (3), Lebrun M.H.

(3), Bouchez A. (1), FRANC A. (2)

1: INRA CARRTEL, Thonon, France 2: INRA BIOGECO, Cestas, France

3: INRA BIOGER, Thiverval-Grignon, France

Abstract Title:

A PIPELINE FOR AUTOMATIC TAXONOMIC COMMUNITY INVENTORIES ON DATA FROM NGS : AN EXAMPLE WITH FRESHWATER DIATOMS

Abstract Text:

Most of the work devoted to molecular taxonomy in the context of barcoding has been devoted to get the name of a query from a match of a barcode or a sequence on a high quality database. Those high quality databases (quality of sequences and quality of taxonomic assignation) have been obtained on Sanger. Currently, NGS (Next Generation Sequencing) are exponentially developing, providing large data sets of sequences. Barcode may take an interest in making taxonomic inventories from these data on environmental sequences. As the quality of data produced by NGS is still lower than for those produced by Sanger, the strategy we propose is (i) keep expanding high quality databases with taxonomic expertise and Sanger sequencing and (ii) develop tools for automatically matching reads from NGS on these databases for retrieving taxonomic inventories of communities. Two questions will be distinguished. First, to exhibit perfect matches between reads and a word on a reference sequence. We present a pipeline which implements this task without any heuristic, and retrieving with high consistency a known artificially built community. The accuracy is better than BLAST due to the absence of heuristic, and to the fact that any step is implemented in a rigorous way. Second, most of reads get imperfect matches, as they may differ from a

reference sequence by a couple of snp’s or length differences in homopolymers. We will present a program with still ongoing research which implements these imperfect matches, with a time longer than perfect matches, but which can take into account within species genetic variability between the reference and the queries, and some technical discrepancies to sequencing technology. This tool, called metaMatch, allows an automatic taxonomic

identification just after the output of NGS. Work is still going on to improve the speed and accuracy of this program.

Abstract considered for:

Data Analysis Working Group and the Data Portal

Références

Télécharger maintenant ( PDF - 2 Page - 180.40 KB )

Documents relatifs

Phoneme learning is influenced by the taxonomic organization of the semantic referents

Pre- vious work has shown that human learners rely on referential information to differentiate similar sounds, but largely ignored the problem of taxonomic ambiguity at the

De Novo NGS Data Compression

Abbreviations: Huf - Huffman coding, Markov - Context model, CM - Context mixing, AC - Arithmetic coding, Generic - Generic compression tools such as GZIP and BZIP 2, PPM -

Repositories for Taxonomic Data: Where We Are and What is Missing

In the specialized taxonomic journals (4113 studies), molecular data were widely used in mycology, but much less so in botany and zoology (Supplementary Appendix S7 available on

Identification of candidate polymorphism in a QTL region by combining eQTL mapping with NGS data

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des

Commentary: Cornfield on cigarette smoking and lung cancer and how to assess causality

Here, Cornfield and colleagues had to combine arguments about the strength of the associ- ation (a relative risk of about nine when comparing smokers with non-smokers and one of

Mercury and cadmium trigger expression of the copper importer Ctr1B, which enables Drosophila to thrive on heavy metal-loaded food

MTF-1 is activated by heavy metal load and, somewhat counter-intuitively, also has a role during copper starvation, whereby it induces transcrip- tion of the Ctr1B copper importer

Quality Control of Environmental Measurement Data with Quality Flagging

is tailored for weather measurement data, we propose as part of the main contribution a uniform interpretation for the quality flag values to be used for the flagging of any kind of

Structure of nanocrystalline phyllomanganates produced by freshwater fungi

The crystal structures of biogenic Mn oxides produced by three fungal strains isolated from stream pebbles were determined using chemical analyses, XANES and EXAFS spectroscopy,

Téléchargez tous les documents en téléchargeant vos documents d'étude.

Votre document sera enrichi, partagé sur 123dok FR pour vous aider à étudier.

Documents relatifs

Cinétiques de dissolution des cristaux dans les silicates fondus : contexte des verres nucléaires

Cinétiques de dissolution des cristaux dans les silicates fondus : contexte des verres nucléaires

233

0

0

en
fr

91

0

0

Biodiversité des digènes et autres helminthes intestinaux des Rougets : synthèse pour Mullus surmuletus (Linné, 1758) et M. barbatus (L., 1758) dans le bassin méditerranéen

Biodiversité des digènes et autres helminthes intestinaux des Rougets : synthèse pour Mullus surmuletus (Linné, 1758) et M. barbatus (L., 1758) dans le bassin méditerranéen

18

0

0

Bioaccumulation of heavy metals in Nha Trang bay, Khanh Hoa, Viet Nam

Bioaccumulation of heavy metals in Nha Trang bay, Khanh Hoa, Viet Nam

265

0

0

Switchable two-dimensional electron gas based on ferroelectric Ca:SrTiO 3

Switchable two-dimensional electron gas based on ferroelectric Ca:SrTiO 3

7

0

0

Les résections de l'intestin grêle et du colon au centre hospitalier national yalgado ouedraogo aspects épidémio-cliniques, thérapeutiques et évolutifs

Les résections de l'intestin grêle et du colon au centre hospitalier national yalgado ouedraogo aspects épidémio-cliniques, thérapeutiques et évolutifs

124

0

0

L'odeur de l'encre : exploration heuristique du sens donné à une pratique de calligraphie japonaise

L'odeur de l'encre : exploration heuristique du sens donné à une pratique de calligraphie japonaise

300

0

0

9. Nutrition azotée de l’épeautre en Ardenne et en région limoneuse

9. Nutrition azotée de l’épeautre en Ardenne et en région limoneuse

16

0

0