Récemment recherché

Aucun résultat trouvé

Étiquettes

Aucun résultat trouvé

Document

Aucun résultat trouvé

Accueil Écoles Thèmes

Connexion

SVJedi : Structural variation genotyping using long reads

Partager "SVJedi : Structural variation genotyping using long reads"

N/A

N/A

Protected

Année scolaire: 2021

Info

Protected

Academic year: 2021

Partager "SVJedi : Structural variation genotyping using long reads"

Copied!

2

0

0

2

0

0

Chargement.... (Voir le texte intégral maintenant)

Télécharger maintenant ( 2 Page )

Texte intégral

(1)

HAL Id: hal-02290884

https://hal.inria.fr/hal-02290884

Submitted on 18 Sep 2019

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

SVJedi : Structural variation genotyping using long reads

Lolita Lecompte, Pierre Peterlongo, Dominique Lavenier, Claire Lemaitre

To cite this version:

Lolita Lecompte, Pierre Peterlongo, Dominique Lavenier, Claire Lemaitre. SVJedi : Structural variation genotyping using long reads. HiTSeq 2019 - Conference on High Throughput Sequencing, Jul 2019, Basel, Switzerland. �hal-02290884�

(2)

SVJedi : Structural variation genotyping using long reads

Lolita Lecompte ¹ , Pierre Peterlongo ¹ , Dominique Lavenier ¹ , Claire Lemaitre ¹

1

Univ Rennes, Inria, CNRS, IRISA, F-35000 Rennes, France

Introduction

Structural variations (SVs) are DNA seg- ments that have been moved regarding another reference genome. The number of known SVs is increasing, especially with the development of third generation sequencing technologies, which produce long reads data.

Discovery vs. Genotyping

SV discovery

Given a reference genome, discover SVs be- tween a sequencing sample and a reference genome.

SVIM (2019), Sniﬄes (2018), pbsv (2018), NanoSV (2017)

SV genotyping

Given a set of already known variants for a species, assess the presence or absence of each variant in a given sequencing sample.

There is no known method for SV genotyping with long reads.

Method - SVJedi

Results on a simulated dataset

Simulated dataset

■

1,000 deletions selected on the human chromosome 1 from dbVar

(50 bp - 10k bp)

■

Equally distributed : 0/0, 0/1, 1/1

■

Long reads simulated with SimLoRD [1]

16 % error rate, 20X sequence depth

SVJedi

0/0 0/1 1/1 ./.

T ruth

0/0 330 3 0 0

0/1 19 305 6 4

1/1 3 10 311 9

Precision : 95.85 %

Does SV discovery tools correctly estimate genotypes?

Sniﬄes [2]

0/0 0/1 1/1

T ruth

0/1 3 89 0

1/1 0 254 60

Recall : 60.87 % Precision : 36.70 %

80.9 % of 1/1 deletions are wrongly genotyped as 0/1

Results on a real dataset

Dataset

■

Sample: NA12878

■

Data: Nanopore WGS consortium (37X)

■

svclassify [3]: 2,677 high confidence dele- tions

sv classify

SVJedi

0/0 0/1 1/1 ./.

0/1 194 1,289 202 28

1/1 8 75 877 4

Identity : 81.89 % Predicted genotypes = 98.81 %

Runtime : 2h35 (40 CPU)

Conclusion

▶ New method to genotype SVs using long reads data.

▶ Importance of dedicated genotyping methods.

▶ Currently only applied to deletions.

▶ Implementation: SVJedi

References

[1] Bianca K Stöcker, Johannes Köster, and Sven Rahmann. Simlord: simulation of long read data. Bioinformatics, 32(17):2704–2706, 2016.

[2] Fritz J Sedlazeck, Philipp Rescheneder, Moritz Smolka, Han Fang, Maria Nattestad, Arndt von Haeseler, and Michael C Schatz. Accurate detection of complex structural variations using single-molecule sequencing. Nature methods, 15(6):461, 2018.

[3] Hemang Parikh, Marghoob Mohiyuddin, Hugo YK Lam, Hariharan Iyer, Desu Chen, Mark Pratt, Gabor Bartha, Noah Spies, Wolfgang Losert, Justin M Zook, et al. svclassify: a method to establish benchmark structural variant calls. BMC genomics, 17(1):64, 2016.

https://github.com/llecompte/SVJedi

Contact: lolita.lecompte@inria.fr

Références

Télécharger maintenant ( PDF - 2 Page - 1.08 MB )

Documents relatifs

Política sectorial para el sector financiero de la COSUDE

Hay que establecer políticas para cada uno de los sectores importantes de la COSUDE (Agencia Suiza para el Desarrollo y la Cooperación); por tanto también para el sector

Accurate sequence variant genotyping in cattle using variation-aware genome graphs

We compared the accuracy and sensitivity of graph-based sequence variant genotyping using the Graphtyper software to two widely- used methods, i.e., GATK and SAMtools, which rely

MaskAl: Privacy Preserving Masked Reads Alignment using Intel SGX

MaskAl combines a fast preprocessing step on raw genomic data — filtering and masking — with established algorithms to align sanitized reads, from which sensitive parts have been

ELECTOR: evaluator for long reads correction methods

ELECTOR then compares three different versions of each read: the ‘uncorrected’ version, as provided by the sequencing experiment or by the read simulator, the ‘corrected’

Using genotyping-by-sequencing to understand Musa diversity : [P449]

To assess genetic diversity with an improved resolution, we have selected 65 accessions with diploid and triploid combinations of the A and/or B genomes including AAB plantains and

Identification of QTLs controlling yield traits in sweet sorghum using genotyping by sequencing

A lattice experiment with three replicates was established in two years for evaluating four important yield traits: plant height (in m), plot weight (in kg), sucrose content (or

Numerical study of transition to turbulence in plane Poiseuille flow in physical space and state space

Trajectories that are restricted to the edge via this algorithm eventually converge to the edge state, which can be a fixed point like in spatially periodic plane Couette flow [50],

CONSENT: Scalable self-correction of long reads with multiple sequence alignment

This edit script can easily be retrieved by Daccord, as it relies on DALIGNER (Myers, 2014) to compute actual alignments between the long reads. However, as CONSENT relies on a

Téléchargez tous les documents en téléchargeant vos documents d'étude.

Votre document sera enrichi, partagé sur 123dok FR pour vous aider à étudier.

Documents relatifs

Quel fourrage pour quelle autonomie ?

Quel fourrage pour quelle autonomie ?

63

0

0

Genetics of nodulation in Aeschynomene evenia uncovers new mechanisms of the rhizobium-legume symbiosis

Genetics of nodulation in Aeschynomene evenia uncovers new mechanisms of the rhizobium-legume symbiosis

28

0

0

Against preservation

Against preservation

17

0

0

Projection des dépenses publiques en médicaments au Québec : regards sur les récentes modifications au cadre législatif, la viabilité du régime et l'accessibilité aux traitements

Projection des dépenses publiques en médicaments au Québec : regards sur les récentes modifications au cadre législatif, la viabilité du régime et l'accessibilité aux traitements

101

0

0

فعالية التسويق المصرفي الحديث في ترقية تنافسية البنوك التجارية

فعالية التسويق المصرفي الحديث في ترقية تنافسية البنوك التجارية

186

0

0

A Hybrid Tissue-Level Model of the Left Ventricle: Application to the Analysis of the Regional Cardiac Function in Heart Failure

A Hybrid Tissue-Level Model of the Left Ventricle: Application to the Analysis of the Regional Cardiac Function in Heart Failure

11

0

0

CHEMICAL DIVERSITY IN THE ULTRA-FAINT DWARF GALAXY TUCANA II

CHEMICAL DIVERSITY IN THE ULTRA-FAINT DWARF GALAXY TUCANA II

8

0

0

L'impact spatial et énergétique des data centers sur les territoires

L'impact spatial et énergétique des data centers sur les territoires

141

0

0