• Aucun résultat trouvé

BIC-seq: a fast algorithm for detection of copy number alterations based on high-throughput sequencing data

N/A
N/A
Protected

Academic year: 2021

Partager "BIC-seq: a fast algorithm for detection of copy number alterations based on high-throughput sequencing data"

Copied!
2
0
0

Texte intégral

(1)

BIC-seq: a fast algorithm for detection of copy number

alterations based on high-throughput sequencing data

The MIT Faculty has made this article openly available.

Please share

how this access benefits you. Your story matters.

Citation

Xi et al. "BIC-seq: a fast algorithm for detection of copy number

alterations based on high-throughput sequencing data". Genome

Biology 2010 11(Suppl 1):O10.

As Published

http://dx.doi.org/10.1186/gb-2010-11-S1-O10

Publisher

BioMed Central Ltd.

Version

Final published version

Citable link

http://hdl.handle.net/1721.1/69934

Terms of Use

Creative Commons Attribution

(2)

SELECTED ORAL PRESENTATION

Open Access

BIC-seq: a fast algorithm for detection of copy

number alterations based on high-throughput

sequencing data

Ruibin Xi

1

, Joe Luquette

1

, Angela Hadjipanayis

2

, Tae-Min Kim

1

, Peter J Park

1,3,4*

From Beyond the Genome: The true gene count, human evolution and disease genomics

Boston, MA, USA. 11-13 October 2010

DNA copy number alterations (CNA), which are amplifications and deletions of certain regions in the genome, play an important role in the pathogenesis of cancer and have been shown to be associated with other diseases such as autism, schizophrenia and obe-sity. Next-generation sequencing technologies provide an opportunity to identify CNA regions with unprece-dented accuracy. We developed a CNA detection algo-rithm based on single-end whole-genome sequencing data for samples with matched controls. This algo-rithm, called BIC-seq, can accurately and efficiently identify the CNAs via minimizing the Bayesian infor-mation criterion (BIC). We applied BIC-seq on a glio-blastoma multiforme (GBM) tumor genome from the Cancer Genome Atlas (TCGA) project and identified hundreds of CNVs, some were as small as 10 bp. We compared these CNAs with those detected using the array Comparative Genomic Hybridization (CGH) plat-forms and found that about one third were ‘missed’ by the array-CGH platforms, most of which were CNAs less than 10 kb. We selected 17 of the CNAs not detected by the array-based platforms for validation, ranging from 110 bp to 14 kb, and found that 15 of them are true CNAs. We further extended BIC-seq to the multi-sample case to identify recurrent CNAs in across multiple tumor genomes.

Author details

1Center for Biomedical Informatics, Harvard Medical School, 10 Shattuck St,

Boston, Massachusetts 02115, USA.2Harvard Medical School, 77 Louis

Pasteur Avenue, Boston, Massachusetts 02115, USA.3Department of

Medicine, Brigham and Women’s Hospital, 77 Louis Pasteur Avenue, Boston,

Massachusetts 02115, USA.4Harvard-MIT Health Sciences and Technology

Informatics Program at Children’s Hospital, 300 Longwood Ave., Boston, Massachusetts 02115, USA.

Published: 11 October 2010

doi:10.1186/gb-2010-11-S1-O10

Cite this article as: Xi et al.: BIC-seq: a fast algorithm for detection of copy number alterations based on high-throughput sequencing data. Genome Biology 2010 11(Suppl 1):O10.

Submit your next manuscript to BioMed Central and take full advantage of:

• Convenient online submission

• Thorough peer review

• No space constraints or color figure charges

• Immediate publication on acceptance

• Inclusion in PubMed, CAS, Scopus and Google Scholar

• Research which is freely available for redistribution

Submit your manuscript at www.biomedcentral.com/submit 1

Center for Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, Massachusetts 02115, USA

Full list of author information is available at the end of the article Xi et al. Genome Biology 2010,11(Suppl 1):O10

http://genomebiology.com/2010/11/S1/O10

Références

Documents relatifs

Detection of unknown genetically modified organisms (GMO) by statistical analysis of high throughput sequencing data... Detection of unknown genetically modified organisms (GMO)

We provide a short introduction to the laboratory work required and then describe the possible steps in data processing for the detection of plant viruses, including quality control

ont propos´e une exp´erience intracavit´e de fluorescence dispers´ee de la mol´ecule NiH [Hill 90] dans laquelle une mini source `a pulv´erisation cathodique ´etait ins´er´ee dans

Our aim is to control the transition graph of the PWA system to obtain an oscilla- tory behaviour, which is indeed of primary functional importance in numerous biological networks;

Then, during the second step an adaptive genetic algorithm will explore the matrix to discover combina- tions of k-mers that can accurately classify input samples.. The adaptive

Keywords: Computational Statistics, High-dimensional data, Dimension reduction, Compression, Variable selection, Logistic regression, Sparse Partial Least Squares, Probabilistic

For instance, the defender-first sequential game with perfect information (by the attacker) does not combine naturally with hidden choice ⨊, since that would mean that the

Dans ce chapitre, on définit ce système dans des coordonnées héliocentriques, On démontre l’analyticité du hamiltonien séculaire, au voisinage du point circulaire non incliné