• Aucun résultat trouvé

HaploPhyle : graphical haplotype network in the light of external data

N/A
N/A
Protected

Academic year: 2021

Partager "HaploPhyle : graphical haplotype network in the light of external data"

Copied!
1
0
0

Texte intégral

(1)

Rationale

Altogether analyzing haplotype networks and relating them to geographic patterns allows development of phylogeographic analyses and has a great power for resolving crop domestication and understanding further crop adaptation [1]. This project intends to provide the community with a pipeline of analysis of genotyping data (sequences or SNP) which will include haplotype definition, haplotype grouping, haplotype network analysis and connexion with external data such as geographic origin or genetic group asessment.

HaploPhyle: Graphical Haplotype Network in the Light of External Data

Gautier SARAH, Manuel RUIZ, Xavier PERRIER, Claire BILLOT CIRAD, UMR DAP, Avenue Agropolis, 34 398 Montpellier, Cedex 5, France, Email: gautier.sarah@cirad.fr

References :

[1] Saisho D and Purugganan MD. 2007. Molecular phylogeography of domesticated Barley traces expansion of agriculture in the Old World. Genetics, 177:1765-1776

[2] Bandelt H-J, Forster P, Sykes BC and Richards MB. 1995. Mitochondrial portraits of human populations using median networks. Genetics 141: 743-753

[3] Bandelt H-J, Forster P and Röhl A. 1999. Median-Joining networks for inferring intraspecific phylogenies. Mol Biol Evol 17: 37-48

Objective 1: Haplotyping

We use existing softwares: Gevat, PHASE (In progress), haplotyper (In progress). Users have the possibility to select which software they want to use and set the different parameters the selected software allows.

Objective 3: Network Building

We created a Jar file implementing the minimum spanning network algorithm. We plan to add soon the Reduced Median Network and Median Joining Network developed by Bandelt et al. [2,3]. In the future, more algorithms will be added enabling more choices to the users. Nodes sizes will be proportional to the numbers of germplasms sharing an haplotype and edges length will be proportional to the difference between haplotypes.

Objective 2: Haplotype Visualization

Users may wish to visualize at once the haplotypes, and enhance differences between haplotypes, by choosing some accessions as references. We created an R function which draw the haplotypes according to a tree. Users can set one color for each referent haplotypes, color for common alleles and color for new alleles. Missing data always appear in white. For a better visualization, haplotypes order is fixed by a tree. This representation allows users to visualize at once gene integration between two populations.

Objective 4: Correlation with External Data

Nodes of the network will be represented by pie-charts taking into consideration the repartition of external information (set by the user) among germplasms owning this haplotype. We can also add the graphical representation of the haplotype as explained in objective 2. Haplophyle Workflow Haplotyping Network Building Correlation with External Data Clustering

Allelic Data External Data

Input Output

Objective 5: Creation of a Web Interface

The pipeline is avalaible through a user friendly Web interface allowing users to set parameters for each step. The output image can be downloaded or visualized directly in the Web browser. User can zoom out and zoom in the image to have a general view of the network or to see more precisely a part of it.

Références

Documents relatifs

In the present study, we extended the analysis of HFE by using internal HFE SNPs, and derived haplotypes to determine whether particular alleles are associated with iron

To cite this article: Nouran Adel Youssef , Manon Curaudeau , Soheir Mohamed El Nahas , Amal Ahmed Mohamed Hassan & Alexandre Hassanin (2021) Haplotype diversity in

In theoretical investigations it was shown that the number of alleles per seg- ment influences the power for the distinction between populations with the marker estimated kinship

A: hsphase; B: AlphaPhase with pedigree; C: pedigree free AlphaPhase; D: BEAGLE; E: PedPhase; red and blue colours indicate the sire ’s strands inherited by each offspring, obtained

5 Additive and residual variances for the three methods used to build relationship matrices for three traits (SC, AGECL and WTCL) and different window sizes.. The black

In this paper, we present a procedure combining variable selection (i.e. the selection of predictive SNPs) and linear discriminant analysis for the identification of carriers of

WhatsHap is a sequential algorithm, based on dynamic programming, that takes as input the fragment matrix F (one row per read, one column per SNP position, and values in {0, 1, −})

Both fixed-effect and random-effect models (for those SNPs with heterogeneity I 2 > 25) of meta-analysis were performed in order to control cross-study heterogeneity. A total of