• Aucun résultat trouvé

Co2Vis: A Visual Analytics Tool for Mining Co-Expressed and Co-Regulated Genes Implied in HIV Infections

N/A
N/A
Protected

Academic year: 2021

Partager "Co2Vis: A Visual Analytics Tool for Mining Co-Expressed and Co-Regulated Genes Implied in HIV Infections"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: lirmm-01275395

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01275395

Submitted on 17 Feb 2016

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Co2Vis: A Visual Analytics Tool for Mining

Co-Expressed and Co-Regulated Genes Implied in HIV

Infections

Amal Zine El Aabidine, Arnaud Sallaberry, Sandra Bringay, Mickaël

Fabrègue, Charles-Henri Lecellier, Nhat Hai Phan, Pascal Poncelet

To cite this version:

Amal Zine El Aabidine, Arnaud Sallaberry, Sandra Bringay, Mickaël Fabrègue, Charles-Henri Lecellier,

et al.. Co2Vis: A Visual Analytics Tool for Mining Co-Expressed and Co-Regulated Genes Implied in

HIV Infections. BioVis: Biological Data Visualization, 2013, Atlanta, United States. 2013.

�lirmm-01275395�

(2)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

                   

           [1]  E.R.  Dougherty.  Small  sample  issues  for  microarray-­‐based  classifica>on.  Compara>ve  and  Func>onal  Genomics,  2:28-­‐34,  2001.  

           [2]  M.  Fabregue,  S.  Bringay,  P.  Poncelet,  M.  Teisseire  and  B.  OrseP.  Mining  microarray  data  to  predict  the  histological  grade  of  a  breast  cancer.  JBI,  44:S12-­‐S16,  2011.              [3]  H.  Phan  Nhat,  D.  Ienco,  P.  Poncelet  and  M.  Teisseire.  Mining  Representa>ve  Movement  PaVerns  through  Compression.  PAKDD,  pp.  314-­‐326,  2013.  

           [4]  H.  Phan  Nhat,  P.  Poncelet  and  M.  Teisseire.  GET_MOVE:  An  Efficient  and  Unifying  Spa>o-­‐Temporal  PaVern  Mining  Algorithm  for  Moving  Objects.  IDA,  pp.  276-­‐288,  2012.              [5]  E.  R.  Gansner,  E.  Koutsofios,  S.  C.  North,  and  K.-­‐P.  Vo.  A  Technique  for  Drawing  Directed  Graphs.  IEEE  TSE,  19(3):214–230,  1993.    

           [6]  F.  Bendix  and  R.  Kosara  and  H.  Hauser.  Parallel  sets:  A  visual  analysis  of  categorical  data.  InfoVis,  pp  133-­‐140,  2005.  

                                     Co

2

Vis:  A  Visual  Analy/cs  Tool  for  Mining  Co-­‐Expressed    

                                               and  Co-­‐Regulated  Genes  Implied  in  HIV  Infec/ons  

Amal  Zine  El  Aabidine

1,2

,  Arnaud  Sallaberry

1,3

,  Sandra  Bringay

1,3

,  Mickael  Fabregue

1,4

,    

Charles  Lecellier

5

,  Nhat  Hai  Phan

1,4

,  Pascal  Poncelet

1,2  

 

 

1  –  Laboratoire  d’Informa>que,  de  Robo>que  et  de  Microélectronique  de  Montpellier  LIRMM,  Montpellier,  France   2  –  Université  Montpellier  2,  Montpellier,  France  

3  –  Université  Montpellier  3,  Montpellier,  France   4  –  IRSTEA  Montpellier,  UMR  TETIS,  Montpellier,  France  

5  –  Ins>tut  de  Géné>que  Moléculaire  de  Montpellier  IGMM,  Montpellier,  France    

hQp://www.lirmm.fr/genes_visualiza/on/                                                                                                                                                                                  Email:  [email protected]

               

One   of   the   key   challenges   in   human   health   is   the   iden>fica>on   of  

disease-­‐causing   genes   like   AIDS   (Acquired   ImmunoDeficiency  

Syndrome).   Numerous   studies   have   addressed   this   challenge   through   gene   expression   analysis.   Due   to   the   amount   of   data   available,   processing   DNA   microarrays   in   a   way   that   makes   biomedical  sense  is  s>ll  a  major  issue.    

 

Sta>s>cal   methods   and   data   mining   techniques   play   a   key   role   in   discovering  previously  unknown  knowledge.  However,  applying  such   techniques   in   this   context   is   difficult   because   the   number   of   measurement   points   (i.e.,   gene   expression   levels)   is   much   higher   than   the   number   of   samples   resul>ng   in   the   well-­‐known   curse   of   dimensionality  problem  also  called  the  high  feature-­‐to-­‐sample  ra>o   [1].    

 

We   propose   a   combina>on   of   data   mining   and   visual   analy/cs   methods   to   iden>fy   and   render   groups   of   genes   implied   in   HIV   infec>ons  and  sharing  common  behaviors.  

Temporal  changes  of  the  expression  levels  of  about  19,000  genes  on   three  HIV  strains  (HIV-­‐2,  HIV-­‐1  :  X4  and  R5),  were  evaluated  at  04h,   08h,  24h,  48h  and  72h  amer  infec>ons.  

Introduc>on  

Data  

This   step   consists   in   iden>fying   co-­‐expressed   genes,   i.e.   sequen>al   paVerns  of  genes  extracted  according  to  their  level  of  expression  [2]:               1-­‐  Sequence              processing                         2-­‐  Closed  Sequen>al              paVerns                 3-­‐  Par>ally  ordered              paVerns  

Par>ally  ordered  paVerns  

Par>ally  ordered  paVerns  (on  the  lem):  DAG  drawing  algorithm  [5]    

Temporal  paVerns  (on  the  right):    

-­‐  Interac>ve  graphic  chart.  The  list  (on  the  lem)  shows  the  set  of   func>ons.   The   diagram   (on   the   center)   shows   the   number   of   trajectories   grouped   by   their   main   func>ons   along   >me.   The   histogram  (on  the  right)  shows  the  distribu>ons  of  the  func>ons   for  each  trajectory  of  a  selected  rectangle  in  the  diagram.     -­‐  Parallel  sets  visualiza>on  [6]  (on  the  boVom).  Clusters  of  genes  

for   each   >me   step   (computed   for   the   temporal   paVern   extrac>on  process).  

BioVis

2013

Visualiza>on  

Temporal  paVerns  

This   step   consists   in   iden>fying   co-­‐regulated   genes,   i.e.   groups/ trajectories  of  genes  sharing  frequently  a  similar  level  of  expression   (incremental  approach  GET_MOVE  [3,4]):  

            ({o1,o2}{t1,t3,t4})    

Annota>on:   each   trajectory   (e.g.   the   group   C   in   the   previous   example)   is   labeled   by   a   set   of   func>ons   extracted   from   the   gene   ontology  website  (hVp://www.geneontology.org/).    

Références

Documents relatifs

Alexis Delaforge, Rohan Goel, Samiha Fadloun, Sarah Valentin, Arnaud Sallaberry, Mathieu Roche, Pascal Poncelet.. To cite

Résumé : Pour mieux comprendre les fondements écophysiologiques du dépérissement du Cèdre de l’Atlas (Cedrus atlantica M.) au Moyen Atlas Tabulaire au Maroc, un

1 IFIP, breeding techniques, France, 2 INRA, UMR Pegase, France, 3 ANSES, Animal Health and Welfare, France, 4 ISA Lille, Grecat, France;

Alignment of the upstream region of the pep1, pep2 and pep3 genes identified in B.cereus ATCC 14579 showed that the three promoter regions were very similar to the - 35 and -10

Figure 4 illustrates a topic based analysis of the nDCG where relative difference between the 64 runs are calculated on partial indexes and the final runs are calculated on the

To study the effect of mastitis infection on the gene expression pattern in the mammary gland, mRNA abundance of selected differentially displayed sequences characterizing the

Conclusion: Human Gene Correlation Analysis (HGCA) is a tool to classify human genes according to their coexpression levels and to identify overrepresented annotation terms

This information includes attack type, attacking host and targeted host, user criticality value, vulnerability specific infor- mation (metric, CVE code [3] and description), and