Le Valais en marche vers la modernité [1849-1945]

A expressão gênica é um dos principais pilares na transmissão da informação genética, pois os genes são sequências importantes na regulação celular em

organismos de todos os domínios da vida. A criação de enciclopédias de expressão gênica possibilita a compreensão de grupos de genes que são co-expressos em diferentes tecidos, facilitando o entendimento de suas funções e de seus processos regulatórios.

Devido à grande quantidade de dados biológicos gerados em grande escala em projetos de genômica e transcriptômica, foi criada uma demanda intensa na utilização de técnicas fornecidas pela inteligência artificial, o uso de ferramentas baseadas em métodos de aprendizagem de máquinas tornou-se amplamente utilizado na bioinformática. Assim, com o propósito de ajudar pesquisadores em análises de expressão gênica, foi construída a ferramenta CORAZON.

Com base nos resultados apresentados, foi possível observar que o servidor web CORAZON, além de ter sido validado em dados simulados, gerou grupos de tecidos e genes com base na classificação dos processos biológicos e na origem evolutiva. Além disso, foi detectado que as normalizações do CORAZON suprem alguns problemas biológicos, como por exemplo, problemas relacionados com tamanhos de genes, e melhoram os resultados dos agrupamentos.

Em relação aos tecidos, foram observados grupos relacionados a glândulas, tecidos cardíacos, tecidos musculares, tecidos relacionados ao sistema reprodutivo e em todas as três bases, foram vistos grupos com um único tecido, como testículos, cérebro e medula óssea. Tendo em vista os grupos de genes, foi notado que vários grupos possuíam especificidades em suas funções: detecção de estímulos envolvidos na percepção sensorial, reprodução, sinalização sináptica, sistema nervoso, sistema imunológico, desenvolvimento de sistemas, metabólicos e outros.

Além disto, foi visto que, normalmente, os grupos com mais genes codificantes estão associados aos processos celulares, metabólicos, de transportes e de desenvolvimento de sistemas, enquanto que grupos com mais genes não codificantes geralmente estão envolvidos com a detecção de estímulo, percepção sensorial, sistema imunológico e digestão. Foi observado também que os grupos compostos com mais de 85% de genes não codificantes, mais de 40% de seus genes codificantes são recentes, originados no clado Mammalia e a minoria originado no clado Eukaryota. Caso contrário, grupos constituídos com mais de 90% de genes codificantes, mais de 40% deles apareceram em Eukaryota e a minoria em Mammalia e grupos sem estes critérios, tem a maioria dos seus genes codificantes surgidos em Eumetazoa.

genes originados em Eukaryota, Bilateria e Euteleostomi e que na base analisada (Txn Factor ChIP), existiam mais 16 FTs com este comportamento, mas que ainda não foram analisados. Também foi percebido que os FTs que fazem parte do complexo da RNA polimerase III (BRF1, POLR3G, BDP1) regulam 8 vezes mais genes do grupo com mais de 86% de genes não codificantes, que estão relacionados com detecção de estímulos e percepção sensorial e cujos genes codificantes, pertencentes a estes grupos, são originados em Mammalia, no qual cerca de 10% foram originados em Eutheria. E juntamente com eles, o fator ZNF274, que é um repressor da transcrição e tem enriquecimento de sítios de HighD em categorias específicas de ncRNA.

Portanto, a ferramenta CORAZON pode ajudar em análises de grande quantidade de dados genômicos, facilitando o entendimento das relações entre as instâncias, neste caso, genes, e possibilita estudos da história evolutiva dos genes, do mapa regulatório do sistema biológico e dos fatores de transcrição associados.

Assim, algumas propostas de trabalhos futuros são: testar mais bases de dados; analisar biologicamente os fatores de transcrição, de forma mais aprofundada; estudar mais a fundo a evolução dos fatores de transcrição; investigar os genes regulados por fatores de transcrição originados em Mammalia; entender de forma mais detalhada a relação dos grupos com maior porcentagem de genes originado em Mammalia, e os fatores de transcrição que fazem parte do complexo da RNA polimerase III.

REFERÊNCIAS

AGGARWAL, Charu C.; REDDY, Chandan K. (Ed.). Data clustering: algorithms and

applications. CRC Press, 2013.

ALBERTS Bruce, JOHNSON Alexander, LEWIS Julian, et al. Molecular Biology of the

Cell. 4th edition. New York: Garland Science, 2002.

ALBERTS Bruce, JOHNSON Alexander, LEWIS Julian, et al. Molecular Biology of the

ALOISIO, Giovanni; CAFARO, Massimo; FIORE, Sandro; MIRTO, Maria. Progengrid: A Grid Framework for Bioinformatics. Biological and Artificial Intelligence

Environments. Springer, Dordrecht, 2005.

ANDEBERG, Michael R. Cluster analysis for applications. New York: Academic Press, 1973.

ASHBURNER, Michael et al. Gene Ontology: tool for the unification of biology. Nature

genetics. v. 25, n. 1, p.p. 25-29, 2000.

AZUAJE, Francisco. Making genome expression data meaningful: prediction and discovery of classes of cancer through a connectionist learning approach. Proceedings

IEEE International Symposium on Bio-Informatics and Biomedical Engineering. p.p.

208-213, 2000.

BENJAMINI, Yoav. Discovering the false discovery rate. Journal of the Royal Statistical Society, Series B. v, 72, n. 4, p.p. 405–416, 2010.

BENJAMINI, Yoav; HOCHBERG, Yosef. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B. v. 57, n. 1, p.p. 289–300, 1995.

BINNS, D. et al. QuickGO: a web-based tool for Gene Ontology searching.

Bioinformatics (Oxford, England). v. 25, n. 22, p.p. 3045–3046, novembro 2009.

CARLIN, B.P.; LOUIS, T.A. Bayes and Empirical Bayes methods for date analysis. 2

ed. Boco Raton: Chapman and Hall/ CRC Press. 2000.

CHAKRABORTY, Indrajeet; CHOUDHURY, Amarendranath. Artificial Intelligence in Biological Data. Journal of Information Technology and Software Engineering. v. 7, n. 4, 2017.

CHEN, Scott; GOPALAKRISHNAN, P.S.. Clustering via the Bayesian information criterion with applications in speech recognition. Proceedings of the 1998 IEEE

International Conference on Acoustics, Speech and Signal Processing. v. 2, p.p. 645-

648, 1998.

Pattern Analysis and Machine Intelligence. v. 17, n. 8, p.p. 790–799, 1995.

CHUNG, Fang; HUANG, Chuen-Der; SHEN, Ya-Hsin; LIN, Chin-Teng. Recognition of Structure Classification of Protein Folding by NN and SVM Hierarchical Learning Architecture. Artificial Neural Networks and Neural Information Processing —

ICANN/ICONIP 2003. v. 2714, p.p. 1159-1167, 2003.

CLANCY, Suzanne. DNA transcription. Nature Education v. 1, n. 1, p. 41, 2008.

CLARK, Tyson A.; SUGNET Charles W.; ARES, Manuel. Genomewide analysis of mRNA processing in yeast using splicing-specific microarrays. Science. v. 296, p.p. 907– 910, 2002.

CLINE, Melissa et al. Integration of biological networks and gene expression data using Cytoscape. Nature protocols. v. 2, n. 10, p.p. 2366-2382, 2007.

COMANICIU, Dorin; MEER, Peter. Mean shift: A robust approach toward feature space analysis. Pattern Analysis and Machine Intelligence, IEEE Transactions on. v. 24, n. 5, p. 603-619, 2002.

CONDUTA, Bruno; MAGRIN, Diego. Aprendizagem de Máquina. 2010. Dissertação (ou Tese) – Faculdade de Tecnologia - FT, Universidade Estadual de Campinas – UNICAMP, Limeira-SP, 2010.

CONESA, A. et al. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics (Oxford, England). v. 21, n. 18, p.p. 3674–3676, setembro 2005.

CZECH, Daniel et al. The human testis determining factor SRY localizes in midbrain dopamine neurons and regulates multiple components of catecholamine synthesis and metabolism. Journal of neurochemistry. v. 122, n. 2, p.p. 260-271, 2012.

De BRITO, Daniel; MARACAJA-COUTINHO, Vinícius; de FARIAS, Sávio; BATISTA, Leonardo; do RÊGO, Thaís. A Novel Method to Predict Genomic Islands Based on Mean Shift Clustering Algorithm. PLOS ONE. v. 11, n. 1, 2016.

comparison of 13 endogenous control genes. Lab Invest. v. 85, p.p. 154-159, 2005.

de SOUTO, Marcilio et al. Clustering cancer gene expression data: a comparative study. BMC Bioinformatics. v.9, n. 497, 2008.

DERPANIS, Konstantinos G.. Mean shift clustering. Notas de aula. http://www.cse.yorku.ca/~kosta/CompVis_Notes/mean_shift.pdf, 2005.

DEWING, Phoebe et al. Direct regulation of adult brain function by the male-specific factor SRY. Current Biology. v. 16, n. 4, p.p. 415-420, fevereiro 2006.

DI CROCE Luciano, HELIN Kristian. Transcriptional regulation by Polycomb group proteins. Nature Structural & Molecular Biology. v. 20, n. 10 p.p. 1147-1155, outubro 2013.

DIAS, Vera; JUNN, Eunsung; MOURADIAN, Maral. The Role of Oxidative Stress in Parkinson’s Disease. Journal of Parkinson’s disease. v. 3, n. 4, p.p. 461-491, 2013.

DING, Chris; DUBCHAK, Inna. Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics. v. 17, n.4, p.p. 349-358, abril 2001.

do RÊGO, Thaís. Predição da Expressão Gênica a Partir de Modificações de

Histonas e Afinidades de Ligação de Fatores de Transcrição Utilizando Modelo de Mistura de Regressão Linear. Pernambuco: Centro de Informática, Universidade

Federal de Pernambuco, 2012.

DONI, Marcelo; Análise de Cluster: Métodos Hierárquicos e de Particionamento. 2004. Dissertação (ou Tese) – Sistemas de Informação, Universidade Presbiteriana Mackenzie, São Paulo, 2004.

DUDA, Richard; Hart, Peter; Stork, David. Pattern Classification. Wiley, 2001.

DUDOIT, Sandrine; FRIDLYAND, Jane. A prediction-based resampling method for estimating the number of clusters in a dataset. Genome Biology. v. 3, n. 7, 2002.

D’HAESELEER, P.. How does gene expression clustering work?. Nature Biotechnology. v. 23, p.p. 1499-1501, 2005.

EZZIANE, Zoheir. Applications of artificial intelligence in bioinformatics: A review.

Expert Systems with Applications. v. 30, n. 1, p.p. 2-10, 2006.

FACELI, Katti. Inteligência artificial: Uma abordagem de aprendizado de máquina. Rio de Janeiro: LTC, 2011.

FACHEL, Angela et al. Expression analysis and in silico characterization of intronic long noncoding RNAs in renal cell carcinoma: emerging functional associations. Molecular

Cancer. v. 12, n. 140, 2013.

FEDERHEN, Scotty. The NCBI Taxonomy database. Nucleic acids research, v. 40, n. 102 Database issue, p.p. 136–143, janeiro 2012.

FERAUCHE, Thais. Teoria Geral de Sistemas de Informação. São Paulo: Centro Paula Souza, Faculdade de Tecnologia da Praia Grande, 2006. Disponível em: <https://xa.yimg.com/kq/groups/68387000/1716660132/name/APOSTILA+INTRODUA O+A+TEORIA+DE+SISTEMAS.pdf.>.

FRAYLEY, Chris; RAFTERY, Adrian. How many clusters? Which clustering method? Answers via model-based cluster analysis. The computer Journal. v. 41, n. 8, p.p. 578- 588, 1998.

FRIEDMAN, Nir et al. Using Bayesian networks to analyze expression data. Journal of

Computational Biology. v. 7 n. 3-4, p.p. 601-620, 2000.

FUKUNAGA, Keinosuke; HOSTETLER, Larry. The estimation of the gradient of a density function, with applications in pattern recognition. Information Theory, IEEE

Transactions on. v. 21, n. 1, p.p. 32-40, 1975.

GAN, Guojun; MA, Chaoqun; WU, Jianhong. Data Clustering: Theory, Algorithms,

and Applications. Philadelphia, Pa.: SIAM, Society for Industrial and Applied

Mathematics, 2007.

GEERTZ, Marcel; MAERKL, Sebastian J. Experimental strategies for studying transcription factor- DNA binding specificities. Briefings in Functional Genomics. v. 9, n. 5, p.p. 362-373. 2010.

Gene Ontology Consortium. The Gene Ontology (GO) database and informatics resource.

Nucleic Acids Research. v. 32(Database issue), p.p. 258-261, 2004.

GEORGESCU, Bogdan; SHIMSHONI, Ilan; MEER, Peter. Mean shift based clustering in high dimensions: A texture classification example. Computer Vision, 2003.

Proceedings. Ninth IEEE International Conference on. IEEE, p.p. 456-463, 2003.

GERSTEIN, Mark B. et al. Architecture of the human regulatory network derived from ENCODE data. Nature. v. 489, n. 7414, p.p. 91-100, setembro 2012.

GOLBABAPOUR, Shahram et al. Gene Silencing and Polycomb Group Proteins: An

Overview of their Structure, Mechanisms and Phylogenetics. OMICS : a Journal of

Integrative Biology. v. 17, n. 6, p.p. 283-296, 2013.

GOMES-FILHO, J.V. et al. Sense overlapping transcripts in IS1341-type transposase genes are functional non-coding RNAs in archaea. RNA Biology. v. 12, n. 5, p.p. 490- 500, 2015.

GONÇALVES, Carlos. Ferramentas e métodos para estudo da evolução de processos

biológicos e funções moleculares do Homo sapiens. Minas Gerais: Instituto de Ciências

Biológicas, Universidade Federal de Minas Gerais, 2016. Disponível em: <http://www.pgbioinfo.icb.ufmg.br/defesas/90M.PDF>.

GRIFFITHS, Anthony et al. An Introduction to Genetic Analysis, 7th edition. New York: W. H. Freeman, 2000.

GUAN, Xiuyang; MURAL, R. J.; UBERBACHER, E. C.. Protein structure prediction using hybrid AI methods. Proceedings of the Tenth Conference on Artificial

Intelligence for Applications. p.p. 471-473, 1994.

HAIRI, Li et al. Determination of tag density required for digital transcriptome analysis: Application to an androgen-sensitive prostate cancer model. Proceedings of the National

Academy of Sciences. v. 105, n. 51, p.p. 20179–20184, 2008.

HAN, Jiawei; KAMBER, Micheline; PEI, Jian. Data Mining Concepts and Techniques

HAO, Yibin et al. Prediction of long noncoding RNA functions with co-expression network in esophageal squamous cell carcinoma. BMC Cancer. v. 15, n. 168, 2015.

HUSS, Mikael. Introduction into the analysis of high-throughput-sequencing based epigenome data. Briefings in Bioinformatics. v. 5, n. 2, p.p. 512-523, 2010.

JIANG, Daxin; TANG, Chunand; ZHANG, Aidong. Cluster analysis for gene expression data: a survey. IEEE Transactions on Knowledge and Data Engineering. v. 16, n. 11, p.p. 1370-1386, novembro 2004.

JIN Jinpu et al. PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors. Nucleic Acids Research. v. 42 (Database issue), p.p. 1182–1187, janeiro 2014.

KARIN M. Too many transcription factors: positive and negative interactions. The New

Biologist. v. 2, n. 2, p.p. 126–31, fevereiro 1990.

KASS, R.E.; WASSERMAN, L.. A reference Bayesian test for nested Hypotheses and its relationship to the Schwarz Criterion. Journal of the American Statistical Association. v. 90, n. 431, p.p. 928-934, 1995.

KAUFMAN, L.; ROUSSEEUW, P.J.. Finding Groups in Data: An Introduction to

Cluster Analysis. New York: Wiley, 1990.

KHAN, Javed et al. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nature medicine. v. 7, n. 6, p.p. 673- 679, 2001.

KHURANA, Ekta et al. Integrative annotation of variants from 1092 humans: application to cancer genomics. Science. v. 342, n. 6154, outubro 2013.

KINGSFORD, Carl; PATRO, Rob. Reference-based compression of short-read sequences using path encoding. Bioinformatics. v. 31, n. 12, p.p. 1920–1928, 2015.

KOCH, Frederic et al. Transcription initiation platforms and GTF recruitment at tissue- specific enhancers and promoters. Nature Structural Molecular Biology. v. 18, p.p. 956–963, 2011.

KOERICH, Alessandro. Aprendizagem de Máquina. Algoritmo k–Means. Paraná: Pontifícia Universidade Católica do Paraná, 2008. Disponível em: <http://www.ppgia.pucpr.br/~alekoe/AM/2008/6a-AlgoritmokMeans-ApreMaq-

2008.pdf>.

KOO, Jana et al. Induction of proto-oncogene BRF2 in breast cancer cells by the dietary soybean isoflavone daidzein. BMC Cancer. v. 15, n. 905, novembro 2015.

KRISTOPHER, L. et al. Chapter 17 - Analysis of Genome-Wide Gene Expression Data from Microarrays and Sequencing. Human Stem Cell Manual (Second Edition). Boston: Jeanne F. Loring e Suzanne E. Peterson, Academic Press, 2012, p.p. 271-291.

KUKURBA, Kimberly R.; MONTGOMERY, Stephen B.. RNA Sequencing and Analysis. Cold Spring Harbor Protocols. v. 11, p.p. 951–969, 2015.

LATCHMAN David S. Transcription factors: an overview. The International Journal

of Biochemistry & Cell Biology. v. 29, n. 12, p.p. 1305–1312, dezembro 1997.

LEE, Kyung; LEE, Keon; LEE, Chan. Statistical cluster validity indexes to consider cohesion and separation. International conference on Fuzzy Theory and Its

Applications (iFUZZY2012), p.p. 228-232, 2012.

LEGENDRE, P.; LEGENDRE, L.. Numerical Ecology 3th edition. Amsterdam: Elsevier, 2012.

LEGG, Shane; HUTTER, Marcus. A Collection of Definitions of Intelligence. Frontiers

in Artificial Intelligence and Applications, Vol.157, p.17-24, 2007.

LEWIS EB. A gene complex controlling segmentation in Drosophila. Nature. v. 276, n. 5688, p.p. 565-570, dezembro 1978.

LEWIN, Benjamin. GENES IX. Sudbury: Jones and Bartlett Publishers, 2008.

LI, Suqing et al. Exploring functions of long noncoding RNAs across multiple cancers through co-expression network. Scientific Reports. v. 7, n. 1, 2017.

LIAO, Qi et al. Large-scale prediction of long non-coding RNA functions in a coding– non-coding gene co-expression network. Nucleic Acids Research. v. 39, n. 9, p.p. 3864- 3878, 2011.

LIEBESKIND Benjamin J.; MCWHITE Claire D.; MARCOTTE Edward M.. Towards Consensus Gene Ages. Genome Biology Evolution. v. 8, n. 6, p.p. 1812-1823, junho 2016.

LORENA, Ana C.; CARVALHO, André. Utilização de Técnicas Inteligentes em

Bioinformática. São Carlos: Instituto de Ciências Matemáticas e de Computação, 2003.

Disponível em: <http://conteudo.icmc.usp.br/CMS/Arquivos/arquivos_enviados/BIBLIOTECA_113_RT

_219.pdf>.

LUND, Riikka et al. RNA Polymerase III Subunit POLR3G Regulates Specific Subsets of PolyA(+) and SmallRNA Transcriptomes and Splicing in Human Pluripotent Stem Cells. Stem Cell Reports. v. 8, n. 5, p.p. 1442-1454, maio 2017.

MANNING, Christopher; RAGHAVAN, Prabhakar; SCHÜTZE, Hinrich. Introduction

to information retrieval. New York: Cambridge University Press, 2008.

MATTICK, John S. The Central Role of RNA in the Genetic Programming of Complex Organisms. Anais da Academia Brasileira de Ciências. v. 82 , n. 4, p.p. 933–939, 2010. MATYS V et al. TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Research. v. 34 (Database issue), p.p. 108–110, janeiro 2006.

MCCARTHY, D.J. et al. Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R. Bioinformatics. v. 33, p.p. 1179-1186, 2017.

MI, Huaiyu et al. Large-scale gene function analysis with the PANTHER classification system. Nature Protocols. v. 8, n. 8, p.p. 1551-1566, agosto 2013.

MITCHELL PJ; TJIAN R. Transcriptional regulation in mammalian cells by sequence- specific DNA binding proteins. Science. v. 245, n.4916, p.p. 371–378, julho 1989.

MITCHELL, Tom. Machine Learning. New York: McGraw Hill, 1997.

MOHRI, Mehryar; ROSTAMIZADEH, Afshin; TALWALKAR, Ameet. Foundations of

Machine Learning. Cambridge, MA: MIT Press, 2012.

MORIN Ryan D. et al. Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing. BioTechniques. v. 45, n. 1, p.p.81– 94, julho 2008.

MURTAGH F.. Multidimensional Clustering Algorithms. Physica-Verlag, 1985.

NACU, Serban et al. Gene expression network analysis and applications to immunology.

Bioinformatics. v. 23, n. 7, p.p. 850-358, 2007.

NAGALAKSHMI, Ugrappa et al. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science. v. 320, p.p. 1344–1349, 2008.

NAIK, Azad. k-means clustering algorithm. 2010. Disponível em:

https://sites.google.com/site/dataclusteringalgorithms/k-means-clustering-algorithm.

NECSULEA, Anamaria et al. The evolution of lncRNA repertoires and expression patterns in tetrapods. Nature. v. 505, n. 7485, p.p. 635-640, janeiro 2014.

NIH working definition of bioinformatics and computational biology. Biomedical Information Science and Technology Initiative. Julho, 2000.

NILSSON, Nils. Artificial Intelligence: A New Synthesis. Morgan Kaufmann, 1998. O’BRIEN Kevin P.; REMM Maido; SONNHAMMER Erik L.L. Inparanoid: a comprehensive database of eukaryotic orthologs. Nucleic Acids Research. v. 33(Database Issue), p.p. D476-D480, 2005.

OCHI, Luiz; DIAS, Carlos; SOARES, Stênio. Clusterização em Mineração de Dados. 2004.

OLIVEIRA, KC; CARVALHO, MLP; MARACAJA-COUTINHO, V; KITAJIMA, JP; VERJOVSKI-ALMEIDA, S. 2011. Non-coding RNAs in schistosomes: an unexplored world. Anais da Academia Brasileira de Ciências. v. 83, n. 2, p.p. 673-694.

OSHLACK, Alicia; ROBINSON, Mark D.; YOUNG, Matthew D.. From RNA-seq reads to differential expression results. Genome Biology. v. 11, n. 220, 2010.

PELLEG, Dau; MOORE, Andrew. X-means: Extending K-means with efficient estimation of the number of clusters. Proceeding of the 17th International Conference

on Machine Learning, p.p.727-734, 2000.

PENNACCHIO, Len et al. Enhancers: five essential questions. Nature reviews Genetics. v. 14, n. 4, p.p. 288-295, 2013.

PETOUKHOV, Sergey; HE, Matthew. Symmetrical analysis techniques for genetic

systems and bioinformatics: advanced patterns and applications. New York: Medical

Information Science Reference, 2010.

POOLE, Davis; MACKWORTH, Alan and GOEBEL, Randy. Computational

Intelligence: A Logical Approach. New York: Oxford University Press, 1998.

PORTOSO, Manuela, CAVALLI Giacomo. The Role of RNAi and Noncoding RNAs in Polycomb Mediated Control of Gene Expression and Genomic Programming. Caister

Academic Press. p. p. 29–44, 2008.

QUINLAN, Aaron; HALL, Ira. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. v. 26, n. 6, p.p. 841-842, março 2010.

RIIS, Søren. Combining neural networks for protein secondary structure prediction.

IEEE International Conference on Neural Networks. v.4, p.p. 1744-1748, 1995.

ROUSSEEUW, Peter J.. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics. v. 20, p.p. 53-65, 1987.

RUSSELL, Stuart J.; NORVING, Peter. Artificial Intelligence: A Modern Approach

(2nd ed.), Upper Saddle River, New Jersey: Prentice Hall, 2003.

RUSSELL, Stuart J.; NORVING, Peter. Artificial Intelligence: A Modern Approach

SAINSBURY, Sarah. Structural basis of transcription initiation by RNA polymerase II.

Nature Reviews Molecular Cell Biology. v. 16, n. 3, 2015.

SCHENA, M. et al. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science. v. 270, p.p. 467–470, 1995.

SCHENA, M. Microarray biochip technology. Eaton Publishing: Sunnyvale, CA, USA, 2000.

SCHRAMM, Laura; HERNANDEZ, Nouria. Recruitment of RNA polymerase III to its target promoters. GENES & DEVELOPMENT. v. 16, p.p. 2593–2620, 2002.

SCHWARZ, Gideon. Estimating the dimension of a model. Annals of Statistics. v. 6, n. 2, p.p. 461–464. 1978.

SCOTT, Gilbert. Commentary: ‘The Epigenotype’ by C.H. Waddington. International

Journal of Epidemiology. v. 41, p.p. 20–23, 2012.

SEGUNDO-VAL, Ignacio S.; SANZ-LOZANO, Catalina S. Introduction to the Gene Expression Analysis. Methods in Molecular Biology. v. 1434, p.p. 29-43, 2016.

SERIN, Elise et al. Learning from Co-expression Networks: Possibilities and Challenges.

Frontiers in Plant Science. v. 7, n. 444, 2016.

SHAFFER, Juliet P. Multiple hypothesis testing. Annual Review of Psychology. v. 46, p.p. 561-584, 1995.

SHAO, Zhaohui et al. Stabilization of chromatin structure by PRC1, a Polycomb complex.

Cell. v. 98, n. 1, p.p. 37-46, julho 1999.

SIMUNOVIC, Filip et al. Gene expression profiling of substantia nigra dopamine neurons: further insights into Parkinson's disease pathology. Brain. v. 132, n. 7, p.p. 1795–1809, julho 2009.

SINOUQUET Christine; MOURAD, Raphael. Probabilistic Graphical Models for

SMITH, Martin et al. DotAligner: identification and clustering of RNA structure motifs.

Genome Biology. 2017 Dec 28;v. 18, n. 1, dezembro 2017.

SONNENBURG, Sören et al. Accurate splice site prediction using support vector machines. BMC Bioinformatics. v, 8, suppl. 10, S7, 2007.

STEGLE, Oliver et al. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nature

Protocols. v. 7, n. 3, p.p. 500–507, fevereiro 2012.

STORZ, G; VOGEL, J; WASSARMAN, KM. Regulation by small RNAs in bacteria: expanding frontiers. Molecular Cell. v. 43, p.p. 880-891, 2011.

SWEATT, David. Experience-dependent epigenetic modifications in the central nervous system. Biological Psychiatry. v. 65, p.p. 191-197, 2009.

TAN, Aik; GILBERT, David. Ensemble machine learning on gene expression data for cancer classification. Applied Bioinformatics. v. 2, 3 Suppl, S75-83, 2003.

TAN, Pang-Ning; STEINBACH, Michael; KUMAR, Vipin. Cluster Analysis: Basic Concepts and Algorithms. Introduction to Data Mining. p.p. 487-568, 2005.

TARCA, Adi L. et al. Analysis of microarray experiments of gene expression profiling. American journal of obstetrics and gynecology. v. 195, p.p. 373-388, 2006.

TIBSHIRANI, Robert; WALTHER, Guenther; HASTIE, Trevor. Estimating the Number of Clusters in a Dataset via the Gap Statistic. Journal of the Royal Statistical Society,

Series B. n. 63, p.p. 411-423, 2001.

TOURE, Amadou; BASU, Mitra. Application of neural network to gene expression data for cancer classification. International Joint Conference on Neural Networks, IJCNN

2001. v.1, p.p. 583-587, 2001.

TUTEJA, Renu. Gene Expression Profiling by SAGE. 2007.

v. 29, n. 6 p.p. 567-584, 2015.

VARUZZA, Leonardo. Introdução à análise de dados de sequenciadores de nova geração

Dans le document Le Valais par les dates (Page 92-118)