• Aucun résultat trouvé

Introduction to applied bioinformatics

N/A
N/A
Protected

Academic year: 2022

Partager "Introduction to applied bioinformatics"

Copied!
27
0
0

Texte intégral

(1)

Introduction to applied bioinformatics

PETRA MATOUŠKOVÁ

2021/2022 3/10

(2)

Schedule update ?

X

exam

1 2 3 4

5 6 7 8 9 (10)

? (after Easter) X

(3)

„Protein bioinformatics II“

Retrieving protein sequences from databases (Uniprot: FASTA formate)

Computing amino-acids compositions, molecular weight, isoelectric point, and other parameters (SMS) Prediction of proteases cutting (PeptideCutter)

Predicting elements of protein secondary structure, signal peptide, transmembrane helix Finding 3-D structure and the domain organization of proteins

Finding all proteins that share a similar sequence and Classifying proteins into families Finding evolutionary relationships between proteins, drawing proteins’ family trees Computing the optimal alignment between two or more protein sequences

(4)

Proteins

SEQUENCE➪STRUCTURE➪FUNCTION

(Frederick Sanger-1958 Nobel prize for insulin sequencing)

20 Aminoacids – primary structure:

Secondary structure Tertiary structure Quaternary structure

(5)

Conserved domain search

◦ Conserved domain databases:

NCBI/CDD

SMART

EMBL/InterPro Pfam

SEKVENCE➪STRUKTURA➪FUNKCE

(6)

Conserved domain search - CD (NCBI)

(7)

Conserved domain search - CD (NCBI)

Identifikátor sekvence Délka sekvence Menu pro omezení úrovně zobrazovaných detailů

(8)

Conserved domain search / NQO1

(9)

Try CD search

Find domains in your sequnce

Practical part

(10)

Prediction of transmembrane helices

Transmembrane proteins:

Amino acid Hydrofobicity

◦ various programs – different alghoritms – different results

◦ Topological predictions (estimation of in and out topology)

(11)

Prediction of transmembrane helices

Profile of amino acids hydrofobicity

Window size: 19 (21) Cut off: 1.5 (1.6)

(12)

TMHMM

no TM helix

five predicted TM helices

(13)

TOPCONS

(14)

Phobius

(15)

Prediction of transmembrane helices

ProtScale TMHMM TOPCONS

Always try more programs!

(16)

ER signal peptide prediction

Endoplasmic reticulum signal peptide: 15-60 amino acids on protein N-terminus

https://www.khanacademy.org/science/biology/gene-expression-central-dogma#central-dogma-transcription

(17)

ER signal peptide prediction

SignalP

Yes-cleavage site

(18)

Try more programs.

Does your sequence have any TMHs?

and/or signal peptide?

Practical part

(19)

„Protein bioinformatics I“

Retrieving protein sequences from databases (Uniprot: FASTA formate)

Computing amino-acids compositions, molecular weight, isoelectric point, and other parameters (SMS) Prediction of proteases cutting (PeptideCutter)

Predicting elements of protein secondary structure, signal peptide, transmembrane helix Finding 3-D structure and the domain organization of proteins

Finding all proteins that share a similar sequence and Classifying proteins into families

Finding evolutionary relationships between proteins, drawing proteins’ family trees Computing the optimal alignment between two or more protein sequences

(20)

Searching for similar sequences

Similarity x Homology

BLAST

: Basic Local Alignment and Search Tool

Finds regions of similarity between biological sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance.

Similarity matrix:

„Leucine is more similar to Isoleucine than Histidine“

(21)

NCBI/BLAST

http://blast.ncbi.nlm.nih.gov/Blast.cgi

(22)

NCBI/BLAST

http://www.youtube.com/watchBLAST

Enter accession number(s), gi(s) or FASTA sequence(s)

(23)

NCBI/BLAST

E-value (expectancy) Odkazy

(24)

(25)

Try BLAST.

Download five similar sequences from different organisms.

Practical part

(26)

1) How many times will be the whole sequence cut by trypsin?

2) Does your sequence have a typical domain?

3) Does your sequence have transmembrane helix?

4) Does your sequence have ER retention signal?

5) Find and download five similar sequnces.

E.g use „výstřižky“

➢Compile in „one note“ (or word, or pdf)

➢Submit via Moodle

Homework 3

„snipping tool“

(27)

Homework 3 - example

. . .

5)

4)

Références

Documents relatifs

Section 7 demonstrates the existence of sound operation-order specifications for collection types by restriction to unambiguous histories, extension of the history signature

We introduced a principled method to detect functional sectors from sequence data, based on the primary signature of these sectors in the small-eigenvalue modes of the

Nous avons vécu des entretiens qui ont duré entre 30 et 60 minutes. Nous avons annoncé une durée de 45 minutes pour permettre aux enseignants de prendre le temps de répondre

During the last 5 years, accumulating data revealed that the nutrient- sensing O-GlcNAc glycosylation (O-GlcNAcylation) may be pivotal in the modulation of chromatin remodeling and

Previously identified matrix pro- tein components can be divided into three characteristic groups as follows: (i) “egg white” proteins, which are also pres- ent in the eggshell,

We introduce a new heuristic approach learning non-deterministic automata based on selection and ordering of significantly similar fragments to be merged and on

Unité de recherche INRIA Rhône-Alpes : 655, avenue de l’Europe - 38334 Montbonnot Saint-Ismier (France) Unité de recherche INRIA Rocquencourt : Domaine de Voluceau - Rocquencourt -

The two monomers in the MS2 coat protein dimer that binds to RNA are in anti-parallel orientation (see above) and hence the fused proteins will be positioned on either side of