Article
Reference
PROSITE: a dictionary of sites and patterns in proteins
BAIROCH, Amos Marc
Abstract
PROSITE is a compilation of sites and patterns found in protein sequences. The use of protein sequence patterns (or motifs) to determine the function of proteins is becoming very rapidly one of the essential tools of sequence analysis. This reality has been recognized by many authors. While there have been a number of recent reports that review published patterns, no attempt had been made until very recently [5,6] to systematically collect biologically significant patterns or to discover new ones. It is for these reasons that we have developed, since 1988, a dictionary of sites and patterns which we call PROSITE.
BAIROCH, Amos Marc. PROSITE: a dictionary of sites and patterns in proteins. Nucleic Acids Research , 1991, vol. 19 Suppl, p. 2241-5
DOI : 10.1093/nar/19.suppl.2241 PMID : 2041810
Available at:
http://archive-ouverte.unige.ch/unige:36276
Disclaimer: layout of this document may differ from the published version.
1 / 1
Nucleic Acids Research, Vol. 19, Supplement 2241
PROSITE: a dictionary of sites and patterns in proteins
Amos Bairoch
Department of Medical Biochemistry, University of Geneva, 1 rue Michel Servet, 1211 Geneva 4, Switzerland
Background
PROSITE is a compilation of sites and patterns found in protein sequences. The use of protein sequence patterns (or motifs) to determine the function of proteins is becoming very rapidly one of the essential tools of sequence analysis. This reality has been recognized by many authors [1,2]. While there have been a number of recent reports [3,4] that review published patterns, no attempt had been made until very recently [5,6] to systematically collect biologically significant patterns or to discover new ones. It is for these reasons that we have developed, since 1988, a dictionary of sites and patterns which we call PROSITE.
Some of the patterns compiled in PROSITE have been published in the literature, but the majority have been developed, in the last two years, by the author.
Format
The PROSITE database is composed of two ASCII (text) files.
The first file (PROSITE.DAT) is a computer-readable file that contains all the information necessary to programs that make use of PROSITE to scan sequence(s) with pattern(s). This file also includes, for each of the patterns described, statistics on the number of hits obtained while scanning for that pattern in the SWISS-PROT protein sequence data bank [7]. Cross-references to the corresponding SWISS-PROT entries are also present in that file. The second file (PROSITE.DOC), which we call the textbook, contains textual information that documents each pattern. A user manual (PROSUSER.TXT) is distributed with the database, it fully describes the format of both files. A sample textbook entry is shown in Figure 1 with the corresponding data from the pattern file.
Leading concepts
The design of PROSITE follows four leading concepts:
Completeness. For such a compilation
tobe helpful in the determination of protein function, it is important that it contains a significant number of biologically meaningful patterns.
High specificity of the patterns. In the majority of
cases wehave chosen patterns that
arespecific enough
not todetect
toomany unrelated sequences, yet that detect
mostif
notall sequences that clearly belong
tothe
setin consideration.
Documentation. Each of the patterns is fully documented; the documentation includes
aconcise description of the family of protein that it is supposed
todetect
aswell
as anexplanation
onthe reasons that led
tothe selection of the particular pattern.
Periodic reviewing. It is important that each pattern be periodically reviewed,
so as toinsure that it is still relevant.
Content of the
currentrelease
Release 6.10 of PROSITE (February 1991) contains 375
documentation entries describing 433 different patterns. The list of these entries is provided in Appendix 1.
Distribution
PROSITE is distributed on magnetic tape and on CD-ROM by the EMBL Data Library. For all enquiries regarding the subscription and distribution of PROSITE one should contact:
EMBL Data Library
European Molecular Biology Laboratory Postfach 10.2209, Meyerhofstrasse 1 6900 Heidelberg, Germany
Telephone: (+49 6221) 387 258
Telefax : (+49 6221) 387 519 or 387 306
Electronic network address: DATALIB@EMBL.BITNET PROSITE can be obtained from the EMBL File Server [8].
Detailed instructions on how to make best use of this service, and in particular on how to obtain PROSITE, can be obtained by sending to the network address NETSERV@EMBL.BITNET the following message:
HELP
HELP PROSITE
If you have access to a computer system linked to the Internet you can obtain PROSITE using FTP (File Transfer Protocol), from the following file servers:
GenBank On-line Service [9]
Internet address: genbank.bio.net (134.172.1.160) NCBI
Internet address: ncbi.nlm.nih.gov (130.14.20.1)
The present distribution frequency is four releases per year. No restrictions are placed on use or redistribution of the data.
REFERENCES
1. DoolittleR.F.(In)OfURFs and ORFs: a primer on how to analyze derived amino acid sequences.,UniversityScienceBooks,MillValley,California, (1986).
2. Lesk A.M. (In) Computational Molecular Biology, Lesk A.M., Ed., ppl7-26, OxfordUniversity Press, Oxford (1988).
3. Barker W.C., Hunt T.L., George D.G. Protein Seq. Data Anal.
1:363-373(1988).
4. Hodgman T.C. CABIOS5:1-13(1989).
5. BorkP. FEBSLett. 257:191-195(1989).
6. SmithH.O.,AnnauT.M.,ChandrasegaranS. Proc. Natl. Acad.Sci. USA 87:826-830(1990).
7. BairochA., BoeckmannB. Nucleic Acids Res. 19:2247-2249(1991).
8. StoehrP.J., Omond R.A. Nucleic AcidsRes. 17:6763-6764(1989).
9. Benton D. NucleicAcids Res. 18:1517-1520(1990).
k./ 1991 Oxford University Press
2242 Nucleic Acids Research, Vol. 19, Supplement la) A documentation (textbook) entry
(PDOC00107)
(PS00116; DNA POLYMERASE B) (BEGIN)
*********************************
DNA
polymerase family
B signature *RepLicative DNA polymerases (EC 2.7.7.7) are the key enzymes catalyzing the accurate repLication of DNA. They require either a snaIL RNA molecule or a protein as a primer for the de novo synthesis of a DNA chain. On the basis of sequence simi arities a number of DNA poLymerases have been gred together E1 to 5] under the designation of DNA polymerase family B. The polymerases that belong to this famiLy are:
- Human
poLymerase alpha.
- Yeast
poLymerase
I,poLymerase III,
andpolymerase
REV3.-
PoLymerases
of viruses from the herpesviridaefamily
(Herpes type I and II Epstein-Barr, Cytomegalovirus, and VariceLla-zoster).-
Po(ymerases
fromAdenoviruses.
-
PoLymerases
fromBacuLoviruses.
- Vaccinia virus poLymerase.
- Bacteriophage
T4 polymerase.
Podoviridae
bacteriophages
Phi-29, M2, andPZA polymerase.
- Tectiviridae
bacteriophage
PRD1polymerase.
- Putative polymerases from Yeast K. lactis Linear plasmids
pGKL1
andpGKL2.
- Putative
polymerase
from the maizemitochondrial plasmid-Like Si
DNA.Six regions of similarity (numbered from I to VI) are found in all or a subset of the above polymerases. The most conserved region (I) includes a perfectLy conserved
tetrapeptide
which contains two aspartate residues. The function of this conserved region is not yet known, however it has been suggestedE3]
that it may be involved in binding a magnesium ion. We use this conserved region as a signature for this family of DNA polymerases.- Consensus pattern: CYA]-x-D-T-D-S-ELIVMTJ
- Sequences known to belong to this class detected by the pattern: ALL.
- Other sequence(s) detected in SWISS-PROT: chicken vitellogenin 2.
- Note: the residue in position 1 is Tyr in every family B polymerases, except in phage T4, where it is Ala.
- Last update: February 1991 / Text revised.
t 13 Jung G., Leavitt
M.C.,
HsiehJ.-C,,
Ito J.Proc. Natl. Acad. Sci. U.S.A. 84:8287-8291(1987).
[ 21 Bernad A, Zabaltos A. Salas M., Blanco L.
EMBO J.
6:4219-4225(19A7).
[ 3] Argos P.
Nucleic Acids Res. 16:9909-9916(1988).
E 4] Wang T.S.-F. Wong S.W., Korn D.
FASEB J.
3:14-21(1989).
t 5] Delarue M., Poch O., Todro N., Moras D., Argos P.
Protein Engineering 3:461-467(1990).
(END)
lb) The corresponding entry in the pattern file
ID DNA POLYMERASE
B;
PATTERN.AC PSOU116
DT
APR-1996 (CREATED);
NOV-1990 (DATA UPDATE); NOV-1990 (INFO UPDATE).DE DNA polymerase family B signature.
PA
tYAl-x-D-T-D-S-[LIVMTl.
NR /RELEASE=16
18364;
NR /TOTAL=26(26);
/POSITIVEZ25(25)-
/UNKNOWN=0(0); /FALSE POS=(1); /FALSE_NEG=0(0);CC /TAXO-RANGE=?BEPV;
/MAX-REPEAT=:i;
DR P09884, DPOASHUMAN, T; P13382, DPOASYEAST, T; P15436 DPODSYEAST, T;
DR P14284, DPOXSYEAST, T- P03261,
DPOL$ADE02,
T; P04495 DPOLSADE05, T;DR P05664, DPOLSADE07, T; P06538, DPOLSADE12, T; P04415, DPOLSBPT4 , T;
DR
P08546,
DPOLSHCMVA, T;P03198,
DPOLSEBV , T; P04293,DPOLSHSV1I,
T;DR P07917,
DPOLSHSVlA, T;
P04292DPOLSHSVlK,
T; P09854, DPOLSHSV1S, T;DR P07918, DPOLSHSV21
T; P09252,
DPOLSVZVD , T;P09804,
DPO1SKLULA, T;DR P05468, DPO2SKLULA, T; P10582 DPOLSMAIZE, T;
P18131,
DPOLSNPVAC, T;DR P06856, DPOLSVACCV, T;
P03680,
DPOLSBPPH2, T;P06950, DPOLSBPPZA,
T;DR
P10479,
DPOLSBPPRD, T;DR P02845 VIT2SCHICK, F;
DO
PDOCOOiO7;
//
Figure 1. SampledatafromPROSITE
Appendix 1. List of patterns documentation entries in release 6.10 of PROSITE
Post-translational modifications N-glycosylation site
Glycosaminoglycan attachment site Tyrosine sulfatation site
cAMP- andcGMP-dependentprotein kinasephosphorylation site Protein kinase Cphosphorylation site
Caseinkinase II phosphorylation site Tyrosine kinase phosphorylation site N-myristoylation site
Amidationsite
Aspartic acid and asparagine hydroxylation site Vitamin K-dependent carboxylation domain Phosphopantetheine attachment site
Prokaryotic membrane lipoprotein lipid attachment site Farnesylgroupbinding site (CAAX box)
Domains
Endoplasmic reticulum targeting sequence Peroxisomaltargetingsequence
Gram-positive cocci surface proteins 'anchoring' hexapeptide Nucleartargetingsequence
Cell attachmentsequence
ATP/GTP-binding site motifA(P-loop) EF-hand calcium-binding domain
Actinin-type actin-binding domainsignatures Cofilin/tropomyosin-type actin-binding domain Kringle domainsignature
EGF-likedomain cysteinepatternsignature Type IIfibronectincollagen-binding domain Hemopexindomainsignature
'Trefoil' domain signature
Chitinrecognition orbindingdomain signature WAP-type 'four-disulfide core' domain signature DNAorRNAassociatedproteins
'Homeobox' domain signature
'Homeobox'antennapedia-type protein signature 'Homeobox' engrailed-type protein signature 'Paired box' domainsignature
'POU'domainsignature Zincfinger, C2H2type,domain
Nuclear hormonesreceptorsDNA-bindingregion signature Eryfl-typezincfingerdomain
Poly(ADP-ribose) polymerasezincfingerdomain Leucinezipperpattern
Fos/junDNA-binding basicdomainsignature Myb DNA-binding domainrepeatsignatures
Myc-type, 'helix-loop-helix' putativeDNA-bindingdomainsignature p53tumorantigen signature
'Cold-shock' DNA-binding domain signature CTF/NF-I signature
ETS-domain signatures
SRF-typetranscriptionfactorsDNA-bindinganddimerization domain Transcriptionfactor TFIID repeatsignature
eIF-4Afamily ATP-dependenthelicasessignatures Eukaryotic putativeRNA-binding regionRNP-1 signature Bacterial activatorproteins,araCfamily signature Bacterial activator proteins,crpfamily signature Bacterialactivatorproteins, gntRfamily signature Bacterialactivatorproteins, lacd family signature Bacterial activatorproteins, lysRfamily signature Bacterialhistone-likeDNA-binding proteins signature Histone H2A signature
Histone H2Bsignature Histone H3signature Histone H4signature HMG1/2signature
HMG-Iand HMG-YDNA-bindingdomain(A+T-hook) HMG14 and HMG17 signature
ProtamineP1 signature RibosomalproteinL5signature Ribosomalprotein LI1 signature
Ribosomal proteinL14signature Ribosomal protein L23 signature Ribosomalprotein L39/L46signature RibosomalproteinS7 signature Ribosomal protein S8 signature Ribosomal protein S9 signature Ribosomal protein S10 signature RibosomalproteinSll signature RibosomalproteinS12 signature RibosomalproteinS15 signature RibosomalproteinS17 signature RibosomalproteinS18 signature RibosomalproteinS19 signature
DNAmismatchrepair proteinsmutL / hexB / PMS1 signature Enzymes
Oxidoreductases
Zinc-containing alcohol dehydrogenases signature Iron-containing alcohol dehydrogenases signature
Insect-type alcoholdehydrogenase / ribitol dehydrogenase family signature Aldo/keto reductasefamily signatures
L-lactate dehydrogenase active site
Glycerate-type 2-hydroxyaciddehydrogenases signature Hydroxymethylglutaryl-coenzyme A reductases signatures 3-hydroxyacyl-CoA dehydrogenase signature
Malate dehydrogenase active site signature Malic enzymessignature
Glucose-6-phosphate dehydrogenase activesite Bacterial quinoprotein dehydrogenases signatures Aldehydedehydrogenases active site
Glyceraldehyde 3-phosphate dehydrogenase activesite Acyl-CoA dehydrogenases signatures
Glutamatedehydrogenasesactivesite Dihydrofolatereductase signature
Pyridine nucleotide-disulphideoxidoreductases active site
Nitrite reductases and sulfite reductase putative siroheme-binding sites Uricasesignature
Cytochromecoxidase subunitI,copper Bbinding region signature CytochromecoxidasesubunitII, copper Abindingregion signature Multicopper oxidases signatures
Lipoxygenases, putative iron-bindingregion signature Extradiol ring-cleavage dioxygenasessignature Intradiolring-cleavage dioxygenases signature
Biopterin-dependentaromatic amino acidhydroxylases signature CoppertypeII, ascorbate-dependent monooxygenasessignatures Cytochrome P450cysteine heme-iron ligand signature Copper/Zinc superoxidedismutasesignature Manganese and ironsuperoxide dismutases signature Ribonucleotidereductaselarge subunit signature Ribonucleotidereductase small subunit signature
Nitrogenases component 1 alphaandbetasubunitssignature Transferases
Thymidylate synthaseactive site
Methylated-DNA--protein-cysteine methyltransferaseactive site N-6Adenine-specific DNAmethylasessignature
N4cytosine-specific DNAmethylases signature C-S cytosine-specificDNAmethylases signatures
Serinehydroxymethyltransferase pyridoxal-phosphateattachment site Phosphoribosylglycinamide formyltransferaseactive site
Aspartate andornithinecarbamoyltransferases signature Thiolases signatures
Chloramphenicolacetyltransferase active site cysE / lacA / nodLacetyltransferases signature Phosphorylasepyridoxal-phosphate attachmentsite UDP-glucoronosylandUDP-glucosyltransferases signature Purine/pyrimidine phosphoribosyl transferases signature S-Adenosylmethionine synthetase signatures
EPSPsynthaseactive site
Aspartate aminotransferasespyridoxal-phosphateattachmentsite Hexokinasessignature
Galactokinase signature
2244 Nucleic Acids Research, Vol. 19, Supplement
Phosphofructokinase signature Protein kinases signatures
Pyruvate kinaseactive sitesignature Phosphoglycerate kinase signature Aspartokinase signature
ATP:guanidophosphotransferasesactive site PTSproteinsphosphorylationsitessignatures Adenylatekinasesignature
Phosphoribosyl pyrophosphatesynthetase signature EukaryoticRNApolymeraseHIheptapeptiderepeat DNApolymerase familyBsignature
Galactose-l-phosphate uridyltransferase active sitesignature CDP-alcoholphosphatidyltransferases signature
Rhodanese active site Hydrolases
Phospholipase A2 active sitessignatures Lipases, serine activesite
Colipase signature
Carboxylesterases type-B active site Alkaline phosphataseactivesite Fructose-i -6-bisphosphatase active site
Serine/threonine specific proteinphosphatasessignature Tyrosine specific proteinphosphatases active site Prokaryotic zinc-dependent phospholipaseCsignature 3'5'-cyclic nucleotidephosphodiesterasessignature Sulfatasessignature
Pancreaticribonucleasefamily signature Alpha-lactalbumin/lysozyme Csignature
Lysosomalalpha-glucosidase /sucrase-isomaltaseactive site Alpha-L-fucosidaseputative active site
Uracil-DNAglycosylasesignature Serinecarboxypeptidases, serineactive site
Zinccarboxypeptidases, zinc-binding regionssignatures Serineproteases, trypsin family, active sites
Serine proteases, subtilisinfamily,active sites ClpPproteasesactive sites
Eukaryotic thiol(cysteine)proteasesactivesite
Ubiquitincarboxyl-terminal hydrolase, putativeactive-sitesignature Eukaryoticaspartyl proteasesactive site
Neutralzincmetallopeptidases, zinc-bindingregion signature Insulin-degradingenzyme /E.coliproteaseImsignature recAsignature
Proteasome subunitssignature Asparaginase/ glutaminase signature Urease active site
Beta-lactamases classes-A, -C,and -Dactive site Arginaseandagmatinase signatures
Inorganic pyrophosphatase signatures Acylphosphatasesignatures
ATPsynthase alphaand betasubunits signature ATPsynthasegammasubunitsignature ATPsynthase delta(OSCP) subunitsignature El-E2ATPasesphosphorylation site
SodiumandpotassiumATPasesbeta subunitssignatures Cutinase, serine active site
Lyases
DDC / GAD / HDCpyridoxal-phosphateattachment site Orotidine5'-phosphate decarboxylase signature Phosphoenolpyruvate carboxylaseactive site
Ribulosebisphosphate carboxylase large chain active site Fructose-bisphosphate aldolase active site
KDPG andKHG aldolases active sitesignatures Isocitratelyasesignature
DNAphotolyasessignature Carbonicanhydrases signature Fumaratelyases signature Enolasesignature
Serine/threoninedehydratasespyridoxal-phosphateattachment site Enoyl-CoAhydratase signature
Tryptophan synthase alpha chain signature
Tryptophan synthasebetachain pyridoxal-phosphateattachment site Delta-aminolevulinic acid dehydratase active site
Isomerases
Alanineracemasepyridoxal-phosphate attachment site Peptidyl-prolyl cis-trans isomerase signature Triosephosphate isomerase active site Xylose isomerase signatures Phosphoglucose isomerase signature
Phosphoglycerate mutasefamily phosphohistidine signature Eukaryotic DNA topoisomerase I active site
ProkaryoticDNAtopoisomerase I active site DNAtopoisomerase II signature
Ligases
Aminoacyl-transfer RNA synthetases class-I signature Aminoacyl-transfer RNA synthetasesclass-Il signatures ATP-citrate lyase and succinyl-CoA ligases active site Glutamine synthetase signatures
Ubiquitin-conjugating enzymesactive site Phosphoribosylglycinamide synthetasesignature ATP-dependent DNAligase putativeactive site Othersenzymes
IsopenicillinN synthetasesignatures Site-specific recombinasessignatures Thiamine pyrophosphate enzymessignature Biotin-requiringenzymesattachment site
2-oxo aciddehydrogenasesacyltransferasecomponentlipoylbinding site Electrontransportproteins
Cytochromecfamily heme-binding sitesignature Cytochrome b5 family, heme-binding domain signature Cytochromeb/b6signatures
Thioredoxin family active site Glutaredoxin activesite
Type-I copper(blue) proteins signature
2Fe-2S ferredoxins,iron-sulfur binding regionsignature 4Fe-4S ferredoxins, iron-sulfurbinding region signature Rieskeiron-sulfurprotein signatures
Flavodoxin signature Rubredoxin signature Othertransportproteins ClassImetallothioneins signature Ferritin iron-bindingregionsignature Transferrins signatures
Planthemoglobins signature
Arthropod hemocyanins/ insectLSPssignatures ATP-binding proteins 'active transport' family signature
Binding-protein dependent transportsystemsinnermembranecomponentsignature Serumalbuminfamily signature
Lipocalins signature
Cytosolicfatty-acid binding proteins signature LBP/ BPI / CETPfamily signature Uteroglobin family signatures
Mitochondrial energy transferproteins signature Sugartransportproteins signatures
Prokaryoticsulfate-binding proteins signature Amino acidpermeases signature
Anionexchangersfamily signatures MIP /Nodulin-26 /glpFfamily signature Insulin-like growth factorbinding proteins signature
Structura
proteins43 Kdpostsynaptic protein signature Actinssignatures
Annexinsphospholipid/calcium-bindingdomain signature Clathrinlightchainsignature
Connexinssignatures
Crystallinsbeta andgamma 'Greekkey' motifsignature Dynamin family signature
Intermediatefilamentssignature Kinesinmotordomainsignature Neuromodulin (GAP-43) signatures Profilinsignature
Surfactant associatedpolypeptide SP-C palmytoylationsites Synapsinssignatures
Synaptobrevinsignature Tropomyosinssignature
Tubulinsubunitsalpha and betasignature Tubulin-beta mRNAautoregulation signal Tau and MAP2proteins repeated region signature Neuraxinand MAPIB proteinsrepeated regionsignature F-actincapping proteinbeta subunitsignature
Amyloidogenicglycoprotein signatures
Cadherinsextracellularrepeated domain signature Insectlarval cuticleproteins signature
Gasvesiclesprotein GVPasignature
Gas vesiclesprotein GVPcrepeateddomainsignature NMePhepilimethylationsite
Potexviruses and carlavirusescoatproteinsignature Receptors
Neurotransmitter-gated ion-channels signature G-protein coupledreceptors signature Visualpigments (opsins) retinal binding site Bacterialrhodopsins retinal binding site Receptor tyrosine kinase classIIsignature Receptor tyrosine kinase class IH signature
Growth factorandcytokinesreceptorsfamily signature Integrins alpha chain signature
Integrins beta chain cysteine-rich domain signature Photosynthetic reactioncenterproteins signature Photosystem I psaA and psaB proteins signature Phytochrome chromophore attachment site Speract receptorrepeateddomain signature TonB-dependentreceptorproteinssignatures Type-II membrane antigensfamily signature Cytokinesandgrowthfactors
Int-l family signature HBGF/FGFfamily signature Nervegrowthfactorfamily signature
Platelet-derivedgrowthfactor(PDGF)family signature TGF-betafamily signature
TNFfamily signature
Interferonalpha and beta family signature Interleukin-1 signature
Interleukin-2signature
Interleukin-6 / G-CSF / MGFsignature Interleukin-7signature
Hormones and activepeptides Adipokinetichormonefamilysignature Bombesin-likepeptidesfamily signature Calcitonin / CGRP /LAPP family signature Chromogranins/secretogranins signatures Gastrin /cholecystokinin family signature Glucagon/GIP / secretin/ VIPfamily signature Glycoproteinhormones betachainsignature Insulinfamily signature
Natriureticpeptides signature Neurohypophysial hormones signature Pancreatichormonefamily signature Parathyroidhormonefamily signature
Somatotropin, prolactinandrelated hormonessignatures
Tachykinin family signature Cecropinfamily signature Mammalian defensins signature Insectdefensins signature Endothelins/sarafotoxinssignature Toxins
Plantthionins signature Snaketoxins signature
Heat-stable enterotoxins signature Aerolysin type toxins signature
Shiga/ricin ribosomal inactivating toxins activesite signature Channelforming colicins signature
Staphyloccocalenterotoxins / Streptococcal pyrogenicexotoxins signatures Membrane attackcomplex components / perforin signature
Inhibitors
Pancreatictrypsininhibitor (Kunitz) family signature Bowman-Birk serine protease inhibitors family signature Kazal serine proteaseinhibitorsfamilysignature
Soybean trypsin inhibitor (Kunitz) protease inhibitors family signature Serpins signature
Potatoinhibitor I family signature
Squashfamily ofserine protease inhibitors signature Cysteineproteasesinhibitors signature
Tissueinhibitors of metalloproteinases signature Cereal trypsin/alpha-amylaseinhibitorsfamily signature Disintegrinssignature
Others
Pentraxin familysignature
Immunoglobulinsandmajor histocompatibilitycomplexproteins signature Prion protein signature
Cyclinsignature
Proliferatingcellnuclearantigen signature Arrestinssignature
Chaperonins signature
Heat shockhsp7Oproteins family signatures Heatshockhsp9Oproteins family signature Ubiquitinsignature
SRP54-type proteins GTP-bindingdomainsignature GTP-binding elongation factorssignature
Eukaryotic initiation factor4Dhypusinesignature S-100/ICaBPtypecalciumbinding protein signature Hemolysin-type putative calcium-binding region signature Small, acid-soluble sporeproteins, alpha/betatype, signature Caseinsalpha/beta signature
Legume lectinssignatures
Vertebrategalactoside-binding lectin signature Lysosome-associated membrane glycoproteins signatures Glycophorin Asignature
Seminal vesicleproteinIrepeatssignature HCP repeatssignature
Bacterialice-nucleation proteins octamerrepeat Cellcycleproteins ftsW/rodA /spoVEsignature Staphylocoagulaserepeatsignature
11-Splant seed storageproteinssignature Dehydrins signature
Small hydrophilic plantseedproteins signature Thaumatinfamily signature