• No results found

Sequence Information. Sequence information. Good web sites. Sequence information. Sequence. Sequence

N/A
N/A
Protected

Academic year: 2021

Share "Sequence Information. Sequence information. Good web sites. Sequence information. Sequence. Sequence"

Copied!
14
0
0

Loading.... (view fulltext now)

Full text

(1)

Bengt Persson

Sequence Information

Linköping University & Karolinska Institutet 2

Sequence information

Sequence

Comparisons

Pair-wise

Multiple

Database searches

SRS

Entrez

Protein families

Patterns

Post-translational

modifications

Organell

localisation

Orthologue

clusters

InterPro

Pfam

Prosite

Membrane

attachment

Secondary

structure

Linköping University & Karolinska Institutet 3

Sequence information

Sequence

Comparisons

Pair-wise

Multiple

Database searches

SRS

Entrez

Protein families

Patterns

Post-translational

modifications

Organell

localisation

Orthologue

clusters

InterPro

Pfam

Prosite

Membrane

attachment

Secondary

structure

Linköping University & Karolinska Institutet 4

Good web sites

‘

www.expasy.org

‘

www.ebi.ac.uk

‘

www.ncbi.nlm.nih.gov

(2)

Protein

Protein

family

family

databases

databases

Linköping University & Karolinska Institutet 6

Protein families, nomenclature

‘

Super-family

– Family

• Sub-family

Linköping University & Karolinska Institutet 7

InterPro

‘

Prosite

– Amos Bairoch, Genève

‘

Pfam

– Erik Sonnhammer, KI and Sanger Institute, UK

‘

PRINTS

– Terri Attwood, UCL, London, UK

‘

ProDom

– Daniel Kahn, INRA, Toulouse, France

‘

SMART

– Peer Bork, EMBL

‘

Swissprot+TrEMBL

Linköping University & Karolinska Institutet 8

InterPro entry

(3)

Linköping University & Karolinska Institutet 9

InterPro entry, cont.

Linköping University & Karolinska Institutet 10

InterPro entry, cont.

Linköping University & Karolinska Institutet 11

InterPro -- protein matches

Linköping University & Karolinska Institutet 12

InterPro -- protein matches, graphical

(4)

Linköping University & Karolinska Institutet 13

Prosite

‘

Database of protein families and domains

‘

Release 16, September 1999

‘

1035 documentation entries

‘

1375 different patterns

http://www.expasy.ch/prosite/

Amos Bairoch, University of Geneva

Linköping University & Karolinska Institutet 14

Prosite

Linköping University & Karolinska Institutet 15

Prosite

Linköping University & Karolinska Institutet 16

ScanProsite

(5)

Linköping University & Karolinska Institutet 17

Prosite, documentation entry

Linköping University & Karolinska Institutet 18

Example of Prosite patterns

‘

Post-translational modifications

‘

Domains

‘

DNA or RNA associated proteins

‘

Enzymes

‘

Electron transport proteins

‘

Other transport proteins

‘

Structural proteins

‘

Receptors

‘

Hormones and active peptides

‘

Toxins

‘

Inhibitors

‘

Protein secretion and chaperones

‘

Cytokines and growth factors

‘

Others

Linköping University & Karolinska Institutet 19

Pfam

‘

A collection of protein families and domains.

‘

Pfam contains multiple protein alignments and

profile-HMMs of these families.

‘

Pfam is a semi-automatic protein family database,

which aims to be comprehensive as well as accurate.

http://www.sanger.ac.uk/Software/Pfam/index.shtml

http://www.cgr.ki.se/Pfam

Linköping University & Karolinska Institutet 20

Hidden Markov Models (HMMs)

‘

Statistical profile method

‘

Enables database searches

‘

Enables multiple alignment creation

(6)

Linköping University & Karolinska Institutet 21

Pfam

Linköping University & Karolinska Institutet 22

Pfam

Linköping University & Karolinska Institutet 23

Pfam

Linköping University & Karolinska Institutet 24

COG--Clusters of Orthologous Groups

(7)

Linköping University & Karolinska Institutet 25

Functional groups of protein families

Linköping University & Karolinska Institutet 26

COG

Linköping University & Karolinska Institutet 27

COG

Predictions

Predictions

of

of

structure

structure

and

and

post

post

-

-translational

(8)

Linköping University & Karolinska Institutet 29

Structure predictions

‘

Secondary structure

‘

Hydrophilicity

‘

Membrane-spanning regions

‘

Antigenicity

‘

Glycosylation

‘

Acetylation

… and much more ...

Linköping University & Karolinska Institutet 30

Secondary structure predictions

‘

Chou & Fasman (CF)

‘

Garnier, Osguthorpe & Robson (GOR)

http://pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_gor4.html

‘

neural networks (e.g. PHD)

http://dodo.cpmc.columbia.edu/predictprotein/

Linköping University & Karolinska Institutet 31

Artificial Neural Networks (ANNs)

‘

Statistical method

‘

Pattern recognition, e. g. secondary structure

predictions

Output layer

Hidden layer

Input layer

Output layer

Hidden layer

Input layer

modified from Yvonne Kallberg

Linköping University & Karolinska Institutet 32

The PredictProtein server

(9)

Linköping University & Karolinska Institutet 33

Default submission form

Linköping University & Karolinska Institutet 34

Hydrophilicity

‘

Kyte & Doolittle

‘

Hopp & Woods

Linköping University & Karolinska Institutet 35

Example of hydrophilicity and secondary

structure plots

Linköping University & Karolinska Institutet 36

ProtScale

‘

A general tool for plotting sequence properties,

e.g. hydrophilicity

(10)

Linköping University & Karolinska Institutet 37

ProtScale, selection of property to plot

Linköping University & Karolinska Institutet 38

ProtScale, results

Linköping University & Karolinska Institutet 39

ProtScale, Graphic view

Linköping University & Karolinska Institutet 40

Membrane protein prediction, TMAP

(11)

Linköping University & Karolinska Institutet 41

Membrane protein prediction, TMAP

Linköping University & Karolinska Institutet 42

TMAP, graphics output

Linköping University & Karolinska Institutet 43

Prediction servers at CBS

www.cbs.dtu.dk/services/

Linköping University & Karolinska Institutet 44

SignalP

(12)

Linköping University & Karolinska Institutet 45

SignalP -- Results

Linköping University & Karolinska Institutet 46

SignalP -- Results, cont.

Linköping University & Karolinska Institutet 47

TargetP

Linköping University & Karolinska Institutet 48

TargetP -- Results

(13)

Linköping University & Karolinska Institutet 49

Phobius

Linköping University & Karolinska Institutet 50

Phobius, results

Linköping University & Karolinska Institutet 51

ExPASy site map

Linköping University & Karolinska Institutet 52

(14)

Linköping University & Karolinska Institutet 53

Post-translational modifications

Linköping University & Karolinska Institutet 54

Primary structure analysis

Linköping University & Karolinska Institutet 55

Secondary structure prediction

Linköping University & Karolinska Institutet 56

Transmembrane regions & Sequence

alignments

References

Related documents

A comparison between broadband aerosol optical depth (BAOD), derived from broadband pyrheliometer ob- servations at the Camagüey site and three other meteorolog- ical stations in

We aimed to compare the mean global score for com- pleteness of reporting scores (scale 0 – 10) for all domains of the manuscript written by the participants with and without the

18 Another reason for the high level of knowledge regarding condoms could be due to the fact that condoms are the most appropriate contraceptive methods used among

information are accessible for most vehicles, and US06 information are accessible for most autos and for a. predetermined number of light

With that being known, our PICOT question is presented as: In MedSurg/ICU nurses caring for adult patients showing AWS, what effect on decreasing hospital stay on patients who

According to Barnes (1998; 2001), the manufacturing performance of the South African automotive component industry can be measured by means of the following criteria: 1) Cost,

At the first postoperative follow up, 13 of the 31 cases (41.9%) had facial nerve dysfunction, which included all 8 total parotidectomies performed.. However, these 3 cases had

Table A.8: Regressions of maximum likelihood estimated smoothing coefficient ˆ θ 0 and smoothing index ˆ ξ on indicator variables for 908 hedge funds in the TASS Hedge Fund