Signal Peptide Website
Search my Protein
Advanced Search
Database Search
References
Hints
Links
Imprint
Signal Peptide Database - Viruses
Entry Details
ID
3830
Source Database
UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot Accession Number
P18450 (Created: 1990-11-01 Updated: 2008-11-25)
UniProtKB/Swiss-Prot Entry Name
SPIKE_CVPFS
Protein Name
Spike glycoprotein
Gene
S
Organism Scientific
Porcine transmissible gastroenteritis coronavirus (strain FS772/70)
Organism Common
TGEV
Lineage
Viruses
ssRNA positive-strand viruses, no DNA stage
Nidovirales
Coronaviridae
Coronavirus
Coronavirus group 1
Coronavirus group 1a
Protein Length [aa]
1449
Protein Mass [Da]
159958
Features
Type
Description
Status
Start
End
signal peptide
1
16
chain
Spike glycoprotein
17
1449
transmembrane region
potential
1391
1410
topological domain
Extracellular
potential
17
1390
topological domain
Cytoplasmic
potential
1411
1449
region of interest
S1
17
776
region of interest
S2
777
1449
glycosylation site
N-linked (GlcNAc...)
potential
26
26
glycosylation site
N-linked (GlcNAc...)
potential
42
42
glycosylation site
N-linked (GlcNAc...)
potential
71
71
glycosylation site
N-linked (GlcNAc...)
potential
94
94
glycosylation site
N-linked (GlcNAc...)
potential
243
243
glycosylation site
N-linked (GlcNAc...)
potential
250
250
glycosylation site
N-linked (GlcNAc...)
potential
285
285
glycosylation site
N-linked (GlcNAc...)
potential
334
334
glycosylation site
N-linked (GlcNAc...)
potential
345
345
glycosylation site
N-linked (GlcNAc...)
potential
362
362
glycosylation site
N-linked (GlcNAc...)
potential
375
375
glycosylation site
N-linked (GlcNAc...)
potential
405
405
glycosylation site
N-linked (GlcNAc...)
potential
449
449
glycosylation site
N-linked (GlcNAc...)
potential
516
516
glycosylation site
N-linked (GlcNAc...)
potential
532
532
glycosylation site
N-linked (GlcNAc...)
potential
554
554
glycosylation site
N-linked (GlcNAc...)
potential
594
594
glycosylation site
N-linked (GlcNAc...)
potential
704
704
glycosylation site
N-linked (GlcNAc...)
potential
725
725
glycosylation site
N-linked (GlcNAc...)
potential
780
780
glycosylation site
N-linked (GlcNAc...)
potential
819
819
glycosylation site
N-linked (GlcNAc...)
potential
834
834
glycosylation site
N-linked (GlcNAc...)
potential
840
840
glycosylation site
N-linked (GlcNAc...)
potential
921
921
glycosylation site
N-linked (GlcNAc...)
potential
1074
1074
glycosylation site
N-linked (GlcNAc...)
potential
1200
1200
glycosylation site
N-linked (GlcNAc...)
potential
1294
1294
glycosylation site
N-linked (GlcNAc...)
potential
1311
1311
glycosylation site
N-linked (GlcNAc...)
potential
1324
1324
glycosylation site
N-linked (GlcNAc...)
potential
1336
1336
glycosylation site
N-linked (GlcNAc...)
potential
1341
1341
glycosylation site
N-linked (GlcNAc...)
potential
1358
1358
glycosylation site
N-linked (GlcNAc...)
potential
1371
1371
compositionally biased region
Cys-rich
1411
1432
short sequence motif
KxHxx
by similarity
1445
1449
coiled-coil region
potential
1104
1148
coiled-coil region
potential
1338
1380
SP Length
16
----+----1----+----2----+----3----+----4----+----5
Signal Peptide
MKKLFVVLVVMPLIYG
Sequence
MKKLFVVLVVMPLIYG
DNFPCSKLT
N
RTIGNHWNLIETFLL
N
YSSRLSPN
SDVVLGDYFPTVQPWFNCIH
N
NSNDLYVTLENLKALYWDYATE
N
STWNHK
QRLNVVVNGYPYSITVTTTRNFNSAEGAIICICKGSPPTTTTESSLTCNW
GSECRLNHKFPICPSNSEANCGNMLYGLQWFADAVVAYLHGASYRISFEN
QWSGTVTLGDMRATTLETAGTLVDLWWFNPVYDVSYYRVNNK
N
GTTVVS
N
CTDQCASYVANVFTTQPGGFIPSDFSFNNWFLLT
N
SSTLVSGKLVTKQPL
LVNCLWPVPSFEEAASTFCFEGAGFDQCNGAVL
N
NTVDVIRFNL
N
FTTNV
QSGKGATVFSL
N
TTGGVTLEISCY
N
DTVSDSSFSSYGEIPFGVTDGPRYC
YVLY
N
GTALKYLGTLPPSVKEIAISKWGHFYINGYNFFSTFPIDCISF
N
L
TTGDSDVFWTIAYTSYTEALVQVENTAITKVTYCNSYVNNIKCSQLTANL
NNGFYPVSSSEVGFV
N
KSVVLLPTFYTHTIV
N
ITIGLGMKRSGYGQPIAS
TLS
N
ITLPMQDNNIDVYCIRSDQFSVYVHSTCKSALWDNVFKR
N
CTDVLD
ATAVIKTGTCPFSFDKLNNYLTFNKFCLSLSPVGANCKFDVAARTRANDQ
VVRSLYVIYEEGDNIVGVPSDNSGLHDLSVLHLDSCTDYNIYGRSGVGII
RQT
N
RTLLSGLYYTSLSGDLLGFK
N
VSDGVIYSVTPCDVSAQAAVIDGTI
VGAITSINSELLGLTHWTTTPNFYYY
SIY
N
YTNDMTRGTAIDSNDVDCEP
VITYSNIGVCKNGALVFI
N
VTHSDGDVQPISTG
N
VTIPT
N
FTISVQVEYI
QVYTTPVSIDCSRYVCNGNPRCNKLLTQYVSACQTIEQALAVGARLENME
VDSMLFVSENALKLASVEAF
N
SSETLDPIYKEWPNIGGSWLEGLKYILPS
DNSKRKYRSAIEDLLFSKVVTSGLGTVDEDYKRCTGGYDIADLVCAQYYN
GIMVLPGVANADKMTMYTASLAGGITLGALGGGAVAIPFAVAVQARLNYV
ALQTDVLNKNQQILASAFNQAIG
N
ITQSFGKVNDAIHQTSRGLATVAKAL
AKV
QDVVNTQGQALSHLTVQLQNNFQAISSSISDIYNRLDELSADAHV
DR
LITGRLTALNAFVSQTLTRQAEVRASRQLAKDKVNECVRSQSQRFGFCG
N
GTHLFSLANAAPNGMIFFHAVLLPTAYETVTAWAGICALDGDRTFGLVVK
DVQLTLFRNLDDKFYLTPRTMYQPRVATSSDFVQIEGCDVLFV
N
ATLSDL
PSIIPDYIDI
N
QTVQDILENFRP
N
WTVPELTFDIF
N
A
TYLNLTGEIDDLE
FRSEKLHNTTVELAILIDNINNTLVNLEWL
NRIETYVKWPWYVWLLIGLV
VIFCIPLLLF
CCCSTGCCGCIGCLGSCCHSIC
SRRQFENYEPIE
KVHIH
Original
MKKLFVVLVVMPLIYGDNFPCSKLTNRTIGNHWNLIETFLLNYSSRLSPN
SDVVLGDYFPTVQPWFNCIHNNSNDLYVTLENLKALYWDYATENSTWNHK
QRLNVVVNGYPYSITVTTTRNFNSAEGAIICICKGSPPTTTTESSLTCNW
GSECRLNHKFPICPSNSEANCGNMLYGLQWFADAVVAYLHGASYRISFEN
QWSGTVTLGDMRATTLETAGTLVDLWWFNPVYDVSYYRVNNKNGTTVVSN
CTDQCASYVANVFTTQPGGFIPSDFSFNNWFLLTNSSTLVSGKLVTKQPL
LVNCLWPVPSFEEAASTFCFEGAGFDQCNGAVLNNTVDVIRFNLNFTTNV
QSGKGATVFSLNTTGGVTLEISCYNDTVSDSSFSSYGEIPFGVTDGPRYC
YVLYNGTALKYLGTLPPSVKEIAISKWGHFYINGYNFFSTFPIDCISFNL
TTGDSDVFWTIAYTSYTEALVQVENTAITKVTYCNSYVNNIKCSQLTANL
NNGFYPVSSSEVGFVNKSVVLLPTFYTHTIVNITIGLGMKRSGYGQPIAS
TLSNITLPMQDNNIDVYCIRSDQFSVYVHSTCKSALWDNVFKRNCTDVLD
ATAVIKTGTCPFSFDKLNNYLTFNKFCLSLSPVGANCKFDVAARTRANDQ
VVRSLYVIYEEGDNIVGVPSDNSGLHDLSVLHLDSCTDYNIYGRSGVGII
RQTNRTLLSGLYYTSLSGDLLGFKNVSDGVIYSVTPCDVSAQAAVIDGTI
VGAITSINSELLGLTHWTTTPNFYYYSIYNYTNDMTRGTAIDSNDVDCEP
VITYSNIGVCKNGALVFINVTHSDGDVQPISTGNVTIPTNFTISVQVEYI
QVYTTPVSIDCSRYVCNGNPRCNKLLTQYVSACQTIEQALAVGARLENME
VDSMLFVSENALKLASVEAFNSSETLDPIYKEWPNIGGSWLEGLKYILPS
DNSKRKYRSAIEDLLFSKVVTSGLGTVDEDYKRCTGGYDIADLVCAQYYN
GIMVLPGVANADKMTMYTASLAGGITLGALGGGAVAIPFAVAVQARLNYV
ALQTDVLNKNQQILASAFNQAIGNITQSFGKVNDAIHQTSRGLATVAKAL
AKVQDVVNTQGQALSHLTVQLQNNFQAISSSISDIYNRLDELSADAHVDR
LITGRLTALNAFVSQTLTRQAEVRASRQLAKDKVNECVRSQSQRFGFCGN
GTHLFSLANAAPNGMIFFHAVLLPTAYETVTAWAGICALDGDRTFGLVVK
DVQLTLFRNLDDKFYLTPRTMYQPRVATSSDFVQIEGCDVLFVNATLSDL
PSIIPDYIDINQTVQDILENFRPNWTVPELTFDIFNATYLNLTGEIDDLE
FRSEKLHNTTVELAILIDNINNTLVNLEWLNRIETYVKWPWYVWLLIGLV
VIFCIPLLLFCCCSTGCCGCIGCLGSCCHSICSRRQFENYEPIEKVHIH
----+----1----+----2----+----3----+----4----+----5
Hydropathies
Home
Imprint
© 2007-2017
Dr. Katja Kapp
, Kassel &
thpr.net e. K.
, Dresden, Germany, last update 2010-06-11