Signal Peptide Database - Viruses

 Entry Details
ID   3830
Source Database   UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot Accession Number   P18450    (Created: 1990-11-01 Updated: 2008-11-25)
UniProtKB/Swiss-Prot Entry Name   SPIKE_CVPFS
Protein Name   Spike glycoprotein
Gene   S
Organism Scientific   Porcine transmissible gastroenteritis coronavirus (strain FS772/70)
Organism Common   TGEV
Lineage   Viruses
  ssRNA positive-strand viruses, no DNA stage
    Nidovirales
      Coronaviridae
        Coronavirus
          Coronavirus group 1
            Coronavirus group 1a
Protein Length [aa]   1449
Protein Mass [Da]   159958
Features  
TypeDescriptionStatusStartEnd
signal peptide         1   16
chain   Spike glycoprotein      17   1449
transmembrane region      potential   1391   1410
topological domain   Extracellular   potential   17   1390
topological domain   Cytoplasmic   potential   1411   1449
region of interest   S1      17   776
region of interest   S2      777   1449
glycosylation site   N-linked (GlcNAc...)   potential   26   26
glycosylation site   N-linked (GlcNAc...)   potential   42   42
glycosylation site   N-linked (GlcNAc...)   potential   71   71
glycosylation site   N-linked (GlcNAc...)   potential   94   94
glycosylation site   N-linked (GlcNAc...)   potential   243   243
glycosylation site   N-linked (GlcNAc...)   potential   250   250
glycosylation site   N-linked (GlcNAc...)   potential   285   285
glycosylation site   N-linked (GlcNAc...)   potential   334   334
glycosylation site   N-linked (GlcNAc...)   potential   345   345
glycosylation site   N-linked (GlcNAc...)   potential   362   362
glycosylation site   N-linked (GlcNAc...)   potential   375   375
glycosylation site   N-linked (GlcNAc...)   potential   405   405
glycosylation site   N-linked (GlcNAc...)   potential   449   449
glycosylation site   N-linked (GlcNAc...)   potential   516   516
glycosylation site   N-linked (GlcNAc...)   potential   532   532
glycosylation site   N-linked (GlcNAc...)   potential   554   554
glycosylation site   N-linked (GlcNAc...)   potential   594   594
glycosylation site   N-linked (GlcNAc...)   potential   704   704
glycosylation site   N-linked (GlcNAc...)   potential   725   725
glycosylation site   N-linked (GlcNAc...)   potential   780   780
glycosylation site   N-linked (GlcNAc...)   potential   819   819
glycosylation site   N-linked (GlcNAc...)   potential   834   834
glycosylation site   N-linked (GlcNAc...)   potential   840   840
glycosylation site   N-linked (GlcNAc...)   potential   921   921
glycosylation site   N-linked (GlcNAc...)   potential   1074   1074
glycosylation site   N-linked (GlcNAc...)   potential   1200   1200
glycosylation site   N-linked (GlcNAc...)   potential   1294   1294
glycosylation site   N-linked (GlcNAc...)   potential   1311   1311
glycosylation site   N-linked (GlcNAc...)   potential   1324   1324
glycosylation site   N-linked (GlcNAc...)   potential   1336   1336
glycosylation site   N-linked (GlcNAc...)   potential   1341   1341
glycosylation site   N-linked (GlcNAc...)   potential   1358   1358
glycosylation site   N-linked (GlcNAc...)   potential   1371   1371
compositionally biased region   Cys-rich      1411   1432
short sequence motif   KxHxx   by similarity   1445   1449
coiled-coil region      potential   1104   1148
coiled-coil region      potential   1338   1380
SP Length   16
 ----+----1----+----2----+----3----+----4----+----5
Signal Peptide MKKLFVVLVVMPLIYG
Sequence MKKLFVVLVVMPLIYGDNFPCSKLTNRTIGNHWNLIETFLLNYSSRLSPN
SDVVLGDYFPTVQPWFNCIH
NNSNDLYVTLENLKALYWDYATENSTWNHK
QRLNVVVNGYPYSITVTTTRNFNSAEGAIICICKGSPPTTTTESSLTCNW
GSECRLNHKFPICPSNSEANCGNMLYGLQWFADAVVAYLHGASYRISFEN
QWSGTVTLGDMRATTLETAGTLVDLWWFNPVYDVSYYRVNNK
NGTTVVSN
CTDQCASYVANVFTTQPGGFIPSDFSFNNWFLLTNSSTLVSGKLVTKQPL
LVNCLWPVPSFEEAASTFCFEGAGFDQCNGAVL
NNTVDVIRFNLNFTTNV
QSGKGATVFSL
NTTGGVTLEISCYNDTVSDSSFSSYGEIPFGVTDGPRYC
YVLY
NGTALKYLGTLPPSVKEIAISKWGHFYINGYNFFSTFPIDCISFNL
TTGDSDVFWTIAYTSYTEALVQVENTAITKVTYCNSYVNNIKCSQLTANL
NNGFYPVSSSEVGFV
NKSVVLLPTFYTHTIVNITIGLGMKRSGYGQPIAS
TLS
NITLPMQDNNIDVYCIRSDQFSVYVHSTCKSALWDNVFKRNCTDVLD
ATAVIKTGTCPFSFDKLNNYLTFNKFCLSLSPVGANCKFDVAARTRANDQ
VVRSLYVIYEEGDNIVGVPSDNSGLHDLSVLHLDSCTDYNIYGRSGVGII
RQT
NRTLLSGLYYTSLSGDLLGFKNVSDGVIYSVTPCDVSAQAAVIDGTI
VGAITSINSELLGLTHWTTTPNFYYY
SIYNYTNDMTRGTAIDSNDVDCEP
VITYSNIGVCKNGALVFI
NVTHSDGDVQPISTGNVTIPTNFTISVQVEYI
QVYTTPVSIDCSRYVCNGNPRCNKLLTQYVSACQTIEQALAVGARLENME
VDSMLFVSENALKLASVEAF
NSSETLDPIYKEWPNIGGSWLEGLKYILPS
DNSKRKYRSAIEDLLFSKVVTSGLGTVDEDYKRCTGGYDIADLVCAQYYN
GIMVLPGVANADKMTMYTASLAGGITLGALGGGAVAIPFAVAVQARLNYV
ALQTDVLNKNQQILASAFNQAIG
NITQSFGKVNDAIHQTSRGLATVAKAL
AKV
QDVVNTQGQALSHLTVQLQNNFQAISSSISDIYNRLDELSADAHVDR
LITGRLTALNAFVSQTLTRQAEVRASRQLAKDKVNECVRSQSQRFGFCG
N
GTHLFSLANAAPNGMIFFHAVLLPTAYETVTAWAGICALDGDRTFGLVVK
DVQLTLFRNLDDKFYLTPRTMYQPRVATSSDFVQIEGCDVLFV
NATLSDL
PSIIPDYIDI
NQTVQDILENFRPNWTVPELTFDIFNATYLNLTGEIDDLE
FRSEKLHNTTVELAILIDNINNTLVNLEWL
NRIETYVKWPWYVWLLIGLV
VIFCIPLLLF
CCCSTGCCGCIGCLGSCCHSICSRRQFENYEPIEKVHIH
Original MKKLFVVLVVMPLIYGDNFPCSKLTNRTIGNHWNLIETFLLNYSSRLSPN
SDVVLGDYFPTVQPWFNCIHNNSNDLYVTLENLKALYWDYATENSTWNHK
QRLNVVVNGYPYSITVTTTRNFNSAEGAIICICKGSPPTTTTESSLTCNW
GSECRLNHKFPICPSNSEANCGNMLYGLQWFADAVVAYLHGASYRISFEN
QWSGTVTLGDMRATTLETAGTLVDLWWFNPVYDVSYYRVNNKNGTTVVSN
CTDQCASYVANVFTTQPGGFIPSDFSFNNWFLLTNSSTLVSGKLVTKQPL
LVNCLWPVPSFEEAASTFCFEGAGFDQCNGAVLNNTVDVIRFNLNFTTNV
QSGKGATVFSLNTTGGVTLEISCYNDTVSDSSFSSYGEIPFGVTDGPRYC
YVLYNGTALKYLGTLPPSVKEIAISKWGHFYINGYNFFSTFPIDCISFNL
TTGDSDVFWTIAYTSYTEALVQVENTAITKVTYCNSYVNNIKCSQLTANL
NNGFYPVSSSEVGFVNKSVVLLPTFYTHTIVNITIGLGMKRSGYGQPIAS
TLSNITLPMQDNNIDVYCIRSDQFSVYVHSTCKSALWDNVFKRNCTDVLD
ATAVIKTGTCPFSFDKLNNYLTFNKFCLSLSPVGANCKFDVAARTRANDQ
VVRSLYVIYEEGDNIVGVPSDNSGLHDLSVLHLDSCTDYNIYGRSGVGII
RQTNRTLLSGLYYTSLSGDLLGFKNVSDGVIYSVTPCDVSAQAAVIDGTI
VGAITSINSELLGLTHWTTTPNFYYYSIYNYTNDMTRGTAIDSNDVDCEP
VITYSNIGVCKNGALVFINVTHSDGDVQPISTGNVTIPTNFTISVQVEYI
QVYTTPVSIDCSRYVCNGNPRCNKLLTQYVSACQTIEQALAVGARLENME
VDSMLFVSENALKLASVEAFNSSETLDPIYKEWPNIGGSWLEGLKYILPS
DNSKRKYRSAIEDLLFSKVVTSGLGTVDEDYKRCTGGYDIADLVCAQYYN
GIMVLPGVANADKMTMYTASLAGGITLGALGGGAVAIPFAVAVQARLNYV
ALQTDVLNKNQQILASAFNQAIGNITQSFGKVNDAIHQTSRGLATVAKAL
AKVQDVVNTQGQALSHLTVQLQNNFQAISSSISDIYNRLDELSADAHVDR
LITGRLTALNAFVSQTLTRQAEVRASRQLAKDKVNECVRSQSQRFGFCGN
GTHLFSLANAAPNGMIFFHAVLLPTAYETVTAWAGICALDGDRTFGLVVK
DVQLTLFRNLDDKFYLTPRTMYQPRVATSSDFVQIEGCDVLFVNATLSDL
PSIIPDYIDINQTVQDILENFRPNWTVPELTFDIFNATYLNLTGEIDDLE
FRSEKLHNTTVELAILIDNINNTLVNLEWLNRIETYVKWPWYVWLLIGLV
VIFCIPLLLFCCCSTGCCGCIGCLGSCCHSICSRRQFENYEPIEKVHIH
 ----+----1----+----2----+----3----+----4----+----5
Hydropathies  
 

© 2007-2017 Dr. Katja Kapp, Kassel & thpr.net e. K., Dresden, Germany, last update 2010-06-11