Signal Peptide Website
Search my Protein
Advanced Search
Database Search
References
Hints
Links
Imprint
Signal Peptide Database - Viruses
Entry Details
ID
3829
Source Database
UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot Accession Number
A3EXG6 (Created: 2007-06-12 Updated: 2008-11-25)
UniProtKB/Swiss-Prot Entry Name
SPIKE_BCHK9
Protein Name
Spike glycoprotein
Gene
S
Organism Scientific
Bat coronavirus HKU9
Organism Common
BtCoV
Lineage
Viruses
ssRNA positive-strand viruses, no DNA stage
Nidovirales
Coronaviridae
Coronavirus
Coronavirus group 2
Coronavirus group 2d
Protein Length [aa]
1274
Protein Mass [Da]
139717
Features
Type
Description
Status
Start
End
signal peptide
potential
1
14
chain
Spike glycoprotein
potential
15
1274
chain
Spike protein S1
potential
15
676
chain
Spike protein S2
potential
677
1274
transmembrane region
potential
1213
1233
topological domain
Extracellular
potential
15
1212
topological domain
Cytoplasmic
potential
1234
1274
glycosylation site
N-linked (GlcNAc...)
potential
30
30
glycosylation site
N-linked (GlcNAc...)
potential
34
34
glycosylation site
N-linked (GlcNAc...)
potential
90
90
glycosylation site
N-linked (GlcNAc...)
potential
154
154
glycosylation site
N-linked (GlcNAc...)
potential
165
165
glycosylation site
N-linked (GlcNAc...)
potential
199
199
glycosylation site
N-linked (GlcNAc...)
potential
205
205
glycosylation site
N-linked (GlcNAc...)
potential
304
304
glycosylation site
N-linked (GlcNAc...)
potential
423
423
glycosylation site
N-linked (GlcNAc...)
potential
459
459
glycosylation site
N-linked (GlcNAc...)
potential
521
521
glycosylation site
N-linked (GlcNAc...)
potential
547
547
glycosylation site
N-linked (GlcNAc...)
potential
572
572
glycosylation site
N-linked (GlcNAc...)
potential
644
644
glycosylation site
N-linked (GlcNAc...)
potential
663
663
glycosylation site
N-linked (GlcNAc...)
potential
688
688
glycosylation site
N-linked (GlcNAc...)
potential
705
705
glycosylation site
N-linked (GlcNAc...)
potential
791
791
glycosylation site
N-linked (GlcNAc...)
potential
1042
1042
glycosylation site
N-linked (GlcNAc...)
potential
1081
1081
glycosylation site
N-linked (GlcNAc...)
potential
1096
1096
glycosylation site
N-linked (GlcNAc...)
potential
1113
1113
glycosylation site
N-linked (GlcNAc...)
potential
1128
1128
glycosylation site
N-linked (GlcNAc...)
potential
1133
1133
glycosylation site
N-linked (GlcNAc...)
potential
1157
1157
glycosylation site
N-linked (GlcNAc...)
potential
1163
1163
glycosylation site
N-linked (GlcNAc...)
potential
1172
1172
glycosylation site
N-linked (GlcNAc...)
potential
1193
1193
site
Cleavage
potential
676
677
compositionally biased region
Cys-rich
1235
1253
short sequence motif
Di-lysine motif
by similarity
1270
1274
SP Length
14
----+----1----+----2----+----3----+----4----+----5
Signal Peptide
MLLILVLGVSLAAA
Sequence
MLLILVLGVSLAAA
SRPECFNPRFTLTPL
N
HTL
N
YTSIKAKVSNVLLPDP
YIAYSGQTLRQNLFMADMSNTILYPVTPPANGANGGFIY
N
TSIIPVSAGL
FVNTWMYRQPASSRAYCQEPFGVAFGDTFENDRIAILIMAPDNLGSWSAV
APR
N
QTNIYLLVCS
N
ATLCINPGFNRWGPAGSFIAPDALVDHSNSCFV
N
N
TFSV
N
ISTSRISLAFLFKDGDLLIYHSGWLPTSNFEHGFSRGSHPMTYFM
SLPVGGNLPRAQFFQSIVRSNAIDKGDGMCTNFDVNLHVAHLINRDLLVS
YFN
N
GSVANAADCADSAAEELYCVTGSFDPPTGVYPLSRYRAQVAGFVRV
TQRGSYCTPPYSVLQDPPQPVVWRRYMLYDCVFDFTVVVDSLPTHQLQCY
GVSPRRLASMCYGSVTLDVMRI
N
ETHLNNLFNRVPDTFSLYNYALPDNFY
GCLHAFYL
N
STAPYAVANRFPIKPGGRQSNSAFIDTVINAAHYSPFSYVY
GLAVITLKPAAGSKLVCPVA
N
DTVVITDRCVQYNLYGYTGTGVLSK
N
TSL
VIPDGKVFTASSTGTIIGVSI
N
STTYSIMPCVTVPVSVGYHPNFERALLF
NGLSCSQRSRAVTEPVSVLWSASATAQDAFDTPSGCVVNVELR
N
TTIVNT
CAMPIGNSLCFI
N
GSIATANADSLP
RL
QLVNYDPLYD
N
STATPMTPVYWV
KVPT
N
FTLSATEEYIQTTAPKITIDCARYLCGDSSRCLNVLLHYGTFCND
INKALSRVSTILDSALLSLVKELSINTRDEVTTFSFDGDY
N
FTGLMGCLG
PNCGATTYRSAFSDLLYDKVRITDPGFMQSYQKCIDSQWGGSIRDLLCTQ
TYNGIAVLPPIVSPAMQALYTSLLVGAVASSGYTFGITSAGVIPFATQLQ
FRLNGIGVTTQVLVENQKLIASSFNNALVNIQKGFTETSIALSKMQDVIN
QHAAQLHTLVVQLGNSFGAISSSINEIFSRLEGLAANAEVDRLINGRMMV
LNTYVTQLLIQASEAKAQNALAAQKISECVKAQSLRNDFCG
N
GTHVLSIP
QLAPNGVLFIHYAYTPTEYAFVQTSAGLCH
N
GTGYAPRQGMFVLP
N
NTNM
WHFTTMQFYNPV
N
ISASNTQVLTSCSV
N
YTSV
N
YTVLEPSVPGDYDFQKE
FDKFYK
N
LSTIF
N
NTFNPNDF
N
FSTVDVTAQIKSLHDVVNQL
N
QSFIDLK
KLNVYEKTIKWP
WYVWLAMIAGIVGLVLAVIML
M
CMTNCCSCFKGMCDCR
RCC
GSYDSYDDVYPAVRVN
KKRTV
Original
MLLILVLGVSLAAASRPECFNPRFTLTPLNHTLNYTSIKAKVSNVLLPDP
YIAYSGQTLRQNLFMADMSNTILYPVTPPANGANGGFIYNTSIIPVSAGL
FVNTWMYRQPASSRAYCQEPFGVAFGDTFENDRIAILIMAPDNLGSWSAV
APRNQTNIYLLVCSNATLCINPGFNRWGPAGSFIAPDALVDHSNSCFVNN
TFSVNISTSRISLAFLFKDGDLLIYHSGWLPTSNFEHGFSRGSHPMTYFM
SLPVGGNLPRAQFFQSIVRSNAIDKGDGMCTNFDVNLHVAHLINRDLLVS
YFNNGSVANAADCADSAAEELYCVTGSFDPPTGVYPLSRYRAQVAGFVRV
TQRGSYCTPPYSVLQDPPQPVVWRRYMLYDCVFDFTVVVDSLPTHQLQCY
GVSPRRLASMCYGSVTLDVMRINETHLNNLFNRVPDTFSLYNYALPDNFY
GCLHAFYLNSTAPYAVANRFPIKPGGRQSNSAFIDTVINAAHYSPFSYVY
GLAVITLKPAAGSKLVCPVANDTVVITDRCVQYNLYGYTGTGVLSKNTSL
VIPDGKVFTASSTGTIIGVSINSTTYSIMPCVTVPVSVGYHPNFERALLF
NGLSCSQRSRAVTEPVSVLWSASATAQDAFDTPSGCVVNVELRNTTIVNT
CAMPIGNSLCFINGSIATANADSLPRLQLVNYDPLYDNSTATPMTPVYWV
KVPTNFTLSATEEYIQTTAPKITIDCARYLCGDSSRCLNVLLHYGTFCND
INKALSRVSTILDSALLSLVKELSINTRDEVTTFSFDGDYNFTGLMGCLG
PNCGATTYRSAFSDLLYDKVRITDPGFMQSYQKCIDSQWGGSIRDLLCTQ
TYNGIAVLPPIVSPAMQALYTSLLVGAVASSGYTFGITSAGVIPFATQLQ
FRLNGIGVTTQVLVENQKLIASSFNNALVNIQKGFTETSIALSKMQDVIN
QHAAQLHTLVVQLGNSFGAISSSINEIFSRLEGLAANAEVDRLINGRMMV
LNTYVTQLLIQASEAKAQNALAAQKISECVKAQSLRNDFCGNGTHVLSIP
QLAPNGVLFIHYAYTPTEYAFVQTSAGLCHNGTGYAPRQGMFVLPNNTNM
WHFTTMQFYNPVNISASNTQVLTSCSVNYTSVNYTVLEPSVPGDYDFQKE
FDKFYKNLSTIFNNTFNPNDFNFSTVDVTAQIKSLHDVVNQLNQSFIDLK
KLNVYEKTIKWPWYVWLAMIAGIVGLVLAVIMLMCMTNCCSCFKGMCDCR
RCCGSYDSYDDVYPAVRVNKKRTV
----+----1----+----2----+----3----+----4----+----5
Hydropathies
Home
Imprint
© 2007-2017
Dr. Katja Kapp
, Kassel &
thpr.net e. K.
, Dresden, Germany, last update 2010-06-11