Signal Peptide Database - Viruses

 Entry Details
ID   804
Source Database   UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot Accession Number   Q3I5J5    (Created: 2005-11-22 Updated: 2008-11-25)
UniProtKB/Swiss-Prot Entry Name   SPIKE_BCRP3
Protein Name   Spike glycoprotein
Gene   S
Organism Scientific   Bat coronavirus Rp3/2004
Organism Common   BtCoV/Rp3/2004
Lineage   Viruses
  ssRNA positive-strand viruses, no DNA stage
    Nidovirales
      Coronaviridae
        Coronavirus
          Coronavirus group 2
            Coronavirus group 2b
              unclassified Coronavirus group 2b
Protein Length [aa]   1241
Protein Mass [Da]   137560
Features  
TypeDescriptionStatusStartEnd
signal peptide      potential   1   15
chain   Spike glycoprotein      16   1241
chain   Spike protein S1   potential   16   653
chain   Spike protein S2   potential   654   1241
transmembrane region      potential   1182   1202
topological domain   Extracellular   potential   16   1181
topological domain   Cytoplasmic   potential   1203   1241
region of interest   Fusion peptide   potential   756   774
region of interest   Heptad repeat 1      888   938
region of interest   Heptad repeat 2      1131   1170
glycosylation site   N-linked (GlcNAc...)   potential   65   65
glycosylation site   N-linked (GlcNAc...)   potential   115   115
glycosylation site   N-linked (GlcNAc...)   potential   124   124
glycosylation site   N-linked (GlcNAc...)   potential   125   125
glycosylation site   N-linked (GlcNAc...)   potential   162   162
glycosylation site   N-linked (GlcNAc...)   potential   231   231
glycosylation site   N-linked (GlcNAc...)   potential   273   273
glycosylation site   N-linked (GlcNAc...)   potential   298   298
glycosylation site   N-linked (GlcNAc...)   potential   322   322
glycosylation site   N-linked (GlcNAc...)   potential   334   334
glycosylation site   N-linked (GlcNAc...)   potential   361   361
glycosylation site   N-linked (GlcNAc...)   potential   575   575
glycosylation site   N-linked (GlcNAc...)   potential   588   588
glycosylation site   N-linked (GlcNAc...)   potential   629   629
glycosylation site   N-linked (GlcNAc...)   potential   677   677
glycosylation site   N-linked (GlcNAc...)   potential   685   685
glycosylation site   N-linked (GlcNAc...)   potential   769   769
glycosylation site   N-linked (GlcNAc...)   potential   1042   1042
glycosylation site   N-linked (GlcNAc...)   potential   1066   1066
glycosylation site   N-linked (GlcNAc...)   potential   1102   1102
glycosylation site   N-linked (GlcNAc...)   potential   1126   1126
glycosylation site   N-linked (GlcNAc...)   potential   1141   1141
glycosylation site   N-linked (GlcNAc...)   potential   1162   1162
helix         913   949
helix         959   976
helix         1134   1137
helix         1141   1171
site   Cleavage   potential   653   654
compositionally biased region   Cys-rich      1203   1222
short sequence motif   KxHxx   by similarity   1237   1241
turn         1172   1176
coiled-coil region      potential   917   961
coiled-coil region      potential   1143   1171
SP Length   15
 ----+----1----+----2----+----3----+----4----+----5
Signal Peptide MKILILAFLASLAKA
Sequence MKILILAFLASLAKAQEGCGIISRKPQPKMAQVSSSRRGVYYNDDIFRSN
VLHLTQDYFLPFDS
NLTQYFSLNVDSDRFTYFDNPILDFGDGVYFAATEK
SNVIRGWIFGSTFD
NTTQSAVIVNNSTHIIIRVCNFNLCKEPMYTVSRGA
QQSSWVYQSAF
NCTYDRVEKSFQLDTAPKTGNFKDLREYVFKNRDGFLSV
YQTYTAVNLPRGLPIGFSVLRPILKLPFGI
NITSYRVVMAMFSQTTSNFL
PESAAYYVGNLKYTTFMLSFNE
NGTITNAIDCAQNPLAELKCTIKNFNVS
KGIYQTSNFRVSPTQEVIRFP
NITNRCPFDKVFNATRFPNVYAWERTKIS
DCVADYTVLY
NSTSFSTFKCYGVSPSKLIDLCFTSVYADTFLIRSSEVRQ
VAPGETGVIADYNYKLPDDFTGCVIAWNTAKQDQGQYYYRSHRKTKLKPF
ERDLSSDENGVRTLSTYDFYPSVPVAYQATRVVVLSFELLNAPATVCGPK
LSTQLVKNQCVNFNFNGLKGTGVLTESSKRFQSFQQFGRDTSDFTDSVRD
PQTLEILDISPCSFGGVSVITPGT
NASSEVAVLYQDVNCTDVPAAIHADQ
LTPAWRVYSTGTNVFQTQAGCLIGAEHV
NASYECDIPIGAGICASYHTAS
TL
RSVGQKSIVAYTMSLGAENSIAYANNSIAIPTNFSISVTTEVMPVSMA
KTSVDCTMYICGDSLECSNLLLQYGSFCTQLNRALSGIAIEQDKNTQEVF
AQVKQ
MYKTPAIKDFGGFNFSQILPDPSKPTKRSFIEDLLFNKVTLADAG
FMKQYGECLGDISARDLICAQKFNGLTVLPPLLTDEMIAAYTAALVSGTA
TAGWTFGAGSALQIPFAMQMAYRFNGIGVTQNVLYEN
QKQIANQFNKAIS
QIQESLTTTSTA
LGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILS
RLDKVEAEVQI
DRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSEC
VLGQSKRVDFCGKGYHLMSFPQAAPHGVVFLHVTYVPSQER
NFTTAPAIC
HEGKAYFPREGVFVS
NGTSWFITQRNFYSPQIITTDNTFVAGSCDVVIGI
I
NNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKE
IDRLNEVAKNLNESLIDLQEL
GKYEQYIKWPWYVWLGFIAGLIAIVMVTI
LL
CCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLHYT
Original MKILILAFLASLAKAQEGCGIISRKPQPKMAQVSSSRRGVYYNDDIFRSN
VLHLTQDYFLPFDSNLTQYFSLNVDSDRFTYFDNPILDFGDGVYFAATEK
SNVIRGWIFGSTFDNTTQSAVIVNNSTHIIIRVCNFNLCKEPMYTVSRGA
QQSSWVYQSAFNCTYDRVEKSFQLDTAPKTGNFKDLREYVFKNRDGFLSV
YQTYTAVNLPRGLPIGFSVLRPILKLPFGINITSYRVVMAMFSQTTSNFL
PESAAYYVGNLKYTTFMLSFNENGTITNAIDCAQNPLAELKCTIKNFNVS
KGIYQTSNFRVSPTQEVIRFPNITNRCPFDKVFNATRFPNVYAWERTKIS
DCVADYTVLYNSTSFSTFKCYGVSPSKLIDLCFTSVYADTFLIRSSEVRQ
VAPGETGVIADYNYKLPDDFTGCVIAWNTAKQDQGQYYYRSHRKTKLKPF
ERDLSSDENGVRTLSTYDFYPSVPVAYQATRVVVLSFELLNAPATVCGPK
LSTQLVKNQCVNFNFNGLKGTGVLTESSKRFQSFQQFGRDTSDFTDSVRD
PQTLEILDISPCSFGGVSVITPGTNASSEVAVLYQDVNCTDVPAAIHADQ
LTPAWRVYSTGTNVFQTQAGCLIGAEHVNASYECDIPIGAGICASYHTAS
TLRSVGQKSIVAYTMSLGAENSIAYANNSIAIPTNFSISVTTEVMPVSMA
KTSVDCTMYICGDSLECSNLLLQYGSFCTQLNRALSGIAIEQDKNTQEVF
AQVKQMYKTPAIKDFGGFNFSQILPDPSKPTKRSFIEDLLFNKVTLADAG
FMKQYGECLGDISARDLICAQKFNGLTVLPPLLTDEMIAAYTAALVSGTA
TAGWTFGAGSALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAIS
QIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILS
RLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSEC
VLGQSKRVDFCGKGYHLMSFPQAAPHGVVFLHVTYVPSQERNFTTAPAIC
HEGKAYFPREGVFVSNGTSWFITQRNFYSPQIITTDNTFVAGSCDVVIGI
INNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKE
IDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYVWLGFIAGLIAIVMVTI
LLCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLHYT
 ----+----1----+----2----+----3----+----4----+----5
Hydropathies  
 

© 2007-2017 Katja Kapp, Dresden & thpr.net e. K., Dresden, Germany, last update 2010-06-11