Signal Peptide Database - Viruses

 Entry Details
ID   4424
Source Database   UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot Accession Number   P07974    (Created: 1988-08-01 Updated: 2008-11-25)
UniProtKB/Swiss-Prot Entry Name   HEMA_INCTA
Protein Name   Hemagglutinin-esterase-fusion glycoprotein
Gene   HE
Organism Scientific   Influenza C virus (strain C/Taylor/1233/1947)
Organism Common  
Lineage   Viruses
  ssRNA negative-strand viruses
    Orthomyxoviridae
      Influenzavirus C
Protein Length [aa]   642
Protein Mass [Da]   70762
Features  
TypeDescriptionStatusStartEnd
signal peptide         1   1
chain   Hemagglutinin-esterase-fusion glycoprotein chain 1      2   433
chain   Hemagglutinin-esterase-fusion glycoprotein chain 2      434   642
disulfide bond   Interchain (between HEF1 and HEF2 chains)   by similarity   7   570
disulfide bond      by similarity   107   152
disulfide bond      by similarity   127   175
disulfide bond      by similarity   197   239
disulfide bond      by similarity   216   303
disulfide bond      by similarity   224   276
transmembrane region      potential   618   638
topological domain   Extracellular   potential   1   617
topological domain   Cytoplasmic   potential   639   642
region of interest   Fusion domain-1   by similarity   2   27
region of interest   Esterase domain-1   by similarity   28   138
region of interest   N-acetyl-9-O-acetylneuraminic acid binding   by similarity   138   297
region of interest   Esterase domain-2   by similarity   298   352
region of interest   Fusion domain-2   by similarity   353   638
glycosylation site   N-linked (GlcNAc...)   potential   13   13
glycosylation site   N-linked (GlcNAc...)   potential   48   48
glycosylation site   N-linked (GlcNAc...)   potential   131   131
glycosylation site   N-linked (GlcNAc...)   potential   382   382
active site   Nucleophile   by similarity   58   58
active site   Charge relay system   by similarity   353   353
active site   Charge relay system   by similarity   356   356
non-terminal residue         1   1
SP Length   1
 ----+----1----+----2----+----3----+----4----+----5
Signal Peptide A
Sequence AEKIKICLQKQVNSSFSLHNGFGGNLYATEEKRMFELVKPKAGASVLNQS
TWIGFGD
SRTDKSNSAFPRSADVSEKTADKFRSLSGGSLMLSMFGPPGKV
DYLYQGCGKHKVFYEGVNWSPHAAINCYRK
NWTDIKLNFQKNIYELASQS
HCMSLVNALDKTIPLQVTAGVAKNCNNSFLKNPALYTQEVNPSKEICGKE
NLAFFTLPTQFGTYECKLHLVASCYFIYDSKEVYNKRGCDNYFQVIYDSS
GKVVGGLDNRVSPYTGNTGDTPTMQCDMLQLKPGRYSVRSSPRFLLM
PER
SYCFDMKEKGLVTAVQSVWGKGRESDHAVDQAYLSTPGCMLIQKQKPYIG
EA
DDHHGDQEMRELLSGLDYEARCISQSGWVNETSPFTEEYLLPPKFGRC
PLAAKEESIPKIPDGLLIPTSGTDTIVTKPKSRIFGIDDLIIGLLFVAIV
EAGIGGYLLGSRKESGGGVTKESAEKGFEKIGNDIQILRSSTNIAIEKLN
DRITHDEQAIRDLTLEIENARSEALLGELGIIRALLVGNISIGLQESLWE
LASEITNRAGDLAVEVSPGCWIIDNNICDQSCQNFIFKFNETAPVPTIPP
LDTKIDLQSDPFYWGSSLGLAITTPISLAALVISGIAI
CRTK
Original AEKIKICLQKQVNSSFSLHNGFGGNLYATEEKRMFELVKPKAGASVLNQS
TWIGFGDSRTDKSNSAFPRSADVSEKTADKFRSLSGGSLMLSMFGPPGKV
DYLYQGCGKHKVFYEGVNWSPHAAINCYRKNWTDIKLNFQKNIYELASQS
HCMSLVNALDKTIPLQVTAGVAKNCNNSFLKNPALYTQEVNPSKEICGKE
NLAFFTLPTQFGTYECKLHLVASCYFIYDSKEVYNKRGCDNYFQVIYDSS
GKVVGGLDNRVSPYTGNTGDTPTMQCDMLQLKPGRYSVRSSPRFLLMPER
SYCFDMKEKGLVTAVQSVWGKGRESDHAVDQAYLSTPGCMLIQKQKPYIG
EADDHHGDQEMRELLSGLDYEARCISQSGWVNETSPFTEEYLLPPKFGRC
PLAAKEESIPKIPDGLLIPTSGTDTIVTKPKSRIFGIDDLIIGLLFVAIV
EAGIGGYLLGSRKESGGGVTKESAEKGFEKIGNDIQILRSSTNIAIEKLN
DRITHDEQAIRDLTLEIENARSEALLGELGIIRALLVGNISIGLQESLWE
LASEITNRAGDLAVEVSPGCWIIDNNICDQSCQNFIFKFNETAPVPTIPP
LDTKIDLQSDPFYWGSSLGLAITTPISLAALVISGIAICRTK
 ----+----1----+----2----+----3----+----4----+----5
Hydropathies  
 

© 2007-2017 Dr. Katja Kapp, Kassel & thpr.net e. K., Dresden, Germany, last update 2010-06-11