Signal Peptide Database - Viruses

 Entry Details
ID   3286
Source Database   UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot Accession Number   Q9QSQ7    (Created: 2006-06-27 Updated: 2008-11-25)
UniProtKB/Swiss-Prot Entry Name   ENV_HV1VI
Protein Name   Envelope glycoprotein gp160
Gene   env
Organism Scientific   Human immunodeficiency virus type 1 (isolate VI850 group M subtype F1)
Organism Common   HIV-1
Lineage   Viruses
  Retro-transcribing viruses
    Retroviridae
      Orthoretrovirinae
        Lentivirus
          Primate lentivirus group
Protein Length [aa]   832
Protein Mass [Da]   93810
Features  
TypeDescriptionStatusStartEnd
signal peptide      by similarity   1   31
chain   Envelope glycoprotein gp160   by similarity   32   832
chain   Surface protein   by similarity   32   487
chain   Transmembrane protein   by similarity   488   832
disulfide bond      by similarity   53   73
disulfide bond      by similarity   118   196
disulfide bond      by similarity   125   187
disulfide bond      by similarity   130   147
disulfide bond      by similarity   209   238
disulfide bond      by similarity   219   230
disulfide bond      by similarity   287   321
disulfide bond      by similarity   368   422
disulfide bond      by similarity   375   395
transmembrane region      potential   661   681
topological domain   Extracellular   potential   32   660
topological domain   Cytoplasmic   potential   682   832
region of interest   V1      130   146
region of interest   V2      147   187
region of interest   V3      287   320
region of interest   V4      375   395
region of interest   V5      438   447
region of interest   Fusion peptide   potential   488   508
region of interest   Immunosuppression   by similarity   552   568
region of interest   Involved in GalCer binding   by similarity   638   643
glycosylation site   N-linked (GlcNAc...)   potential   87   87
glycosylation site   N-linked (GlcNAc...)   potential   129   129
glycosylation site   N-linked (GlcNAc...)   potential   132   132
glycosylation site   N-linked (GlcNAc...)   potential   135   135
glycosylation site   N-linked (GlcNAc...)   potential   146   146
glycosylation site   N-linked (GlcNAc...)   potential   150   150
glycosylation site   N-linked (GlcNAc...)   potential   177   177
glycosylation site   N-linked (GlcNAc...)   potential   178   178
glycosylation site   N-linked (GlcNAc...)   potential   188   188
glycosylation site   N-linked (GlcNAc...)   potential   225   225
glycosylation site   N-linked (GlcNAc...)   potential   232   232
glycosylation site   N-linked (GlcNAc...)   potential   253   253
glycosylation site   N-linked (GlcNAc...)   potential   267   267
glycosylation site   N-linked (GlcNAc...)   potential   280   280
glycosylation site   N-linked (GlcNAc...)   potential   286   286
glycosylation site   N-linked (GlcNAc...)   potential   292   292
glycosylation site   N-linked (GlcNAc...)   potential   322   322
glycosylation site   N-linked (GlcNAc...)   potential   329   329
glycosylation site   N-linked (GlcNAc...)   potential   345   345
glycosylation site   N-linked (GlcNAc...)   potential   352   352
glycosylation site   N-linked (GlcNAc...)   potential   382   382
glycosylation site   N-linked (GlcNAc...)   potential   388   388
glycosylation site   N-linked (GlcNAc...)   potential   419   419
glycosylation site   N-linked (GlcNAc...)   potential   425   425
glycosylation site   N-linked (GlcNAc...)   potential   437   437
glycosylation site   N-linked (GlcNAc...)   potential   587   587
glycosylation site   N-linked (GlcNAc...)   potential   592   592
glycosylation site   N-linked (GlcNAc...)   potential   601   601
glycosylation site   N-linked (GlcNAc...)   potential   613   613
site   Cleavage; by host furin   by similarity   487   488
short sequence motif   YXXL motif; contains endocytosis signal   by similarity   688   691
lipid moiety-binding region   S-palmitoyl cysteine; by host   by similarity   740   740
coiled-coil region      potential   518   568
coiled-coil region      potential   609   643
SP Length   31
 ----+----1----+----2----+----3----+----4----+----5
Signal Peptide MRVRGMQRNWQHLGKWGLLFLGILIICNAAD
Sequence MRVRGMQRNWQHLGKWGLLFLGILIICNAADNLWVTVYYGVPVWKEATTT
LFCASDAKAYEREAHNVWATHACVPTDPNPQEVFLK
NVTENFDMWKNNMV
EQMHTDIISLWDQSLKPCVKLTPLCVTL
NCTNATNNSQEKPGAMQNCSFN
MTTEVRDKKLKLSALFYRLDIVPIGNNNSSEYRLINCNTSTITQACPKVS
WDPIPIHYCAPAGYAILKCNDKRF
NGTGPCKNVSTVQCTHGIKPVVSTQL
LL
NGSLAEEGIVIRSQNISNNAKTIIVHLNESVQINCTRPNNNTRKGIHL
GPGQTFYATGAIIGDIRKAH
CNISGTQWNNTLEYVKAELKSHFPNNTAIK
F
NQSSGGDLEITMHSFNCRGEFFYCDTSGLFNDTGSNNGTITLPCRIKQI
VNMWQGVGRAMYTSPIAG
NITCNSNITGLLLTRDGGNESNIETFRPEGGN
MKDNWRSELYKYKVVEIEPLGVAPTKAKRQVVQREK
RAAGLGALFLGFLG
DSREHMGA
ASITLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIK
QLQARVLAVERYLKDQQL
LGIWGCSGKLICTTNVPWNSSWSNKSQEEIWN
NMTWMEWEKEISNYSNIIYKLIEESQNQQEKNEQELLALDKWASLWNWFD
ISNWLWYIKI
FIMIVGGLIGLRIVFAVLSIVNRVRKGYSPLSLQTLIPSP
RGPDRPEGIEEGGGEQGKDRSVRLVTGFLALAWDDLRNL
CLFSYRHLRDF
ILIAARIVDRGLRRGWEALKYLGNLTRYWSQELKNSAISLFNTTAIVVAE
GTDRIIEVLQRAGRAVLNIPRRIRQGAERALL
Original MRVRGMQRNWQHLGKWGLLFLGILIICNAADNLWVTVYYGVPVWKEATTT
LFCASDAKAYEREAHNVWATHACVPTDPNPQEVFLKNVTENFDMWKNNMV
EQMHTDIISLWDQSLKPCVKLTPLCVTLNCTNATNNSQEKPGAMQNCSFN
MTTEVRDKKLKLSALFYRLDIVPIGNNNSSEYRLINCNTSTITQACPKVS
WDPIPIHYCAPAGYAILKCNDKRFNGTGPCKNVSTVQCTHGIKPVVSTQL
LLNGSLAEEGIVIRSQNISNNAKTIIVHLNESVQINCTRPNNNTRKGIHL
GPGQTFYATGAIIGDIRKAHCNISGTQWNNTLEYVKAELKSHFPNNTAIK
FNQSSGGDLEITMHSFNCRGEFFYCDTSGLFNDTGSNNGTITLPCRIKQI
VNMWQGVGRAMYTSPIAGNITCNSNITGLLLTRDGGNESNIETFRPEGGN
MKDNWRSELYKYKVVEIEPLGVAPTKAKRQVVQREKRAAGLGALFLGFLG
DSREHMGAASITLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIK
QLQARVLAVERYLKDQQLLGIWGCSGKLICTTNVPWNSSWSNKSQEEIWN
NMTWMEWEKEISNYSNIIYKLIEESQNQQEKNEQELLALDKWASLWNWFD
ISNWLWYIKIFIMIVGGLIGLRIVFAVLSIVNRVRKGYSPLSLQTLIPSP
RGPDRPEGIEEGGGEQGKDRSVRLVTGFLALAWDDLRNLCLFSYRHLRDF
ILIAARIVDRGLRRGWEALKYLGNLTRYWSQELKNSAISLFNTTAIVVAE
GTDRIIEVLQRAGRAVLNIPRRIRQGAERALL
 ----+----1----+----2----+----3----+----4----+----5
Hydropathies  
 

© 2007-2017 Dr. Katja Kapp, Kassel & thpr.net e. K., Dresden, Germany, last update 2010-06-11