Signal Peptide Database - Drosophila melanogaster

 Entry Details
ID   25844
Source Database   UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot Accession Number   P98159    (Created: 1996-10-01 Updated: 2008-12-16)
UniProtKB/Swiss-Prot Entry Name   NUDEL_DROME
Protein Name   Serine protease nudel
Gene   ndl
Organism Scientific   Drosophila melanogaster
Organism Common   Fruit fly
Lineage   Eukaryota
  Metazoa
    Arthropoda
      Hexapoda
        Insecta
          Pterygota
            Neoptera
              Endopterygota
                Diptera
                  Brachycera
                    Muscomorpha
                      Ephydroidea
                        Drosophilidae
                          Drosophila
                            Sophophora
Protein Length [aa]   2616
Protein Mass [Da]   292492
Features  
TypeDescriptionStatusStartEnd
signal peptide      potential   1   43
chain   Serine protease nudel      44   2616
disulfide bond      by similarity   891   905
disulfide bond      by similarity   899   918
disulfide bond      by similarity   912   927
disulfide bond      by similarity   957   982
disulfide bond      by similarity   964   995
disulfide bond      by similarity   989   1004
disulfide bond      by similarity   1170   1186
disulfide bond      potential   1276   1338
disulfide bond      by similarity   1305   1317
disulfide bond      by similarity   1328   1359
disulfide bond      by similarity   1396   1408
disulfide bond      by similarity   1401   1421
disulfide bond      by similarity   1415   1430
disulfide bond      by similarity   1728   1745
disulfide bond      by similarity   1734   1764
disulfide bond      by similarity   1758   1773
disulfide bond      by similarity   1776   1789
disulfide bond      by similarity   1783   1802
disulfide bond      by similarity   1796   1811
disulfide bond      by similarity   2055   2071
disulfide bond      by similarity   2177   2230
disulfide bond      by similarity   2310   2320
disulfide bond      by similarity   2315   2333
disulfide bond      by similarity   2327   2344
disulfide bond      by similarity   2351   2364
disulfide bond      by similarity   2358   2377
disulfide bond      by similarity   2371   2387
disulfide bond      by similarity   2421   2435
disulfide bond      by similarity   2428   2448
disulfide bond      by similarity   2442   2457
domain   LDL-receptor class A 1      889   929
domain   LDL-receptor class A 2; truncated      929   956
domain   LDL-receptor class A 3      955   1006
domain   Peptidase S1 1      1145   1383
domain   LDL-receptor class A 4      1394   1432
domain   LDL-receptor class A 5; truncated      1713   1743
domain   LDL-receptor class A 6; truncated      1745   1775
domain   LDL-receptor class A 7      1774   1813
domain   Peptidase S1 2      2027   2301
domain   LDL-receptor class A 8      2308   2346
domain   LDL-receptor class A 9      2349   2389
domain   LDL-receptor class A 10; truncated      2387   2419
domain   LDL-receptor class A 11      2419   2459
modified residue   Phosphoserine      215   215
modified residue   Phosphoserine      220   220
modified residue   Phosphoserine      574   574
modified residue   Phosphoserine      581   581
modified residue   Phosphoserine      1134   1134
modified residue   Phosphoserine      1136   1136
glycosylation site   N-linked (GlcNAc...)   potential   291   291
glycosylation site   N-linked (GlcNAc...)   potential   347   347
glycosylation site   N-linked (GlcNAc...)   potential   379   379
glycosylation site   N-linked (GlcNAc...)   potential   417   417
glycosylation site   N-linked (GlcNAc...)   potential   492   492
glycosylation site   N-linked (GlcNAc...)   potential   515   515
glycosylation site   N-linked (GlcNAc...)   potential   598   598
glycosylation site   O-linked (Xyl...) (glycosaminoglycan)   potential   794   794
glycosylation site   N-linked (GlcNAc...)   potential   827   827
glycosylation site   O-linked (Xyl...) (glycosaminoglycan)   potential   829   829
glycosylation site   N-linked (GlcNAc...)   potential   861   861
glycosylation site   N-linked (GlcNAc...)   potential   975   975
glycosylation site   N-linked (GlcNAc...)   potential   1064   1064
glycosylation site   N-linked (GlcNAc...)   potential   1445   1445
glycosylation site   N-linked (GlcNAc...)   potential   1878   1878
glycosylation site   N-linked (GlcNAc...)   potential   1956   1956
glycosylation site   N-linked (GlcNAc...)   potential   2023   2023
glycosylation site   N-linked (GlcNAc...)   potential   2144   2144
glycosylation site   N-linked (GlcNAc...)   potential   2173   2173
glycosylation site   N-linked (GlcNAc...)   potential   2197   2197
glycosylation site   N-linked (GlcNAc...)   potential   2237   2237
glycosylation site   N-linked (GlcNAc...)   potential   2269   2269
glycosylation site   N-linked (GlcNAc...)   potential   2420   2420
glycosylation site   N-linked (GlcNAc...)   potential   2556   2556
glycosylation site   N-linked (GlcNAc...)   potential   2601   2601
repeat   WIID 1      261   269
repeat   WIID 2      320   328
repeat   WIID 3      399   407
repeat   WIID 4      446   454
repeat   WIID 5      477   485
repeat   WIID 6      528   536
active site   Charge relay system   by similarity   1185   1185
active site   Charge relay system   by similarity   1233   1233
active site   Charge relay system   by similarity   1332   1332
compositionally biased region   Ser/Thr-rich      1489   1702
short sequence motif   Cell attachment site   potential   1031   1033
SP Length   43
 ----+----1----+----2----+----3----+----4----+----5
Signal Peptide MNYNMDEMEATRLLRHPRRWWSIGFGKRIVAISILVIIVLLFS
Sequence MNYNMDEMEATRLLRHPRRWWSIGFGKRIVAISILVIIVLLFSLVYHGLV
VEKIDQVQQIAALNARHQVLFNQPFEEDQSALIVSPQTLHFKLLDEDMNK
DMEDSKNRRRKHMRQMLVKFRLNKKHRMRRDLHGLDLLDPVRMEANMQHL
YTKLRSKRAREALSQLEHEFVRCKKHTPQDCMSAFLRMYKMAKEVTEKME
KMKAIMREQQPKLE
SSSMESHEQKGTFSPADLIQVTTAEATTVAVHATEK
PARTKIKPSR
ISWIIDGHDHDESPVYTDGAPKKETTKAPWNTTQLVEITT
TKIDATATERTTVESTTEK
ISWILDHFDKPQEILRTTEGPGQRIIRNVTT
TSASSEPIVDTENTNSDHVPTTENGLVF
NITTDGPVETTKSTAQRKLSFD
WILDGEE
NVEPEVKSTNTTTTTAATTTTGATSETIIVTTELPKITFDWII
DGRE
VVEPQETTTEVTGTTERLRKMPFDWIIDGEEVVEPQENVTTTTIAT
TVAVSTTEINERIH
NSTAYPTKPKPVKFDWIIDGGESSGEVSTSSTSQPK
LTTREAISNPESPRSSHPLDNPT
SIENMLESFEQHEEQKPILRVLNANES
SSETVTDGYERQLWLKKFEDQARPNQNELIDTFGTALDAKALDKMGPKIN
PLNGHTWNAADAQILSLCERVALRMRNKVATMSDGETKEKGETFTASPSV
QFTSRAPGGFPVSGETMKASAQFMFNPNFGMPSIPVCFYMTPANFRMPMW
SNTPTFMGMQGAHFGGSSNPGAGIFFVPQQFGPSGNFFGGSGG
SGAGGQG
ANIFSKNASPQKPTNGQQQVYCSYMQ
NQSGQGAGQSQTSSQQQQGGQSAF
SNANFKMRHA
NQTNTANQQGQIIYASYAGLPQQPIQERSRCPEPDQFSCF
GQQECIPAARWCDNVVDCSDGSDESACT
CADRVDEERLCDGYEDCPMGED
ELGC
FGCESLAYSCYENPQDFAKRNRSTISMCYSRLERCDGFMNCLNGRD
EEQCSM
LVTDVADHMSHGASASEGYLYHNYRGDWHPVCNNGEKWAALACQ
MDENSRMDHSASL
NVSFQTLTLPGPFIEPSLHAGVHFAQACHGRNSHDSL
VDHVAYVKCPPMQCGLPSKSSMLEHSKRVRRAV
SDSKEIVGDGRIVGGSY
TSALQWPFVVAIYRNGKFHCGGTIYSDRWIISAA
HCVINYGKYFYEVRAG
LLRRSSYSPATQIQPVSHVVVHQAYERRSMRN
DLSLLRLLNPLQFNRWVK
PICLPDKGRTTVGDDWIWGPVEHTLCTVVGWGAIREKGPSSDPMRQVIVP
IRKKCTDPEDQASEDICAGDPDGGRDACQGD
SGGPLFCRSVSNPDEFYLA
GVVSHGNGCARPQEFGVYTRVTLYLDWLEMATT
PRLLPKLQPLQLCPGFI
CVWGGKRCIAKRQRCDRNVDCLGGEDEVGCTY
NFLPDMVGGVRQNISTTT
ESDYHPVKESEEKSKMREVIPIDDEDLKAEQDEEDLLK
STTSLGQTETTQ
GPMDLSFAEQITSTTSDDLSITDETTSTDFTVSDSATSPSTLLPTTTNPS
TWLPSTNIETSTFSFTTTESEASTKQETLPTTVAQTTTIPTSTEDLKKLT
DLVTEFIESTTFETTMEVETTTLSLTSTDAPKLVTTEGVKETTTTEDTTT
ISSIVTLTTTPLATISTTILTTEKHVAVTTLAPTTTTESAKTTTTHSSST
HS
EKDQIQIPNKFVCKKMSQIVDIMMRCDRKVDCEDGTDELDCTCKDYLK
GSLKGLICDGKADCEDLTDEQNC
VECQSNEFRCPLSKTCLPLSSRCDNKV
DCKFKEDEKDCFA
LTNGHDVHFDVHQQPKFSSTGIFSRNGHGVWRVVCAH
ETGYHEHQAKTADAVCALLGFNGAHYF
NSSEFVTQHEMQPITPELKGGRN
RMSAQIHSMVGDNVQFTENEVIIPELGHPSASRPEKDRLLPRKCVGIYVE
CNPYS
NKTTPLKTFSAGQVVKEKPIEQVPVLSPTIETHNTPNVHFKPQIP
AMVVNKKDEILDRLDKLIKSKK
NKTILVNEQLHEAIEELHWPWLADVYMN
GDLWCIGVLIDKHWVMVHESCLSGIDLETHYVSVLLGGGKTKRSAHRSNH
EQIRRVDCFEGVPKSNVLLLHLERPVRFTHHVLPTFLPDSSHQ
NQSHARQ
CISVLHDDATGRIKTVAITRIH
NATNCDSCYKLQEKQPPANLMRLLNVSA
EDMASISEEVELINGVAPTELPAITKFTTCNQFGLK
NVSDAHHNPSDQGV
LVCRDSHTGWFPTALFNY
NNSDCQSFKQPFGIRTLELVYKSLQDIIDKPS
C
KMLLPAPDCSTHRCPLGTCLPQAAMCNGRSDCHDGSDEEETKCRQQKQQ
CAPGEMKCRTSFKCVPKSKFCDHVPDCEDMTDEPTI
CSCFTYLQATDPSK
ICDGKRNCWDKSDESSVL
CNCTADHFQCSSSPEDCIPRDFVCDKEKDCPN
GEDERYCFG
IEHPLHLQKKDFWTNSQHTQPEIAPQYGQVIEQTYGIWHTK
CFPKSKPPQVDEVREICKKLGYNPYRQPSYRLIDDEENKPVHTYELADRQ
GRSFS
NESLMGKYRDSTKALIISKFSPLQLNERLTLFLKSSRPIAELVRW
NATDSSMCYRLEIRCA
Original MNYNMDEMEATRLLRHPRRWWSIGFGKRIVAISILVIIVLLFSLVYHGLV
VEKIDQVQQIAALNARHQVLFNQPFEEDQSALIVSPQTLHFKLLDEDMNK
DMEDSKNRRRKHMRQMLVKFRLNKKHRMRRDLHGLDLLDPVRMEANMQHL
YTKLRSKRAREALSQLEHEFVRCKKHTPQDCMSAFLRMYKMAKEVTEKME
KMKAIMREQQPKLESSSMESHEQKGTFSPADLIQVTTAEATTVAVHATEK
PARTKIKPSRISWIIDGHDHDESPVYTDGAPKKETTKAPWNTTQLVEITT
TKIDATATERTTVESTTEKISWILDHFDKPQEILRTTEGPGQRIIRNVTT
TSASSEPIVDTENTNSDHVPTTENGLVFNITTDGPVETTKSTAQRKLSFD
WILDGEENVEPEVKSTNTTTTTAATTTTGATSETIIVTTELPKITFDWII
DGREVVEPQETTTEVTGTTERLRKMPFDWIIDGEEVVEPQENVTTTTIAT
TVAVSTTEINERIHNSTAYPTKPKPVKFDWIIDGGESSGEVSTSSTSQPK
LTTREAISNPESPRSSHPLDNPTSIENMLESFEQHEEQKPILRVLNANES
SSETVTDGYERQLWLKKFEDQARPNQNELIDTFGTALDAKALDKMGPKIN
PLNGHTWNAADAQILSLCERVALRMRNKVATMSDGETKEKGETFTASPSV
QFTSRAPGGFPVSGETMKASAQFMFNPNFGMPSIPVCFYMTPANFRMPMW
SNTPTFMGMQGAHFGGSSNPGAGIFFVPQQFGPSGNFFGGSGGSGAGGQG
ANIFSKNASPQKPTNGQQQVYCSYMQNQSGQGAGQSQTSSQQQQGGQSAF
SNANFKMRHANQTNTANQQGQIIYASYAGLPQQPIQERSRCPEPDQFSCF
GQQECIPAARWCDNVVDCSDGSDESACTCADRVDEERLCDGYEDCPMGED
ELGCFGCESLAYSCYENPQDFAKRNRSTISMCYSRLERCDGFMNCLNGRD
EEQCSMLVTDVADHMSHGASASEGYLYHNYRGDWHPVCNNGEKWAALACQ
MDENSRMDHSASLNVSFQTLTLPGPFIEPSLHAGVHFAQACHGRNSHDSL
VDHVAYVKCPPMQCGLPSKSSMLEHSKRVRRAVSDSKEIVGDGRIVGGSY
TSALQWPFVVAIYRNGKFHCGGTIYSDRWIISAAHCVINYGKYFYEVRAG
LLRRSSYSPATQIQPVSHVVVHQAYERRSMRNDLSLLRLLNPLQFNRWVK
PICLPDKGRTTVGDDWIWGPVEHTLCTVVGWGAIREKGPSSDPMRQVIVP
IRKKCTDPEDQASEDICAGDPDGGRDACQGDSGGPLFCRSVSNPDEFYLA
GVVSHGNGCARPQEFGVYTRVTLYLDWLEMATTPRLLPKLQPLQLCPGFI
CVWGGKRCIAKRQRCDRNVDCLGGEDEVGCTYNFLPDMVGGVRQNISTTT
ESDYHPVKESEEKSKMREVIPIDDEDLKAEQDEEDLLKSTTSLGQTETTQ
GPMDLSFAEQITSTTSDDLSITDETTSTDFTVSDSATSPSTLLPTTTNPS
TWLPSTNIETSTFSFTTTESEASTKQETLPTTVAQTTTIPTSTEDLKKLT
DLVTEFIESTTFETTMEVETTTLSLTSTDAPKLVTTEGVKETTTTEDTTT
ISSIVTLTTTPLATISTTILTTEKHVAVTTLAPTTTTESAKTTTTHSSST
HSEKDQIQIPNKFVCKKMSQIVDIMMRCDRKVDCEDGTDELDCTCKDYLK
GSLKGLICDGKADCEDLTDEQNCVECQSNEFRCPLSKTCLPLSSRCDNKV
DCKFKEDEKDCFALTNGHDVHFDVHQQPKFSSTGIFSRNGHGVWRVVCAH
ETGYHEHQAKTADAVCALLGFNGAHYFNSSEFVTQHEMQPITPELKGGRN
RMSAQIHSMVGDNVQFTENEVIIPELGHPSASRPEKDRLLPRKCVGIYVE
CNPYSNKTTPLKTFSAGQVVKEKPIEQVPVLSPTIETHNTPNVHFKPQIP
AMVVNKKDEILDRLDKLIKSKKNKTILVNEQLHEAIEELHWPWLADVYMN
GDLWCIGVLIDKHWVMVHESCLSGIDLETHYVSVLLGGGKTKRSAHRSNH
EQIRRVDCFEGVPKSNVLLLHLERPVRFTHHVLPTFLPDSSHQNQSHARQ
CISVLHDDATGRIKTVAITRIHNATNCDSCYKLQEKQPPANLMRLLNVSA
EDMASISEEVELINGVAPTELPAITKFTTCNQFGLKNVSDAHHNPSDQGV
LVCRDSHTGWFPTALFNYNNSDCQSFKQPFGIRTLELVYKSLQDIIDKPS
CKMLLPAPDCSTHRCPLGTCLPQAAMCNGRSDCHDGSDEEETKCRQQKQQ
CAPGEMKCRTSFKCVPKSKFCDHVPDCEDMTDEPTICSCFTYLQATDPSK
ICDGKRNCWDKSDESSVLCNCTADHFQCSSSPEDCIPRDFVCDKEKDCPN
GEDERYCFGIEHPLHLQKKDFWTNSQHTQPEIAPQYGQVIEQTYGIWHTK
CFPKSKPPQVDEVREICKKLGYNPYRQPSYRLIDDEENKPVHTYELADRQ
GRSFSNESLMGKYRDSTKALIISKFSPLQLNERLTLFLKSSRPIAELVRW
NATDSSMCYRLEIRCA
 ----+----1----+----2----+----3----+----4----+----5
Hydropathies  
 

© 2007-2017 Dr. Katja Kapp, Kassel & thpr.net e. K., Dresden, Germany, last update 2010-06-11