Signal Peptide Database - Drosophila melanogaster

 Entry Details
ID   9271
Source Database   UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot Accession Number   Q24292    (Created: 2002-11-15 Updated: 2008-12-16)
UniProtKB/Swiss-Prot Entry Name   DS_DROME
Protein Name   Protein dachsous
Gene   ds
Organism Scientific   Drosophila melanogaster
Organism Common   Fruit fly
Lineage   Eukaryota
  Metazoa
    Arthropoda
      Hexapoda
        Insecta
          Pterygota
            Neoptera
              Endopterygota
                Diptera
                  Brachycera
                    Muscomorpha
                      Ephydroidea
                        Drosophilidae
                          Drosophila
                            Sophophora
Protein Length [aa]   3503
Protein Mass [Da]   379780
Features  
TypeDescriptionStatusStartEnd
signal peptide      potential   1   20
chain   Protein dachsous      21   3503
transmembrane region      potential   3046   3066
topological domain   Extracellular   potential   21   3045
topological domain   Cytoplasmic   potential   3067   3503
domain   Cadherin 1      22   121
domain   Cadherin 2      122   233
domain   Cadherin 3      234   340
domain   Cadherin 4      345   451
domain   Cadherin 5      452   558
domain   Cadherin 6      559   662
domain   Cadherin 7      663   774
domain   Cadherin 8      775   878
domain   Cadherin 9      879   983
domain   Cadherin 10      984   1100
domain   Cadherin 11      1101   1203
domain   Cadherin 12      1205   1312
domain   Cadherin 13      1313   1432
domain   Cadherin 14      1433   1549
domain   Cadherin 15      1556   1666
domain   Cadherin 16      1667   1794
domain   Cadherin 17      1796   1899
domain   Cadherin 18      1900   2004
domain   Cadherin 19      2005   2111
domain   Cadherin 20      2114   2269
domain   Cadherin 21      2270   2375
domain   Cadherin 22      2375   2479
domain   Cadherin 23      2489   2595
domain   Cadherin 24      2596   2699
domain   Cadherin 25      2701   2809
domain   Cadherin 26      2810   2916
domain   Cadherin 27      2919   3028
modified residue   Phosphoserine      3465   3465
modified residue   Phosphoserine      3469   3469
glycosylation site   N-linked (GlcNAc...)   potential   220   220
glycosylation site   N-linked (GlcNAc...)   potential   234   234
glycosylation site   N-linked (GlcNAc...)   potential   245   245
glycosylation site   N-linked (GlcNAc...)   potential   381   381
glycosylation site   N-linked (GlcNAc...)   potential   416   416
glycosylation site   N-linked (GlcNAc...)   potential   564   564
glycosylation site   N-linked (GlcNAc...)   potential   594   594
glycosylation site   N-linked (GlcNAc...)   potential   743   743
glycosylation site   N-linked (GlcNAc...)   potential   966   966
glycosylation site   N-linked (GlcNAc...)   potential   991   991
glycosylation site   N-linked (GlcNAc...)   potential   1006   1006
glycosylation site   N-linked (GlcNAc...)   potential   1029   1029
glycosylation site   N-linked (GlcNAc...)   potential   1143   1143
glycosylation site   N-linked (GlcNAc...)   potential   1236   1236
glycosylation site   N-linked (GlcNAc...)   potential   1453   1453
glycosylation site   N-linked (GlcNAc...)   potential   1479   1479
glycosylation site   N-linked (GlcNAc...)   potential   1524   1524
glycosylation site   N-linked (GlcNAc...)   potential   1553   1553
glycosylation site   N-linked (GlcNAc...)   potential   1700   1700
glycosylation site   N-linked (GlcNAc...)   potential   1884   1884
glycosylation site   N-linked (GlcNAc...)   potential   1940   1940
glycosylation site   N-linked (GlcNAc...)   potential   2115   2115
glycosylation site   N-linked (GlcNAc...)   potential   2211   2211
glycosylation site   N-linked (GlcNAc...)   potential   2212   2212
glycosylation site   N-linked (GlcNAc...)   potential   2421   2421
glycosylation site   N-linked (GlcNAc...)   potential   2511   2511
glycosylation site   N-linked (GlcNAc...)   potential   2520   2520
glycosylation site   N-linked (GlcNAc...)   potential   2547   2547
glycosylation site   N-linked (GlcNAc...)   potential   2588   2588
glycosylation site   N-linked (GlcNAc...)   potential   2678   2678
glycosylation site   N-linked (GlcNAc...)   potential   2845   2845
glycosylation site   N-linked (GlcNAc...)   potential   2967   2967
SP Length   20
 ----+----1----+----2----+----3----+----4----+----5
Signal Peptide MLRSSLLILLAIVLLGSSQA
Sequence MLRSSLLILLAIVLLGSSQAASHDQERERKLEVFEGVAVDYQIGYIGDFG
GIDSGPPYIIVAEAGVETDLAIDRATGEIRTKVKLDRETRASYSLVAIPL
SGRNIRVLVTVKDENDNAPTF
PQTSMHIEFPENTPREVKRTLLPARDLDL
EPYNTQRYNIVSGNVNDAFRLSSHRERDGVLYLDLQISGFLDRETTPGYS
LLIEALDGGTPPLRGFMTV
NITIQDVNDNQPIFNQSRYFATVPENATVGT
SVLQVYASDTDADENGLVEYAINRRQSDKEQMFRIDPRTGAIYINKALDF
ETKELHELVVVAKDHGEQPLETTAFVSIRVTDVNDNQPTI
NVIFLSDDAS
PKISESAQPGEFVARISVHDPDSKTEYANV
NVTLNGGDGHFALTTRDNSI
YLVIVHLPLDREIVS
NYTLSVVATDKGTPPLHASKSIFLRITDVNDNPPE
F
EQDLYHANVMEVADPGTSVLQVLAHDRDEGLNSALTYSLAETPETHAQW
FQIDPQTGLITTRSHIDCETEPVPQLTVVARDGGVPPLSSTATVLVTIHD
VNDNEPIF
DQSFYNVSVAENEPVGRCILKVSASDPDCGVNAMVNYTIGEG
FKHLTEFEVRSASGEICIAGELDFERRSSYEFPVLATDRGGLSTTAMIKM
QLTDVNDNRPVF
YPREYKVSLRESPKASSQASSTPIVAVVATDPDYGNFG
QVSYRIVAGNEAGIFRIDRSTGEIFVVRPDMLSVRTQPMHML
NISATDGG
NLRSNADAVVFLSIIDAMQRPPIF
EKARYNYYVKEDIPRGTVVGSVIAAS
GDVAHRSPVRYSIYSGDPDGYFSIETNSGNIRIAKPLDHEAKSQVLLNIQ
ATLGEPPVYGHTQVNIEVEDVNDNAPEF
EASMVRISVPESAELGAPLYAA
HAHDKDSGSSGQVTYSLVKESGKGLFAIDARSGHLILSQHLDYESSQRHT
LIVTATDGGVPSLST
NLTILVDVQDVNDNPPVFEKDEYSVNVSESRSINA
QIIQV
NASDLDTGNNARITYRIVDAGVDNVTNSISSSDVSQHFGIFPNSG
WIYLRAPLDRETRDRYQLTVLATDNGTPAAHAKTRVIVRVLDANDNDPKF
QKSKYEFRIEENLRRGSVVGVVTASDLDLGENAAIRYSLLPINSSFQVHP
VTGEISTREPLDRELRELYDLVVEARDQGTPVRSARVPVRIHVSDVNDNA
PEI
ADPQEDVVSVREEQPPGTEVVRVRAVDRDHGQNASITYSIVKGRDSD
GHGLFSIDPTSGVIRTRVVLDHEERSIYRLGVAASDGGNPPRETVRMLRV
EVLDLNDNRPTF
TSSSLVFRVREDAALGHVVGSISPIERPADVVRNSVEE
SFEDLRVTYTLNPLTKDLIEAAFDIDRHSGNLVVARLLDREVQSEFRLEI
RALDTTASNNPQSSAITVKIEVADVNDNAPEW
PQDPIDLQVSEATPVGTI
IH
NFTATDADTGTNGDLQYRLIRYFPQLNESQEQAMSLFRMDSLTGALSL
QAPLDFEAVQEYLLIVQALDQSS
NVTERLQTSVTVRLRILDANDHAPHFV
SP
NSSGGKTASLFISDATRIGEVVAHIVAVDEDSGDNGQLTYEITGGNGE
GRFRINSQTGIIELVKSLPPATEDVEKGGRFNLIIGAKDHGQPEPKKSSL
NLHLIVQGSHNNPPRF
LQAVYRATILENVPSGSFVLQVTAKSLHGAENAN
LSYEIPAGVANDLFHVDWQRGIITTRGQFDRESQASYVLPVYVRDANRQS
TLSSSAVRKQRSSDSIGDTSNGQHFDVATIYITVGDVNDNSPEF
RPGSCY
GLSVPENSEPGVIHTVVASDLDEGPNADLIYSITGGNLGNKFSIDSSSGE
LSARPLDREQHSRYTLQIQASDRGQPKSRQGHC
NITIFVEDQNDNAPRFK
LSKYTGSVQEDAPLGTSVVQISAVDADLGVNARLVYSLA
NETQWQFAIDG
QSGLITTVGKLDRELQASYNFMVLATDGGRYEVRSATVPVQINVLDINDN
RPIF
ERYPYIGQVPALIQPGQTLLKVQALDADLGANAEIVYSLNAENSAV
SAKFRINPSTGALSASQSLASESGKLLHLEVVARDKGNPPQSSLGLIELL
IGEAPQGTPVL
RFQNETYRVMLKENSPSGTRLLQVVALRSDGRRQKVQFS
FGAGNEDGILSLDSLSGEIRVNKPHLLDYDRFSTPSMSALSRGRALHYEE
EIDESSEEDP
NNSTRSQRALTSSSFALTNSQPNEIRVVLVARTADAPFLA
SYAELVIELEDENDNSPKF
SQKQFVATVSEGNNKGTFVAQVHAFDSDAGS
NARLRYHIVDGNHDNAFVIEPAFSGIVRTNIVLDREIRDIYKLKIIATDE
GVPQMTGTATIRVQIVDVNDNQPT
FPPNNLVTVSEATELGAVITSISAND
VDTYPALTYRLGAESTVDIE
NMSIFALDRYSGKLVLKRRLDYELQQEYEL
DVIASDAAHEARTVLTVRVNDENDNAPVF
LAQQPPAYFAILPAISEISES
LSVDFDLLTV
NATDADSEGNNSKVIYIIEPAQEGFSVHPSNGVVSVNMSR
LQPAVSSSGDYFVRIIAKDAGKPALKSSTLLRVQAND
NGSGRSQFLQNQY
RAQISEAAPLGSVVLQLGQDALDQSLAIIAGNEESAFELLQSKAIVLVKP
LDRERNDLYKLRLVLSHPHGPPLISSL
NSSSGISVIITILDANDNFPIFD
RSAKYEAEISELAPLRYSIAQLQAIDADQENTPNSEVVYDITSGNDEHMF
TIDLVTGVLFVNNRLDYDSGAKSYELIIRACDSHHQRPLCSLQPFRLELH
DENDNEPKF
PLTEYVHFLAENEPVGSSVFRAHASDLDKGPFGQLNYSIGP
APSDESSWKMFRVDSESGLVTSAFVFDYEQRQRYDMELLASDMGGKKASV
AVRVEIESRDEFTPQF
TERTYRFVLPAAVALPQGYVVGQVTATDSDSGPD
GRVVYQLSAPHSHFKV
NRSSGAVLIKRKLKLDGDGDGNLYMDGRDISLVI
SASSGRHNSLSSMAVVEIALDPLAHPGT
NLASAGGSSSGSIGDWAIGLLV
AFLLVLCAAAGIFLFI
HMRSRKPRNAVKPHLATDNAGVGNTNSYVDPSAF
DTIPIRGSISGGAAGAASGQFAPPKYDEIPPFGAHAGSSGAATTSELSGS
EQSGSSGRGSAEDDGEDEEIRMINEGPLHHRNGGAGAGSDDGRISDISVQ
NTQEYLARLGIVDHDPSGAGGGASSMAGSSHPMHLYHDDDATARSDITNL
IYAKLNDVTGAGSEIGSSADDAGTTAGSIGTIGTAITHGHGVMSSYGEVP
VPVPVVVGGSNVGGSLSSIVHSEEELTGSYNWDYLLDWGPQYQPLAHVFS
EIARLKDDTLSEHSGSGASSSAKSKHSSSHSSAGAGSVVLKPPPSAPPTH
IPPPLLTNVAPRAINLPMRLPPHLSLAPAHLPRSPIGHEASGSFSTSSAM
SPSFSPSLSPLATR
SPSISPLGAGPPTHLPHVSLPRHGHAPQPSQRGNVG
TRM
Original MLRSSLLILLAIVLLGSSQAASHDQERERKLEVFEGVAVDYQIGYIGDFG
GIDSGPPYIIVAEAGVETDLAIDRATGEIRTKVKLDRETRASYSLVAIPL
SGRNIRVLVTVKDENDNAPTFPQTSMHIEFPENTPREVKRTLLPARDLDL
EPYNTQRYNIVSGNVNDAFRLSSHRERDGVLYLDLQISGFLDRETTPGYS
LLIEALDGGTPPLRGFMTVNITIQDVNDNQPIFNQSRYFATVPENATVGT
SVLQVYASDTDADENGLVEYAINRRQSDKEQMFRIDPRTGAIYINKALDF
ETKELHELVVVAKDHGEQPLETTAFVSIRVTDVNDNQPTINVIFLSDDAS
PKISESAQPGEFVARISVHDPDSKTEYANVNVTLNGGDGHFALTTRDNSI
YLVIVHLPLDREIVSNYTLSVVATDKGTPPLHASKSIFLRITDVNDNPPE
FEQDLYHANVMEVADPGTSVLQVLAHDRDEGLNSALTYSLAETPETHAQW
FQIDPQTGLITTRSHIDCETEPVPQLTVVARDGGVPPLSSTATVLVTIHD
VNDNEPIFDQSFYNVSVAENEPVGRCILKVSASDPDCGVNAMVNYTIGEG
FKHLTEFEVRSASGEICIAGELDFERRSSYEFPVLATDRGGLSTTAMIKM
QLTDVNDNRPVFYPREYKVSLRESPKASSQASSTPIVAVVATDPDYGNFG
QVSYRIVAGNEAGIFRIDRSTGEIFVVRPDMLSVRTQPMHMLNISATDGG
NLRSNADAVVFLSIIDAMQRPPIFEKARYNYYVKEDIPRGTVVGSVIAAS
GDVAHRSPVRYSIYSGDPDGYFSIETNSGNIRIAKPLDHEAKSQVLLNIQ
ATLGEPPVYGHTQVNIEVEDVNDNAPEFEASMVRISVPESAELGAPLYAA
HAHDKDSGSSGQVTYSLVKESGKGLFAIDARSGHLILSQHLDYESSQRHT
LIVTATDGGVPSLSTNLTILVDVQDVNDNPPVFEKDEYSVNVSESRSINA
QIIQVNASDLDTGNNARITYRIVDAGVDNVTNSISSSDVSQHFGIFPNSG
WIYLRAPLDRETRDRYQLTVLATDNGTPAAHAKTRVIVRVLDANDNDPKF
QKSKYEFRIEENLRRGSVVGVVTASDLDLGENAAIRYSLLPINSSFQVHP
VTGEISTREPLDRELRELYDLVVEARDQGTPVRSARVPVRIHVSDVNDNA
PEIADPQEDVVSVREEQPPGTEVVRVRAVDRDHGQNASITYSIVKGRDSD
GHGLFSIDPTSGVIRTRVVLDHEERSIYRLGVAASDGGNPPRETVRMLRV
EVLDLNDNRPTFTSSSLVFRVREDAALGHVVGSISPIERPADVVRNSVEE
SFEDLRVTYTLNPLTKDLIEAAFDIDRHSGNLVVARLLDREVQSEFRLEI
RALDTTASNNPQSSAITVKIEVADVNDNAPEWPQDPIDLQVSEATPVGTI
IHNFTATDADTGTNGDLQYRLIRYFPQLNESQEQAMSLFRMDSLTGALSL
QAPLDFEAVQEYLLIVQALDQSSNVTERLQTSVTVRLRILDANDHAPHFV
SPNSSGGKTASLFISDATRIGEVVAHIVAVDEDSGDNGQLTYEITGGNGE
GRFRINSQTGIIELVKSLPPATEDVEKGGRFNLIIGAKDHGQPEPKKSSL
NLHLIVQGSHNNPPRFLQAVYRATILENVPSGSFVLQVTAKSLHGAENAN
LSYEIPAGVANDLFHVDWQRGIITTRGQFDRESQASYVLPVYVRDANRQS
TLSSSAVRKQRSSDSIGDTSNGQHFDVATIYITVGDVNDNSPEFRPGSCY
GLSVPENSEPGVIHTVVASDLDEGPNADLIYSITGGNLGNKFSIDSSSGE
LSARPLDREQHSRYTLQIQASDRGQPKSRQGHCNITIFVEDQNDNAPRFK
LSKYTGSVQEDAPLGTSVVQISAVDADLGVNARLVYSLANETQWQFAIDG
QSGLITTVGKLDRELQASYNFMVLATDGGRYEVRSATVPVQINVLDINDN
RPIFERYPYIGQVPALIQPGQTLLKVQALDADLGANAEIVYSLNAENSAV
SAKFRINPSTGALSASQSLASESGKLLHLEVVARDKGNPPQSSLGLIELL
IGEAPQGTPVLRFQNETYRVMLKENSPSGTRLLQVVALRSDGRRQKVQFS
FGAGNEDGILSLDSLSGEIRVNKPHLLDYDRFSTPSMSALSRGRALHYEE
EIDESSEEDPNNSTRSQRALTSSSFALTNSQPNEIRVVLVARTADAPFLA
SYAELVIELEDENDNSPKFSQKQFVATVSEGNNKGTFVAQVHAFDSDAGS
NARLRYHIVDGNHDNAFVIEPAFSGIVRTNIVLDREIRDIYKLKIIATDE
GVPQMTGTATIRVQIVDVNDNQPTFPPNNLVTVSEATELGAVITSISAND
VDTYPALTYRLGAESTVDIENMSIFALDRYSGKLVLKRRLDYELQQEYEL
DVIASDAAHEARTVLTVRVNDENDNAPVFLAQQPPAYFAILPAISEISES
LSVDFDLLTVNATDADSEGNNSKVIYIIEPAQEGFSVHPSNGVVSVNMSR
LQPAVSSSGDYFVRIIAKDAGKPALKSSTLLRVQANDNGSGRSQFLQNQY
RAQISEAAPLGSVVLQLGQDALDQSLAIIAGNEESAFELLQSKAIVLVKP
LDRERNDLYKLRLVLSHPHGPPLISSLNSSSGISVIITILDANDNFPIFD
RSAKYEAEISELAPLRYSIAQLQAIDADQENTPNSEVVYDITSGNDEHMF
TIDLVTGVLFVNNRLDYDSGAKSYELIIRACDSHHQRPLCSLQPFRLELH
DENDNEPKFPLTEYVHFLAENEPVGSSVFRAHASDLDKGPFGQLNYSIGP
APSDESSWKMFRVDSESGLVTSAFVFDYEQRQRYDMELLASDMGGKKASV
AVRVEIESRDEFTPQFTERTYRFVLPAAVALPQGYVVGQVTATDSDSGPD
GRVVYQLSAPHSHFKVNRSSGAVLIKRKLKLDGDGDGNLYMDGRDISLVI
SASSGRHNSLSSMAVVEIALDPLAHPGTNLASAGGSSSGSIGDWAIGLLV
AFLLVLCAAAGIFLFIHMRSRKPRNAVKPHLATDNAGVGNTNSYVDPSAF
DTIPIRGSISGGAAGAASGQFAPPKYDEIPPFGAHAGSSGAATTSELSGS
EQSGSSGRGSAEDDGEDEEIRMINEGPLHHRNGGAGAGSDDGRISDISVQ
NTQEYLARLGIVDHDPSGAGGGASSMAGSSHPMHLYHDDDATARSDITNL
IYAKLNDVTGAGSEIGSSADDAGTTAGSIGTIGTAITHGHGVMSSYGEVP
VPVPVVVGGSNVGGSLSSIVHSEEELTGSYNWDYLLDWGPQYQPLAHVFS
EIARLKDDTLSEHSGSGASSSAKSKHSSSHSSAGAGSVVLKPPPSAPPTH
IPPPLLTNVAPRAINLPMRLPPHLSLAPAHLPRSPIGHEASGSFSTSSAM
SPSFSPSLSPLATRSPSISPLGAGPPTHLPHVSLPRHGHAPQPSQRGNVG
TRM
 ----+----1----+----2----+----3----+----4----+----5
Hydropathies  
 

© 2007-2017 Dr. Katja Kapp, Kassel & thpr.net e. K., Dresden, Germany, last update 2010-06-11