SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO05444  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA006694
Annotation
Hemocytin_[Papilio_machaon]
Full name
Hemocytin      
Alternative Name
Humoral lectin
Location in the cell
Extracellular   Reliability : 3.284
 

Sequence

CDS
ATGGCGGCCGAGTGCCTCGCGGCAGACGCCGGCGTCGACCTCGCCTCCTGGAGGCTCATGATGGACTGTCCGGCGGACTGTCCCCCGCCGCTGGTGCACTACGACTGCTACCGCAAGCGATGCGAGGAGACGTGCGCGCCCTACCCGAACGCCGCGCGCGCCTGCCCCGCGCAGGAGGGACAGTGCTCGCCGGGCTGCTACTGCCCCGACGGGAAGCTTCGGAAGGGAGACCAGTGCGTGCTGCCAGCGGACTGCTTGGACTGCACGTGTACAGGAGTCGGTACACCAGCTAAGTACACTACCTTTGAGGGCGACGACCTTCCCTTCTTGGGCAACTGTACCTACCTCGCCTCCAGAGACAGAAACCAAACTGGAGAACATAAATATCAGGTGTACGCTACGAACGGCCCGTGCGATGATAACGCTAATATTGTGTGCACTAAGATTGTGCATCTCATATACGAGAAGAATGTTATCCATATTAGCAAGGATCCCACAACCAAAAAGCTTCGTACAGTGATAGGGAAGACCGCGGTGTTCAAGTATCCGGTTAAGGAGAACTGGGGCACCATCAGCTTGCTCAACGGACAGGATGTCTCCGTCACCCTGCCGGATATACACGTGGAGCTGACAGTGTCGCAACTGAACCTGGAATTCGCCGTCCGCGTGCCGACCTTCCTGTACGGGAACCGAACTGAGGGTCTGTGCGGAGTGTGCGCCGGCTACCAGGACTTCCTCGTCACCAGCAACGGAACTGTCACCGATGACTTCGACTTGTACGGCAAGAGCTGGCAAGCGTCACCCGAGAAGCTGACGGAGCTGGAGGTGCCGAGCGACGAGCAGTGTGACGCGCCGCCCCCGCCCGCCCCCTGCACGCCGCCCCCGCCCGACAACAACACCTGCTACCACCTGTACAACGCTGACAGATTCGGAGCCTGCCACGCGCTGGTGGAGCCGCAGCCGTACGTGGAGTCGTGCGAGGCGGACGAGTGCGGCGGGCGCGGTCCGTGCGACGCGCTGCAGCGATACGCGGCCGCTTGTGCCGAGCTCGGCCTATGTCTGCCGGACTGGCGGCGGGAACTCTGCCCCTATCCTTGTGAGGAACCGTTCGTGTACCGAGCGTGCGTGGACTGCGAGAGGACGTGCGACAACTACGAGCAGTTGCAGACCAGCCCGGAGAAGTGCACCAACAAACCCGTCGAAGGATGTTTCTGTCCCGAAGGAAAGGTGCGCGTGAACAGCACGTGCATAGAGCCCGGCAAGTGCTTCCCGTGCGGCGTCGACGGACACTACGCGGGCGACGAGTGGCAGGAGGACGCGTGCACGTTGTGCGCGTGCGCCCGCAGCCCGGACGGCACGGCGCTGGTCGGCTGCCGCGCCACCAGCTGCGCGCCGCCCGTGTGCGCGCACGGTGAGGACCTGCGCACCGCGCCGCCGCCGCCCGGACAATGCTGCCCGGAATACGACTGTGTCGCGAAACCCGAGGCACAGTGTAAAGAGACCAAGAAGATTGTTTGCGATTACGGTCAAGTGCTGAAACAGAAGACAAACCCTAGCGGCTGCAAGGAATACTTCTGCGAATGCAAGCCGTCCAGTGAATGCGAAGTGATTCCTCCGGAGAGTGAGGTGGAGATAGTCGAGGCTGGTATCCATCGCGAGATCGACAACTCTGGATGCTGTCCGCGCGTGTCGCTCGTGTGTCGCCCCGAAACATGTCCAAAGCCACCGCACTGTCCCCAGTTCCAAACCCTCGCCTCTGTCAATATCACCGGCAAGTGCTGTCCGGAATACAAGTGCGAACTACCTAAAGACAAGTGCATCGTCACTCTAGAATGGGAGGCGGCTGCTAAAGGCGGCGAGAAGCCACGAGAAAAACCACAAACTGTGCTAAAAGATTTGGAAGCCGTCTGGCTGGACGGTCCGTGCCGGTCGTGCGAGTGCGCGCTGTCCGGCGCGGGTCCGGCGGCGACGTGCGCGGTGTCGGCGTGCCCGGCGGTGGTCAGCTCGGAGCTGTTCGTGCTGGAGCCGCGCCCCGTGCCCTTCGCGTGCTGCCCGGAGCCCGTGCAGGTCGCGTGCAGACACCAGGATAATGTCTACAAGGTCGGCGAGAAGTGGAAGTCCCCCACGGACGTGTGCGAGACGTACGAGTGCGCGGCGGACGGCGACGGCAAGCTGCAGCGGCTCGCGGCCGTGCAGCGCTGCGACCAGCACTGCCAGCCCGGTTGGAAATACGTCCCGGCTGAAGCCGATAGCGGTCAGTGCTGTGGGAAGTGCGAGCCCGTGGCCTGCGTGGTGGATGGAGAGGAGAAGCCAATAGGCGAGAAGTGGACGTCCTCGGACTTCTGCACGAACTTCACCTGCGTGAACCTCAATGGCACCCTCCAAGTCCAAAGCTCGAATGAGACTTGCCCGGAGATATCGGATGCTGAACGGAAGCAGTTCGTGCTGAAGGAGCAGAAGGTCCCAGGGAAGTGCTGCCCCAAGATCGAACGTGAGGCGTGTCGAGTGGGAGATCAGATATACCAGGTCGGCGAGAACTGGACGTCGACCGAGAACTTGTGTGAGAGCTACCGATGCGTGCGGGACGGGGGCGGCGGCCTGCGCTCCGTGGCCTCCGAGCAGCGCTGCGAGACCGACTGCCAACCCGGCTGGAAGTACTTCCCGGCGCCGGCGGAGTGCTGCGGCCGCTGCAAGCCCGTCGCCTGCGTCGTGGAAGGGCGGGAGAGGCCCGTCGGGGAGAGCTGGACGTCTGCCGACTTCTGCACCAACTACACCTGCGCCGACCTGCACGGGACCCTACAAGTGCAAAGCTCAAACGAGACTTGTCCTGAAGTGTCGGAGGCTGTGAAGAAACAGTTCGTGCTCAAAGAGGAGAAAATCCCGGGCAAGTGCTGCCCCAAGGTCGAGCCGGTCGCGTGTCGAGATGGCGACAAAATATATCAGGAGGGTCAGGTGTGGACCACACCGGACCCGTGCACCAACCGGACGTGCAGGCGGGAGGACGGGCAGCTGTCCGTGGGCCGCACGGTGGAGCACTGCGAGCGACAGTGCCGGCGCGGCTGGACGTACTCACCGCCCGCCGCCGACCGGTGCTGCGGGCGCTGCGTGCAGTCTGCCTGCCTCGTCGACGACCAGCTCAAGGAACCCGGCTCCACGTGGTCCTCGGCCGACAACTGCACCACATTCAGCTGCGACCGAAGCGGGGAGGAGGTGTTCGTGACGTCAGCCACCGAGCACTGCCCCGACGTGTCGGCCTGCGACCCGGCCGACATTGTCAATACCACCTGCTGTCAGATCTGCAACGAGAAACCGCAAGCCCTCAGTAAATGCGTCCCTAAGAGCATCCCGAGCTCAGAGACGGTAGGACTGATCCGCATCCCGATGGGAGCCCACGGCCTGTGCGTCAACAAGTTCCCCATCACCGGCTTCACGGAGTGCCACGGCTCCTGCGACTCCGGAACTATCTATAACAACCAGACCGGTACGCACGAGTCTGCGTGCGAGTGTTGTCAGGCCGCGAAGTACAGCGGCGTGTCCGTGCGACTCACGTGCGAAGACGGGACGGTCCGACCGCACCGGGTCGCCACGCCGGCGCGCTGCCACTGTGCCGCCTGCGGACCGGGACTCACTAAGCACCCTAAACCTGGTCACGCATCCTACACTGGTACCAAGAACCCAGTACAGCCGGAGCGGGACAGGGAGTACGTGATCCCGGATATATTCCAGCGCTTCGGGGGAAGCGAGGAGAAACCACGACTCTAA
Protein
MAAECLAADAGVDLASWRLMMDCPADCPPPLVHYDCYRKRCEETCAPYPNAARACPAQEGQCSPGCYCPDGKLRKGDQCVLPADCLDCTCTGVGTPAKYTTFEGDDLPFLGNCTYLASRDRNQTGEHKYQVYATNGPCDDNANIVCTKIVHLIYEKNVIHISKDPTTKKLRTVIGKTAVFKYPVKENWGTISLLNGQDVSVTLPDIHVELTVSQLNLEFAVRVPTFLYGNRTEGLCGVCAGYQDFLVTSNGTVTDDFDLYGKSWQASPEKLTELEVPSDEQCDAPPPPAPCTPPPPDNNTCYHLYNADRFGACHALVEPQPYVESCEADECGGRGPCDALQRYAAACAELGLCLPDWRRELCPYPCEEPFVYRACVDCERTCDNYEQLQTSPEKCTNKPVEGCFCPEGKVRVNSTCIEPGKCFPCGVDGHYAGDEWQEDACTLCACARSPDGTALVGCRATSCAPPVCAHGEDLRTAPPPPGQCCPEYDCVAKPEAQCKETKKIVCDYGQVLKQKTNPSGCKEYFCECKPSSECEVIPPESEVEIVEAGIHREIDNSGCCPRVSLVCRPETCPKPPHCPQFQTLASVNITGKCCPEYKCELPKDKCIVTLEWEAAAKGGEKPREKPQTVLKDLEAVWLDGPCRSCECALSGAGPAATCAVSACPAVVSSELFVLEPRPVPFACCPEPVQVACRHQDNVYKVGEKWKSPTDVCETYECAADGDGKLQRLAAVQRCDQHCQPGWKYVPAEADSGQCCGKCEPVACVVDGEEKPIGEKWTSSDFCTNFTCVNLNGTLQVQSSNETCPEISDAERKQFVLKEQKVPGKCCPKIEREACRVGDQIYQVGENWTSTENLCESYRCVRDGGGGLRSVASEQRCETDCQPGWKYFPAPAECCGRCKPVACVVEGRERPVGESWTSADFCTNYTCADLHGTLQVQSSNETCPEVSEAVKKQFVLKEEKIPGKCCPKVEPVACRDGDKIYQEGQVWTTPDPCTNRTCRREDGQLSVGRTVEHCERQCRRGWTYSPPAADRCCGRCVQSACLVDDQLKEPGSTWSSADNCTTFSCDRSGEEVFVTSATEHCPDVSACDPADIVNTTCCQICNEKPQALSKCVPKSIPSSETVGLIRIPMGAHGLCVNKFPITGFTECHGSCDSGTIYNNQTGTHESACECCQAAKYSGVSVRLTCEDGTVRPHRVATPARCHCAACGPGLTKHPKPGHASYTGTKNPVQPERDREYVIPDIFQRFGGSEEKPRL

Summary

Keywords
Cell adhesion   Complete proteome   Disulfide bond   Glycoprotein   Lectin   Reference proteome   Repeat   Signal  
Feature
chain  Hemocytin
EMBL
D29738    D14035    KQ459984    KPJ19090.1    ODYU01010645    SOQ55893.1    + More
KX778609    AON96581.1    AGBW02008897    OWR52242.1    RSAL01000012    RVE53530.1    GEZM01019556    JAV89630.1    KQ971372    EFA10333.2    AAZX01003330    ADTU01013061    ADTU01013062    KQ981153    KYN08979.1    KQ976424    KYM88484.1    KQ981993    KYN30795.1    GL447952    EFN85665.1    KQ982548    KYQ55284.1    NNAY01000006    OXU32124.1    KQ978251    KYM95931.1    QOIP01000006    RLU21347.1    GBYB01006924    JAG76691.1    KK107503    EZA49607.1    GL443213    EFN62410.1    GDIQ01034292    JAN60445.1    GDIP01149714    JAJ73688.1    GDIP01031192    JAM72523.1    GDIP01130165    JAL73549.1    GDIP01252951    JAI70450.1    GDIP01123469    JAL80245.1    GDIP01108717    JAL94997.1    GDIP01140215    JAL63499.1    GDIP01166409    JAJ56993.1    GDIP01215358    JAJ08044.1    GDIP01165508    JAJ57894.1    GDIP01123467    JAL80247.1    GDIP01210257    JAJ13145.1    GDIQ01183986    JAK67739.1    GDIQ01242520    JAK09205.1    GDIQ01242519    JAK09206.1    GDIQ01248568    JAK03157.1    GDIQ01242522    JAK09203.1    GDIQ01117110    JAL34616.1    LRGB01003123    KZS04410.1    GDIP01219556    JAJ03846.1    GDIQ01250618    JAK01107.1    GDIQ01242521    JAK09204.1    GDIQ01174638    JAK77087.1    GDIQ01171688    JAK80037.1    GDIQ01202779    JAK48946.1    GDIQ01207928    JAK43797.1    GDIQ01174637    JAK77088.1    GDIQ01028898    JAN65839.1    GDIQ01115912    JAL35814.1    GDIQ01113661    JAL38065.1    GDIQ01057989    JAN36748.1    GDIQ01065336    JAN29401.1    GDIQ01221573    JAK30152.1    GDIQ01129909    JAL21817.1    GDIQ01057988    JAN36749.1    GDIQ01228774    GDIQ01213728    GDIQ01047291    GDIQ01031545    JAK37997.1    GDIQ01115913    JAL35813.1    GDIQ01129908    JAL21818.1    GDIQ01022807    JAN71930.1    GDIQ01055336    JAN39401.1    GDIQ01211428    JAK40297.1    GDIQ01113660    JAL38066.1    GDIQ01115911    JAL35815.1    GDIQ01163196    JAK88529.1    GDIQ01148501    JAL03225.1    GDIQ01065335    JAN29402.1    GDIQ01220223    JAK31502.1    GDIQ01204578    JAK47147.1    GDIQ01066683    JAN28054.1    GDIQ01154097    JAK97628.1    GDIQ01224322    JAK27403.1    GDIQ01159146    JAK92579.1    GDIQ01159145    JAK92580.1    GDIP01197711    JAJ25691.1    GDIQ01213727    JAK37998.1    GDIP01197712    JAJ25690.1    GDIP01065024    JAM38691.1    GDIP01051678    JAM52037.1   
Pfam
PF00094   VWD        + More
PF01826   TIL
PF00754   F5_F8_type_C
PF08742   C8
PF13330   Mucin2_WxxW
PF00093   VWC
Interpro
IPR008979   Galactose-bd-like_sf        + More
IPR001846   VWF_type-D       
IPR001007   VWF_dom       
IPR014853   Unchr_dom_Cys-rich       
IPR000421   FA58C       
IPR006207   Cys_knot_C       
IPR002919   TIL_dom       
IPR012111   Hml       
IPR036084   Ser_inhib-like_sf       
IPR000742   EGF-like_dom       
IPR013032   EGF-like_CS       
IPR002557   Chitin-bd_dom       
IPR036508   Chitin-bd_dom_sf       
IPR002172   LDrepeatLR_classA_rpt       
IPR036055   LDL_receptor-like_sf       
IPR000571   Znf_CCCH       
IPR025155   WxxW_domain       
SUPFAM
SSF49785   SSF49785        + More
SSF57567   SSF57567       
SSF57625   SSF57625       
SSF57424   SSF57424       
Gene 3D
PDB
6N29     E-value=1.43599e-13,     Score=190

Ontologies

Topology

Length:
1253
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
0.00016
Exp number, first 60 AAs:
0
Total prob of N-in:
0.00001
outside
1  -  1253
 
 

Population Genetic Test Statistics

Pi
192.166898
Theta
163.90796
Tajima's D
0.393721
CLR
0.422374
CSRT
0.48377581120944
Interpretation
Uncertain
Peptides ×
Source Sequence Identity Evalue
24402669 CFIVGADNVGSQQMQQIR 96.30 1e-12
26280517 FFFENAPR 96.00 2e-10
29197581 NTGICVDTNTCICPANYQGK 96.00 2e-10
26280517 TAPFDPR 100.00 2e-09
24402669 TANIFR 100.00 2e-09
29197581 KIVCDYGQVIK 100.00 2e-09
26280517 WIEDNHFEFIEGDVK 100.00 3e-07
24402669 WKPGEPNDSHNNEDCVVIHR 100.00 3e-07
28467696 YQVSIQDFR 100.00 3e-07
26280517 ACGVSRPIISCSITINEGSQIKPQIQTIQQEIER 100.00 3e-06
24402669 ACNQIGQFISNR 100.00 3e-06
28467696 ACNQIGQFISNR 100.00 3e-06
29197581 YVPAEADSGQCCGK 100.00 3e-06
26280517 SKVTIDCIPAR 100.00 2e-05
29197581 PVPFACCPEPVQVACR 100.00 2e-05
26280517 CDVDIR 100.00 8e-05
28467696 CENEDQYAEWVAACR 100.00 8e-05
29197581 TCDNYEQIQTSPEK 100.00 8e-05
26822097 DIEAVWIDGPCR 100.00 6e-04
26280517 NIISWTNTHEPGK 100.00 6e-04
24402669 PVIPQYFK 100.00 6e-04
28467696 PVPDQIPR 100.00 6e-04
29197581 DIEAVWIDGPCR 100.00 6e-04
26280517 CTGTVGTAQCIR 100.00 0.001
24402669 CTINGVPSIIK 100.00 0.001
28467696 CTITNPHGEASCTAQITYDSMEPHASK 100.00 0.001
29197581 ATSCAPPVCAHGEDIR 100.00 0.001
24402669 KQYIENAEK 100.00 0.002
28467696 KQVGDNFTEEQIKEFIWK 100.00 0.002
24402669 RCDCSSIGSIDNFCDATSGQCK 100.00 0.010
26280517 YVANQIDIPIDAR 100.00 0.010
24402669 YVNSDTNDGDIYK 100.00 0.010
28467696 YVNPEDSGTYTCR 100.00 0.010
29197581 SPTDVCETYECAADGDGK 100.00 0.010
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号