SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO05441  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA006693
Annotation
PREDICTED:_hemocytin_isoform_X2_[Papilio_xuthus]
Full name
Hemocytin      
Alternative Name
Humoral lectin
Location in the cell
Extracellular   Reliability : 3.544
 

Sequence

CDS
ATGCTGCAGGCGGGCAACCTTACTGTCGACGTGGAGAACGTGGCCTGCTCCGGAGCCATCTCTGAGGAAATGAATCTAACACCATACAAAGGTGCAGGTTCACCTTCTTGCACCAAGGCCGTCAATTTGAACTACGGAGAAATTAAGGTCCATCTGAAGCAAGGAGGATTTATACTTGTGAATGGGAAGGAAGTCGCCACACTGCCCGTTATGATCGGGAATGTCAGAATCAGAGCGGCTTCATCTCTCTTCCTCATCGTGCAACTTCCCAACAAAGTGGATCTCTGGTGGGATGGCAACACGCGAGTCTTCATAGATGTACCACCAGAATTCAAGGATAAAACGAAGGGCCTCTGCGGAACTTTCAATTTGAACCAGAAAGATGACTTCTTAACGCCCGAGAGCGACGTGGAACAATCAGCGCTCGCGTTCGCCAATAAATGGAAGACCAGGGAGTTCTGTGCGGATATCGATAACAAAGAACCGGAGCACCCCTGCGTGGCCAACGTCGAGAACAAAGCTGCAGCCGAGAAGTACTGCAGCAAGCTCATGAGTAATCTGTTTGAAGAGTGTCACTGGTACGTGGACGTGGAGCCGTACTACCAGGCGTGCCTGTACGACATGTGCGCATGCGCGGGGGACGTGACCCGGTGCCTGTGCCCCGTGCTCGGGGACTACGCCATGGTGTGCGCCAAGAACGGGATCATCCTGCAGTGGCGGTACAACGTTAAGGAGTGCGAGCTGTCGTGTACCGGTGGCCAGCAGTACACGGTGTGTGCTGACAGCTGTCTCCGCAAGTGCTCGGACACAGCTTTGGCGGCCTCCGGGCAGTGCAAGCCTGTCTGTGTGGAGGGCTGCGCCTGCCCGCCCTCACAGCTCCTAGATGACAACGGAGTTTGTGTTCCCGTCGCGAAGTGTCCCTGCATCCATAAGGGCTTGCAGTTCAACGCTGGGTATAAGGAAATCAGACCTGGGAGACGGGAGAGAGAGTTGTGTACTTGTGTCGGTGCTCGTTGGGACTGCAAGCCTGCCACTCCTGAGGAGATCCAGAACTATCCTCCAGCTGAAGACTTGAGAAGCAATTGCACAGCCCAGAACATGGAGTTCACCACTTGCGAGACATCGGAACCACTAACATGCAAGAATATGCACCTACCCCCGAGCACGCAGACCGCGGAGTGTCGCCCCGGATGTCAGTGCAAGAAGGGCCAGGTGCTGGACACCGCGTCCAAGCGCTGCGTGCCCGCCACCCAGTGCCCGTGCCACCACGCCGGCCGGAGCTACCCCGACGGACACCTCATGCAGGAGGAGTGCAACAAATGCGAATGCAAAAACGGTAACTGGTCGTGCACGCAGCGCAAGTGCGCGGGCGTGTGCGGCGCGTGGGGCGACTCCCACGTCAACACGTTCGACGGAACACAGTACGACTTCGAAGGAGTATGTACTTATCTACTCGCGAAAGGTGCTATGGACGGAACCGATGGCTTCGATGTCGAGATACAGAACGTCCCGTGCGGCACAACGGGAGCGACCTGTTCCAAATCTGTCACATTGAAGGTCGGAGGAGCTGGAAACGAGGAGATCGTGTCTCTAACCAAAAACGCGCCCATTCCTGATATCTCTAAGCTCAAACGGATAAAGATGCGTAAGGCCGGTGCGTACGTGTTCCTGGATGTACCGTCTCTAGGTATGAGCCTGCAGTGGGATCGAGGGTTGCGGGTTTACGTCAAGATCGACACTATGTGGCAGGGACGAGTGAAAGGTCTGTGTGGTAACTACAACGGAGACATGCGAGATGACTTCCAGACTCCGTCGGGAGGCGGCATGTCGGAGTCCTCTGCTCTGATATTCGCTGACTCGTGGAAACTGAAGCCGACCTGCCCTAAACCTCAACCTGTTATTGACCATTGTAAGCAACGTCCGGAGCGCAAGGAGTGGGCGCAGAGCGTGTGCGGCGCGCTGAAGCGGTACCCGTTCAGCCTGTGCGCGGGGGAGGTGGGCGCGGGCGCGTACGTGGCGCGGTGCGAGCGCGACGCGTGCGCGTGCGACGCGGGCGCCGACTGCGAGTGTGCGTGCGCCGCGCTTGCCGCCTACGCACACGCGTGCGCGCACCGCGGCGTCACCTTCAACTGGAGAACTAACGACTTGTGCCCAATGCAATGCGACGAAGAGTGCTCGAATTACGACTCGTGCGTGTCGGCGTGTCCCGTCGAGACGTGTGACAACATACTGTACTACGCCGAGACGAAGGCTCGCTGCGAGCAGGACACGTGCGTGGAAGGTTGCAAGCCCAAAAAGTCTTGCCCGGAAGGCTCGGTCTACAAGAATGACTCCACCACGGAATGCGTGCCCCGCGCCAAGTGCAAGCCGGTCTGCATGACCCTCGACGGCGGCAGGGAGGTGCTCGAAGGAGAGATCATCGAGGAGGACGCCTGCCACACTTGCCGATGCTCCAAGAAACACAAAGTGTGCACGGGACAGCCTTGCTCTACTGAAGCGCCTCGAATCCAAGCGACCTCATCGTCTGCCGAGCCGGCTACAGAGAGGCCTCACGATGAGCCGCTGAAGTGTGTGACTGGCTGGACGCCCTGGATCAACCGGGGACCCGCCGAGATCGGCCCCGACGGACAGTCCGTCGAGAGCGAGCCGCTACCCAAACCTAATGAGCTGCAAATCGGTAAGCCAATGTGCAAACCCGAAATGATGAAGAAGATTGAGTGCCGCACCGTAAATGATCACAAGACTCCGAAAGAGACCGGACTGAACGTAGAGTGCAGCTTGGAGAACGGACTCGTGTGTGAGGAGCCAGAGAAAACCTGTCCCGACTTCGAGATCAAGGTCTACTGCGAATGTGAAGAGCCGCAAGTGTGCGTGGAGTCGGCTCGTCCCAGTGAGCCGCACCCCACGGACTGCAGCAAGTTCTACGAGTGTGGCCCCGCCGGGCCCGTGCTGAAGAACTGTGGGCCCGGCACGTGGTACAACCCTCAGTCCATGGTCTGCGACTGGCCCGCCGCTGTAGCCGCTGTCCGGTCGGACTGCGCCTTCACCACCCAAAAACCCCAGGACACCAGTCCCCCAGTGACTGTCACCAGCGAAGCTTCTTCAGAACCAGTGAGCACCACATTAGCGACAACCACATCCCGGTGCCCTCCCGGGGAGGTCTACCAGGCCTGCGCCTACAAGTGTGACCGCCTCTGTGACCACTTCAAGAAGACTCTCATTGCGAAGGGCAGATGCATTAGCGAGACTATATAA
Protein
MLQAGNLTVDVENVACSGAISEEMNLTPYKGAGSPSCTKAVNLNYGEIKVHLKQGGFILVNGKEVATLPVMIGNVRIRAASSLFLIVQLPNKVDLWWDGNTRVFIDVPPEFKDKTKGLCGTFNLNQKDDFLTPESDVEQSALAFANKWKTREFCADIDNKEPEHPCVANVENKAAAEKYCSKLMSNLFEECHWYVDVEPYYQACLYDMCACAGDVTRCLCPVLGDYAMVCAKNGIILQWRYNVKECELSCTGGQQYTVCADSCLRKCSDTALAASGQCKPVCVEGCACPPSQLLDDNGVCVPVAKCPCIHKGLQFNAGYKEIRPGRRERELCTCVGARWDCKPATPEEIQNYPPAEDLRSNCTAQNMEFTTCETSEPLTCKNMHLPPSTQTAECRPGCQCKKGQVLDTASKRCVPATQCPCHHAGRSYPDGHLMQEECNKCECKNGNWSCTQRKCAGVCGAWGDSHVNTFDGTQYDFEGVCTYLLAKGAMDGTDGFDVEIQNVPCGTTGATCSKSVTLKVGGAGNEEIVSLTKNAPIPDISKLKRIKMRKAGAYVFLDVPSLGMSLQWDRGLRVYVKIDTMWQGRVKGLCGNYNGDMRDDFQTPSGGGMSESSALIFADSWKLKPTCPKPQPVIDHCKQRPERKEWAQSVCGALKRYPFSLCAGEVGAGAYVARCERDACACDAGADCECACAALAAYAHACAHRGVTFNWRTNDLCPMQCDEECSNYDSCVSACPVETCDNILYYAETKARCEQDTCVEGCKPKKSCPEGSVYKNDSTTECVPRAKCKPVCMTLDGGREVLEGEIIEEDACHTCRCSKKHKVCTGQPCSTEAPRIQATSSSAEPATERPHDEPLKCVTGWTPWINRGPAEIGPDGQSVESEPLPKPNELQIGKPMCKPEMMKKIECRTVNDHKTPKETGLNVECSLENGLVCEEPEKTCPDFEIKVYCECEEPQVCVESARPSEPHPTDCSKFYECGPAGPVLKNCGPGTWYNPQSMVCDWPAAVAAVRSDCAFTTQKPQDTSPPVTVTSEASSEPVSTTLATTTSRCPPGEVYQACAYKCDRLCDHFKKTLIAKGRCISETI

Summary

Keywords
Cell adhesion   Complete proteome   Disulfide bond   Glycoprotein   Lectin   Reference proteome   Repeat   Signal  
Feature
chain  Hemocytin
EMBL
ODYU01003880    SOQ43191.1    KQ458793    KPJ05004.1    KQ459984    KPJ19090.1    + More
AGBW02008897    OWR52242.1    KX778609    AON96581.1    D29738    D14035    NWSH01001906    PCG69726.1    KQ971372    EFA10333.2    KQ434868    KZC09215.1    KQ435760    KOX75564.1    KQ981153    KYN08979.1    KQ978251    KYM95931.1    GL443213    EFN62410.1    KQ414756    KOC61639.1    GEZM01019556    JAV89630.1    KZ288266    PBC30167.1    KK107503    EZA49607.1    NNAY01000006    OXU32124.1    NEVH01006738    PNF36643.1    AAZX01003330    KQ981993    KYN30795.1    ADTU01013061    ADTU01013062    QOIP01000006    RLU21347.1    KQ982548    KYQ55284.1    KK852470    KDR23192.1    GBYB01006924    JAG76691.1    GEDC01009994    JAS27304.1    KQ976424    KYM88484.1    GEBQ01007062    JAT32915.1    GEBQ01006096    JAT33881.1    CVRI01000037    CRK93315.1    GL447952    EFN85665.1    KK856868    PTY26506.1    AJWK01025249    AJWK01025250    GDIP01197712    JAJ25690.1    GDIQ01237921    JAK13804.1    GDIQ01220223    JAK31502.1    GDIP01106322    JAL97392.1    GDIQ01171688    JAK80037.1    GDIQ01045444    JAN49293.1    GDIQ01129908    JAL21818.1    GDIQ01204578    JAK47147.1    GDIQ01163196    JAK88529.1    GDIQ01022807    JAN71930.1    GDIQ01028898    JAN65839.1    GDIQ01057988    JAN36749.1    GDIQ01115912    JAL35814.1    GDIQ01159146    JAK92579.1    GDIQ01055336    JAN39401.1    GDIQ01174637    JAK77088.1    GDIQ01113661    JAL38065.1    GDIQ01065335    JAN29402.1    GDIQ01129909    JAL21817.1    GDIQ01211428    JAK40297.1    GDIQ01213727    JAK37998.1    GDIQ01065336    JAN29401.1    GDIQ01113660    JAL38066.1    GDIQ01154097    JAK97628.1    GDIQ01115911    JAL35815.1    GDIQ01115913    JAL35813.1    GDIQ01174638    JAK77087.1    GDIQ01221573    JAK30152.1    GDIQ01057989    JAN36748.1    GDIQ01228773    JAK22952.1    GDIQ01034292    JAN60445.1    GDIQ01159145    JAK92580.1    GDIQ01066683    JAN28054.1    GDIQ01202779    JAK48946.1    GDIQ01228774    GDIQ01213728    GDIQ01047291    GDIQ01031545    JAK37997.1    GDIP01039030    JAM64685.1    GDIQ01115910    JAL35816.1    GDIP01106321    JAL97393.1    GDIQ01079322    JAN15415.1    GDIQ01140155    JAL11571.1    GDIP01063575    JAM40140.1    GDIP01078177    JAM25538.1    GDIP01069224    JAM34491.1    GDIP01194408    JAJ28994.1    GDIP01065024    JAM38691.1    GDIP01130165    JAL73549.1    GDIP01123468    JAL80246.1    GDIP01194409    JAJ28993.1    GDIP01228756    JAI94645.1    GDIP01165508    JAJ57894.1    GDIP01140215    JAL63499.1    GDIP01210257    JAJ13145.1    GDIP01043878    JAM59837.1    GDIP01219556    JAJ03846.1    GDIP01190558    JAJ32844.1    GDIP01123467    JAL80247.1    GDIP01031192    JAM72523.1    GDIP01108717    JAL94997.1    GDIP01166409    JAJ56993.1    GDIQ01207928    JAK43797.1    GDIP01123469    JAL80245.1    GDIP01215358    JAJ08044.1    GDIP01051678    JAM52037.1    GDIP01252951    JAI70450.1    GDIP01195659    JAJ27743.1   
Pfam
PF08742   C8        + More
PF00754   F5_F8_type_C
PF01826   TIL
PF00094   VWD
PF12456   hSac2
PF13330   Mucin2_WxxW
PF01607   CBM_14
PF00093   VWC
Interpro
IPR002557   Chitin-bd_dom        + More
IPR001846   VWF_type-D       
IPR000742   EGF-like_dom       
IPR001007   VWF_dom       
IPR008979   Galactose-bd-like_sf       
IPR013032   EGF-like_CS       
IPR000421   FA58C       
IPR036508   Chitin-bd_dom_sf       
IPR014853   Unchr_dom_Cys-rich       
IPR036084   Ser_inhib-like_sf       
IPR002919   TIL_dom       
IPR006207   Cys_knot_C       
IPR002172   LDrepeatLR_classA_rpt       
IPR012111   Hml       
IPR036201   Pacifastin_dom_sf       
IPR000571   Znf_CCCH       
IPR034753   hSac2       
IPR022158   Inositol_phosphatase       
IPR036055   LDL_receptor-like_sf       
IPR025155   WxxW_domain       
SUPFAM
SSF49785   SSF49785        + More
SSF57567   SSF57567       
SSF57625   SSF57625       
SSF57283   SSF57283       
SSF57424   SSF57424       
Gene 3D
ProteinModelPortal
PDB
6N29     E-value=2.14606e-28,     Score=317

Ontologies

Topology

Length:
1084
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
0.02053
Exp number, first 60 AAs:
0.00389
Total prob of N-in:
0.00097
outside
1  -  1084
 
 

Population Genetic Test Statistics

Pi
213.756896
Theta
176.359335
Tajima's D
0.791512
CLR
0.124024
CSRT
0.601669916504175
Interpretation
Uncertain
Peptides ×
Source Sequence Identity Evalue
24402669 EENIIVIIIEK 100.00 1e-08
28467696 EFAYNEADIIAGKNEITK 100.00 1e-08
29197581 CVQPADCPCR 100.00 1e-08
28467696 WDASASEPEYMEQVR 95.24 3e-08
24402669 RCNSIATNIGQACCEECSIGR 100.00 3e-04
24402669 CVNDNNVKPR 100.00 0.002
24402669 CPDWQIPPSR 100.00 0.019
26280517 AYTFCFNDK 100.00 0.042
24402669 CENFYGGYSCQCPAGHR 100.00 0.042
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号