SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO09747  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA012935
Annotation
clathrin_heavy_chain_[Bombyx_mori]
Full name
Clathrin heavy chain      
Location in the cell
Cytoplasmic   Reliability : 2.158
 

Sequence

CDS
ATGGCGCAAGTATTACCGATACGGTTTCAAGAACATTTACAGCTTACCAATGTGGGAATCAATCCTGCTTCTATTTCTTTCAACACTCTCACCATGGAATCGGACAAGTTTATCTGTGTTCGCGAGAAGGTTGGTGAGACTGCAGAAGTTGTCATTATTGATATGGCAGATCCAACAAATCCAATTCGACGACCAATCAGTGCAGACTCTGCTATCATGAATCCAGCTAGTAAAGTCATTGCTTTGAAAGGCAAAGCTGGTGTTGAAGCTCAAAAAACACTTCAAATCTTCAACATCGAGATGAAATCCAAAATGAAGGCGCACACCATGACCGAAGACATTGTTTTCTGGAAGTGGATTTCACTGAACACACTCGCTCTGGTCACCAAGATGTCGGTGTACCACTGGTCGATGGAAGGCGACTCGACACCCGTAAAGATGTTCGACAGGCATTCGTCGCTTGCCGATTGCCAGATCATTAACTACAGGACGGATCCGAAGCAACAGTGGTTACTGCTGGTCGGTATTTCTGCTCAACAAAACCGGGTAGTCGGAGCTATGCAGCTGTACTCTGTCGAGCGCAAGTGCTCTCAGCCTATCGAGGGACACGCCGCGTCATTCGCCACATTCAAAGCGGAAGGCAACGCCGAGCCGTCCACTCTCTTCTGCTTCGCTGTGAGGACTGCCCAAGGCGGCAAACTACACATTATTGAGGTTGGCCAAACGCCGGCCGGCAACCAACCATTTCCGAAGAAAGCCGTCGACGTGTTCTTCCCAGCGGAGGCGCAGAATGATTTCCCAGTGGCGATGCAAGTATCGCCGAAGTATGACGTCATTTACTTGATTACCAAATATGGATACATTCACATGTACGACATCGAGACGGGAACGTGCATTTACATGAACCGTATCTCGTCGGACACGATCTTCGTGACAGCGCCCCATGAAGCGACTGGTGGTATTATCGGTGTAAACCGCAAAGGTCAAGTGCTATCAGTGACGGTAGAGGAAGACTCAATAGTTCCGTACATAAACACCGTGCTCCAGAACCCCGAGTTAGCGTTACGCATGGCCGTGCGTAACAACTTGGCAGGTGCTGAAGAACTGTTTGTGAGGAAATTCAACATGTTGTTCACGAATGGACAATACATTGAGGCTGCTAAGGTAGCAGCTATGGCACCTCGAGGAATCCTCCGTACACCGCAAACTATCCAGCGTTTCCAACAAGTGCCCACGCAGCCAGGTCAAAATTCGCCGTTGTTGCAATACTTTGGTATCCTCCTTGATCAGGCGCAACTCAACAAGTTTGAATCCCTTGAATTGTGTCGTCCTGTTTTATTGCAAGGGCGAAAACAACTGCTCGAGAAATGGTTGAAGGAAGAAAAATTGGAGTGCTCAGAAGAACTCGGAGATCTTGTAAAGCAAGTAGACCCTACACTTGCTCTCTCAGTTTACCTTCGTGCCAACGTCGCCAGTAAAGTTATTCAGTGCTTTGCTGAGACTGGGCAATTCCAGAAAATTGTATTATATGCCAAGAAGGTTGGCTACACTCCAGACTACATTTATCTTCTTCGTTCAGTGATGCGTACTAATCCTGAACAAGGGGCGGGATTTGCAGGAATGTTAGTCGCCGAAGATCCCCCTCTAGCAGACATCAATCAGATAGTAGATGTTTTCATGGAACAAAATATGGTGCAACAATGTACAGCCTTCCTTTTGGACGCTCTGAAGAATAACAGACCTGAGGAAGGACCTTTACAAACTAGGCTTCTAGAAATGAACTTGATGTCAGCGCCGCAGGTAGCAGATGCGATCCTCGGCAACGCGATGTTCACGCAGTACGATCGCGCGCACGTTGCTCAACTCTGCGAGAAGGCTGGATTGTTACAACGGGCGCTTGAGCATTACACTGACTTATATGACATCAAGAGAGCTGTTGTACATACTCACCTTTTACCAGCTGACTGGCTTGTCACATACTTCGGAAGTTTGTCTGTAGAGGACTCCCTTGAATGTTTGAAAGCAATGCTTCAGGCTAACATCCGTCAGAATCTGCAGATTTGTGTCCAGATCGCCACTAAATACCATGAGCAACTAACAACAAAGGCTTTAATTGAATTGTTTGAGAGTTTCAAAACATATGAAGGGTTGTTCTATTTCCTGGGATCAATTGTAAACTTCAGCCAGGATCCAGAAGTCCACTTTAAATATATTCAGGCTGCATGTAAGACTGGTCAAATCAAGGAAGTGGAGCGTATCTGTCGCGAATCCAGCTGTTACAACCCAGAGCGCGTGAAGAATTTCCTCAAAGAAGCAAAGCTTCCCGACCAGTTGCCTCTTATCATAGTTTGTGATAGATTCGACTTTGTTCACGACTTGGTTTTGTACTTATATAGAAACAGTCTTCAGAAATATATTGAGATTTATGTACAAAAGGTTAATCCTTCTCGTCTGCCAGTCGTAGTTGGTGGCTTGTTAGATGTTGATTGTGCTGAAGACATCATTAAGAACCTGATCTTGGTTGTTCGGGGACAGTTTTCGACTGATGAATTGGTTGCTGAAGTGGAAAAGAGGAATAGACTGAAACTCCTCCTTCCTTGGCTGGAAACAAGAGTACATGAGGGCAGCAACGAACCAGCCACACATAATGCTCTAGCCAAAATCTACATTGATTCCAACAACAATCCTGAGAGATTCCTCAGAGAAAATCAGTGGTATGACTCCCGCGTGGTTGGCCGCTACTGCGAGAAGCGCGACCCGCACTTGGCGTGCGTGGCGTACGAGCGCGGACAGTGCGACCGCGAGCTCATCGCCGTCTGCAACGACAACTCTCTCTTTAAGACCCAGGCTCGCTACCTGGTGCGCAGGAAGGATCAAGACCTCTGGCTCGAGGTGCTCTCTGAGGAGAATCCTTACAAGAGGCAACTTATTGACCAGGTGGTGCAAACAGCTCTTTCTGAAACCCAAGATCCTGAAGATATCTCTGTTACTGTAAAAGCTTTCATGACAGCCGACCTACCCAACGAACTCATTGAGCTGCTTGAAAAGATTGTTCTTGACAATTCTGTGTTCTCTGATCACAGGAACCTTCAGAACCTTTTGATTCTTACTGCTATCAAGGCGGACCGTTCTCGTGTAATGGAATACATCAATCGTTTGGATAACTACGACGCACCTGATATCGCCAACATTGCTATCAACAATGAATTATACGAGGAGGCATTCGCTATCTTCAAAAAATTCGATGTTAATACTTCTGCTATACAAGTACTGATTGAACAAGTGAAAGATCTCGAACGTGCCTATGAATTTGCCGAGCGATGCAACGAGCCTGGTGTTTGGTCTCAGCTCGCAAAGGCTCAACTACAGCAAGGTCTGGTCAAGGAAGCCATCGACTCTTACATAAAAGCGGACGATCCCTCCGCTTACATGGACGTTGTCGCTACTGCTACCAAGCAAGAGTCTTGGGATGATCTTGTTAGGTATTTGCAGATGGCCCGGAAGAAGGCGCGCGAATCGTACATAGAGTCCGAACTGATCTACGCTTACGCCCGCACCGGTCGCCTCGCCGATCTGGAAGAGTTTATCTCTGGCCCGAATCATGCTGACATCCAGAAAATCGGTGACCGTTGTTTCGACGATAAGATGTACAACGCTGCGAAACTCTTGTATAACAACGTCAGCAACTTCGCTCGTCTTGCCATTACTTTGGTGCATCTGAAAGAATTCCAAGGTGCCGTGGACAGCGCGCGTAAGGCCAACTCTACGCGTACATGGAAGGAGGTCTGCTTCGCGTGCGTGGACGCCGGTGAATTTAGACTCGCGCAAATGTGCGGATTGCACATTGTAGTACATGCTGACGAACTTGAAGACCTCATCAATTACTACCAGGACCGAGGTCACTTTGACGAGCTGATCAGCTTGTTAGAAGCTGCCTTAGGGCTGGAGAGAGCTCACATGGGAATGTTCACTGAACTCGCCATCTTGTATTCAAAATACAAGCCCGTGAAGATGAGGGAGCACCTCGAACTGTTCTGGTCACGCGTTAACATTCCCAAGGTACTCCGTGCTGCTGAACATGCCCATCTTTGGTCAGAGCTGGTATTCCTGTATGACAAGTATGAGGAGTATGACAACGCTGCTCTCACGATGATGCAGCATCCCACCGAGGCTTGGCGCGAGGGACACTTCAAAGACATAATTACTAAAGTCGCGAACATGGAACTGTACTACAAAGCTATCCAATTTTACCTAGACTACAAGCCACTTCTCCTTAACGATCTACTGTTAGTACTGGCCCCACGCATGGATCACACCCGAGCAGTTAACTTCTTCACCAAAGCTGGTCATCTGCAACTAGTGAAGGCCTATTTAAGATCAGTGCAGAGTTTGAACAATAAGGCTGTTAATGAAGCATTGAACTCGCTGCTTATTGATGAGGAGGATTATCAAGGCTTAAGAACATCGATCGACGCCTTCGACAACTTCGACACTATTGCGTTAGCGCAACAGTTGGAGAAACACGAACTTACAGAATTCAGACGGATTGCCGCCTACTTATACAAAGGTAATAACAGATGGAAGCAAAGCGTGGAGCTCTGTAAGAAAGACGCTTTGTACGCCGATGCAATGGAGTATGCATCGGAGTCGCGCCAGCCCGAGGTCGCCGAAGAGCTCCTCAACTGGTTCTTGGAAAGAGATAATTTCGAATGTTTCTCGGCTTGCTTATATCAGTGTTACGATCTCTTGAAACCCGATGTTGTCATCGAGCTCGCATGGAGACATAACATCATGGACTTTGCCATGCCCTATCTTATTCAGACAGTTCGCGAGTTGACAACTAAAGTAGAGAAGCTTGAAGAAGCAGACGCCAAGCGAAGCACCGAGAATGCGGAGCACGAAGCGAAGCCCACCATGATAATAGAACCACAACTCATGCTGACTGCCAGTCCATCGATGCCGTACGTGGTGCCGCCCCAGCCGTCGCAGTACGGGTACACTGCGCAGGCGCCCTCGCCCGCCCCCTACCACGGGTACGGCATGTAG
Protein
MAQVLPIRFQEHLQLTNVGINPASISFNTLTMESDKFICVREKVGETAEVVIIDMADPTNPIRRPISADSAIMNPASKVIALKGKAGVEAQKTLQIFNIEMKSKMKAHTMTEDIVFWKWISLNTLALVTKMSVYHWSMEGDSTPVKMFDRHSSLADCQIINYRTDPKQQWLLLVGISAQQNRVVGAMQLYSVERKCSQPIEGHAASFATFKAEGNAEPSTLFCFAVRTAQGGKLHIIEVGQTPAGNQPFPKKAVDVFFPAEAQNDFPVAMQVSPKYDVIYLITKYGYIHMYDIETGTCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVTVEEDSIVPYINTVLQNPELALRMAVRNNLAGAEELFVRKFNMLFTNGQYIEAAKVAAMAPRGILRTPQTIQRFQQVPTQPGQNSPLLQYFGILLDQAQLNKFESLELCRPVLLQGRKQLLEKWLKEEKLECSEELGDLVKQVDPTLALSVYLRANVASKVIQCFAETGQFQKIVLYAKKVGYTPDYIYLLRSVMRTNPEQGAGFAGMLVAEDPPLADINQIVDVFMEQNMVQQCTAFLLDALKNNRPEEGPLQTRLLEMNLMSAPQVADAILGNAMFTQYDRAHVAQLCEKAGLLQRALEHYTDLYDIKRAVVHTHLLPADWLVTYFGSLSVEDSLECLKAMLQANIRQNLQICVQIATKYHEQLTTKALIELFESFKTYEGLFYFLGSIVNFSQDPEVHFKYIQAACKTGQIKEVERICRESSCYNPERVKNFLKEAKLPDQLPLIIVCDRFDFVHDLVLYLYRNSLQKYIEIYVQKVNPSRLPVVVGGLLDVDCAEDIIKNLILVVRGQFSTDELVAEVEKRNRLKLLLPWLETRVHEGSNEPATHNALAKIYIDSNNNPERFLRENQWYDSRVVGRYCEKRDPHLACVAYERGQCDRELIAVCNDNSLFKTQARYLVRRKDQDLWLEVLSEENPYKRQLIDQVVQTALSETQDPEDISVTVKAFMTADLPNELIELLEKIVLDNSVFSDHRNLQNLLILTAIKADRSRVMEYINRLDNYDAPDIANIAINNELYEEAFAIFKKFDVNTSAIQVLIEQVKDLERAYEFAERCNEPGVWSQLAKAQLQQGLVKEAIDSYIKADDPSAYMDVVATATKQESWDDLVRYLQMARKKARESYIESELIYAYARTGRLADLEEFISGPNHADIQKIGDRCFDDKMYNAAKLLYNNVSNFARLAITLVHLKEFQGAVDSARKANSTRTWKEVCFACVDAGEFRLAQMCGLHIVVHADELEDLINYYQDRGHFDELISLLEAALGLERAHMGMFTELAILYSKYKPVKMREHLELFWSRVNIPKVLRAAEHAHLWSELVFLYDKYEEYDNAALTMMQHPTEAWREGHFKDIITKVANMELYYKAIQFYLDYKPLLLNDLLLVLAPRMDHTRAVNFFTKAGHLQLVKAYLRSVQSLNNKAVNEALNSLLIDEEDYQGLRTSIDAFDNFDTIALAQQLEKHELTEFRRIAAYLYKGNNRWKQSVELCKKDALYADAMEYASESRQPEVAEELLNWFLERDNFECFSACLYQCYDLLKPDVVIELAWRHNIMDFAMPYLIQTVRELTTKVEKLEEADAKRSTENAEHEAKPTMIIEPQLMLTASPSMPYVVPPQPSQYGYTAQAPSPAPYHGYGM

Summary

Description
Clathrin is the major protein of the polyhedral coat of coated pits and vesicles.
Similarity
Belongs to the clathrin heavy chain family.
EMBL
BABH01003440    AB470485    BAH03459.1    NWSH01000188    PCG78626.1    KZ149898    + More
PZC78648.1    RSAL01000008    RVE54021.1    AGBW02007651    OWR54911.1    GEZM01073289    JAV65183.1    NEVH01014358    PNF27910.1    KJ476828    KQ971347    AHY84716.1    EFA04947.1    GALX01005267    JAB63199.1    KF730319    AHC70342.1    KX965603    ATD50464.1    KK852507    KDR22435.1    DS235072    EEB11299.1    AK417412    BAN20627.1    GECU01010345    JAS97361.1    KY285052    ATP16155.1    GECZ01019769    GECZ01019737    JAS50000.1    JAS50032.1    CH478071    EAT34109.1    GANO01001512    JAB58359.1    GFDL01009337    JAV25708.1    GEDC01005398    JAS31900.1    GDAI01000320    JAI17283.1    AXCM01000884    KB632404    ERL95077.1    GEDC01008158    JAS29140.1    GFDF01003575    JAV10509.1    MH143573    AZT88970.1    GFDF01003574    JAV10510.1    GFDF01003573    JAV10511.1    KK107119    QOIP01000009    EZA58717.1    RLU18464.1    ATLV01015606    KE525023    KFB40523.1    AXCP01000887    AXCP01000888    KZ288254    PBC30870.1    JR048998    AEY60829.1    GL450732    EFN80585.1    GL443983    EFN61414.1    GGFM01006474    MBW27225.1    GBGD01000067    JAC88822.1    GGFK01000636    MBW33957.1    GGFJ01000285    MBW49426.1    GFTR01008858    JAW07568.1    AAAB01008859    EAA08110.4    ADMH02001371    ETN62753.1    ADTU01011670    APCN01000612    AXCN02000193    GAMC01008609    GAMC01008608    JAB97946.1    GL764129    EFZ18330.1    GDHF01033867    GDHF01027123    GDHF01021549    GDHF01019321    GDHF01015345    GDHF01013899    GDHF01008650    GDHF01002368    JAI18447.1    JAI25191.1    JAI30765.1    JAI32993.1    JAI36969.1    JAI38415.1    JAI43664.1    JAI49946.1    DS232386    EDS40945.1    GAKP01020415    GAKP01020414    GAKP01020413    JAC38537.1    GDHF01016767    GDHF01004745    JAI35547.1    JAI47569.1    GBXI01015170    GBXI01015135    GBXI01010756    GBXI01006419    JAC99121.1    JAC99156.1    JAD03536.1    JAD07873.1    GDHC01011944    JAQ06685.1    LBMM01002221    KMQ95124.1    NNAY01000676    OXU27054.1    KQ979236    KYN22192.1    KQ981727    KYN37177.1    AAZX01005762    UFQT01000054    SSX19052.1    CCAG010011658    AJWK01008161    AJWK01008162    AJWK01008163    AJWK01008164    AJWK01008165    KY938818    ASA46459.1    GFAH01000150    JAV48239.1    KQ777347    OAD52116.1    GGLE01005165    MBY09291.1    KQ434973    KZC12769.1    CH933810    EDW07527.1    KRF94090.1    GEFH01000661    JAP67920.1    CH916371    EDV92191.1    KK115345    KFM64868.1    MH365862    QBB01671.1    GFWZ01000077    MBW20067.1    GL888463    EGI60613.1    KQ982298    KYQ58168.1    CP012528    ALC48748.1    KQ976796    KYN08245.1    GFPF01006802    MAA17948.1    CH480852    EDW51532.1    GEDV01009864    JAP78693.1    CM000162    EDX01839.1    CH954180    EDV46951.1   
Pfam
PF09268   Clathrin-link        + More
PF00637   Clathrin
PF01394   Clathrin_propel
PF13863   DUF4200
PF00795   CN_hydrolase
Interpro
IPR011990   TPR-like_helical_dom_sf        + More
IPR000547   Clathrin_H-chain/VPS_repeat       
IPR016024   ARM-type_fold       
IPR015348   Clathrin_H-chain_linker_core       
IPR016341   Clathrin_heavy_chain       
IPR022365   Clathrin_H-chain_propeller_rpt       
IPR016025   Clathrin_H-chain_N       
IPR025252   DUF4200       
IPR003010   C-N_Hydrolase       
IPR036526   C-N_Hydrolase_sf       
SUPFAM
SSF48371   SSF48371        + More
SSF50989   SSF50989       
SSF56317   SSF56317       
Gene 3D
PDB
3IYV     E-value=0,     Score=7091

Ontologies

Topology

Subcellular location
Cytoplasmic vesicle membrane  
Membrane  
Coated pit  
Length:
1681
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
0.899909999999997
Exp number, first 60 AAs:
0.0068
Total prob of N-in:
0.03969
outside
1  -  1681
 
 

Population Genetic Test Statistics

Pi
340.289008
Theta
195.207132
Tajima's D
2.340682
CLR
0.222514
CSRT
0.929653517324134
Interpretation
Uncertain
Peptides ×
Source Sequence Identity Evalue
27102218 NNIAGAEEIFVRK 96.00 3e-10
25044914 KAVAAEGDIR 100.00 2e-09
27102218 VMEYINR 100.00 2e-09
28467696 KATIEVGNVMPVGAMPEGTIVCNIEEK 100.00 2e-09
26822097 AIEHYTDIYDIKR 100.00 2e-09
26280517 ISSDGVISVIAPR 100.00 2e-09
25044914 ISSDGVITVVAPR 100.00 2e-09
27102218 ESSCYNPER 100.00 2e-09
28467696 ISSDGVITVVAPR 100.00 2e-09
28556443 EFQGAVDSAR 100.00 2e-09
28556443 HNIMDFAMPYLIQTVR 100.00 2e-09
26822097 ESYIESEIIYAYAR 100.00 8e-09
27102218 YIEIYVQK 100.00 8e-09
28467696 AVDSIVPIGR 100.00 8e-09
28556443 VGYTPDYIYLLR 100.00 8e-09
25044914 YGYGSTVSEK 100.00 1e-08
28556443 IYIDSNNNPER 100.00 6e-07
31250652 NLQNLLILTGNNR 100.00 1e-06
31250652 FNMLFTNGQYIEAAKVLIEQVK 95.24 5e-06
26822097 AEGNAEPSTIFCFAVR 100.00 9e-06
27102218 TPQTIQR 100.00 9e-06
28467696 MSVDKEEIVQR 100.00 9e-06
26822097 IYIDSNNNPER 100.00 3e-05
26280517 CSITVDNEVIK 100.00 3e-05
25044914 CSQPCGVGYQER 100.00 3e-05
27102218 AIIEIFESFK 100.00 3e-05
28467696 CSQAGHYIISNIQDK 100.00 3e-05
28556443 ALEHYTDLYDIK 100.00 8e-05
31250652 VGEWSILR 100.00 3e-04
31250652 YLQTVR 100.00 3e-04
26822097 EFQGAVDSAR 100.00 0.001
26280517 ADCDWNIEAPR 100.00 0.001
25044914 ADDINIIANIYER 100.00 0.001
27102218 WIKEEKIECSEEIGDIVK 100.00 0.001
28467696 ADDMPTFK 100.00 0.001
26822097 HEITEFR 100.00 0.002
26280517 EVANFINSK 100.00 0.002
25044914 EVASIIENIYSK 100.00 0.002
27102218 EIIAVCNDNSIFK 100.00 0.002
28467696 EVATNSEIVQSGK 100.00 0.002
26822097 AHTMTEDIVFWK 100.00 0.003
26280517 RPIGVGIIVAGYDDQGPHIYQTCPSANYFDCR 100.00 0.003
25044914 RPIIDNVSDK 100.00 0.003
27102218 HEITEFRR 100.00 0.003
28467696 RPIPASAESFDDIIIIESAR 100.00 0.003
28556443 NNRPEEGPLQTR 100.00 0.003
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号