SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO15747
Pre Gene Modal
BGIBMGA002251
Annotation
PREDICTED:_huntingtin_[Bombyx_mori]
Location in the cell
Nuclear   Reliability : 2.698
 

Sequence

CDS
ATGAACGTCTTAGAGAAAGCTGAAAAAGCTTTAGAATATCTTAAAGCAAACGAAGGTTCTGCGAAAGTTCATGAGCTCCAAACGGCTGCTGGAACTCTGGGCAGATGCCTGGGAGCTTTAGGAACTCGCTCGAATTGCGCTCGGCATTATGCAAATTTGTTACACTCAGCAATTCCAACTTTATTGTTTTTAGCCAGCCATGACAGCGCTGAGGTGCGTTTGGTTGGCGATGAAGCTCTCAATAGAGCCGTAGTCGGAGGTTTCGCATTCCATTCACACAAAACTAATATTATACTTCAAAATCAGATTGATCACAAAAGGAATGCCAGATGGATCCGTGCAGCTTTATCTCGTATTTGCCTTGGAGAATGTTGGCTCCGTCCTGGTGTTGGCAAGATTCGAAATCAAGCCCAGTTCCTATTTCCAAAATTAAGTCAGATAGTGAAAGAGACAAATGAAATTCAACTCATTGTGGAAGCATTAGAGAGTAACCTGCCTCGGCTCCTAAATGCCTTGGGTGAATATACTTCAGATGAAGAGATTTCAGAACTGTCAAAAGCAATTCTATTGCACGTAGAAACGACTGAAGCATCTGTGAGGAGAGGCATAGCGACATGCATAGCGCACATGTGCTCACACAGAGAAGCCTTGCTTACTAATGTCCTACAGAAAGTGTTTGAAAAACTGTGGCCCATCTCTCGCGAGAGTCCAGTGGTTTTGGGCTGGTTTAGCGTCATCAAAGTCGTTTTTCAGATAAACGAATTCAAGAAATTTGCCGAAAGCGATCTTTTCAACGTGCATGATTACATGGAGCTGTACAATCTATGCGTGAACTACGTAGATCAGTCGACGGACCACAACATCCAGAACTGCGTGATGGAATGCCTTACCGTAATACTGTCCAAAGCGGCCGGAGCGTACAGGGCGGCCGTGCTGGAGAAACACCCGGCTCGGGTTGTATTGGAGGAGAGGAATTTAAATAAAGGACACAGGAGGAATATTAGTTCGTCGAGTGTCATATCCTCAAGGTACGCGCCGAGTCCGGCCTCCGAGCAGAGAGATAATCTAGAGCTTTCGTTTACTGGAGCTTTGCAACTTGGAAGTCTATTGCAGTTGAACACGCCGAGCTTGAGCGTCGAAGATCTGACCATACAGACTCCGGAGTCGCCGGCGAGCGAGGGGGAGGCACCGCCGCCCGACTCGGAGCTCCTCTCCAGAGACTTGTCGGAGAAACTTAAGGAGTTCGAAGAAACCGACAGCACCGATACTGACCTGGAGAAATGTTCCCACGACGCCGGGTTCAAGATAAACATAGGATCTGTAGCCGATGATGATGTCCCGCTAAAGTATTGTGTGAGGTTGCTGGCTTCGAAGTTTTTGTTGGCTGGCAACAAAGGAGACCTCATTCCTGATCGGTCTGTGCGGGTCTCGTTGAAGGCGTCCGCGCTTAACTGCATATCGGAAGTAGTGCAGGTGTACCCTCAAGCGATGACCTTGTATTTGGACAAAGATGCTGACGCGAAAATCCATAATATTTCCGGGTCATCAGAGGGCACAGACTCGAACCAAGATAACATTAATGAGTACATATTAGAGAGAAACGTCTGTTACCAGGATTCCCTAACAGATTCCTTAAGCCAAGAGCTGAATAGGAGCATCGAAGGTCACTCGATTTGCAAAAAAGACATAAAGTCCCTCGAAAAAAGCACAGATAGCGGCGAGAAAAGCTCCAAGGACGCACCCAGCATCGACAGTAAGCACAAAACGGACTTAATGGGAAGCGCCATGACCGATAGCACGAATCCCAACATGACCATTAGTAACTTCACCACAAACTCCAACATGACCACGAGTAACGACCAGACCAGCGGATACCCCATGTCCAGCAGTGGGGACGTACTGTCCTCGAGCATCGATTTGGACATGAAATTCGATCATTTCGGAGAATCGACAACCAATCTGGACAATCTCAAGGATTTGGAACGCAAAACTGACAGGACGAAAAGAGCCTTGAAAAGGACCGATGAAGTACAAAGCGCTGATGTGGAGGGCTTGGTCGACAGCAAAGACGAGAAAGACTTTGAGTTTCAGCATCTATCCGATGTTTTTCTGCTGCTGGAAGGTCACTGCGACCCGCAGATCAGGGGTCTGGTCAGGGTCTGCATTGGGAACTACTTGGCGGCCGCTCTGAATGCCTCCCACGGCGATTATAATAGATGGAGAAACTTTAACTCGTTGCCTAAAGCCGTCGGAGACTGCATGAGCGCTGGGGACCTGGTACAGATCATATTGAAGGGTCTAAGGGATGAAATACATTCAACAGTGAACCACAGCTTGTCAGCACTGACCGGCCTGGCCGTGGCCCTGTCGGCCAGCTCAGACTGGTCCCTGCTCGTGGATGCGTTGACCTCGTTAGTGCACGTGGAGGCCAACACGTATTGGCTGTGCAGGGTCAATCTCTGTAAGCTCTACGAAAGGATACCGTATAAGAGATTATTCACACTGTATCCGGGGTATCGCGCTAGAAGTCGAATGATAATGAACACGTTGATTGGTCTGCTCCGCGACCCGGATCAGAAGGTCAGGAGTGCCGCAGCGCACGCCATAGCGAACGTTATACCAACGATATACCCGCGAAGGAGACACACGCCGGCCACAATGGTAACTGAGTTAGCTAAGGAGTTAACGGACGAGGTGTCTTTGGATAGAGCGACGCTGTCGTTGCAAATAGTACGCGACATATACTTCTATCACGACCTCCCTCGGCAATTGACCGAAGGGAACAGAACGGACGGCGACACCCTCAAGATTGTTGTCGGAGACTTAATGGGTAGACTGATGGCCAGCAGCTGTAAACATTATTCGCAAGGCTTACTGGAGGCGCTACAAGCGATACTAATCAAGTGGACTCCTTGGAAACACATAGACGCCTACTCCGGTCAAGGGATCCTGGGCTATTGTCTAGATAACCTGGACTATTGCAACACGAACGTTTCTATGAGGACCCTTCTTCTGGACCTGTGCCGTTTGATTTATCCAGTTGAAGTGCACAACATAATGCGCAAAAGGACTGCTAGTAGAGATATATTTGAACGCGATTCGACTAGTAAAGGGAAGTGGCAACACCTCGACGGCGACAGGGTTGCCAGCTTAGCAGAGAAATTTCTCCAAGTAACATTGAAAATGCTGAATGTATTGGTTCATCTGATTGAGGAAGTTAATCCCAATGTACATTTGAATAAGGGCGGGATTGCCCTTCCCGGGAGTCCGGTTCGAAGGAAAACTCAAGATTCTATTCAAACTCGGAAGAATTCGACGGCGAGTGAAAGCTTAGATGACAAAGCCTCTCTAAAACGAAAGCCTTTAATGCCGGCAACGAGCTTGAGAGCGAACTTTGTGGGACACTTTTATAACGAACCGTTCTATATGAAATTGTACGAGGGCTTAAGAGCAGCTTATTCGAATCATAAGATAAATCTCGACCCTAAATCGAGCATATTCCACACGTTTCTATCGACCACTCTGGACTGCCTCGCCGTGCAACTGGAACTGTCCACCGAGAAGGAGTTCGGCTCCGTCACAGAAGAGATCCTGTACTACCTCAAGGCAATAATGCCCTTATGCGCCGACAATACGGTCTATTGTGTGACGCAGCTGTTGAAATGTTTATTCGGGACGAACATGGTGGCGCAGTACGACGATTACATGAGCATCAGCGACAAGAATGAGGGCAGCCGGCAGACCAGCTTCTGTGAGGACGTGTGGACTCGCGCAGACCACGGCTCCAGGAGGAGCAGCGTCGGCTCGAATGCGAGTCAGAAGCTGTGTCTGGACATCAAGAACCCCAGGATGGCGGCGGAGAGGCAACTGCTGATCGCGCTGGAGAATTTCGCCAAGAACAAATCTGATAGGAAGTGGAGTACCAATAAGGAGGAGCTGGAGAAATATATTAAGTTATTTGAACCGGTTGTCATACAGTCTTTGAAGAGCTACACAATGCAAAACGACATTCCACTCCAGCGCCAAGTGCTAAGTCTGCTGAACCAACTGTTGACCCTGAGGGTGAACTACTGCATGCTGGACAACGATCTGATCTTCATAGGCTTCCTGATGAAACAACTAGATTTATTCGAACAACATGAAATACCGAATTGTTGCGATTTGGTGAACAGCATCCTCATGTTTTTGGTGCAACTGTCGTCTTCCAAGCATCACACGAAGAAGATCATTGAAATACCGAAATTAATCCAGTTGTGCGATGGTCTGATGGCGTCGGGGGCCCACGCGGAGTGCATCGCCGGTCTAGAACCCGTCGCTGTCAAAGTCTTCAGCAGTATGGGAGGTAACGTGTTAGGCCGAGCTCAACAGCAAGAGGCGCAGGCGACCAGAGAAGTGCTCTTCTATATGCTGCAGAAGACCATGCATCAACAGAAGGTGCTAGAGCTAGTGTCGTGTATACTGTCTCTGACGCAAGAACACCCGGAGAGCTACTATCGATACTCGGAGCTGGCCACCGACACGTTGCTCAACTTGTTGTCTCAGAGGTCTGTAGAGCTGGACTCATGTCTAGCAGTGTCGGCGCTAGAGCGATTGCTAGAGACGGTGTATAAGGACGTTCTGCTAGAGCAGAGCCGGGTCGAGGTGCTACTGAAGATTCTGTTTAAAGCACCACCAGATCAGTCCACAACGCCACCGAAGCTGAAACTCCGCTATCTATCAATAATAATGGTGCTACTAAGGAAGGTCCTGGTACTGATACCGGAGAGCGAGATATTATTATCGATCAACTACTTGAAGTCGACCTGCGTGTCGCCGCAGTCGATCTTCTTCAACGCGAAAGCGGACGTGGACCCGCTCAACGTGCTCAACGTGAACGAGAACTGCGCGAATCTGTCCCCTGACGTGGTGCTCGTAAGGTTCCTTTTCAAAACGCTGACTTACGCCGTTACGGAAGTGGAGAAGAATTACGAGCCGAATCCGAAGTACGATACCGACGAGAACAGGTTGCTGTATTCCGTGTGCGTCAACATCGTCGTGCAGGTCAAGCAGATGTTGCATTTGACTACTGGCTGCCTGTTCCCTCTGATATCGAAGACGGCGCAGAGTCTCCTGCACAACGAGCAGAGCGGCCTCAACACGGGCCTGTACAGTCCGGACGAGAACATTCCACTCGACACACTCAACATGATATGCCTACGAGCTGCCCACGCGCTGCCCCTGCTCACCGCGCACTGGTCCCATTTGCTCATCAGGTCAGTGATTAAAAAGGATTGCTCAAAATAA
Protein
MNVLEKAEKALEYLKANEGSAKVHELQTAAGTLGRCLGALGTRSNCARHYANLLHSAIPTLLFLASHDSAEVRLVGDEALNRAVVGGFAFHSHKTNIILQNQIDHKRNARWIRAALSRICLGECWLRPGVGKIRNQAQFLFPKLSQIVKETNEIQLIVEALESNLPRLLNALGEYTSDEEISELSKAILLHVETTEASVRRGIATCIAHMCSHREALLTNVLQKVFEKLWPISRESPVVLGWFSVIKVVFQINEFKKFAESDLFNVHDYMELYNLCVNYVDQSTDHNIQNCVMECLTVILSKAAGAYRAAVLEKHPARVVLEERNLNKGHRRNISSSSVISSRYAPSPASEQRDNLELSFTGALQLGSLLQLNTPSLSVEDLTIQTPESPASEGEAPPPDSELLSRDLSEKLKEFEETDSTDTDLEKCSHDAGFKINIGSVADDDVPLKYCVRLLASKFLLAGNKGDLIPDRSVRVSLKASALNCISEVVQVYPQAMTLYLDKDADAKIHNISGSSEGTDSNQDNINEYILERNVCYQDSLTDSLSQELNRSIEGHSICKKDIKSLEKSTDSGEKSSKDAPSIDSKHKTDLMGSAMTDSTNPNMTISNFTTNSNMTTSNDQTSGYPMSSSGDVLSSSIDLDMKFDHFGESTTNLDNLKDLERKTDRTKRALKRTDEVQSADVEGLVDSKDEKDFEFQHLSDVFLLLEGHCDPQIRGLVRVCIGNYLAAALNASHGDYNRWRNFNSLPKAVGDCMSAGDLVQIILKGLRDEIHSTVNHSLSALTGLAVALSASSDWSLLVDALTSLVHVEANTYWLCRVNLCKLYERIPYKRLFTLYPGYRARSRMIMNTLIGLLRDPDQKVRSAAAHAIANVIPTIYPRRRHTPATMVTELAKELTDEVSLDRATLSLQIVRDIYFYHDLPRQLTEGNRTDGDTLKIVVGDLMGRLMASSCKHYSQGLLEALQAILIKWTPWKHIDAYSGQGILGYCLDNLDYCNTNVSMRTLLLDLCRLIYPVEVHNIMRKRTASRDIFERDSTSKGKWQHLDGDRVASLAEKFLQVTLKMLNVLVHLIEEVNPNVHLNKGGIALPGSPVRRKTQDSIQTRKNSTASESLDDKASLKRKPLMPATSLRANFVGHFYNEPFYMKLYEGLRAAYSNHKINLDPKSSIFHTFLSTTLDCLAVQLELSTEKEFGSVTEEILYYLKAIMPLCADNTVYCVTQLLKCLFGTNMVAQYDDYMSISDKNEGSRQTSFCEDVWTRADHGSRRSSVGSNASQKLCLDIKNPRMAAERQLLIALENFAKNKSDRKWSTNKEELEKYIKLFEPVVIQSLKSYTMQNDIPLQRQVLSLLNQLLTLRVNYCMLDNDLIFIGFLMKQLDLFEQHEIPNCCDLVNSILMFLVQLSSSKHHTKKIIEIPKLIQLCDGLMASGAHAECIAGLEPVAVKVFSSMGGNVLGRAQQQEAQATREVLFYMLQKTMHQQKVLELVSCILSLTQEHPESYYRYSELATDTLLNLLSQRSVELDSCLAVSALERLLETVYKDVLLEQSRVEVLLKILFKAPPDQSTTPPKLKLRYLSIIMVLLRKVLVLIPESEILLSINYLKSTCVSPQSIFFNAKADVDPLNVLNVNENCANLSPDVVLVRFLFKTLTYAVTEVEKNYEPNPKYDTDENRLLYSVCVNIVVQVKQMLHLTTGCLFPLISKTAQSLLHNEQSGLNTGLYSPDENIPLDTLNMICLRAAHALPLLTAHWSHLLIRSVIKKDCSK

Summary

Pfam
PF12372   DUF3652        + More
PF02985   HEAT
Interpro
IPR011989   ARM-like        + More
IPR024613   Huntingtin_middle-repeat       
IPR016024   ARM-type_fold       
IPR021133   HEAT_type_2       
IPR028426   Huntingtin_fam       
IPR000091   Huntingtin       
IPR000357   HEAT       
SUPFAM
SSF48371   SSF48371       
Gene 3D
PDB
6EZ8     E-value=2.35497e-72,     Score=698

Ontologies

Topology

Length:
1760
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
0.33364
Exp number, first 60 AAs:
0.04182
Total prob of N-in:
0.00436
outside
1  -  1760
 
 

Population Genetic Test Statistics

Pi
235.476857
Theta
171.707453
Tajima's D
0.886433
CLR
0.275685
CSRT
0.632868356582171
Interpretation
Uncertain
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号