SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO04723  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA009929
Annotation
PREDICTED:_arginine-glutamic_acid_dipeptide_repeats_protein_[Papilio_xuthus]
Location in the cell
Nuclear   Reliability : 3.942
 

Sequence

CDS
ATGGCTCAAAATCAAGGAGAAGTTCCAGTAGGTACTCCGAGTCAACCCGTCAAAGATAAGGATATATATGCTTGCTTGCCGGAGATGCGAATTGATGGGCCGCTGGGACCCGACGATCCTTGTCCAGGTGGTGAAGATCTACGATGGCTGCCTGCACAGGCTACGGACAGGGACCTTGTAATGTACTTACGTGCCGCACGTTCAATGGCAGCGTTTGCGGGAATGTGTGATGGTGGCTCTCCTGATGACGGCTGCGTTGCCGCCAGTCGAGACGATACGACTATCAATGCTCTTGACGTGCTGCACGACTCTGGCTACGATCCTGGTCGAGCGTTACAAGCTCTCGTGAAATGTCCGGTACCCAAAGGTATTGAGAAAAAGTGGTCAGAAGACGAAACAAAACGTTTTGTTAAAGGGATACGGCAATTTGGCAAGAATTTTTTCAAAATTAGAAAAGACCTACTGCCTCATAAAGACACTGCAGAACTTGTTGAATTTTATTATCTCTGGAAAAAGACTCCGGGTGCTAGTAGTAATAGACCTCACAGAAGAAGAAGACAAGCATCATTAAGGAGAGTCCGTAATACAAGGAATTCACGCGCAGGTACTCCCAAGGAACAAACGCCTGAAGTTGTACCTGCTTTGCCAGAGACAAATGGTTCTAGACCGTCACCAAATCCAAAGGAAGCCGGAGAAATGAGTTCAGTCACTGAAGATGAAATTTCTGAAGATGATAGTGATTCACGAGATGCCGGTGATTTAGCCAAGAGTGAAAATGGTAGAATTGTAGAAAACCCTGATGACTCACCGAGCAGAATGAGAACAAGAAATAAATCTAAAGAACAAACTACACCGAACGGTAAGAAAGCTCCGGAGGAAAGTGATATTGAACAAAAAGCAAAACCTAAACCAAAAGCTATTGTTCAAAGCAATAATAACAATATTCCCGATAAAGTAATTTCTTCACCTGTAAGTAAAGAAGTTAAGAAAAAGGTGACAAATGGAAAAGTTGATGCATCTAAAGTGAAGAAACGACTGCCAGATGATACGAAACCGGATGGAATTATGGATGGAGATGTACAAATGAAAAAGAAGAGAGCCGACCCTCCAGAAAGTCCTTCGGAGAGCCTAACTAATGATAGTTTCCCAGCCATGGATGAGACTGAAACTCAAGAACAAGAACCGGAAGCCTGTAACTTCAGTTTCAACAAAACAGATAAAGAAAGTGTAGAACAAAATAAAGAACCTGAATCTGAGCAACAACAGTCAAAACCAAATGAACATCTTGAAGTTCAAGTTAAAGTAGAAGCAAACAATGAACCACTCATTAAAACTGAAAAAGACCTTATAGCTCCTACATTCAAACTGGCTACTGATAAAGATCAAAAGAATGTTTTGGACTTAAAAACGGATACCAATGTGACTAAAAATCCTGAAAATGTGGACATGAGACCACTATCAATGCAGAATTCAGTATTCCCCAAAACAGAATTAATAATTCCTAAAGTGACACCAATGACAGTTGAAGCCATGGAAAAGATAAAAATCAAGGAAGAAATAGACCCTGAAGATCAAACTCTTAATCTACACAAGGATGAGTATCAGAAAGACCCGTTATCTCACAGTTACCCAGGACATCCTATGACTCAAGCCAATAAACCTCTAAATCTAGAAAATACTAACTTTTTTGTGAAAGATAGTCACATATACAATCCTAAACTTGGGCATGGTATTAAAATTGAAGGTGTCCCAAATATTTTCAACCCAATGAATATAACAAAAGAAAGTACAATAATCAGGAATTCAAAAGAGTATTCTGCAAGTATTCCTCCCTTTCCCTATTCATCAAATCTTAGTTTTGGCAGTGATCCTGGAAAACACAATCCTCATGGCATCAATCCATTGAAAACTGTCATAAAATTGGAACCAAGAGATGAAACAAGTGAATTAAAGAGTCAAAACGCCCCTGAAATTTTCACTGCTACAATATCAGCAGGAAATAAGGTTGATTCTCCAAACACACCAAGATTGGATTCCTTACAAAGGATTAGTCCTTCACCATCAAATGTACATGGTGCATCACCACCACCCATGGAACATCCTACTGCCATAGGAAATACTGAACCCATATCACCTGCAAACTCACAAGAAAAAGAATCAGATACCGATGATAAGCAACCAACAGACTTGAAAACTCAACATATATCAAAACCTGAAAATCAAAGAGACAGCTTGAGGCCACCAGCTTTTGCTCATCCAATAAGACCTGACAATTTGGGACTCAGAACAACTGAAGCTACCATTGTGTCTGTTGCCGGACAGAACATGCCTCCTCCACCTTTAAGTCAAGCTTCCATATCAGGATTATTGCACCCAGGTCCTTTGATAACTGTAGGTAGTGGTGCAAATGTTGGTCCGTATGGATTTATGGCAACGTCTCTCTATGGTCATCCAGGGCATCCTTCATTAGAAAAACCAGGTTCAATGCCACCTCTTATGCAACAAATTCCGCCTTCACATGGTCATCCTAGCAGTAATTCTATACAACAACAATCTAATCAAAATGATCCATCTATGCCACAAGATTTGAAAATTAAACAAGAAGTACCTGACAATATACCTGCCAATTTATCAACACACCCTTCAGATCCGTTGCAATCTCTGAAAGAAGTCAAAGTTCCAGGCTATCCTATAGGAAGTGCAATTGCTCAACATTTGAATACGGAACGAGATCGAGAATCTGTTTCTAGTGTTGAAAACAGTAGTCGACCACCAAGTCAACCGACAAATGAAAGTACAAATATGCCCAGTGGATTCCTCGGTCCTCGAATTGAAAGTATAAAAAAGGAACCGGAATTTTTACATCAACCCCACATTACTCCAGTATCTACCTCCCATGGACCACCTGATTCTATAAACACTATAACTCCAGTAAAAAGTCCACATACACCTACACCTATCAAGAGTCAACCAAGTCATAATGGTACACCTCATGGCTCTCATCGATCCACAACTTCACCATTTTCGAGACATCTGACAAGTCCATCACAACCAAGGCAAATATCTGCTTCTCCAGTTCAACCTCATACTCCAGTTTCTCATTCTGCCTTAAACTTGATGAATCCTACGCCTATTTCGATAGCAGCAACGATACCGGGACCTGTTATGCACTCTGGACAACCTGGTCACCCACCTCCACATCCGTTTGCGTCACCTCTGCATCACCCCCCTCACCCTTTGCTTCATCATCCCTCAATATTTCAACTATCTGCAGCAGCAGCGGCACACGCTATGCATCCTTACTATCCACATCCACACCCTGGATATTCAATGCCTTATCCTTATCCCTACGGACCTCTGCCACAACCCCACCCGATCCCTCCTATGCACCCTGCAGTCAGCACGGCTGGGCGTCACGATCCAGTAAAACCTTCAACAATAGAATCAACAACAATGCTTAGTTCTCATCACAGTACCAGTTCTTCAGTAACCACAAGATCTCTCCGGGAGATATCGGAAAGTAGTGAAGACCCGAGAAACCCGAATGCCACTACTGAAAGGCAGCTGCACGAAACGACAATGACCCATCATCATTCTACAAGCCACCACAGTGCAGTTCATACAAGTACAGAGAAGCAACCAAGTCACGCAGGAGGAGGCACTAATCATACGTTATCAATATCACATTCGACTTCGAGTAGCTCCTCGCAGTCAATACAACATAAAATTAACACTCAACAAAAATCTGCGGGACATTCAAGCTCACCTCACCATTTATCAGCAAGTGTTTCTCAGACCACTAGTTCATCATCAAGCGTGAATGTCACCAACAACCATACGCACCACCACTCCCACCATCTTGCGCATCATCCGGAAAGGCTATCTCCTGCGGACTCTATGCTGCTCCGACATCATCCTAAAATGCTACCCGGAAATCCAAGCCATCTCATGATTCCACCACCATCAATGGGACATCCAATGGGCTTGGGGCTTCCACCAGGGCCAGGTCCCAGTTCGATAGAAAGTTTACGATTACATGCCCAAGCTGCAGCAGGACTGCCTCCGACTCACCAGAGATCTGGATCACCTCATCAAATGCCACACGGTCATCCTCATCTAAGAGGTCCGCCGCGTCAGATTCCCGATGAAAACCCTGAACTCAAACTTGAGACACAATCACAACCGGAAGAAGAAGAAATTCCAAGTCCGGCTCACATTCCACACGGACCTAGCCCAGAGCCAAAAATAGAAGATACCGAATGTCACAGATCACAGTCTGCTATATTCCTCAGACATTGGAATCGCGGTGACTATAATTCTTGTGCGCGAACTGATCTTACATTCAAACCGGTACCTGAATCAAAACTTGCTCGAAAACGAGAGGAAAGATTGAGGAAACAAGCGGAACGTGATAGAGAAGAAAGGGAGAAGATAGCGCAGCAAGCACATAGAAAAATAGCGACGCCGGAGAAGCCGGACACGAAACCACCGTCACGCGGTGCCATAGAGACGATATCATCGCCATACGACCGTTTCCCTAGACCTCCAGGCTACCCCGACACGCCTGCGTTGCGTCAGTTATCTGAGTATGCCCGACCTCACGCCGGCTTTAGCCCGGGCAATCTGCCTCGTCACTGTATGGACCAGATGTTGCAGTATCAACTGAGCTCAATGTACGGCGCACCCGGGGCCCGTGAAAGATTAGAACTCGAACACCTCGAAAGAGAGAAACGTGATAGGGAAATTCGAGAATTACGCGAACGCGAGCTCAATGATCGGTTGAAGGAAGAACTACTCAAGAACAACGTAGGACCTCGAGCACTCGATCCTCACTGGCTCGAGATGCACCGGCGGTACGGAATGCCGCCACCACCGCCCCAGGGTGCAATCCCAGTGCAGTTCGGCCTATATCCTGGCGGACACGCACCCGGGGCGCTCTCTCAACTAGAACGCGAGCGACTCGAGCGCCTCGGCATCCCTCCGTCGGGTCCCGGTCCCACAGGACCCGGTGGCGGTCCACACCACGGCCATCACCCGCACCCCGTCGCCGCGGCTCAACTCGAAGCAGCCGAACGTCTCGCTCTAGCCGCTGACCCGATGGTGCGATTGCAGATGGCCGGGATAAATCCAGAGTATCACGCGCACACGCACGCGCACACTCATGCACACTCTCACACGCACTTGCATTTACATCCGGGACAACAAGCGGCGGCGGCACAGCAGGAAGCGCTCGGCCTTGGACCGTATCGGCCGCTACCTCACCCCGACCTGTTAGGCAGGCCGTATGCTGAGCAGTTAGCGCAGCAGGCGGCGGCACACGAGCAGTTGCAGCGTCAACTGTTACTAGACCGGGAGCGGGGCTTCCTTCACCCCGCGCACCACGAAGACTTCCTGCGGCAGCAGCGCGAGCGCGAGCTCAAGGTACGCGCTCTGGAGGAGGCGGCGCGGGCCTCTCGCCCTTAG
Protein
MAQNQGEVPVGTPSQPVKDKDIYACLPEMRIDGPLGPDDPCPGGEDLRWLPAQATDRDLVMYLRAARSMAAFAGMCDGGSPDDGCVAASRDDTTINALDVLHDSGYDPGRALQALVKCPVPKGIEKKWSEDETKRFVKGIRQFGKNFFKIRKDLLPHKDTAELVEFYYLWKKTPGASSNRPHRRRRQASLRRVRNTRNSRAGTPKEQTPEVVPALPETNGSRPSPNPKEAGEMSSVTEDEISEDDSDSRDAGDLAKSENGRIVENPDDSPSRMRTRNKSKEQTTPNGKKAPEESDIEQKAKPKPKAIVQSNNNNIPDKVISSPVSKEVKKKVTNGKVDASKVKKRLPDDTKPDGIMDGDVQMKKKRADPPESPSESLTNDSFPAMDETETQEQEPEACNFSFNKTDKESVEQNKEPESEQQQSKPNEHLEVQVKVEANNEPLIKTEKDLIAPTFKLATDKDQKNVLDLKTDTNVTKNPENVDMRPLSMQNSVFPKTELIIPKVTPMTVEAMEKIKIKEEIDPEDQTLNLHKDEYQKDPLSHSYPGHPMTQANKPLNLENTNFFVKDSHIYNPKLGHGIKIEGVPNIFNPMNITKESTIIRNSKEYSASIPPFPYSSNLSFGSDPGKHNPHGINPLKTVIKLEPRDETSELKSQNAPEIFTATISAGNKVDSPNTPRLDSLQRISPSPSNVHGASPPPMEHPTAIGNTEPISPANSQEKESDTDDKQPTDLKTQHISKPENQRDSLRPPAFAHPIRPDNLGLRTTEATIVSVAGQNMPPPPLSQASISGLLHPGPLITVGSGANVGPYGFMATSLYGHPGHPSLEKPGSMPPLMQQIPPSHGHPSSNSIQQQSNQNDPSMPQDLKIKQEVPDNIPANLSTHPSDPLQSLKEVKVPGYPIGSAIAQHLNTERDRESVSSVENSSRPPSQPTNESTNMPSGFLGPRIESIKKEPEFLHQPHITPVSTSHGPPDSINTITPVKSPHTPTPIKSQPSHNGTPHGSHRSTTSPFSRHLTSPSQPRQISASPVQPHTPVSHSALNLMNPTPISIAATIPGPVMHSGQPGHPPPHPFASPLHHPPHPLLHHPSIFQLSAAAAAHAMHPYYPHPHPGYSMPYPYPYGPLPQPHPIPPMHPAVSTAGRHDPVKPSTIESTTMLSSHHSTSSSVTTRSLREISESSEDPRNPNATTERQLHETTMTHHHSTSHHSAVHTSTEKQPSHAGGGTNHTLSISHSTSSSSSQSIQHKINTQQKSAGHSSSPHHLSASVSQTTSSSSSVNVTNNHTHHHSHHLAHHPERLSPADSMLLRHHPKMLPGNPSHLMIPPPSMGHPMGLGLPPGPGPSSIESLRLHAQAAAGLPPTHQRSGSPHQMPHGHPHLRGPPRQIPDENPELKLETQSQPEEEEIPSPAHIPHGPSPEPKIEDTECHRSQSAIFLRHWNRGDYNSCARTDLTFKPVPESKLARKREERLRKQAERDREEREKIAQQAHRKIATPEKPDTKPPSRGAIETISSPYDRFPRPPGYPDTPALRQLSEYARPHAGFSPGNLPRHCMDQMLQYQLSSMYGAPGARERLELEHLEREKRDREIRELRERELNDRLKEELLKNNVGPRALDPHWLEMHRRYGMPPPPPQGAIPVQFGLYPGGHAPGALSQLERERLERLGIPPSGPGPTGPGGGPHHGHHPHPVAAAQLEAAERLALAADPMVRLQMAGINPEYHAHTHAHTHAHSHTHLHLHPGQQAAAAQQEALGLGPYRPLPHPDLLGRPYAEQLAQQAAAHEQLQRQLLLDRERGFLHPAHHEDFLRQQRERELKVRALEEAARASRP

Summary

Pfam
PF03154   Atrophin-1
Interpro
IPR009057   Homeobox-like_sf        + More
IPR017884   SANT_dom       
IPR002951   Atrophin-like       
IPR001005   SANT/Myb       
IPR000949   ELM2_dom       
SUPFAM
SSF46689   SSF46689       
ProteinModelPortal
PDB
2YQK     E-value=1.14099e-19,     Score=244

Ontologies

Topology

Subcellular location
Nucleus  
Length:
1820
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
0.00039
Exp number, first 60 AAs:
0
Total prob of N-in:
0.00001
outside
1  -  1820
 
 

Population Genetic Test Statistics

Pi
200.223886
Theta
181.267931
Tajima's D
0.488828
CLR
0.326155
CSRT
0.512174391280436
Interpretation
Uncertain
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号