SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO00016
Annotation
Transposon_Ty3-G_Gag-Pol_polyprotein_[Papilio_machaon]
Location in the cell
Nuclear   Reliability : 3.822
 

Sequence

CDS
ATGGAGGACGACACTAGCCGAGGGGAAGGGTCTAGTGCTCGTCTAGTAAAAGGAAGTGGCTTAATGATGAAAGGAGCCCATGTGCTTAGTCGAGGGGAAGGACTAACACATGGGAATGACCAAGGAGCATCCGAGGCTAAGGATGTAGGTCAACAGGAAAGGATTAGGAGGATGAGCTCGAGGATAGAGCATGGAGCATCCGAGGTTGAGGATGTAGGTCGACAGGAAAGGATTAGGAGGATGAGCTCGAGGATAGAGCATGGAGCATCCGAGGTTGAGGATGTAGGTCGACAGGAAAGAATTAGGAGGAGGAGCTCGAGCATAGAGCAATCGGTACCATTGGATAAAATGGACAAGGACGCTATGATTCGAAGAGAACGGTTGCGTGGAAAGCGATCGTGTGAACGACGATCGCGTAAGCAGCGGTCGAGCAGGCAACAATCAAGCGGCTCTGATGACAGGGACGAGTACGAAAGAAGGAGATGGAAAACAAGGACAAGAACAGGACAGAGGAGATGGCATGCGTCTTCCTCTTCAGGATCTGATATGGGGTCACCGCAGCTTCCGCGCAAGCTTAAGCGTGGTAAGGAGGACGATGGTTCAGATACAAAGGTATTAGTAGATAAATTCTTGAATATTCTTGACAAAATAAAAGGTTCAGATAAGCCCAAACTTACGTTTAACACAAATGTTATCCCCGAATTTGACCCAATGGCGAAGGAACAAACTATTCTTACTTGGCTGACGAAAGTGGAAGAATGTGCTCACATATATGGGTGGGAAGACAGGGAAATTATCCACTATTCCTTGCCCAAGCTTTCTGGTGTCGCTAAAACTTGGTATCAGGGCTTATCTAGCCTATTATTTACATGGACTGAGTGGAAACAAAAACTTATAGAATCTTTTCCCCAAAGCGAGGACTATGCTGAGTTATTGCATAAAATGTTAGCAAAAACAGTTAGATACGGTGAGTCGCTTGAACATTACTATTATGCGAAAATTAACTTGCTAAATCGTTGCAAAATTTATGGCAAGGAAGCGGTAGACTGTTTGCTGTACGGGGTTGAAGATCGTGCCGTTAAAGTGGGTGCTCAAGCCGCTCAATTTAAGCAACCTGAGCAAGTGCTCAAATACTTTAGAACCGTGAAGGTAGGCAAAGTTCGTGACTCCAGTAACGATGCCAACCTGAAAAAAACAAATCAATCTGGCACGAGCAGGGCGGGCCTTAGTAATAACGGAATCCGTTGCTTCAATTGCAACGAAATAGGTCACCCTAGTTTTAAATGTACAAAACCTATCGTAAAGTGTACGACCTGCGGGAGGATTGGTCATCAGGCGATTAATTGTTTTAAAAATAAAACTCCGATTAATAAAGACGATAGTCAGGCCTTAGATAAAAATAATGTACAAAAACAAGTATCTAATGTAACTGTCGATGATCGCGCTATTGATAAATATATACTGGATATAAAGATTAACAACTCAAGGGTCAAGTGTCATGTAGATTTAGGGAGTCAATGCTCGTTAATTAGACAGAGTACCGCTAAGGAGTTGAACCTGAATATCGATGTGAGGGAAGGTATGCCGGTATTAAGAGGAATAGGAGGAAACCTGACTTGTCCTTTAGGTGTAGCTACCGTGAGTGTAGAAGTTCAAGGGTTAAAAGAAACTATTGAAATATATGTAGTAGAAGATTATGTACTAAGTTACTCAGCATTACTTGGACATTCCTTTACAGAGAAGCCTGATATTATAATGACTAAGACTCCGACTAAGATCGTATTTGAAAGACTCGCGTGTACTAAAATAAGACTCATAGTGGAAGATGACACGGAAATAGATGGTAACGTTGTACGTCCCATAATTGTTAATGCTGATGGGCAGGGAAACGGTAGCATCTACGTCCATGGCTCGTTAAGGGGTGCTGAAGGAAAAGAGTACTACCTTTTTCCCGGTGCGTACGAGATAAAAGACAGTCGCACAGCGCTTTTAATTTATAATGCCTCGTCAAACAAGGTTTGTATAAAGAAGGACACTCTCTTAACACGAGCACTGCGTAATACGCATCCAACACCTTTTTTCCAATCTTGTAGCGTTTTTACCGATAGGGGACTGGATGAAGAGATCAGTTGCAATCCTGAGCTAACAAAACAGCAACGCGAAGAACTACAAAAGTTATTACTTCAGTATAGCGATTGCTTCTCAAGTGGTCTCAAGGACCTTGGGTTTACCAACATCACTGAGATGGTGATCGACCTTAACGACACGGAACCAGTCGTGTATCGCCCATACCGTATGTCACATACTGAGCGACAATTGGTAAGAGACATGGTTAAGGAGATGGTAGATAATGGTATAGCAAGGGAGTCATCTTCATCATATGCCAGTCCTATAGTCCTGGTACAAAAAAAGAATGGTGAGAAACGATTGTGTGTGGACTATCGAGCTCTAAATAGAAAGACCAAAAAGGAACACTACCCCCTACCAAGAGTCGAGGATCAGTTGGACCTATTATCCGGTAACACCCTATTTACGTCATTGGACCTTGCTTCCGGGTACTACCAAATACCAGTCGCCGAGGCCTCCCGTAGTAAAACTGCTTTTGTAACACCTGATGGGCAGTATGAATACAACCGCATGCCTTTCGGACTAGTTAATGCACCATCTGTGTTCCAAAGGACAATCCATAAAATATTAAAGGAGGCCCAAATTAAATATGCTGTTGTCTACATGGATGACATACTGATACCATCCAAGGATTTTAGAGAAGGCATCCAAAGACTAGGCGAAGTTCTACAGCTTCTGAAGAAAGGCGGACTCACACTGAAATTAAGTAAGTGCCATTTCTTTTTGAATGTCATAGACTTTCTAGGGTTTGAAGTGAGTGCTAATGGTATCCGCCCAGGTAGTAAAAAAACTAAAGCCGTCTCAAAATTCCCAACACCTACCAATCAGCACGAATTGCGCCAATTTCTGGGTCTGTCTGGGTTCTTCAGACGTTTTATAAAGGGATATGCTACGATAACTGCACCGTTAACGGATTTACTGAAGAAGGACGCTAACTGGAGTTGGAAACCCGAACAGGATCAGGCGTTCACAGGCACTAAAAGCGCTCTGGTGGAGCGACCGGTGCTAGCTCTATACGACCCTAAGTCCGAAACCGAGCTACACACGGACGCTTGCAAAGAAGGCCTAGCGGGTATTTTATTGCAGCGTAACTGTAATGGTGCATTGCAACCAGTTTCCTATTTCAGCAGGAAAACTACCGCTGACGAACGCAAGTTTCATTCGTACGAGCTCGAAACTTTAGCGGTCATTGCATCTCTCAATCGGTTTAGGGTTTATTTAGTGGGAATACCATTTAAAATTTTGACTGATTGCAATGCACTCAGGTCCACTCTAACTAAACGTGACTTGATACCTCGCATAGCTAGATGGTGGGTCCAACTTCAGGAGTACGACTGTACAATCGAGTACCGGCCTGGTGTTAGCATGGCACACGCGGATGCCTTAAGCAGAAACCCAGTAAATTCTTCTTCAGGTAATACACACGTACTAGATGTTTTACAAATACAGCAAATCGAACAGGACTGGATTGCGACGGTACAAACTGCAGATGACGAGATAAATAAAATCAAAAGGATGCTCTCTGACCCTTCATCGGCGGAGGTAGCGGACATACACAAAAACTACAGACTGAAGAATGACCGTGTTTACCGCGTGGTAGGTGACGAAATAAAATGGCTGGTACCCAAAAGTGTACGCTGGCAAATTTTAAAAATGAACCACGATGACGTTGGCCATTGTGGTTTTGAAAGAACCCTACAGCGCATCCGAGGCCATTATTGGTTCGCGAAAATGCGGCGTTTCGTCCGGAAGTATGTTACCTCCTGCCTTGAGTGCGCGTACCACAAAGCTCCAGGTGGTAAACGAGAAGGAGAGCTCCACCCAATCGAAAAGGTGAGCGTGCCGTTCCACACAGTCCACGCGGACCATTTGGGACCGTTTGTTAAAAGCAAGAAAGGCAATTGTTATTTGTTCGTTCTCGTTGACGGGTTTACAAAATTCGTCAATATAACCCCGGTTAGGAACACCAAATCCAGTACGACCGTGCGAGTACTAAAAGACCATGTAAGTTATTTTGGCGTTCCTACTCGTCTCATTACAGATAAAGGTACGAGTTTCACCAGTAAGGCGTTTCAAGATTTTACGACCTCTTACGGGATCAAACATATTGTAAATGCCGTTGCAACGCCTCGAGCAAACGGGCAAGTAGAAAGGTTTAATCGAACGATTCTAGATGCTCTATCGACAGCCAGTCACGGCGGCGACGAAAAGTCGTGGGACAACCACATCACTGATATACAGGTTGGTATGAACACAGCTCGACATAAAACTACCCAAAAGAGCCCCTCTGAGCTACTATTTGGATTTAACATAATTAGTAGAACGGAAGGCATACTTAGTACGGTAATCAATGACACCCTAAATAAGACTCCCATTGAGGAATTGGCGGAAGTCAGACAACAGGCTAGTAAAAAAATAAAACGGCAACAAGTTAAAGATGCAGTCAATTTTAATAAGCATAGGAAACCAGCGAGGCAGTACCAAGAAGGGGATTTAGTACGCTTAGCAAGGCAGGTTCCGCACGATGGAAAATCTCAAAAGCTCTCTGTTAAGTACCAAGGCCCTTATCGTATACTAAAGGTTCTTCCCAATGATAGGTTCGTGGTAGAAGATACTCCAATGACTCGAAAAAAAGGTAGAAGGTATGAAGCAGTTGTATCTATTGATAAGATTCAGCCTTGGATGTCGTTCTGTAGAGAATTGGAGTCTGATGAATCTAGTAATAGTGTCGATGACAGTGATTAG
Protein
MEDDTSRGEGSSARLVKGSGLMMKGAHVLSRGEGLTHGNDQGASEAKDVGQQERIRRMSSRIEHGASEVEDVGRQERIRRMSSRIEHGASEVEDVGRQERIRRRSSSIEQSVPLDKMDKDAMIRRERLRGKRSCERRSRKQRSSRQQSSGSDDRDEYERRRWKTRTRTGQRRWHASSSSGSDMGSPQLPRKLKRGKEDDGSDTKVLVDKFLNILDKIKGSDKPKLTFNTNVIPEFDPMAKEQTILTWLTKVEECAHIYGWEDREIIHYSLPKLSGVAKTWYQGLSSLLFTWTEWKQKLIESFPQSEDYAELLHKMLAKTVRYGESLEHYYYAKINLLNRCKIYGKEAVDCLLYGVEDRAVKVGAQAAQFKQPEQVLKYFRTVKVGKVRDSSNDANLKKTNQSGTSRAGLSNNGIRCFNCNEIGHPSFKCTKPIVKCTTCGRIGHQAINCFKNKTPINKDDSQALDKNNVQKQVSNVTVDDRAIDKYILDIKINNSRVKCHVDLGSQCSLIRQSTAKELNLNIDVREGMPVLRGIGGNLTCPLGVATVSVEVQGLKETIEIYVVEDYVLSYSALLGHSFTEKPDIIMTKTPTKIVFERLACTKIRLIVEDDTEIDGNVVRPIIVNADGQGNGSIYVHGSLRGAEGKEYYLFPGAYEIKDSRTALLIYNASSNKVCIKKDTLLTRALRNTHPTPFFQSCSVFTDRGLDEEISCNPELTKQQREELQKLLLQYSDCFSSGLKDLGFTNITEMVIDLNDTEPVVYRPYRMSHTERQLVRDMVKEMVDNGIARESSSSYASPIVLVQKKNGEKRLCVDYRALNRKTKKEHYPLPRVEDQLDLLSGNTLFTSLDLASGYYQIPVAEASRSKTAFVTPDGQYEYNRMPFGLVNAPSVFQRTIHKILKEAQIKYAVVYMDDILIPSKDFREGIQRLGEVLQLLKKGGLTLKLSKCHFFLNVIDFLGFEVSANGIRPGSKKTKAVSKFPTPTNQHELRQFLGLSGFFRRFIKGYATITAPLTDLLKKDANWSWKPEQDQAFTGTKSALVERPVLALYDPKSETELHTDACKEGLAGILLQRNCNGALQPVSYFSRKTTADERKFHSYELETLAVIASLNRFRVYLVGIPFKILTDCNALRSTLTKRDLIPRIARWWVQLQEYDCTIEYRPGVSMAHADALSRNPVNSSSGNTHVLDVLQIQQIEQDWIATVQTADDEINKIKRMLSDPSSAEVADIHKNYRLKNDRVYRVVGDEIKWLVPKSVRWQILKMNHDDVGHCGFERTLQRIRGHYWFAKMRRFVRKYVTSCLECAYHKAPGGKREGELHPIEKVSVPFHTVHADHLGPFVKSKKGNCYLFVLVDGFTKFVNITPVRNTKSSTTVRVLKDHVSYFGVPTRLITDKGTSFTSKAFQDFTTSYGIKHIVNAVATPRANGQVERFNRTILDALSTASHGGDEKSWDNHITDIQVGMNTARHKTTQKSPSELLFGFNIISRTEGILSTVINDTLNKTPIEELAEVRQQASKKIKRQQVKDAVNFNKHRKPARQYQEGDLVRLARQVPHDGKSQKLSVKYQGPYRILKVLPNDRFVVEDTPMTRKKGRRYEAVVSIDKIQPWMSFCRELESDESSNSVDDSD

Summary

Pfam
PF17921   Integrase_H2C2        + More
PF17919   RT_RNaseH_2
PF00078   RVT_1
PF00098   zf-CCHC
PF00665   rve
PF14392   zf-CCHC_4
PF17917   RT_RNaseH
PF03732   Retrotrans_gag
PF16087   DUF4817
Interpro
IPR001584   Integrase_cat-core        + More
IPR041577   RT_RNaseH_2       
IPR036875   Znf_CCHC_sf       
IPR012337   RNaseH-like_sf       
IPR001878   Znf_CCHC       
IPR021109   Peptidase_aspartic_dom_sf       
IPR041588   Integrase_H2C2       
IPR036397   RNaseH_sf       
IPR000477   RT_dom       
IPR034122   Retropepsin-like_bacterial       
IPR001995   Peptidase_A2_cat       
IPR025836   Zn_knuckle_CX2CX4HX4C       
IPR001969   Aspartic_peptidase_AS       
IPR041373   RT_RNaseH       
IPR005162   Retrotrans_gag_dom       
IPR032135   DUF4817       
SUPFAM
SSF50630   SSF50630        + More
SSF53098   SSF53098       
SSF57756   SSF57756       
Gene 3D
PDB
4OL8     E-value=3.58895e-73,     Score=705

Ontologies

Topology

Length:
1623
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
0.00271
Exp number, first 60 AAs:
0.00107
Total prob of N-in:
0.00007
outside
1  -  1623
 
 

Population Genetic Test Statistics

Pi
3.686452
Theta
16.432924
Tajima's D
-1.202849
CLR
0.008053
CSRT
0.102744862756862
Interpretation
Uncertain
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号