SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO10761  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA008399
Annotation
PREDICTED:_LOW_QUALITY_PROTEIN:_symplekin_[Bombyx_mori]
Full name
Symplekin      
Location in the cell
Nuclear   Reliability : 2.796
 

Sequence

CDS
ATGTCGAATAAACACGGACACACGTCTAATACTACAGTACAAAATCAGCTAATACGATGGATAAACGATACTGGAATGGCGGAAGGAAACAAAAAAGCAGGCTTATTGAGAAAAGTCATAGAAGTTCTGCTACATCAAGGCTCGCAAATGATTCCAGTATACATGGAGAATATACTAAGCTACATCAGCGATAAAAATACAGACGTAAAGAAGCAGGTCGCGTATTTTGTTGAAGAGTTGAGTAAAAGTCACCCTGAACTTCTTCCCAAGATTGTCGCTCAGTTGCGTCTCCTACTCCTTGACCCAGTCATTGCGGTGCAGAAGAGGGCCATCCAAGCAGCCAGTATACTGTACCGTAATACGTTAATGTGGATCTGTAAAGGTGACGCTGAAGTATCGGAAATGAAACATGTATGGGAACATTTAACTGAATTAAAGTTGATGGTTTTAAACATGATCGACAGTGAGAATGAAGGTATCAGAACGCATTCCATTAAGTTCCTGGAAGAGGTTGTGCTCCGGCAAAGTCCCGGTGACATTGTTGATGCTGAATCCAGCTTGGATAGTCTACCATCAGATGTTCCTTTTATCAATAGAAAAGCTCTCGAAGAGGAATCTGACCATATCTTCAAACTACTGGTCAAATTTCATAATTCACAGCACATATCAAGTGTAAACCTGATGGCTTGTATGACAACGCTATGTTCTTTAGCAAAATTCCGTCCTAAATACTTGCCCCGAGTCATACAAGCACTAGGCGATCTTCACACGACACTACCTCCAACTCTATCGCAGTCTCAAGTCAATTCAGTCAGGAAACATCTCAAAATGCAGATATTGAATTTAATAAAACAATCAACTTCTAATGAGATGATGCCGCAGTTGACACAATTGCTAATTGATATCGGAATGACAGCACAAGAGATTAATAAGGCGGTACCGAAAGAGAAAAGAAACAAACGCTTAGGGGAAATTAGGAATGGGGATACACCATCTAAACGGTTCAGAATCGATTCACCTCAAAGTAATAGTCAAGGTAGCGACAGTAACAGCCGTAGTGAATTCATGTATGACGATGACAGCCAGTCAAGCATCACGAAGCCTTCGGCAACAGAAGACAGTATCCTCGAAGGCTTAAATTCTGTGGACAATGTTGTTAATCTTGTCATGAAAACATTGCAGAATTGTCTACCAGATAACATGCCGCCCTCGTTCACAAATGATTACAAGCCAATACCTAACTCTGGTAGCAAAGTACAAAAGAAGAACTTGGCTAAAATGTTGATGAGCTTAATAAGAGGTGAATCAGAAAATACACAAATAGTTACACAATCAATGGAGGTAGATACACAACCTAACCAGACAGTATTACAACAGAGCTTATCCGATATAGCTGCGAAAATACCGTTAATAAGGGAAGATGATGAGAAAACTACCCTCAAAAATGCTGTAGCTAAATTACAAGAGTCAACAAAAGACAGGCAGATAGAGAGTGCGGTTTCTAAACTAATGGAAGAAACGAGACAGGAACATTTGAAGGAGGAAGAGAGGAAAGCAAAGGAGAAAGAGAAACCGGTTCCTCCACCTACTCCGACGGTGCCTAAACTGAAACAGAAAGTGAAGCTATTGAAACTACAGGAAATAACACGGCCCATCCCAAAAGAAATAAAGCAAAAGTTGATACTGCAGGCCGTTGAGAGGATTTTAAATGCGGAAAAAGACGCCGCTATCGGTGGAGCCGAACAGATCCGGGCCAAATTCATCACGATATTCGCTTCGAGTTACACGCCGGAAACGCGCGATCTAGTACTGAATTACATTTTAGACAATCCCGCCGAGCGCATCAACCTCGCTTTGAAGTGGTTATATGAAGAATACGCATTCATGCAGGGCTTTAACCGACATCCGGTGACCCTACAACCGAAACTGCACGAGAAGCACGATCAAAACTACAACCAGTTGCTCTGTGCTCTGGTCACCCAGATATCCGAGAGAGGCGACCCTGTCATGGAAGGATGCAAAGATGTTTTACTAAGGAAGATATATTTGGAGAGTCCGGTGATCACGGACGAGGCCCTGGATTATTTGAAGCACTTGGTCACTGAAGATCAGGTGGCTCTGCTGGCTTTGGAGTTGCTAGAGGAATTGTGTTTGATGAGACCACCTCGGGCACACAAATATGTGACATCGTTGGTCTGTCATGTTTTGAGCGAAAATGAGGAAATCCGCGATATAGCTTTGAAAGCTACCATCAAAGTTTACAAACGTGGCTCAGAAGCTGTTAAGAAAGTCATCGAGAAACACGCTCTCCTGTACTTAGGGTTCATATCGTTGAATCATCCGCCCCCTGAACTTTCAAGCACCAGAACTAGGATCATGTGGAATGAGAATTTACATAAGCTGTGCTTGAATTTAGTTGTGGCGCTCTTCCCAGAAAAAGAAGGACTACTGATGGAAATAGCAAGGATATACGGCAGCTCCGGTGCAGAGGCGAAGCGCGTTGTACTGCGAACCATCGAGGGTCCGATCCGGGCGATGGTGTCCAAGGACCTGCCCGACAACGAGTTCGACGAGGACGACGCCACGCTGCGCCCCGCCTTCAGGGAACTGCTCGAGAACTGTCCCCGAGGAGCAGAGACTCTGCTCACCAGACTAGTGCATATACTGACCGAGAAAGTGCCTCCAACAGCCGAACTAGTGTCGAAAGTACGAGACATATACGCCACCAGAGTGTCCGACGTTAGATTCTTGATCCCCGTGCTCACCGGACTGTCAAAGAGAGAGATCGTGGCAGCCCTACCGAAACTAATCAAGCTCAACCCTGCCGTCGTGAAGGAGGTGTTCAACAAGCTGTTGGGAATCAACGGCACGTACGACGAGCAGTCGCCCGGTAGGTGGCGCCACCGACGCGGGTCTGAGAGAGAGAGAGAAAGAGAGAGCATACTTTATTGCTCACCAAAACACTATTAA
Protein
MSNKHGHTSNTTVQNQLIRWINDTGMAEGNKKAGLLRKVIEVLLHQGSQMIPVYMENILSYISDKNTDVKKQVAYFVEELSKSHPELLPKIVAQLRLLLLDPVIAVQKRAIQAASILYRNTLMWICKGDAEVSEMKHVWEHLTELKLMVLNMIDSENEGIRTHSIKFLEEVVLRQSPGDIVDAESSLDSLPSDVPFINRKALEEESDHIFKLLVKFHNSQHISSVNLMACMTTLCSLAKFRPKYLPRVIQALGDLHTTLPPTLSQSQVNSVRKHLKMQILNLIKQSTSNEMMPQLTQLLIDIGMTAQEINKAVPKEKRNKRLGEIRNGDTPSKRFRIDSPQSNSQGSDSNSRSEFMYDDDSQSSITKPSATEDSILEGLNSVDNVVNLVMKTLQNCLPDNMPPSFTNDYKPIPNSGSKVQKKNLAKMLMSLIRGESENTQIVTQSMEVDTQPNQTVLQQSLSDIAAKIPLIREDDEKTTLKNAVAKLQESTKDRQIESAVSKLMEETRQEHLKEEERKAKEKEKPVPPPTPTVPKLKQKVKLLKLQEITRPIPKEIKQKLILQAVERILNAEKDAAIGGAEQIRAKFITIFASSYTPETRDLVLNYILDNPAERINLALKWLYEEYAFMQGFNRHPVTLQPKLHEKHDQNYNQLLCALVTQISERGDPVMEGCKDVLLRKIYLESPVITDEALDYLKHLVTEDQVALLALELLEELCLMRPPRAHKYVTSLVCHVLSENEEIRDIALKATIKVYKRGSEAVKKVIEKHALLYLGFISLNHPPPELSSTRTRIMWNENLHKLCLNLVVALFPEKEGLLMEIARIYGSSGAEAKRVVLRTIEGPIRAMVSKDLPDNEFDEDDATLRPAFRELLENCPRGAETLLTRLVHILTEKVPPTAELVSKVRDIYATRVSDVRFLIPVLTGLSKREIVAALPKLIKLNPAVVKEVFNKLLGINGTYDEQSPGRWRHRRGSERERERESILYCSPKHY

Summary

Description
Component of a protein complex required for cotranscriptional processing of 3'-ends of polyadenylated and histone pre-mRNA.
Subunit
Interacts with Cpsf73 and Cpsf100 forming a core cleavage factor required for both polyadenylated and histone mRNA processing. Interacts with Slbp and Lsm11.
Similarity
Belongs to the Symplekin family.
Keywords
3D-structure   Coiled coil   Complete proteome   mRNA processing   Nucleus   Reference proteome   Repeat   RNA-binding  
Feature
chain  Symplekin
EMBL
BABH01001574    BABH01001575    BABH01001576    BABH01001577    NWSH01002143    PCG69058.1    + More
ODYU01001878    SOQ38697.1    AGBW02009168    OWR51479.1    KQ460556    KPJ13980.1    GDQN01003839    JAT87215.1    AGBW02009901    OWR49608.1    KQ459606    KPI90948.1    JTDY01002196    KOB71902.1    NEVH01014841    PNF27475.1    DS235387    EEB15463.1    KQ978642    KYN29485.1    GL887655    EGI70515.1    AAZX01016310    JXUM01069354    KQ562548    KXJ75644.1    CH477242    EAT46374.1    KQ976543    KYM80966.1    GDHC01015888    JAQ02741.1    AJWK01005336    AJWK01005337    AJWK01005338    AJWK01005339    AJWK01005340    DS232125    EDS35552.1    GGLE01001393    MBY05519.1    CM000364    EDX11778.1    ATLV01020836    KE525308    KFB45847.1    CM000160    EDW97783.1    CH480821    EDW55010.1    GBXI01010624    JAD03668.1    GFDL01013546    JAV21499.1    CH954181    EDV47943.1    GFDL01014089    JAV20956.1    AE014297    AY118592    AAF51962.2    AAM49961.1    AAAB01008859    EAA08083.5    APCN01000279    CH902617    EDV43780.1    GDIQ01086780    JAN07957.1    CH479179    EDW25129.1    GDIQ01146616    GDIQ01111917    GDIQ01110927    GDIQ01108126    GDIQ01047903    GDIQ01044298    JAL40799.1    GDIQ01178601    JAK73124.1    GDIQ01061791    JAN32946.1    LJIG01022555    KRT79951.1    GDIQ01064943    JAN29794.1    CH964272    EDW84025.1    GDIQ01265369    JAJ86355.1    CM000070    EAL28192.1    GDIQ01006568    JAN88169.1    GDIP01227758    JAI95643.1    CH940650    EDW66978.1    GDIQ01052305    JAN42432.1    GDIP01050734    JAM52981.1    GDIQ01032203    JAN62534.1    GAKP01012583    GAKP01012578    GAKP01012575    GAKP01012572    JAC46369.1    GFAA01001529    JAU01906.1    LRGB01000868    KZS15691.1    GDIQ01213538    JAK38187.1    GL732564    EFX77164.1    GBBM01000426    JAC34992.1    GAMC01010553    JAB96002.1    OUUW01000007    SPP83152.1    GDIP01021908    JAM81807.1    CCAG010012181    JRES01000562    KNC30191.1    GFAC01003193    JAT95995.1    GDIQ01148077    JAL03649.1    GDIP01073839    JAM29876.1    GBHO01016200    GBRD01006947    JAG27404.1    JAG58874.1    GDIQ01166845    JAK84880.1    JXJN01025888    GDIP01066776    JAM36939.1    KB320473    ELW71131.1    GFWV01021481    MAA46209.1   
Pfam
PF12295   Symplekin_C        + More
PF11935   DUF3453
PF09797   NatB_MDM20
Interpro
IPR022075   Symplekin_C        + More
IPR021850   Symplekin/Pta1       
IPR011989   ARM-like       
IPR032460   Symplekin/Pta1_N       
IPR016024   ARM-type_fold       
IPR019183   N-acetylTrfase_B_cplx_non-cat       
IPR011990   TPR-like_helical_dom_sf       
IPR013026   TPR-contain_dom       
SUPFAM
SSF48371   SSF48371        + More
SSF48452   SSF48452       
Gene 3D
PDB
6NPW     E-value=2.05113e-58,     Score=576

Ontologies

Topology

Subcellular location
Nucleus   Concentrates in the histone locus body.   With evidence from 5 publications.
Length:
989
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
0.01956
Exp number, first 60 AAs:
0.00495
Total prob of N-in:
0.00099
outside
1  -  989
 
 

Population Genetic Test Statistics

Pi
334.86676
Theta
202.631333
Tajima's D
2.101755
CLR
0.146095
CSRT
0.897955102244888
Interpretation
Uncertain
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号