SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO10368  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA007052
Annotation
PREDICTED:_transcription_initiation_factor_TFIID_subunit_1_isoform_X1_[Amyelois_transitella]
Full name
Transcription initiation factor TFIID subunit       + More
Transcription initiation factor TFIID subunit 1      
Alternative Name
TAFII250
TBP-associated factor 230 kDa
Transcription initiation factor TFIID 230 kDa subunit
Location in the cell
Nuclear   Reliability : 4.194
 

Sequence

CDS
ATGTGTGATAGTGACGAACAAGGTGACCACAGCCGTTCAGGCTTGGATCTGACCGGGTTCTTGTTTGGTAATATTGACGAAAGGGGTCAGTTAGAAGATGATGGATTGCTAGATGGTGAATCGAAGCGAATGTTATCATCGCTTAGCCGCTTAGGTTTAGGGTCTATGCTCTCTGAAGTCCTGGAGGTTGAGGAACCAGTTAAAGAAGAGGAAGAAAAAGACTATAATGAGAAAAGTCCATCGGCTGTGGACTTTTTTGACATTGAAGATGCAGCAGAGGATATAAAAGTAGAAAATAATGTGATAAAATTTGAAAATACTTCAGATGTACAAACTGTGCATAGCAGCAATGTTCCTGTGATTATTGATTCACAGAGTATTAAAGAGGAATTAAAAGAAGATACTTGGAGGGACACTGATATTGCAGGAGAACAAGAGGTACAGAGTACCAATGATGATGGCTATGAAGGTGATATGGAAGGTGATGGTGGACTTATGCCCCCTCCTTCGACAGTTCCACCCCCCAAACAAGAAAAACCCAAAAAACTAGATACTCCATTGGCTGCAATGCTTCCATCTAAATATGCCAATGTTGATGTCACAGAACTGTTTCCGGATTTCAGACCAGATAGAGTACTTCTTTTCTCTCGACTTTTTGGACCTGGAAAACTGTCCAGCTTACCGCAAATATGGCGCGGTGTTAAAAAACGTAGACGTCGAAAGCGCAGTGGAAGTACAGGCAGTGACACACCACCGATGGTTCCAGACAGTCAGGAACCTCAATATGCAAGTGACGATGAAGAGAAACTTCTCAAGCCTATGGAAGAAAATCAACAGATGTCTAATGACGATAAGGGATTAGGCTCAGGAGACAAAGCACAAAGGCCAACACCGGCCCACTGGCGTTTTGGTCCTGCTCAAGTTTGGTATGATATGTTGAATGTACCAGAGACAGGTGATGGGTTTGACTACGGATTCAAATTAAAGACTCAAGGAACAGATCGTGAAGAAAAATCAAAGGAAGTTGAAGAAGAAATTAGAGACACTGTGGAGAGTGCAGATAGCCCTGATAGTGGCTTTCCCGATGATTCTTTCCTTATGCTGTCACAGTTGCAGTGGGAAGATGATGTTATATGGGATGGAAGTGAAATGAAACATAAGGTGCTTGCCCGTTTGAACAGCAAAAGTAATGCAGCCGGATGGGTTCCTACATCAGTAAGCCGCACTGCACAGCAATTCTCACATCCTAGACCACAGCCACCAACCACATTACAGACTAAAACACCCGCTGCTACAGCCACTTCTGCTAATAACCAACCTGGAGGGAATTCAGCTGATGGTGAAGATAACACTTGGTACAGTATATTCCCAGTTGAAAACGAAGAGTTGGTATACGGTACTTGGGAGGATGAAGTTATATGGGATGCAGAGAACATGTCCAAAATACCGAAACCTAAGATACTTACATTGGATCCTAACGATGAGAACATTATATTGGGTATTCCAGATGATGTCGATCCTTCGAAGATTACTAGAGAGCGAGGTCCAGCTCCTAAAGTAAAGATACCTCATCCTCACGTTAAGAAATCCAAGATCCTGCTTGGAAAAGCTGGTGTAATTAATGTACTTCAAGAAGACGCTCCACCGCCTCCCCCTAAGTCGCCTGACAGAGATCCTTTCAATATATCGAATGATGTATATTACCAGCCAAAATCGCAGAACCCACGTTTGAAAGTGGGCGGCGGTCAGCTGATACAGCACAGCACGCCGGTGGTGGAGCTGCGCGCGCCCTTCATCCAGACGCACATGGGGCCGGCCCGACTGCGCGCGTTCCACCGGCCCGCGCTCAGGAAGCTGACGTACGGGCCGCTCTCTGCCCCCGGACCGCACCCCGTCCAACCCTTGCTGAAGCATATTCGAAAGAAAGCTAAGCAACGTGAAGCAGAAAGACTAGCGTCTGGTGGCGGTGACGTGTTCTTCATGCGGACGCCGGAAGATCTGAGCGGTAGAGACGGTGAAATAATACTGGTCGAGTTCTGCGAAGAGCATCCGCCACTGTTGAGTCAAGTTGGCATGTGCACTAAGATCAAAAATTACTACAAGCGGACAGCTACCAAGGACAACGGTCCCAAGCCACTGAAGTACGGTGAAGTAGCGTACGCTCACACGTCGCCGTTCCTCGGGATCCTGGCGCCGGGAGCCACGCAGCCGGTCGTCGAGAACAACATGTACAGAGCTCCTATATACGAACACACGTTGCCGCAGACTGACTTCATCATTATACGTACCAGACAAGCATATTATATACGTGAGATCGACGCTGTGTACGTTGCCGGACAAGAGTGTCCGCTGTATGAAGTCCCCGGACCAAATTCAAAGAGAGCTAACAATTTCGTTAGAGATTTCTTGCAGGTGTTCATATACAGGCTATTTTGGAAATCTCGTGACAAGCCACGTCGTATAAAAATGGATGATATAAAGAGGGCGTTCCCGTCGCATTCTGAAAGTTCGATTAGAAAGAGATTGAAGCTATGCGCTGATTTTAAGCGTACCGGATTGGATTCGAATTGGTGGGTGATAAAGCCGGATTTCCGTCTACCCTCCGAAGAAGAAATCCGAGCCATGGTGTCGCCCGAACAGTGCTGCGCCTACTTCAGTATGGCGGCCGCAGAACAAAGACTGAAAGACGCTGGCTACGGAGAAAAGTTCCTGTTTACCCCGCAAGAGGACGATGACGAGGAAATGCAGCTGAAGATAGATGATGAAGTAAAAGTGGCTCCGTGGAACACGACCCGCGCCTACATCCAGGCCATGCGCGGCAAGTGTCTGCTCCAGCTGACCGGCCCGGCGGACCCCACCGGCTGCGGCGAGGGGTTCTCGTACGTGCGGGTGCCGAACAAGCCCACGCAGCAACCGAACGAGGAGCAACAACCCAAACGAACCGTCACCGGGACCGACGCTGACCTGAGGAGGCTGAGTTTGAACAACGCTAAGGCACTGTTGAGGAAGTTCGGTGTTCCCGAAGAGGAGATCAAAAAGCTGTCGCGCTGGGAAGTAATCGATGTTGTACGAACATTATCGACCGAGAAAGCAAAAGCTGGCGAGGAAGGCATGACGAAGTTCTCCAGAGGAAATAGATTCTCTATTGCAGAGCATCAAGAAAGGTACAAAGAAGAATGTCAGCGTATATTCGAGCTCCAGAACCGCGTCCTGCAGAGTACAGAGGTGCTAAGTACCGACGAGGCCGAGAGTTCGGTCAGCGAGGAATCCGACTTGGAAGAAATGGGCAAGAATCTCGAAAATATGCTAGCAAATAAAAAGACCACAGAACAGCTGTCAATGGAAAGAGAGGAGGCAGAGAGAGCGGAACTTCGGAAGATGATAATGGGACAGTCCGAGAAGAAACCGCAAATTAATCCAAGCGATCAACAACAAACATCGAGTCAAGCAGGTCGAGTGCTCCGCATAGTGCGCACGTTCCGCGACGCTTCCGGCCAGCGGTACACGCGCGTCGAGCTCGTGCGGAAGGCGGCCGTCATAGAGGCCTACACCAAGATACGGTCCACCAAGGACGACATGTTCATACGGCAGTTCGCCTCTATGGATGAAACGCAGAAAGAAGAAATGAAGAGGGAGAAGAGAAGAATACAGGAACAATTGAGACGTATCAAGCGAAATCAGGAGAGAGAAAGACTGGCCGGCAATGTCACCGTACCTGGCCTTGGGAACGACAGTGACCTCGGGCCCTCCTCCAGCAACCTCATACCTCTGGGCCAAATCAAACAAGAGCCGGATCTCCACACGCCATCCCGAAGACGTGCCAAACTAAAGCCTGACTTGAAACTAAAGTGTGGCGCCTGCGGCCAGGTTGGTCACATGAGGACGAACAAAGCGTGTCCGCTATACACCGGCTCGGTTACGGGTGGACCGGCCTCTCCTCCGGACATAGACGTGGAGCCTCCTCCCGTTGAACCAGAAGATGATGATCTTGGATATGTGGATGGAACAAAGCTAACGCTACCAACCAAGATTATTAAGCAAACCGCCGAGGAGCTGAAGCGTCGCGGCGGCGGCAGGCGCGGCAGCGAGTTCGGACGCAGCAAACGCCGCGGCGCCACCGCGGACTCGTGCGAGTACTTGGTCCGGCGGCCGGCCGAGAGGCGCCGCACCGATCCGCTGGTCACCCTGTCGTCGTTGCTGGAAGACGTCCTAAATCACATGCGGCATTTGCCCGATGTACAGCCCTTCCTGTTTCCTGTCAATCATAAGCTTGTTGCAGATTACTATCGCATAGTTTCCCGACCGATGGATCTGCAAACTATACGAGATAATCTCCGACAGAAACATTATCAAAGCCGCGAAGAATTCTTGGCCGATGTCAATCAGATCGTCGAGAACAGCACGCTTTATAATGGTCCAACAAGCAGTCTCACTGTGGCCGCCCAGAGAATGTTACAGCGCTGCGTTGAGAAATTAGCAGAAAAAGAGGAACAGCTAATGAAACTCGAGAAACAAATTAATCCATTACTTGATGATAACGATCAGGTGGCCTTGTCGTTTATATTCGAGAATCTTCTGACAACTAAACTGAAGATAATGCCCGAATCGTGGCCCTTCTTGAAGCCCGTCAACAAGAAACAAGTTAAAGACTATTACAACGTCATCAAGCAACCTATAGACATGGAAACTATGGGAAAGAAGATTCAAGCTCACAAATATCACAGCCGTGAAGAATTCCTCCGGGACGTACAACTTTTGGTGGATAATTGTCGGGCTTACAACGGCCCGAACTCTCAGTTCACGAGACAAGCTGAGGCCGTATTGAAGGTCACGCAGGACGCTTTAGGCCAGTTCGACGAGCATGTAAGTCAGTTGGAAGCGAACATAGTTAGGGTTCAGCAGAAAATGCTCGAAGACGCCCAGCACTCGGAGCCGGAGGGCGACGACCTCCCCCCCACGGACGAGAAGCGCGGCCGGGGGCGCCCGCGGAAACACAAGCCCTCAACATCAGACGGTGCTACTCCTAGAAAACGAGGCAGACCACGTAAAGATCAGAACAGTTTGGAAGAAGATCTTCAATACTCGGACAGTGGTAGTTCGGAATTGGAAGAGGTGGAACAGAAAGACATCGTTGACACCATTCAACTCTGTGTCAAGCCAGAAGATACAACGCTGGACGAGCCGAGCATGATATCGGATCCGTCCGATTTCCCGACGACGATCAAACGAGAAGTGACGGACGACCTGCCTCCGCCTTCGGACGACTTCGTGGACCTCGATCAGCCGACTGACTTTACTTTCCCGTCCATCGTAAAGGAAGAGCCGATCGAACCTGACATGAACATGGTTCCCGCGATGATAGAGAGTCACATGGAGCCGCCTCAGCAGTTCAAGGAGGAGCCGCCAGAGAATTGGCAGCCCGATCCAGCCATTCAAGATGATCTCCAGGTGACTGATAGCGAGGAAGAAGCAGAAGATGGGTTGTGGTTCTAA
Protein
MCDSDEQGDHSRSGLDLTGFLFGNIDERGQLEDDGLLDGESKRMLSSLSRLGLGSMLSEVLEVEEPVKEEEEKDYNEKSPSAVDFFDIEDAAEDIKVENNVIKFENTSDVQTVHSSNVPVIIDSQSIKEELKEDTWRDTDIAGEQEVQSTNDDGYEGDMEGDGGLMPPPSTVPPPKQEKPKKLDTPLAAMLPSKYANVDVTELFPDFRPDRVLLFSRLFGPGKLSSLPQIWRGVKKRRRRKRSGSTGSDTPPMVPDSQEPQYASDDEEKLLKPMEENQQMSNDDKGLGSGDKAQRPTPAHWRFGPAQVWYDMLNVPETGDGFDYGFKLKTQGTDREEKSKEVEEEIRDTVESADSPDSGFPDDSFLMLSQLQWEDDVIWDGSEMKHKVLARLNSKSNAAGWVPTSVSRTAQQFSHPRPQPPTTLQTKTPAATATSANNQPGGNSADGEDNTWYSIFPVENEELVYGTWEDEVIWDAENMSKIPKPKILTLDPNDENIILGIPDDVDPSKITRERGPAPKVKIPHPHVKKSKILLGKAGVINVLQEDAPPPPPKSPDRDPFNISNDVYYQPKSQNPRLKVGGGQLIQHSTPVVELRAPFIQTHMGPARLRAFHRPALRKLTYGPLSAPGPHPVQPLLKHIRKKAKQREAERLASGGGDVFFMRTPEDLSGRDGEIILVEFCEEHPPLLSQVGMCTKIKNYYKRTATKDNGPKPLKYGEVAYAHTSPFLGILAPGATQPVVENNMYRAPIYEHTLPQTDFIIIRTRQAYYIREIDAVYVAGQECPLYEVPGPNSKRANNFVRDFLQVFIYRLFWKSRDKPRRIKMDDIKRAFPSHSESSIRKRLKLCADFKRTGLDSNWWVIKPDFRLPSEEEIRAMVSPEQCCAYFSMAAAEQRLKDAGYGEKFLFTPQEDDDEEMQLKIDDEVKVAPWNTTRAYIQAMRGKCLLQLTGPADPTGCGEGFSYVRVPNKPTQQPNEEQQPKRTVTGTDADLRRLSLNNAKALLRKFGVPEEEIKKLSRWEVIDVVRTLSTEKAKAGEEGMTKFSRGNRFSIAEHQERYKEECQRIFELQNRVLQSTEVLSTDEAESSVSEESDLEEMGKNLENMLANKKTTEQLSMEREEAERAELRKMIMGQSEKKPQINPSDQQQTSSQAGRVLRIVRTFRDASGQRYTRVELVRKAAVIEAYTKIRSTKDDMFIRQFASMDETQKEEMKREKRRIQEQLRRIKRNQERERLAGNVTVPGLGNDSDLGPSSSNLIPLGQIKQEPDLHTPSRRRAKLKPDLKLKCGACGQVGHMRTNKACPLYTGSVTGGPASPPDIDVEPPPVEPEDDDLGYVDGTKLTLPTKIIKQTAEELKRRGGGRRGSEFGRSKRRGATADSCEYLVRRPAERRRTDPLVTLSSLLEDVLNHMRHLPDVQPFLFPVNHKLVADYYRIVSRPMDLQTIRDNLRQKHYQSREEFLADVNQIVENSTLYNGPTSSLTVAAQRMLQRCVEKLAEKEEQLMKLEKQINPLLDDNDQVALSFIFENLLTTKLKIMPESWPFLKPVNKKQVKDYYNVIKQPIDMETMGKKIQAHKYHSREEFLRDVQLLVDNCRAYNGPNSQFTRQAEAVLKVTQDALGQFDEHVSQLEANIVRVQQKMLEDAQHSEPEGDDLPPTDEKRGRGRPRKHKPSTSDGATPRKRGRPRKDQNSLEEDLQYSDSGSSELEEVEQKDIVDTIQLCVKPEDTTLDEPSMISDPSDFPTTIKREVTDDLPPPSDDFVDLDQPTDFTFPSIVKEEPIEPDMNMVPAMIESHMEPPQQFKEEPPENWQPDPAIQDDLQVTDSEEEAEDGLWF

Summary

Description
TFIID is a multimeric protein complex that plays a central role in mediating promoter responses to various activators and repressors. Largest component and core scaffold of the complex. Contains N- and C-terminal Ser/Thr kinase domains which can autophosphorylate or transphosphorylate other transcription factors. Possesses DNA-binding activity. Essential for progression of the G1 phase of the cell cycle. Negative regulator of the TATA box-binding activity of Tbp.
Catalytic Activity
ATP + L-seryl-[protein] = ADP + H(+) + O-phospho-L-seryl-[protein]
ATP + L-threonyl-[protein] = ADP + H(+) + O-phospho-L-threonyl-[protein]
Cofactor
Mg(2+)
Subunit
Belongs to the TFIID complex which is composed of TATA binding protein (Tbp) and a number of TBP-associated factors (Tafs). Taf1 is the largest component of the TFIID complex. Interacts with Tbp, Taf2, Taf4 and Taf6.
Similarity
Belongs to the TAF1 family.
Keywords
3D-structure   Alternative splicing   ATP-binding   Bromodomain   Cell cycle   Complete proteome   Direct protein sequencing   DNA-binding   Kinase   Nucleotide-binding   Nucleus   Phosphoprotein   Polymorphism   Reference proteome   Repeat   Transcription   Transcription regulation   Transferase  
Feature
chain  Transcription initiation factor TFIID subunit 1
splice variant  In isoform A.
EC Number
2.7.11.1
EMBL
BABH01014660    BABH01014661    BABH01014662    BABH01014663    AGBW02014522    OWR41530.1    + More
NWSH01000129    PCG79261.1    ODYU01001168    SOQ37001.1    RSAL01000036    RVE51281.1    KQ414855    KOC60122.1    GBYB01007685    JAG77452.1    KQ434924    KZC11582.1    KQ976725    KYM76533.1    GECZ01009127    JAS60642.1    KQ971330    EEZ99738.1    GEZM01074872    GEZM01074870    GEZM01074867    GEZM01074865    JAV64433.1    KQ978957    KYN27196.1    KQ981636    KYN38772.1    NEVH01003506    PNF40466.1    GEZM01074876    JAV64426.1    GL888255    EGI63784.1    PNF40462.1    ADTU01016114    KK107078    EZA60385.1    KQ977622    KYN01312.1    KQ435732    KOX77334.1    KK853018    KDR12445.1    PNF40464.1    GL438389    EFN69024.1    GL762111    EFZ22135.1    GECZ01023445    JAS46324.1    KZ288212    PBC32805.1    GBBI01002511    JAC16201.1    GL451254    EFN79615.1    KB632005    ERL87928.1    GBXI01003689    JAD10603.1    KQ982813    KYQ50337.1    APGK01034498    KB740914    ENN78346.1    KQ779356    OAD52000.1    GAKP01022293    JAC36659.1    GDHF01026579    JAI25735.1    GDHC01014590    GDHC01000568    JAQ04039.1    JAQ18061.1    DS235854    EEB18871.1    GDHF01032673    GDHF01029694    GDHF01008225    JAI19641.1    JAI22620.1    JAI44089.1    AAZO01006526    UFQS01001591    UFQT01001591    SSX11524.1    SSX31091.1    JRES01001300    KNC23821.1    UFQT01002946    SSX34330.1    JXUM01132778    JXUM01132779    KQ567891    KXJ69235.1    JXUM01127242    KQ567165    KXJ69586.1    GDUN01000966    JAN94953.1    CH477373    EAT42378.1    GFDL01012941    JAV22104.1    GFDL01012704    JAV22341.1    DS231865    EDS40372.1    AXCN02000311    GFXV01003166    MBW14971.1    ABLF02029287    AXCM01005046    CCAG010011328    CM000160    KRK03157.1    CH940650    KRF83073.1    KRK03159.1    KRK03160.1    KRK03161.1    KRF83072.1    GGMS01008306    MBY77509.1    EDW96463.1    CH933806    KRG00555.1    KRG00553.1    KRK03158.1    CH902617    KPU79560.1    KRF83071.1    JXJN01016598    KPU79561.1    S61883    AE001572    AE014297    BT004888    KRF83070.1    BT150382    ACZ94830.1    AGW52192.1    AHN57200.1    EDW13664.1    EDV42088.1    KPU79562.1    KRG00554.1    GDIQ01000177    JAN94560.1    GBHO01010466    JAG33138.1    AFH06271.1    AAAB01008805    EAA03907.5   
Pfam
PF15288   zf-CCHC_6        + More
PF09247   TBP-binding
PF00439   Bromodomain
PF12157   DUF3591
PF00651   BTB
PF01607   CBM_14
Interpro
IPR036741   TAFII-230_TBP-bd_sf        + More
IPR036427   Bromodomain-like_sf       
IPR040240   TAF1       
IPR009067   TAF_II_230-bd       
IPR041670   Znf-CCHC_6       
IPR000637   HMGI/Y_DNA-bd_CS       
IPR018359   Bromodomain_CS       
IPR022591   TFIID_sub1_DUF3591       
IPR001487   Bromodomain       
IPR017956   AT_hook_DNA-bd_motif       
IPR011177   TAF1_animal       
IPR000210   BTB/POZ_dom       
IPR036236   Znf_C2H2_sf       
IPR011333   SKP1/BTB/POZ_sf       
IPR013087   Znf_C2H2_type       
IPR036508   Chitin-bd_dom_sf       
IPR002557   Chitin-bd_dom       
SUPFAM
SSF47370   SSF47370        + More
SSF47055   SSF47055       
SSF54695   SSF54695       
SSF57667   SSF57667       
SSF57625   SSF57625       
Gene 3D
PDB
5FUR     E-value=0,     Score=3448

Ontologies

Topology

Subcellular location
Nucleus  
Length:
1830
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
0.00117
Exp number, first 60 AAs:
0
Total prob of N-in:
0.00006
outside
1  -  1830
 
 

Population Genetic Test Statistics

Pi
225.937036
Theta
19.007611
Tajima's D
0.060809
CLR
0.362669
CSRT
0.393530323483826
Interpretation
Uncertain
Peptides ×
Source Sequence Identity Evalue
28467696 KPQDNIRPEGDFEK 100.00 1e-04
28467696 VPNEMGSINPIR 100.00 0.002
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号