SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO04342  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA005269
Annotation
PREDICTED:_NFX1-type_zinc_finger-containing_protein_1-like_isoform_X1_[Amyelois_transitella]
Location in the cell
Nuclear   Reliability : 2.875
 

Sequence

CDS
ATGGATAAACCAACAATGCGGATAGATTGGTTTGATGGTACCGTAGTAGAAGATTCCAGATCAATAGCAACAACGTCGCAATTTGCTGACGAGGAAGAAGACATAGATGTTGCAAAAAAACCTACACATAAACAAACAAACACAGAAAAGAAACCAATTGGCTTCAAAAGGCTACTCGATATATCATTATTGAAACCACCCATTTTAACATTAGAAGTCTCTCATAGACCTGGTTTTTGGAATCTTCTAGACGAGGAAGAGCTAAAGGGCGATTTTATAGTTTTAATCGTAAAAATATTGGCTTCCATTTACAAATCTTTGAGAAGTGGCGAGAATAGTAAAATTGTTACAATGCTCAGGACACGATTGAGAAATTCGAAGTTTTTATTGAATTTGAAAAACTACATCACTGATCTACCAGCTGTTCGTATAGTAGAAAAGAGATTGAATACACACATGTGGGATGATGCCGAGGGATTTTTTATTGAAATGGTTGAATTGTGTGAAGGAATCCTTAATTTTGGTGGGAACAGTGAAGAATATTTACAAAACATTTTTGAGTTGTTAGAAGCTACTGAAATTAGTGCTTTAGGAGTTATGGAAGAGCATTCTGAAAGGTTTAGTAATGAACTGTTTGACAGAATCGAAAAAATTAAGATAAGGTTTGCTGAAGAACTTAGTTCAAAGGGTCAGACTATAACAAATGAGACAAGGATAAATGAAGACCCGAACAGTTTCAGAAATCTAAACATATTCCCGACTCAACGAGACCTACTGAGCACTGATCATATTAATTTAAAACCTAACATAATCAACGGAGCGTACCCAAATGTGGAGCATTATCTGGATGTTCAGTTCAAATTGTTAAGGGAAGATTACTTTGGACCTTTAAGAGATGGAATAAGTTGAATACATCCAAAAGTTCGTATAATCCGTAACTATGTGACAAACAACAAAATAGGGTACCTAGTCGATATAGCATGGAATGAGCGATACAACCAAAGTACAATCAACAACACAACACTAAAGTACACCAAGCAACTGATGTTTGGGTCACTTTTACTGTTCACAAATGATAATTTTGAAACGATATTGTGCGCAACTGTACTGGACTCTAGTTATAGTTTATTGGCTGAAGGCTATATTGCTGTAGCCTTTCAAAATCCAGTATCCAATAAAATATTCTCAGAAGCATATTTAATGGTTGAAAGCGAAGTATTTTTTGAGCCTTACCATAGAGTGCTTAAAGTTTTACAAAATTCAAGACCTGACGAGCTTCCCATGAAGAAATACATTGTGGATGTGGACAATGAAACTCAACCACCAAGATATTTGAACTCTGATACGCGATACGCCATAACCGAACCAGGGTGTACTGAAGAAATACAATTCCCAGTCTTGCGACCAGACCTATGGCCAGATGAATCAATAGGCCTAGACGAATCTCAGTTAAATGCATACAAATTCGCTTTGACCAAAGAGTTCGCCGTCATCCAAGGTCCTCCTGGTACCGGAAAAACGTTTTTAGGGGTCAAAATTGCTTCAACGCTACTCAGAAATTTATCCCTAGAAGGCACACCGATGTTGATTATTTGTTATACAAATCACGCCCTTGATCAATTCTTGGAGGGATTGCTCTCTGTGACCCAAAGTATAGTCAGATTAGGAAGCCAAAGTAAAAGTAAGGTTTTGGAGCGGTATAGCTTACACAACGCCAGGATGGGGGTCAAATCTAAGTATTCATATTTGTATGGGACTAAGAGGGCGGAAATAGAAAGGGTTTTTAAAGAAATGTGTGAAGTTCAGAGCGAGATTGAGTTATGCGAAAGAGAGGTGCTCTCATATAAATGCATTAGACCATTCCTGAAGATTGATGGTAAGAGTTATGAATTGAAGAGTTCAAAAGGTGATTCCGTATTGGATTGGCTGTTTGGTTGTGACAATGGAGCTTTTGAACCGGACACAAGTGACGATTGGGAAAATTTAGACGAATCTAGCACAGATATACTAGAAAGTGTATTTTCAGAACATTATGCTTTAAAGGAAATAGATTCAATGTTAAACAGCATAAAATATGTTCAAGATATAACTGATGATGCAGAGGAAAGAAAGCAAATGGTGGATAAGTTTAAAGCACAGATTGATAAAGTTAAAAGTCGATTGGATTGTTTTAAGAAATACCGTACTCTATACCAGATTTACAATGAACCAAAATTAACCATAGACAAAGTTGAAGATCTATACTGTTTGAAATTAGAACAAAGATGGCAACTTTATTACAAAACAGTTGGATGTGTTAAAAGTAACTTACTGTCGAAAATGAATACGCTTTTGGAAAAGCACAAGACGTTGAATCAGGAGTTGGAACAGGTGACGACGTTGACTGATGCCGCCGTAATGGAGAAGGTCCGGGTGGTGGGCGCGACCACCAGCGCGGCGGCCCGGAGACGCGACCTGCTGACCACGCTGCGCCAGTCCGTAGTGATAATTGAAGAAGCAGCCGAAGTCTTAGAGGCTCACATCGTCGCTTCTCTGACCAACGACTGTCAGCACCTGATACTGATAGGTGACCACAAGCAACTTCGTCCTACGGCCGCTAACTATATCCTAGCGAAGCGATACAACCTGGAGGTATCTCTGTTCGAACGAATGATTAGGAACGGTGTACACGCTCGAGTGCTCACAGTGCAGCGCCGCATGAGGCCGCGTGTCGCCGAGCTGCTGGTGCCGACCGTGTACCCTGCCCTAGACTCGCACCCGGACGTGTACAACTACCCGGACGTCAGGGGGATGGCCGATAATCTATACTTCTTCACCCACGAAGTCCTAGAAGATACCGAGGGTTTGGAAGGGAGCTGGAGTCACCGAAATACCTTTGAAGCCAGATGGTGCGTGGTCTTGGCGAATTATTTACGACAAATGAAATATTTGGCTCACGAAGTGACTATACTGTCTACTTATATGGAACAAGTGTCATTAATAAAGCAGTTGAGCTTAAAGTACAATTCGCTACGCGATATAAAGATAACGGCCGTCGATAATTACCAAGGCGAAGAGAGCAGGATAGTGATATTGTCTCTTGTGAGGAGTAACAAGGACGGCAAGATTGGCTTCCTGAATGCCACCAATAGGATCTGCGTGGCTCTATCTAGAGCTAGAGAAGGTTTCTACATATTTGGCAATATGAATTTATTAAAGACTGCGAATAATGTGTGGAGCTCGATCAATGAAAAGCTGATAAAACAAAACGCTATCGGCAATAAATTAACATTGGTGTGTAAACATCATGCGAAACGTTTGGAAGTGAGGACCGTGAACGAATTTGAAAAATACATTTCTGGAACGTGCCCGATGAATTGTGGAAAGAAATAA
Protein
MDKPTMRIDWFDGTVVEDSRSIATTSQFADEEEDIDVAKKPTHKQTNTEKKPIGFKRLLDISLLKPPILTLEVSHRPGFWNLLDEEELKGDFIVLIVKILASIYKSLRSGENSKIVTMLRTRLRNSKFLLNLKNYITDLPAVRIVEKRLNTHMWDDAEGFFIEMVELCEGILNFGGNSEEYLQNIFELLEATEISALGVMEEHSERFSNELFDRIEKIKIRFAEELSSKGQTITNETRINEDPNSFRNLNIFPTQRDLLSTDHINLKPNIINGAYPNVEHYLDVQFKLLREDYFGPLRDGISXIHPKVRIIRNYVTNNKIGYLVDIAWNERYNQSTINNTTLKYTKQLMFGSLLLFTNDNFETILCATVLDSSYSLLAEGYIAVAFQNPVSNKIFSEAYLMVESEVFFEPYHRVLKVLQNSRPDELPMKKYIVDVDNETQPPRYLNSDTRYAITEPGCTEEIQFPVLRPDLWPDESIGLDESQLNAYKFALTKEFAVIQGPPGTGKTFLGVKIASTLLRNLSLEGTPMLIICYTNHALDQFLEGLLSVTQSIVRLGSQSKSKVLERYSLHNARMGVKSKYSYLYGTKRAEIERVFKEMCEVQSEIELCEREVLSYKCIRPFLKIDGKSYELKSSKGDSVLDWLFGCDNGAFEPDTSDDWENLDESSTDILESVFSEHYALKEIDSMLNSIKYVQDITDDAEERKQMVDKFKAQIDKVKSRLDCFKKYRTLYQIYNEPKLTIDKVEDLYCLKLEQRWQLYYKTVGCVKSNLLSKMNTLLEKHKTLNQELEQVTTLTDAAVMEKVRVVGATTSAAARRRDLLTTLRQSVVIIEEAAEVLEAHIVASLTNDCQHLILIGDHKQLRPTAANYILAKRYNLEVSLFERMIRNGVHARVLTVQRRMRPRVAELLVPTVYPALDSHPDVYNYPDVRGMADNLYFFTHEVLEDTEGLEGSWSHRNTFEARWCVVLANYLRQMKYLAHEVTILSTYMEQVSLIKQLSLKYNSLRDIKITAVDNYQGEESRIVILSLVRSNKDGKIGFLNATNRICVALSRAREGFYIFGNMNLLKTANNVWSSINEKLIKQNAIGNKLTLVCKHHAKRLEVRTVNEFEKYISGTCPMNCGKK

Summary

Similarity
Belongs to the peptidase S1 family.
EMBL
BABH01028334    BABH01028335    NWSH01000895    PCG73651.1    ODYU01007234    SOQ49838.1    + More
RSAL01000030    RVE51716.1    KQ459302    KPJ01847.1    AGBW02011602    OWR46444.1    JTDY01003429    KOB69567.1    KQ460211    KPJ16525.1    RSAL01000025    RVE52130.1    KQ460397    KPJ15236.1    ODYU01008646    SOQ52407.1    KQ459589    KPI97707.1    GEZM01088890    JAV57845.1    NEVH01005883    PNF38643.1    GEDC01025993    GEDC01003586    JAS11305.1    JAS33712.1    GEDC01004975    JAS32323.1    GEDC01021385    JAS15913.1    KQ971321    EFA00122.1    NEVH01005006    PNF39219.1    KK852655    KDR19284.1    GEBQ01011287    JAT28690.1    GEDC01025857    GEDC01025145    JAS11441.1    JAS12153.1    GEDC01018110    JAS19188.1    GECZ01005540    JAS64229.1    GECZ01001543    JAS68226.1    GEDC01015131    JAS22167.1    GGMR01008816    MBY21435.1    GECU01007038    JAT00669.1    GECU01012252    JAS95454.1    GEDC01026474    JAS10824.1    GECU01035096    GECU01029016    JAS72610.1    JAS78690.1    GEBQ01030399    GEBQ01016845    JAT09578.1    JAT23132.1    GCES01068928    JAR17395.1    AAZX01003253    NNAY01001131    OXU25033.1    GCES01068927    JAR17396.1    JARO02000941    KPP76956.1    ADON01112229    ADON01112230    GEDC01020153    JAS17145.1    KK116477    KFM67868.1    AKCR02000047    PKK23763.1    KL218292    KFP03950.1    GG666604    EEN49930.1    KL447510    KFO74299.1    AYCK01000328    MWRG01000218    PRD36703.1    KL225346    KFW71273.1    KL226064    KFM07706.1    LSYS01002950    OPJ85064.1    KL351094    KFZ58957.1    FR904817    CDQ72212.1    KK735429    KFR14057.1    KK513965    KFP64109.1    AGCU01006266    AGCU01006267    AGCU01006268    AGCU01006269    AGCU01006270    AGCU01006271    AGCU01006272    KL896797    KGL83860.1    GFXV01006410    MBW18215.1    RCHS01001807    RMX51300.1    HAAD01000606    CDG66838.1    KB564564    EMP28273.1    AAGJ04048993    KB201205    ESO98811.1    AAGJ04001507    AMQM01000597    KB096324    ESO06284.1    PZQS01000014    PVD18632.1    AAGJ04128395    QUSF01000037    RLV98769.1    QKKE01000086    RGB38984.1    LLXL01000837    PKK68482.1    LLXK01000979    PKY26566.1    LLXH01000521    PKC65690.1    JEMT01017251    EXX68564.1    LLXJ01000894    PKC05288.1    KI301740    BDIQ01000044    AUPC02000399    ERZ94797.1    GBC17823.1    POG60277.1    AAGJ04001505    LLXI01000373    PKY45278.1    QKYT01000543    RIA83764.1    ESO98812.1    AMQN01008687    KB303737    ELU02825.1   
Pfam
PF13086   AAA_11        + More
PF13087   AAA_12
PF00089   Trypsin
PF00017   SH2
PF03372   Exo_endo_phos
PF00266   Aminotran_5
Interpro
IPR027417   P-loop_NTPase        + More
IPR041677   DNA2/NAM7_AAA_11       
IPR041679   DNA2/NAM7-like_AAA       
IPR000967   Znf_NFX1       
IPR001254   Trypsin_dom       
IPR009003   Peptidase_S1_PA       
IPR001314   Peptidase_S1A       
IPR033116   TRYPSIN_SER       
IPR003593   AAA+_ATPase       
IPR016024   ARM-type_fold       
IPR013087   Znf_C2H2_type       
IPR014013   Helic_SF1/SF2_ATP-bd_DinG/Rad3       
IPR036860   SH2_dom_sf       
IPR015424   PyrdxlP-dep_Trfase       
IPR005135   Endo/exonuclease/phosphatase       
IPR015421   PyrdxlP-dep_Trfase_major       
IPR000980   SH2       
IPR000192   Aminotrans_V_dom       
IPR036691   Endo/exonu/phosph_ase_sf       
IPR015422   PyrdxlP-dep_Trfase_dom1       
IPR000300   IPPc       
IPR014001   Helicase_ATP-bd       
SUPFAM
SSF52540   SSF52540        + More
SSF50494   SSF50494       
SSF48371   SSF48371       
SSF53383   SSF53383       
SSF56219   SSF56219       
SSF55550   SSF55550       
PDB
6QDV     E-value=2.27497e-21,     Score=257

Ontologies

Topology

Length:
1123
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
0.021
Exp number, first 60 AAs:
0.0005
Total prob of N-in:
0.00073
outside
1  -  1123
 
 

Population Genetic Test Statistics

Pi
202.526801
Theta
168.130114
Tajima's D
1.595488
CLR
0.78236
CSRT
0.808409579521024
Interpretation
Uncertain
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号