SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO05081  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA014089
Annotation
PREDICTED:_ovarian_serine_protease_isoform_X1_[Bombyx_mori]
Location in the cell
Nuclear   Reliability : 3.237
 

Sequence

CDS
ATGATTGAAACTATGAAAACAGAACGATCCTTCGAAAAACTTGGACTTCCACCTATGGAGTTGGCAGAATCAACAACTAATCGTCAAAATAATGAGGTTAAAAATAAATGGATACTTTTAATACAAAAACTAAAAACGACAATTTTACTTATTGCACTCGCTGCAGGGGTATATGTTTTATCGCTGAGATTGTTGGAAATTTTTGTCGGTACACGTAGCACTGAAATATTTGTAGTAACAGGAACATACGAATACCCGAACATTACATTATACGATGGCGACATGGAAATATTTTTACGATTGAAAAACAATACTTTAGAAGCAAACAAAAGGATGAAAAGACATGTCAGCGAACTAAGCGATAGCATAATAGAGCCTGTAGAAGAGTTCGTTTTAACCGATGAAGACTTACAGAATGCAAGAGAAATAATAGTAAAACATGATTTACTATGTAAACATGGTGACAATGATGAAACTTGTATAAAAATAACTAACCGCTTAAAAGAAATGGTTTTGAAGCCGAAGCTTAATAAAGATCAAAGCAACGAAATTCCTGTCATTCCTAAAATAAAAATTAATAGTTTGCATAGCTTAGAAGCCGATCGTGATTTAAATTCTAACAATCATGCTATTCGTAATTCTGAGGCAGTAAAAAAACGTGAAATACATTCCGTAGCAAAAACACAAAATATATTGGAATCTGTTCCTCGCACACAACTAGCATTTGCAGCTCAACCTTCAGAGAATCATATCACTGATCCATGTCTAGAGCGTCTCTTGAAGCAATATACTCAAGCACGTAACAATTATGAATTGCCTCATTCTGAATACGCACCACCTCAGTATGAGAACCCAACACTCAACAACCTACACCCAAGATACTACGATCCGGTCTCGCACCCAGCATTTCTCCAAGGTCGCGAGGTTGAAGCTAGGAAAAGCAAACTTCATCCACAAGACGTTGATATATTCATGAGCCTACTAACTCCCAAGCCCGAAGCTATTTCCACCAAAAAAATTGATGCAGAAGCACGAACTACACTTTGCTCTGATGGATCAAAACCATGTGATAATGGTGAAGGATGTATAACTGAAAAACAATGGTGTGATGGGAATGTTGATTGTTCTGATGTAAGCGACGAAGCTAAGTGTGACTGTAAATCTAGAGTAGACAAATCGAGGCTTTGTGATGGATACTTTGACTGTCCATTTGGAGAAGACGAAATGGGGTGTTTTGGGTGCAGTGAGACTACTTTCAGTTGTTTGGACGTGGATATGAATTCACAGAGTACTTGTTTCACGAAAGAGCAAAGATGCGATAATATCGAGCATTGTCCGAATCATAGAGATGAACTTGAATGTAATATGTTAGCTCCAAGCCTTTTAAAAAAGCCTCTATTTGCAGTGTCAAATACAGAAGGGTTTTTGCATAGAAATCTTAAAGGAGATTGGTACGCCGTATGCCATAATCCATATATGTGGGCGCATGACACTTGCCGTCGAGAAACTGGACTTATAATAAGACCACCTTTTATACAAATAGTCCCTGTTGATCCGATGTCGAAAGTCAGTTACTTGAGCACGGGTCCAGGTGGTATATTACAAACAAGTGAGACTTGCTTCAACTCTTCCGCCGTTTACGTCACCTGCCCAGATCTGCTATGTGGCACTCGAGTCCCAAGCACGTCACAGCTACTGAGAGAAAACGCTGCTATGGAAAATAGACTTTTTGGAAGAAATAAAAGATTCCTTATAGATGGCCATCCATATCCATTTATGTTTTTCGGCAACCGACAAAAACGATCATTAGTATATCATAACTACGACCTATTAAATTACTGGAGAAATCGCAATGAGTTAAATCGACCTTGGTATATTAGAAATATGCGATCGGAAAGCCGAGTGGTTGGTGGAAAACCAAGTCAACCCACGGCTTGGCCATGGACGGTAGCCATATACAGAAACGGCATGTTTCATTGTGGTGGAGTTATTATCACTCAAAACTGGGTTATATCTGCTGCGCATTGCGTACATAAATTTTGGGATCACTACTATGAAGTTCAAGCAGGTATGCTTCGGCGTTTCTCGTTTTCACCGCAAGAACAAAACCATCAAGTAACACATGTGATTGTAAACCAGCATTACAAACAAGACGATATGAAAAATGACTTGTCTCTACTAAGGGTCGAGCCAATTATTCAGTTCAGTCGATGGGTTCGTCCAATATGCTTACCGGGACCTGATACTGCTGGTCCTGATTGGTTATGGGGCCCATCACCCGGTACTATCTGCACTGCTGTAGGCTGGGGAGCTACAGTTGAACATGGACCAGATCCTGATCATCTTCGCGAAGTGGAAGTACCAATATGGGACAAGTGTAAGCATGAAGAGGATAGAGCCGGTAAAGAAATTTGCGCTGGACCTTCAGAAGGTGGAAAAGATGCATGCCAAGGCGACAGTGGGGGACCATTATTATGTCGAAACCCTACGAATTCTCATCAGTGGTATTTAGCCGGAATTGTGAGTCATGGGGATGGTTGCGCTCGTAAAGGTGAACCCGGGGTTTATACAAGAGTTAGTCTTTTTGTAAAGTGGATTAAGCATCATATAGCCTCGAATTCCTTACCAATAGTCAAGCCTTTACAAGAATGTCCTGGATTCAAATGTAAATCTGGAATTTCAAAGTGCATATTTAATAAAAGAAAATGTGACCAAATTATCGATTGTCTCGGAGGAGAAGATGAAATTGATTGCAATTTTGTAAAATCTACTACAACCTTAAGCGATAACGAGTTTCTTTCTGAAATGTTTAGAAGCAATGAAAGAAGTGACCGAATTGAAATAAATCGCATAACAAAAGGCGCTAATATTGAAAATAACACTTTAAAAATTATCCCAAATACCGAGGCCAGTACAGAAGAAGCGACATCAACTTCTGTTGCAGTAAATCGTTATGATGACAAAACCACTATGGAAAATAACTTTAAACATGCGTCAACACTCGAACTTCCATTTTCTTCAAGTTCTGATGAATCCCATTCTGAATCACATGAAAAATCTATAGGTTTACCTTCAATGGAAGGAACCCTATCAATTTCAGAAGAAAAAACAACAGCAAGCACTACAACAACTTTCGAAGAAATTTTCAGTAGTACAATTTTATCTACAACTGAGGAATATCGCGGAGACACAGCAATTGTCAATGAATCACTAGAATCAGATTCTACAGATTTCCAAAATGCAACAATGGATATGGAAAGCAGTGCTAATGATTTTTTTAAAAACCTTAATATACTTAATGCAAGTAATGTGGAAAATACAACAAATACTAGACCAACAGACGAAACTACTATAACAGATTCAACGACAAATATTCCAACAGAAAGTCAGTTTTCTCAGTTTACTTTGTCGACGGCTGTTACCATCGACGAATACATAAGCATGAATGCAAATGAAATTGAAAAACTTCACTTGTTAAATTTAATTACCAATATATCCAATACAAATTCAAATAATACTGATTTTAGAAATAAATCTGAAGGTTTATATATCGAACTACTTCCAAAATCTAATATAAGCATTATACAAAATACCGATGAGGATGTTAAAGAAACAACTGTAATCGAACAGCATATTACGAGCTCTACTGACAAAGGGCCTCTAAAAACTCTGATCGATATCGAATCAAAGCATGATATCATAAAAAAAATCGAAAAAATAGTTTTCTCTCAAAAAATGCCAGCAAAAATAAGAAGAAGGCACCGTATTCCCAAGGACTTCGAATGTGGAAAGATTCACCAAGTCATTTCGTACAATCTAAGATGCGACAATAAAGCGGACTGTGAAGATGGAACTGATGAACTTGGATGCACATGTATTGATTATCTATCGACTATAAATGAGAAATTTCTTTGTGATGGCAGTTTTCATTGCGCCGATGGACAAGATGAACTTGATTGCTTCAGCTGCCCTGAAGACCATTTTCTGTGTAAAAGAAGTAAATTGTGCATACCACTGAGTAATGTATGCGATGGAGTACCTGAATGTCCTCAAAATGACGACGAGTCGGACTGCTTTGCTTTAACGAATGGCAAAGAACTCCAATACGAAATCGATGATAGACCGCAAATCAATCTAGAAGGATTTGTGACGAAGAAACATTTGAATCAGTGGCATGTTGTTTGCGAGGATAAACTGACTATTGAGCAAATAGAACAAGAAGCTAATCATATTTGCCATTACTTGGGATTCAGTTCAGCAAGAACATATTCTGTGAAGTATATTAACATTAAAGAAGATGACGTTTTATTGATTGATAAAAGAACGAAAAGACAAATAGTTACTACAGTCCCTGTGCATTTTGCTTATAAAAATATTAGCGATGGTGTTAATGCGACTGAACATGTACTCATTCAAGAACCACAATTACTCAAAGAACAATGTGTCCCAAATGTAACGAAAACATGCAAATCTTTGTATGTTTTCTGTGATAGAACATTATATACTGATTTTGATGAGACACCACTTTTCTTGCGTGAAACAGAAGTTGTACAAACATTTAAGTGGCCTTGGATTGCGAAAGTATATGTAGAGGGAAATTATAGATGTACTGGCGTTTTAGTTGACTTGTCATGGGTGTTAGTCAGCCACGCGTGTCTCTGGGATACGTCACTTCACAGCTATATTTCTGTGGTTCTTGGATCACATAAAACTTTGAAGTCTGTAAAGGGACCATATGAACAAATCTACAAAGTCGACGCTAGGAAAGATTTATACAGAAGTAAAATTTCGTTGTTGCATTTAAAAAGTCCCGCAACTTATTCAAATATGGTAAAACCGATGATTGTAGCATCCACCCGCAATCATTTGGAGAAAAATAATAAATGTGTTACTGTGGGGCAATTCGATAACAACGAAACTATTAGTATTTTCCTGGAGGAAACTAATGAAAACTGCAGCTCTCATAACATATGTTTCAAACGCAAATCAATTGGAAATACTGCTGGAATGACATCTAATCATGAATGGGCTGGAATTATAAGCTGTCATACTGAACAAGGCTGGGTGCCAGTTGCATCTTTTGTAGATGGTAGAGGAGAGTGCGGCATAGGCGATCATATACTTGCTACTGATATAGACAACCTGAAAAACGAAATGAAACATTACTCGAGTAAGAAAATATTTTCGGACTCAATGGAGCAGGTAGACGTAGATACTTGTGAAGGGACAAGGTGTGGTCGGGGCTCGTGTATTGGCTTGGAGCGAATATGTGACGGTGTAAGACAGTGTGAAGATGGAAACGACGAATCCGAAGAATCCTGTCATAAAAAAGAACACATCTGTAACGGCGATCCGTTCCATTCGGGATGTGGGTGTTCATCGGGTCAAATGAAATGCCGTAACGGTAAATGCTTGTCTAAAGAGTTATTCAGAGATGGTCATGATGACTGTGGGGATGGAACAGATGAGCCTGGTCACACCACGTGCTCGGATTATTTGGCAAGAGTTATGCCTTCGAGACTTTGTGATGGAATCTTACACTGCCATGACAGAAGTGATGAAGATCCGAGCTTCTGTAAATGCTTTGCTAAAAAGGCATACAGATGTAACAGAAAATCGCGTCAATCAGAACAGTGTGTAGCTCCTGATATGTTGTGTGATGGTGTAAGAGATTGTCCAAATGGAGAAGACGAACAAACTTGTATAGGTCTCAGCGCACCAGAGGGAACTCCCTACGGAACTGGACAAGTAATTGTTCGGTCACACGGCGCATGGCATTCTAAGTGCTATCCCACGCAAAACCACACTAAATCAGAGTTAGAAGCCATTTGCAGAGAACTTGGTTTTATAAGCGGCCACGCGAAGGAAATCAAAGGGATGAAAATCCTCCACCCACACAATAGCCTACTTTTAGATCCTTTCAGAGAAGTCGTTTTGAACAACAACACTGTTATTAGAATGAGAAACACGCACGAGCCACTGGCAAAACCCATTTTTAACAAAGACATAATTAATTGTTACCCAGTTTTTATAGAATGTTATTAA
Protein
MIETMKTERSFEKLGLPPMELAESTTNRQNNEVKNKWILLIQKLKTTILLIALAAGVYVLSLRLLEIFVGTRSTEIFVVTGTYEYPNITLYDGDMEIFLRLKNNTLEANKRMKRHVSELSDSIIEPVEEFVLTDEDLQNAREIIVKHDLLCKHGDNDETCIKITNRLKEMVLKPKLNKDQSNEIPVIPKIKINSLHSLEADRDLNSNNHAIRNSEAVKKREIHSVAKTQNILESVPRTQLAFAAQPSENHITDPCLERLLKQYTQARNNYELPHSEYAPPQYENPTLNNLHPRYYDPVSHPAFLQGREVEARKSKLHPQDVDIFMSLLTPKPEAISTKKIDAEARTTLCSDGSKPCDNGEGCITEKQWCDGNVDCSDVSDEAKCDCKSRVDKSRLCDGYFDCPFGEDEMGCFGCSETTFSCLDVDMNSQSTCFTKEQRCDNIEHCPNHRDELECNMLAPSLLKKPLFAVSNTEGFLHRNLKGDWYAVCHNPYMWAHDTCRRETGLIIRPPFIQIVPVDPMSKVSYLSTGPGGILQTSETCFNSSAVYVTCPDLLCGTRVPSTSQLLRENAAMENRLFGRNKRFLIDGHPYPFMFFGNRQKRSLVYHNYDLLNYWRNRNELNRPWYIRNMRSESRVVGGKPSQPTAWPWTVAIYRNGMFHCGGVIITQNWVISAAHCVHKFWDHYYEVQAGMLRRFSFSPQEQNHQVTHVIVNQHYKQDDMKNDLSLLRVEPIIQFSRWVRPICLPGPDTAGPDWLWGPSPGTICTAVGWGATVEHGPDPDHLREVEVPIWDKCKHEEDRAGKEICAGPSEGGKDACQGDSGGPLLCRNPTNSHQWYLAGIVSHGDGCARKGEPGVYTRVSLFVKWIKHHIASNSLPIVKPLQECPGFKCKSGISKCIFNKRKCDQIIDCLGGEDEIDCNFVKSTTTLSDNEFLSEMFRSNERSDRIEINRITKGANIENNTLKIIPNTEASTEEATSTSVAVNRYDDKTTMENNFKHASTLELPFSSSSDESHSESHEKSIGLPSMEGTLSISEEKTTASTTTTFEEIFSSTILSTTEEYRGDTAIVNESLESDSTDFQNATMDMESSANDFFKNLNILNASNVENTTNTRPTDETTITDSTTNIPTESQFSQFTLSTAVTIDEYISMNANEIEKLHLLNLITNISNTNSNNTDFRNKSEGLYIELLPKSNISIIQNTDEDVKETTVIEQHITSSTDKGPLKTLIDIESKHDIIKKIEKIVFSQKMPAKIRRRHRIPKDFECGKIHQVISYNLRCDNKADCEDGTDELGCTCIDYLSTINEKFLCDGSFHCADGQDELDCFSCPEDHFLCKRSKLCIPLSNVCDGVPECPQNDDESDCFALTNGKELQYEIDDRPQINLEGFVTKKHLNQWHVVCEDKLTIEQIEQEANHICHYLGFSSARTYSVKYINIKEDDVLLIDKRTKRQIVTTVPVHFAYKNISDGVNATEHVLIQEPQLLKEQCVPNVTKTCKSLYVFCDRTLYTDFDETPLFLRETEVVQTFKWPWIAKVYVEGNYRCTGVLVDLSWVLVSHACLWDTSLHSYISVVLGSHKTLKSVKGPYEQIYKVDARKDLYRSKISLLHLKSPATYSNMVKPMIVASTRNHLEKNNKCVTVGQFDNNETISIFLEETNENCSSHNICFKRKSIGNTAGMTSNHEWAGIISCHTEQGWVPVASFVDGRGECGIGDHILATDIDNLKNEMKHYSSKKIFSDSMEQVDVDTCEGTRCGRGSCIGLERICDGVRQCEDGNDESEESCHKKEHICNGDPFHSGCGCSSGQMKCRNGKCLSKELFRDGHDDCGDGTDEPGHTTCSDYLARVMPSRLCDGILHCHDRSDEDPSFCKCFAKKAYRCNRKSRQSEQCVAPDMLCDGVRDCPNGEDEQTCIGLSAPEGTPYGTGQVIVRSHGAWHSKCYPTQNHTKSELEAICRELGFISGHAKEIKGMKILHPHNSLLLDPFREVVLNNNTVIRMRNTHEPLAKPIFNKDIINCYPVFIECY

Summary

Pfam
PF00089   Trypsin        + More
PF09342   DUF1986
PF00057   Ldl_recept_a
Interpro
IPR033116   TRYPSIN_SER        + More
IPR023415   LDLR_class-A_CS       
IPR034381   Nudel       
IPR015420   Peptidase_S1A_nudel       
IPR018114   TRYPSIN_HIS       
IPR001190   SRCR       
IPR001254   Trypsin_dom       
IPR036772   SRCR-like_dom_sf       
IPR009003   Peptidase_S1_PA       
IPR036055   LDL_receptor-like_sf       
IPR002172   LDrepeatLR_classA_rpt       
SUPFAM
SSF57424   SSF57424        + More
SSF56487   SSF56487       
SSF50494   SSF50494       
Gene 3D
ProteinModelPortal
PDB
3W94     E-value=1.30582e-42,     Score=442

Ontologies

Topology

Length:
2014
Number of predicted TMHs:
1
Exp number of AAs in TMHs:
22.03099
Exp number, first 60 AAs:
19.17444
Total prob of N-in:
0.98560
POSSIBLE N-term signal
sequence
inside
1  -  36
TMhelix
37  -  59
outside
60  -  2014
 
 

Population Genetic Test Statistics

Pi
276.552646
Theta
220.018322
Tajima's D
0.808007
CLR
0.514493
CSRT
0.606619669016549
Interpretation
Uncertain
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号