SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO01973  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA014040
Annotation
PREDICTED:_collagen_alpha-2(IV)_chain_[Bombyx_mori]
Location in the cell
Cytoplasmic   Reliability : 2.703
 

Sequence

CDS
ATGTCGGGCCCGGCGCGTGCGCGCCTTCACGCGCTACATCTTGTGCCCGCTGCTCTTCTGATATTCTTCTGGAGTCCAACTCCTGTTTCTACAGTGGTCTGTAATCAGACTCTGTGTGATTGTCGTGGAATAAAAGGAGATCAAGGAAGCATAGGCCCTCCGGGCATCCCAGGCCTTCAAGGGGACTACGGAGCAGATGGACTTGAAGGAAGAATTGGTTTACCCGGTGTGCCGGGAGACTGGGGAGAACGCGGCGATTTCGGGGACAAAGGAGAACGAGGAACCGAAGGATCTTATGGCCGACGAGGAGACGAAGGACCACAGGGTCTGCCTGGGTTAGAAGGAGTATTTGGATTCCCAGGATTGGACGGCTGTAGTGGTATTGATGGAGTTCAAGGCGTTCCCGGTATACAAGGTCCACCTGGTGATAGAGGTATTCCAGGACCTTACGGAGAAAAAGGCCTTCAAGGCTTGGCAGGAGAAGGTGGAGTTAACTCAAGAGGTGCAAAAGGAGATCAAGGAGATGGTGGTAGACCAGGTAGCTATGGACTTCCTGGACCCCAGGGTTGGAGAGGAGACAGTGGTCTACCCGGAGACGAGGGTGATCAAGGCTCAATGGGCTTAAAAGGAGAACCTGGATATAGTGGCGACTTGGGTGAAGATGTCATAGGACCAGCTGGAGAAAAGGGAGATCAAGGCAATGTTGGTGATAGAGGAAGACCTGCCTCTCTAGTAATTGTCGATGAAGTACGAGAGTACAACATTTCAATAACAAGGGGGTCAAAAGGTGAAAAAGGATTACGTGGACAGCAAGGTGTAAAAGGAATTAAAGGAGAAACAGGTTCTGTGGGTCTAAGGGGAACTTATGGACCTAATGGACTACAAGGCTACAAAGGAGATCAGGGTGACGACGGACCTCGGGGCAAGCCTGGACACCGAGGACCACAAGGGCCGCCTGGACAAAAAGGCATTAAAGGTGCTCCTGGATATGCGGGACCGGATGGTGCGAATGGTTTAACAGGACCCAGTGGAGAAGATGCAAGACCGGGAAATCCTGGCCCGCAAGGCTTTTCCGGAGAACCTGGAATATTCGATGAGACACTAAACGAACCTTTACTACCAGGTTCGATGGGTCCACAAGGTCCAGTTGGCTTTGTCGGGCCTATGGGAGCACCTGGATTAGACGGTGCAGTTGGCCAGCCTGGCTTAATGGGACCACCAGGGCTACCAGGAGCTAAAGGCCTACCTGGTCCACAAGGTGCTCCAGGTATATCGCCCAAGGGAGAACCCGGAGATGATGGTTTTAAAGGTCTGCCTGGCCCACGTGGCCCTAACGGATATCCAGGAATACAAGGACCTGTTGGTCCTAAAGGTTTTAAAGGAAACACAGTAATTGGGGAACCTGGTGAAGAGGGCACCCCAGGAATAGATGGCATTCCAGGTTTAAGGGGTGACAGGGGTGAACCAGGTGCAATCGGACCAGCAGGATATCCTGGCAGGGGCGTTTACGGTGCTGGACCGCCGGGTGATCGAGGCCCACCCGGAAGACCTGGAATAGCTGGAAGCCCTGGTAATCCGGGTCGTCCTGGATCGCCTGGTCTAAAAGGAGAACATGGAGATGACTGTCCATTTTGTCCTTCGGGTCTGCCTGGCAGTAAAGGGCAGAAAGGAGACGATGGTTTTACTGGTCGTCAAGGATTCCCAGGTTTTGTTGGCCTACCAGGACCTCGTGGTCAACGTGGAGCTCCAGGGATACCTGGAATAAATGGTCCAAAGGGGCCAAAGGGTAATAAAGGGCTTGCTGGTATGGCAGGACCTCAAGGACCAAAAGGACCTGAAGGACGTTTGAATACACCTGCATATGCCTTAACCGTAGCAGACCGCGGTCCTCGCGGTGAACCAGGTTTTATCGGACCACCGGGTTTTCCTGGGGACACTGGTCGACAAGGACCTCACGGAGATTCAGGAATCACTGGATTCAAAGGTGTGCGCGGTGACTTCGGCCAACCCGGAACTCCTGGAAGAAACGGATCCCGTGGAAGGGATGGAGTACCTGGACGGCCCGGAGCTACTCCGGATATTCCGATGGCTTTTCTTTTCGGTGAAAAAGGAGATATGGGTTTAACTGGCGAAGAAGGAGATAAGGGAGAAATTGGCCCACCTGGAGAAACTGGAGAATCACTGATTGATGGCATTTATGCTAAAGGAGAAAAAGGATATCCAGGATACCCTGGCTTCAATGGTGTACCCGGGAGAAAAGGTATTCGAGGGGATCAGGGTTCAGATGGGTACCCAGGTATGGGAGGTGATATTGGAATACAGGGTACATCACAACAAGGACCAAAGGGCTTCAAAGGATTTAGTGGAGAAAAAGGTAGCATTGGGCCATACGGTGATCCAGGTAGTCCTGGTTACCGGGGACCCTTTGGCAGCGATGGTCAAAAAGGACAAAAAGGTATGCGTGGTGACGGTGGCTACGCAACTTTGCCAGGAGAAAGAGGATCAAACGGAATAAACGGAGAACGAGGCGATAATGGAGAGCCTGGTTATCCTGGGACTCCCGGAAGATCTGGAGTTATAGGTCAGAAAGGCGTAACTGGAGCGCCTGGAGATATAGGACCTGATGGACCACCTGGACCTCGGGGACGAAAAGGTTTCCCCGGGATAATAATACAAGGTGCACCTGGACTACCTGGACGTCCAGGATCACCTGGTCCGATGGGACCTATTGGTGCACAAGGCTTACAAGGAAGTAACGGCCTACAAGGTTTTACAGGACCTAAAGGAATGAAAGGAAACTCAGGAAGATATGGGCGTCGCGGTGAGCCTGGTGTACCTGGGTCGATAGGGTTCACTGGATTTTCTGGATTAATGGGATTACCTGGAGCCAGTGGAAAGCCAGGAGAGCGAGGTGATAAAGGTGCTCCTGGTTTTAATGGAGAGCCTGGTCAAGATGGTTTAGTAGGGCCTCTAGGACCTAAAGGCTATATAGGTAACCCAGGCTTAACAGGAAAACCAGGTATTCCAGGCTTACCAGGAGGAAAGGGAGAACGTGGAATTTATGGTTATCAAGGACCAAAAGGAGACCGAGGAGATAACGCGGTTTCTGGATCAAAAGGAGAACCCGGTGAACATGGCATAAATGGTTATACAGGGAGAGATGGAATACCCGGACAAAAAGGAGTTAAGGGTCCTCAAGGAATGCCTGGTTTAAGCATACGAGGTCCGCCTGGTACTAAAGGTGATAAAGGAGATACAGGTTTCTCAGGGACACCTGGTTTCCAAGGACCAAGAGGTGCTATAGGTGACAAAGGTTTCCCTGGGCAAACAGGAGAACCAGGTGAAAAAGGTTTGCCGGGTACTGATGGATTCGCAGGTAAAAATGGATTAATTGGTACGAAAGGACAGCAGGGTCTATATGGAGCATCAGGTAGAAAGGGTGACCGTGGCGACAGAGGAATGGATGGTGAGCCTGGACGAATCGGCATTCCGGGATTACCTGGTAATAAAGGTTTTCAAGGACCACAAGGACGTATAGGCCCCACAGGAATCAAGGGTGATCGCGGTGATATTGGTAGAACCATTTATCTACCAGCTACTAAAGGAGACATTGGTGATATTGGGTTCCCCGGCCTTCCTGGTAATCCAGGAGAATATGGAGACCCTGGATATACTGGACATATTGGTAGAAAAGGAGAGCAGGGTGATGTTGGAATCAAAGGAATGGTCGGCAACGAGGGTCTTCCAGGTTTCATTGGAGAAAGAGGTGACCAAGGACCTCGTGGTTTACCTGGTCGAAACGGTGAATTTCCTGAAAGAGGCGAACAAGGTAAAGCAGGAATTGACGGATTACCAGGATGGCCTGGTCCCATGGGTCAAAAAGGAGCTCCAGGAGAATATGGAGTAGATGGTCCTGAAGGCACACCAGGTCAACCGGGATTAACATATGAAGGAGCGAAAGGTGATCGTGGTATGCCTGGTTTTACGGGACGCCGAGGATACCTAGGGATACCAGGACCGCGGGGGATGGAGGGACATCCTGGATTAAAAGGTGAAAAAGGAAATGCAGGTGAAATAGGTTTTGCCATTAGTCCCAAGGGCCAACCTGGATATCCGGGTTTGGCTGGTTACAATGGATTAAAAGGTGAAAAAGGTACAATCGGAGAAACTGGAAGAATTGGAATTTACGGAGAACAAGGCTTAGTCGGGGTCAAAGGAGAGACAGGCGACGAAGGTGAAGCTGGATTCATCGGACCCCCCGGTCTACAAGGAATAGAAGGAGACAAAGGAGATACTCTGCTTCCATCAGATATTTTACCTGGACAACCTGGTGATGTTGGATTGCCAGGATTTGATGGACGGGATGGTGTAGCAGGAATCCCAGGGTCTTTTGGTCGAAATGGTTACCTGGGTGTCAAAGGGCAAAGGGGAGAAATAGGATTACAAGGTCCTCCTGGAGAACCAGGCCCTCAAGGGCTACAAGGTATCAAAGGTAAACGAGGGCTTAAAGGTTTCGTTGGATTGCATGGCAGCGTCGGTCGTCCCGGTCCACCAGCACCTCCACCTCCAATACCCAAATCTAGAGGATTTTATTTCACGGTTCATTCACAAACAAGATTGATACCGAAATGTCCATCTGGAACATCACCGCTTTGGGAAGGCTTTTCGTTAATTCATATAATCGCTAATGGAAAAGCTCACGGTCAAGATCTCGGCGCTCCAGGTAGTTGTCTCCGTAAATTCTCCACAATGCCGTATTTGTTTTGCAATCTAAATAATGTTTGCGACTTTGCCCAACGAGAAGACTACAGCTTTTGGCTATCTACTCCAGAACCAATGCCCATGGCAATGACGCCGATTCAAGCCAGAGACGTCGGCACATATATTTCTCGATGTCAAGTATGCGAAGCTCCAACTAGAACAATCGCTATACACAGTCAGAGCAGTGAGGTGCCGACATGTCCAGATAAATGGAAGGAGTTGTGGGTGGGCTACAGCTTTTTAATGCATACCGCGGGCGCAGACGCATCAGGACAGAGCCTCATCTCACCTGGTTCATGCTTACGAGAGTTCAGGACACGACCGTTCATTGAATGTAATGGTCTCGGACGTTGTAACTATTTCGCTACTGCTGTCTCATACTGGCTGTCTACAATAAACGACAACTGGATGTTCCAAAAACCCATACAGGAAACATTAAAAACTGATAAAGACTCAAGAGTCAGCAGATGTACGGTGTGCATGAGACAATGGGCATCTCAGACGTATAGCACCGCAACGAGAACCGGTGATGTTAGCTTTGTACCCAACGCCCAACGACGACCTTTCCGTGGGCCAAGCCCTACAGGCAGAGTACCACGGTGGCACCGCAGAGGTTATCGTGTCGCCTCGCGGAAGGTACCTTCGTTTAACAAACATTAA
Protein
MSGPARARLHALHLVPAALLIFFWSPTPVSTVVCNQTLCDCRGIKGDQGSIGPPGIPGLQGDYGADGLEGRIGLPGVPGDWGERGDFGDKGERGTEGSYGRRGDEGPQGLPGLEGVFGFPGLDGCSGIDGVQGVPGIQGPPGDRGIPGPYGEKGLQGLAGEGGVNSRGAKGDQGDGGRPGSYGLPGPQGWRGDSGLPGDEGDQGSMGLKGEPGYSGDLGEDVIGPAGEKGDQGNVGDRGRPASLVIVDEVREYNISITRGSKGEKGLRGQQGVKGIKGETGSVGLRGTYGPNGLQGYKGDQGDDGPRGKPGHRGPQGPPGQKGIKGAPGYAGPDGANGLTGPSGEDARPGNPGPQGFSGEPGIFDETLNEPLLPGSMGPQGPVGFVGPMGAPGLDGAVGQPGLMGPPGLPGAKGLPGPQGAPGISPKGEPGDDGFKGLPGPRGPNGYPGIQGPVGPKGFKGNTVIGEPGEEGTPGIDGIPGLRGDRGEPGAIGPAGYPGRGVYGAGPPGDRGPPGRPGIAGSPGNPGRPGSPGLKGEHGDDCPFCPSGLPGSKGQKGDDGFTGRQGFPGFVGLPGPRGQRGAPGIPGINGPKGPKGNKGLAGMAGPQGPKGPEGRLNTPAYALTVADRGPRGEPGFIGPPGFPGDTGRQGPHGDSGITGFKGVRGDFGQPGTPGRNGSRGRDGVPGRPGATPDIPMAFLFGEKGDMGLTGEEGDKGEIGPPGETGESLIDGIYAKGEKGYPGYPGFNGVPGRKGIRGDQGSDGYPGMGGDIGIQGTSQQGPKGFKGFSGEKGSIGPYGDPGSPGYRGPFGSDGQKGQKGMRGDGGYATLPGERGSNGINGERGDNGEPGYPGTPGRSGVIGQKGVTGAPGDIGPDGPPGPRGRKGFPGIIIQGAPGLPGRPGSPGPMGPIGAQGLQGSNGLQGFTGPKGMKGNSGRYGRRGEPGVPGSIGFTGFSGLMGLPGASGKPGERGDKGAPGFNGEPGQDGLVGPLGPKGYIGNPGLTGKPGIPGLPGGKGERGIYGYQGPKGDRGDNAVSGSKGEPGEHGINGYTGRDGIPGQKGVKGPQGMPGLSIRGPPGTKGDKGDTGFSGTPGFQGPRGAIGDKGFPGQTGEPGEKGLPGTDGFAGKNGLIGTKGQQGLYGASGRKGDRGDRGMDGEPGRIGIPGLPGNKGFQGPQGRIGPTGIKGDRGDIGRTIYLPATKGDIGDIGFPGLPGNPGEYGDPGYTGHIGRKGEQGDVGIKGMVGNEGLPGFIGERGDQGPRGLPGRNGEFPERGEQGKAGIDGLPGWPGPMGQKGAPGEYGVDGPEGTPGQPGLTYEGAKGDRGMPGFTGRRGYLGIPGPRGMEGHPGLKGEKGNAGEIGFAISPKGQPGYPGLAGYNGLKGEKGTIGETGRIGIYGEQGLVGVKGETGDEGEAGFIGPPGLQGIEGDKGDTLLPSDILPGQPGDVGLPGFDGRDGVAGIPGSFGRNGYLGVKGQRGEIGLQGPPGEPGPQGLQGIKGKRGLKGFVGLHGSVGRPGPPAPPPPIPKSRGFYFTVHSQTRLIPKCPSGTSPLWEGFSLIHIIANGKAHGQDLGAPGSCLRKFSTMPYLFCNLNNVCDFAQREDYSFWLSTPEPMPMAMTPIQARDVGTYISRCQVCEAPTRTIAIHSQSSEVPTCPDKWKELWVGYSFLMHTAGADASGQSLISPGSCLREFRTRPFIECNGLGRCNYFATAVSYWLSTINDNWMFQKPIQETLKTDKDSRVSRCTVCMRQWASQTYSTATRTGDVSFVPNAQRRPFRGPSPTGRVPRWHRRGYRVASRKVPSFNKH

Summary

EMBL
BABH01041529    BABH01041530    BABH01041531    KQ459601    KPI93819.1    KQ460398    + More
KPJ15137.1    AGBW02013994    OWR42429.1    NWSH01000276    PCG77831.1    KZ150090    PZC73685.1    PCG77832.1    ODYU01011068    SOQ56569.1    JTDY01003136    KOB70085.1    ADTU01022807    ADTU01022808    ADTU01022809    KQ976745    KYM75513.1    KQ981931    KYN32828.1    GL887844    EGI69784.1    KQ971343    EFA04092.2    GL450531    EFN80869.1    GBYB01006548    JAG76315.1    QOIP01000011    RLU17105.1    KQ434889    KZC10378.1    KQ980487    KYN15887.1    GEZM01070756    JAV66296.1    DS235093    EEB11814.1    GL441551    EFN64628.1    GFXV01005227    MBW17032.1    ABLF02028011    ABLF02028012    ABLF02057543    KK107575    EZA48733.1    KK852643    KDR19587.1    KQ982409    KYQ56760.1    NNAY01000806    OXU26414.1    DS231895    EDS44340.1    LJIG01000819    KRT86121.1    KQ977649    KYN00806.1    GFDL01006616    JAV28429.1    GFDF01000875    JAV13209.1    NEVH01016289    PNF26261.1    GANO01000796    JAB59075.1    GFDF01000876    JAV13208.1    KZ288225    PBC31926.1    ATLV01025642    KE525396    KFB52537.1    GGLE01001748    MBY05874.1    HACA01020288    CDW37649.1    GDIQ01070032    JAN24705.1    GDIQ01220086    GDIQ01115712    JAK31639.1    GDIQ01115715    JAL36011.1    GAKT01000057    JAA93005.1    GDIQ01220088    GDIQ01115714    JAL36012.1    GDIQ01168143    JAK83582.1    GDIQ01270945    GDIQ01245876    GDIQ01187284    GDIQ01132071    JAJ80779.1    GDIQ01070031    JAN24706.1    KQ414621    KOC67613.1    GDIQ01128222    JAL23504.1    GDIQ01198784    JAK52941.1    GDIQ01200087    JAK51638.1    GDIQ01102021    JAL49705.1    KK855806    PTY23941.1    LRGB01002580    KZS07033.1    GDIQ01220087    JAK31638.1    GDIQ01270399    GDIQ01115713    JAJ81325.1    GDIQ01200086    GDIQ01200084    JAK51639.1    GDIP01254151    JAI69250.1    GDIQ01200085    JAK51640.1    GDIP01022206    JAM81509.1    GDIQ01116922    JAL34804.1    GDIP01167260    JAJ56142.1    GDIQ01081124    JAN13613.1    GDIQ01188538    JAK63187.1    GDIQ01060391    JAN34346.1    GDIQ01099066    JAL52660.1    GDIQ01097618    JAL54108.1    GDIQ01084079    JAN10658.1    GDIQ01115717    JAL36009.1    JH431704    GDIQ01051802    JAN42935.1    GDIQ01068190    JAN26547.1    GDIQ01084081    JAN10656.1    GDIQ01218932    JAK32793.1    GDIQ01084083    JAN10654.1    GDIQ01084082    JAN10655.1    GDIQ01082679    GDIQ01065387    JAN29350.1    GDIQ01265618    JAJ86106.1    GDIQ01161946    JAK89779.1    GDIQ01243379    GDIQ01206038    JAK08346.1    GDIQ01066748    JAN27989.1    GDIQ01084080    JAN10657.1    GDIQ01220089    GDIQ01081126    JAK31636.1    GDIQ01019022    JAN75715.1    MF688643    AXC43910.1    GDRN01059691    JAI65476.1    GEDV01004639    JAP83918.1    GDIQ01066749    JAN27988.1    GDIP01195711    JAJ27691.1    GFTR01008880    JAW07546.1    GDIQ01060393    JAN34344.1    GDIQ01060392    JAN34345.1    GDIQ01097620    JAL54106.1    GFAC01000921    JAT98267.1    GL379798    EGT33736.1    GDIP01136319    JAL67395.1   
Pfam
PF01413   C4        + More
PF01391   Collagen
Interpro
IPR001442   Collagen_IV_NC        + More
IPR036954   Collagen_IV_NC_sf       
IPR008160   Collagen       
IPR016187   CTDL_fold       
SUPFAM
SSF56436   SSF56436       
Gene 3D
PDB
5NAZ     E-value=3.60216e-83,     Score=791

Ontologies

Topology

SignalP
Position:   1 - 31,         Likelihood:  0.991891
 
 
Length:
1806
Number of predicted TMHs:
0
Exp number of AAs in TMHs:
5.59743999999999
Exp number, first 60 AAs:
5.58673
Total prob of N-in:
0.26474
outside
1  -  1806
 
 

Population Genetic Test Statistics

Pi
269.140083
Theta
184.206299
Tajima's D
1.278271
CLR
0
CSRT
0.730013499325034
Interpretation
Uncertain
Peptides ×
Source Sequence Identity Evalue
28556443 RGEPGVPGSIGFTGF 100.00 1e-09
26822097 TRPFIECNGIGR 100.00 5e-08
26280517 TGYIPPFNSFYYPFAQR 100.00 5e-08
28467696 TIAGSHTHTTPVIISFGER 100.00 5e-08
26822097 GYIGIPGPR 100.00 8e-06
28556443 TGDVSFVPNAQR 100.00 8e-06
28556443 QWASQTYSTATR 100.00 0.022
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号