SGID The Silkworm Genome Information Database
Gene
KWMTBOMO06845
Pre Gene Modal
BGIBMGA012062
Annotation
PREDICTED:_collagen_alpha-1(XVIII)_chain-like_[Amyelois_transitella]
Location in the cell
Nuclear   Reliability : 2.972
 

Sequence

CDS
ATGTTTACCAGCGTATGCGCCAAAAATGTGTATTGCAAAGAGGAACTTGAACACACGGATTTAAGTGTTATCTGTATTGACAATGACGTGAGTGTCATATCCAAGATGGCGATGGTTATTTCCCAGACGACAAAAGCGGCAATTGGCGGGTTTTTAGCTTTAATGATCCTTGGTGCTGTGCTAGTTGTCGGAGCCGCTTTCGGGTGGTTCGACCCGAAAAGAGAAGATGAGGATACGCCTGCAAAGATCAGCGCTAGATTGCAGGGGAGGACCTCGTTCGTCACGCATAGAGCGGTGAACCCAGACGAGAATCATCTACCGGTTGTCTCGCACTCGTCGCCGTTTTACGTTGGAGATAAAAGTGGAAGCACAGATGCAACCGCAGTACAAGGAACGTGCCGATGCAGTCTAAATGACATCTCTCAAATCTTGGAAGTCATGCCAGAGTTAAGAGGACCTCCTGGACCACAAGGACCAACCGGGGCTGACGGCACTACGGGCGCTCCAGGGAAAACGGGACAAATCGGAGACCCCGGACCTGCAGGGCCTCCTGGATCTAAAGGCGATCGAGGGGAAAGAGGGGATACAGGGAATTCAGGTCCAGAAGGTCAACCTGGTCCCAAAGGCGACCCTGGCGCAGATGGTTCTCCTGGGCCACAGGGCCCTCAAGGCCCACCCGGGCTACCAGGCCCTGTGACATCTGCACTAATAGAATCAACAGGATTATATGGGTCTGTTAATCCTGGTATCCAAGGACCTCAAGGCGAGAGGGGTCCAATGGGCCTTCCAGGGCCTCAAGGGGAGAGAGGCTACCCTGGGAATAAGGGCGAGAGGGGATTACTTGGCCCTAAAGGAGACAAAGGCGACCGGGGGTACGTGGGTCTTAGAGGTCCACACGGTGCCAAAGGCGAAAAAGGCCCTCCAGGCCGAGACGGGGTCCCAGGCCTGCCGGGAGCGCACGGCCGTCCCTCAGAAAAGGGTGAAAAAGGTGCTAGAGGTCTTCCTGGTCTGCCTGGACCTCCAATGCCGACAATATCTGACAATGCTGTGCTGAGTGACATCACACGAACAGAGGCTTCGCTCTTGAAGGGTCCGAAGGGAGACCGCGGGGAGACTGGTGAAAAGGGTGAGAAAGGAAATAGGGGCCTCGAGGGACCACAAGGATTCCCAGGCGTTGATGGAAAGCCTGGTGAACGGGGTGACATAGGGCCGTCTGGCTTACCAGGAACACAGGGCTCGACTGGTCTTCCAGGGTCTAAAGGTGAACGCGGCGAACCTGGCCCGCCTGGACCAGTGGCTATATCTCGAGACGATGCTATGGTTATGACCAAGGGCGATAAAGGTGATGCAGGGCCTCGCGGTAAAAGAGGCCATCCAGGGCCACCTGGACCGAGAGGTCCTCCAGGACTACCAGGACCTCCAGGTACTCCAGGTGTGAATGGGCCCACTGGAGACATTGGACTTCCTGGATGGGCGGGCCCGCCCGGAGCCACGGGTCAGCCCGGTCCCCCAGGCCCGAAAGGTGAAAAAGGGGACTCCGCCGTTAACCCACTTGACTTGGAAAAGATAAAAGGTGACAAAGGTGATAGAGGATTCGATGGTAATCCTGGTGCGCCGGGCAAGGATGGGCCGCGTGGGCCTCCAGGGCCTCCTGGGACTCCAGCATCGAACGTCCAATACGTCCCAGTACCTGGACCTCCAGGCCCCCCTGGTCCACCTGGAACTTCACTAGGGCTACCAAAAGATTCAGTCGATACACTCACAGATAGTCCTGGAGTTAGAAGGGAGCCCAATACAGGCAAACAGCGGGATCCTCTACAAATTCTACGAAGCCTGAACCATCTGGTGCAGTACCGACAAGAGCCGCACGGTTTTCGAGATTCATTAGATCCACTTGGTGAAGGCTCAGATTTAGATGACGAAGAGGACGGAAGGGCTATTGTTGGAACCATCCTATTTAAAACTACTGATTCGTTGAGACGTCTAGGAGTAAATAGTCCTCGAGGAACTTTGGCCTACGTGTTACAAGAACAAGCTTTGCTGATTCGTGTCAACAACGGCTGGCAATACGTAGCGATGGGGTCATTACTCGCTTTGCACGCTCCGCCCACGAACGGCCCTACCAGGTCGCCATTGCAGAACATTCTGGAAACGTCCAGTTTGGTTCACCACAAGAATCCAGCTGTCGACGGACCAGTCTTACGTCTGGCTGCATTAAACGAGCCTCGCACGGGCGACATGCACGGCGTTAGCAGCACAAATTACGAATGCAAAAGACAAGCTCAGAGAGCTAACATGGACGGAACATTCCGAGCGTTTATATCGTCGAGGGTTCAAGCAATAGATTCCATCGTCAGCTGGGTAGACAGAGAGATTCCAGTCGTGAACACAAGAGGGGATGTTCTCTTTAATTCTTGGGGTGAAATGTTCGATGGTTCAGGAGCATTGTTCGCTCATGCCCCCAGGATATACAGCTTTAGTGGTCAGAATGTTCTAATGGACCCTGGTTGGCCTACAAAAGCCGTCTGGCACGGCGCTAGTTCCACCGGCGAGCCAGCCATGGATGCGTACTGTGACGCATGGCATAGCAGCAGCCCAGATAAATTTGGTCTCGCATCTTCACTGCACACGAATAAACTGCTGGATCAAGAGACATATTCGTGCAGCAGTAGGTTGATAGTATTATGCGTGGAAGCAACTCCGGCTGATACGGTTCGAAGAAAAAAACGATCCAGGTATCGAACAAACGAAAAGCTCAAGTTCTTGAAAGATTCAGAAAAATCGAATGAAACTAGTCAAAGCTTATAG
Protein
MFTSVCAKNVYCKEELEHTDLSVICIDNDVSVISKMAMVISQTTKAAIGGFLALMILGAVLVVGAAFGWFDPKREDEDTPAKISARLQGRTSFVTHRAVNPDENHLPVVSHSSPFYVGDKSGSTDATAVQGTCRCSLNDISQILEVMPELRGPPGPQGPTGADGTTGAPGKTGQIGDPGPAGPPGSKGDRGERGDTGNSGPEGQPGPKGDPGADGSPGPQGPQGPPGLPGPVTSALIESTGLYGSVNPGIQGPQGERGPMGLPGPQGERGYPGNKGERGLLGPKGDKGDRGYVGLRGPHGAKGEKGPPGRDGVPGLPGAHGRPSEKGEKGARGLPGLPGPPMPTISDNAVLSDITRTEASLLKGPKGDRGETGEKGEKGNRGLEGPQGFPGVDGKPGERGDIGPSGLPGTQGSTGLPGSKGERGEPGPPGPVAISRDDAMVMTKGDKGDAGPRGKRGHPGPPGPRGPPGLPGPPGTPGVNGPTGDIGLPGWAGPPGATGQPGPPGPKGEKGDSAVNPLDLEKIKGDKGDRGFDGNPGAPGKDGPRGPPGPPGTPASNVQYVPVPGPPGPPGPPGTSLGLPKDSVDTLTDSPGVRREPNTGKQRDPLQILRSLNHLVQYRQEPHGFRDSLDPLGEGSDLDDEEDGRAIVGTILFKTTDSLRRLGVNSPRGTLAYVLQEQALLIRVNNGWQYVAMGSLLALHAPPTNGPTRSPLQNILETSSLVHHKNPAVDGPVLRLAALNEPRTGDMHGVSSTNYECKRQAQRANMDGTFRAFISSRVQAIDSIVSWVDREIPVVNTRGDVLFNSWGEMFDGSGALFAHAPRIYSFSGQNVLMDPGWPTKAVWHGASSTGEPAMDAYCDAWHSSSPDKFGLASSLHTNKLLDQETYSCSSRLIVLCVEATPADTVRRKKRSRYRTNEKLKFLKDSEKSNETSQSL

Summary

Pfam
PF01391   Collagen        + More
PF06482   Endostatin
Interpro
IPR016187   CTDL_fold        + More
IPR008160   Collagen       
IPR016186   C-type_lectin-like/link_sf       
IPR010515   Collagenase_NC10/endostatin       
SUPFAM
SSF56436   SSF56436       
Gene 3D
PDB
1DY2     E-value=2.52683e-40,     Score=419

Ontologies

Topology

Length:
935
Number of predicted TMHs:
1
Exp number of AAs in TMHs:
22.73448
Exp number, first 60 AAs:
13.84784
Total prob of N-in:
0.99980
POSSIBLE N-term signal
sequence
inside
1  -  47
TMhelix
48  -  70
outside
71  -  935
 
 

Population Genetic Test Statistics

Pi
27.870939
Theta
25.938975
Tajima's D
0.115028
CLR
1.107716
CSRT
0.403529823508825
Interpretation
Uncertain
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号