SGID Silkworm Genome Informatics Database
Gene
KWMTBOMO03116  Validated by peptides from experiments
Pre Gene Modal
BGIBMGA005772
Annotation
neuroglian_[Mythimna_separata]
Full name
Neuroglian      
Location in the cell
Cytoplasmic   Reliability : 1.05 Extracellular   Reliability : 1.055
 

Sequence

CDS
ATGGATACGAGCAGAATTCTGTGTTTCATAGCGTTAGTCTCGCAGGCCGCTGCTTTATTAACATCTCCACCAAAAATCGTGAAGCAACCGACTCAGGAAGAAATCCTGTTCCAAGTGGCGCAACAGGGAGAAGTGGATAAACCTTTTATTATAGAGTGCGAGGCTGAAGGAGAACCAGCACCTAAGTACAAATGGATCAAAAACGGCAAATCATTCGAATACTCGAGCTACGACAACCGTATCTCGCAACAAACCGGTCGAGGTACTCTCGTCATCAGCAAGCCCAAGGAGGAAGATCTGGGACAGTACCAGTGCTTCGCCTACAACGAGTGGGGAACCGCTGCATCCAACTCGGTCTTTGTAAGGAGAGCCGAGCTCAATTCCTTCAAGGAAACCGATGGTCAGCAAGTTGTGAGAGCTGAAGAAGGTAAACCTTATAAGCTGACGTGCGAGCCCCCGGACGGTCATCCGAAGCCCAAGGTCTACTGGATGTTGCAAGGGGACCAGGGTCAGCTGAAGACGATCAACAACTCTCGTATGACACTCGACCCGGAAGGAAACCTCTGGTTCTCCAACGTGACCCAAAACGACGCCAGCGAAGATTACGCGTACATCTGCACGGCCAACTCCGGCTTCAGAAATGAATACAAAGTTGGGAACAAGATCTACCTTCAGGTCATCCCGACCGGCATCTCGCCGACACTCAACCGACACGAGCCTGTGCGCCAGTACACCACCAGGAAGATCGAGAGAGGCCTCAAGGGTCAGCGCGTGGAGCTCTACTGCATCTACGGCGGAACGCCGCTCCCTCAGATCGTATGGAAGAAGAACGGGCGCAGCATCTTGTGGTCTCAGAGCATCACCCAGGACAACTACGGGAAGACTTTAGTGATCAAGTATCCGACGTACGACGACCAGGGCACGTACACGTGCGAGGTCAGCAACGGTGTGGGCTCGGCGCAGTCCAACTCCATCCAACTCAACATCGAAGCTGCGCCATTCTTCACGGTGGAACCAGAGATCCAGAACCTAGCCGAGGGCGAGACCGCGGTCATCCGCTGCGAGGCGGACGGCACCCCGGCGCCGAAGATATCCTGGATCCACAACGGGAAGCCGATCGAGCAGGCCGAGCTGAACCCGCGCCGCCAAGTCAGCGCCAACTCCATCGTCATCACCGACCTCGTGAAGAAGGACACGGGCAACTACGGCTGCAACGCCACCAGCGCCATCGGCTACGTCTACAAGGACGTCTACATCAACGTGCAGTCCATTCCCCCTGAAATCAAGGAGGGTCCCATGAACCTGACCAACGTGGACGGCTCCGAGGCTGTGCTCAAGTGCAGAGTCTTTGGAGCACCGAAACCTATCGTTAAATGGATGAAGGACAACGTCGACGTCACAGGAGGGAAATACAACATAACGGCGGAGGGCGACCTCGTGATCCGCGACGTGTCGTTCACTGACGTGGGCACGTACCAGTGCTACGCCAAGAACAAGTTCGGCGAGACCTCCGCCTACGGATCGCTGGTGGTCAAGAAGCACACGGTGATCACGGACAAGCCGGAGCACTACGAGGTGGCGGCGGGCTCGTCGGCCACGTTCCGCTGCAACGCCAACGCGGACGACTCGCTGCCGCTGCAAATCATCTGGCTCAACGACGGCCAGCAGATCGACTTCGAGAGCCAGCCGCGCTTCCGCATGACCAACGACTACTCGCTGCTCATCTCCGACACCAGCGAGCTCGACTCCGGCCAGTACACCTGCATCGCCAGGACGCCCGTCGACGAGGCGCGCGCCCAGGCCACGCTCACCGTCCAAGGGTGTGGAGCTTGGTATTTGAGTGTGAGTGTGTGTGCAGACAAGCCGAACCCGCCGGTGCTGACGGGCGCGGAGTGCGGCGCGCGCACGGCCACGGTGCGGTGGCGGGCGGCGGGCGACAACCGCGCGCCCATCCTGCGCTACTCGCTGCACTACAACACCAGCTTCACGCCGCACGTGTGGGCCGTCGCCGCCGCCGCCGTGCCGCCAGTAGACTCCGCCTGGACCGCCGCGCTCAGCCCCTGGGCCAACTACACCTTCCGCGTGGTGGCGCACAACAAGATCGGCCCGTCAGCGCCCTCGGGCCACTCCGACGTGTGCACCACGCAGCCCGACGTGCCCTACAAGAACCCCGACAACGTGAAGGGCGAGGGCACCGACCCCACCAACATGCTCATATCGTGGACGAAAATGCCTCAAATCGAACACAACGGCCCCGGGTTCTACTATCTGGTGTCGTGGCGCCGGAACATCACGGGACAACCCTGGGAAGAGCACCAGGTCCGGGACTGGCAGCAAACCGAACACCTGGTGACCAACACCCCCACCTTCCAGCCCTACAAGATCAAGGTGACGGCGGTGAACTTCAAAGGAACATCGAACGTGACTCCCATTGAAGTGATCGGCTGGTCCGGAGAGGACCGACCCACTCAGGCTCCGGGCAACCTCAGCCTGTCCCAGGTGACGTCCGGCACCAGCGCCGTGCTCAGCTGGACTCCCGTGGCGCCGGAGTCACTGCGGGGGCACTTCAAAGGATACAAAATCCAGACCTGGACCGACGCCGAGGAGGACCGGCTCAAGGAGATCTTTGTGGAGTCCGACGCGACCAGCGCCCTCGTCAACAAGTTCAAGCCCTTCAAGAAGAACAACGTGCGCATCCTCGCCTACAACGGCCGCTTCAACGGGCCCGCCAGCGACACCCTCAGCTTCGTCACCCCCGAGGGACAGCCCTCCACCGTTCGCTCCTTCGAGGCCTACCCTATCGGCTCCTCCGCCATGCTGCTCAAGTGGGAGAAGCCCATGGAGGAGAACGGCGTCCTCACCGGCTACAAGATCTACTACAGCAAGGTGATCGCCACGGCCGTGGAGACTCCGAAGGAGCGCAAGAAGGAGATCGACCCCAAGTTCGACCGCGCCAAGCTCGCCGGCCTCGAGCCCAACACCAAGTACAGGATCGAGATCCGCGCCAAGACCAAGGCCGGCGAGGGCAAGGGCTACTACGTGGAGCAGTCCACCAACAACGTGCTCACCGCGCAGCCCGACGTGCCCGCCTTCGAGACCGAGACGCTGCCCGGGAAGGAGGGCACGGCCCACGTCATAGTCCGCTGGATCCCCTCGCTCGACGGACACGCGGGGACGCACTTCGTGGCCTGGTACCGGCTCAAGGGGCACCCCGACTGGCAGCGGACCAACGACATCACCGAGGACGACTTCGTGATCCTGACGGGGCTGGAGCCCGGCCAGCTGTACGAGGTCAAGGTCACGGTGCACGACGGCCACTACTTCAGCACCAGCGAGCTGCGGACGGTCGACACCTCCATAGTAGACGGACCGATAGTGAAGCCGGACGAGAAGATCGCGACCGCGGGCTGGTTCATCGGCGTGATGCTGGCGCTGGCCTTCCTGCTGCTGCTGCTGGTGCTGGTGTGCGTGCTGCGCAGGAACCGCGGCGGCAAGTACGACGTGCACGACCGCGAGCTGGCGCACGGCCGCAGGGACTACCCCGACGCCGGCTTCCACGAGTACACGCACCCATTGGACAACAAGTCTCGCCATTCGATGAGCAGCGGCACCAAGCCGGGGCCCGAGAGCGACACGGACTCGATGGCGGAGTACGGCGACGGAGAGACGGCCGGCATGAACGAGGACGGCTCCTTCATCGGACAGTACGGCCGGAAACGACGTCCGCCAACATCCGGCCAGCCGGACTCAGCCTCGCAAGCATTTGCAACGCTCGTTTAA
Protein
MDTSRILCFIALVSQAAALLTSPPKIVKQPTQEEILFQVAQQGEVDKPFIIECEAEGEPAPKYKWIKNGKSFEYSSYDNRISQQTGRGTLVISKPKEEDLGQYQCFAYNEWGTAASNSVFVRRAELNSFKETDGQQVVRAEEGKPYKLTCEPPDGHPKPKVYWMLQGDQGQLKTINNSRMTLDPEGNLWFSNVTQNDASEDYAYICTANSGFRNEYKVGNKIYLQVIPTGISPTLNRHEPVRQYTTRKIERGLKGQRVELYCIYGGTPLPQIVWKKNGRSILWSQSITQDNYGKTLVIKYPTYDDQGTYTCEVSNGVGSAQSNSIQLNIEAAPFFTVEPEIQNLAEGETAVIRCEADGTPAPKISWIHNGKPIEQAELNPRRQVSANSIVITDLVKKDTGNYGCNATSAIGYVYKDVYINVQSIPPEIKEGPMNLTNVDGSEAVLKCRVFGAPKPIVKWMKDNVDVTGGKYNITAEGDLVIRDVSFTDVGTYQCYAKNKFGETSAYGSLVVKKHTVITDKPEHYEVAAGSSATFRCNANADDSLPLQIIWLNDGQQIDFESQPRFRMTNDYSLLISDTSELDSGQYTCIARTPVDEARAQATLTVQGCGAWYLSVSVCADKPNPPVLTGAECGARTATVRWRAAGDNRAPILRYSLHYNTSFTPHVWAVAAAAVPPVDSAWTAALSPWANYTFRVVAHNKIGPSAPSGHSDVCTTQPDVPYKNPDNVKGEGTDPTNMLISWTKMPQIEHNGPGFYYLVSWRRNITGQPWEEHQVRDWQQTEHLVTNTPTFQPYKIKVTAVNFKGTSNVTPIEVIGWSGEDRPTQAPGNLSLSQVTSGTSAVLSWTPVAPESLRGHFKGYKIQTWTDAEEDRLKEIFVESDATSALVNKFKPFKKNNVRILAYNGRFNGPASDTLSFVTPEGQPSTVRSFEAYPIGSSAMLLKWEKPMEENGVLTGYKIYYSKVIATAVETPKERKKEIDPKFDRAKLAGLEPNTKYRIEIRAKTKAGEGKGYYVEQSTNNVLTAQPDVPAFETETLPGKEGTAHVIVRWIPSLDGHAGTHFVAWYRLKGHPDWQRTNDITEDDFVILTGLEPGQLYEVKVTVHDGHYFSTSELRTVDTSIVDGPIVKPDEKIATAGWFIGVMLALAFLLLLLVLVCVLRRNRGGKYDVHDRELAHGRRDYPDAGFHEYTHPLDNKSRHSMSSGTKPGPESDTDSMAEYGDGETAGMNEDGSFIGQYGRKRRPPTSGQPDSASQAFATLV

Summary

Description
The long isoform may play a role in neural and glial cell adhesion in the developing embryo. The short isoform may be a more general cell adhesion molecule involved in other tissues and imaginal disk morphogenesis. Vital for embryonic development. Essential for septate junctions. Septate junctions, which are the equivalent of vertebrates tight junctions, are characterized by regular arrays of transverse structures that span the intermembrane space and form a physical barrier to diffusion. Required for the blood-brain barrier formation.
Subunit
Forms a complex with Nrx and Cont.
Keywords
3D-structure   Alternative splicing   Cell adhesion   Cell junction   Cell membrane   Complete proteome   Developmental protein   Direct protein sequencing   Disulfide bond   Glycoprotein   Immunoglobulin domain   Membrane   Reference proteome   Repeat   Signal   Tight junction   Transmembrane   Transmembrane helix  
Feature
chain  Neuroglian
splice variant  In isoform A.
EMBL
AB490501    BAI49425.1    KZ150156    PZC72775.1    ODYU01003405    SOQ42125.1    + More
U50719    AAC47451.1    AGBW02013246    OWR43836.1    KQ459597    KPI95162.1    KQ460970    KPJ10193.1    GECU01002740    JAT04967.1    GFDF01006522    JAV07562.1    NEVH01024950    PNF16340.1    GANO01002435    JAB57436.1    PNF16341.1    PNF16342.1    DS235818    EEB17669.1    GBHO01002530    GBRD01018117    GDHC01017353    GDHC01003765    JAG41074.1    JAG47710.1    JAQ01276.1    JAQ14864.1    GBHO01015288    GBRD01017682    GDHC01016665    JAG28316.1    JAG48145.1    JAQ01964.1    JRES01000755    KNC28615.1    GFDL01007303    JAV27742.1    GFDL01007330    JAV27715.1    GFDL01007275    JAV27770.1    GFDL01007295    JAV27750.1    GFDL01007323    JAV27722.1    GBHO01015279    JAG28325.1    GECL01001809    JAP04315.1    GBHO01002526    JAG41078.1    CH964239    KRF99636.1    CH954180    EDV46357.1    AAAB01008847    EAA45046.5    M28231    AF050085    AF050084    AE014298    AY058284    BT024971    X76243    X76244    KZ288194    PBC33965.1    EDW82547.1    ACZ95239.1    ACZ95241.1    AGB95180.1    KQS29945.1    CM000162    EDX02240.1    KRK06659.1    CH379066    KRT07328.1    ACZ95240.1    AGB95181.1    EAL31379.2    KRT07327.1    CH902640    KPU77357.1    KRK06660.1    OUUW01000003    SPP78106.1    SPP78105.1    EDV38314.2    KPU77356.1    GFTR01008644    JAW07782.1    GFDL01007336    JAV27709.1    ATLV01017216    KE525157    KFB42062.1    GFDL01007361    JAV27684.1    CH479235    EDW36693.1    UFQS01002439    UFQT01002439    SSX14044.1    SSX33463.1    CH933810    KRF94266.1    GGFM01000594    MBW21345.1    EGK96582.1    KQ414667    KOC64766.1    GBXI01000988    JAD13304.1    GGFJ01000760    MBW49901.1    EDW07901.2    KRF94267.1    GAMC01019325    GAMC01019324    GAMC01019323    GAMC01019322    JAB87233.1    ADMH02001982    ETN60086.1    KQ434809    KZC06285.1    CH916371    EDV92457.1    KQS29947.1    KQ435863    KOX70321.1    AFH07295.1    GGFM01000603    MBW21354.1    GGFM01000599    MBW21350.1    GGFK01001442    MBW34763.1    CH940651    EDW65583.2    KRK06658.1    GGFJ01000748    MBW49889.1    ADTU01008311    ADTU01008312    GDHF01030716    GDHF01014633    GDHF01004025    JAI21598.1    JAI37681.1    JAI48289.1    GDHF01029359    GDHF01024218    GDHF01005058    JAI22955.1    JAI28096.1    JAI47256.1    GGFJ01000761    MBW49902.1    GGFK01001463    MBW34784.1    GAKP01020419    GAKP01020418    JAC38534.1    SPP78107.1    GGFM01000602    MBW21353.1    CH480850    EDW51334.1    CP012528    ALC49261.1    CCAG010021144    JXJN01005350    AJVK01026363    AJVK01026364    AJVK01026365    AJVK01026366    AJVK01026367    KQ982174    KYQ59371.1    CH477503    EAT39893.1    AAZX01014469    KRF94268.1    UFQT01002340    SSX33234.1   
Pfam
PF00041   fn3        + More
PF07679   I-set
PF13882   Bravo_FIGEY
PF00047   ig
PF02820   MBT
PF12140   SLED
Interpro
IPR003598   Ig_sub2        + More
IPR036179   Ig-like_dom_sf       
IPR026966   Neurofascin/L1/NrCAM_C       
IPR036116   FN3_sf       
IPR013098   Ig_I-set       
IPR007110   Ig-like_dom       
IPR003599   Ig_sub       
IPR003961   FN3_dom       
IPR013783   Ig-like_fold       
IPR003529   Hematopoietin_rcpt_Gp130_CS       
IPR013151   Immunoglobulin       
IPR013106   Ig_V-set       
IPR037604   Scm-like-4MBT1/2_SAM       
IPR004092   Mbt       
IPR021987   SLED       
IPR038348   SLED_sf       
IPR013761   SAM/pointed_sf       
SUPFAM
SSF49265   SSF49265        + More
SSF48726   SSF48726       
SSF47769   SSF47769       
Gene 3D
PDB
1CFB     E-value=6.38674e-64,     Score=624

Ontologies

Topology

Subcellular location
Cell membrane  
Cell junction  
Tight junction  
SignalP
Position:   1 - 18,         Likelihood:  0.975086
 
 
Length:
1259
Number of predicted TMHs:
1
Exp number of AAs in TMHs:
23.20957
Exp number, first 60 AAs:
0.35459
Total prob of N-in:
0.01828
outside
1  -  1135
TMhelix
1136  -  1158
inside
1159  -  1259
 
 

Population Genetic Test Statistics

Pi
20.714032
Theta
19.793308
Tajima's D
0.007913
CLR
1.667313
CSRT
0.378831058447078
Interpretation
Uncertain
Peptides ×
Source Sequence Identity Evalue
28467696 HTVIIEPVREGEFAVK 100.00 2e-08
28467696 FNFINSGDPYHAYYQHK 95.45 6e-08
26280517 DVQIPPVIVGVIGNDPKK 100.00 3e-04
Copyright@ 2018-2023    Any Comments and suggestions mail to:zhuzl@cqu.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号