Biology:CXorf36

From HandWiki
Revision as of 22:38, 10 February 2024 by Ohm (talk | contribs) (simplify)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Short description: Protein-coding gene in humans


A representation of the 3D structure of the protein myoglobin showing turquoise α-helices.
Generic protein structure example

Chromosome X open reading frame 36 (CXorf36) is a gene that in humans encodes a protein “hypothetical protein LOC79742”. This protein has a function that is not currently very well understood.[1][2] Other known aliases are “FLJ14103, DKFZp313K0825, FLJ55198, PRO3743, FLJ55198, hCG1981635, bA435K1.1,” and “4930578C19Rik.”[3]

Gene

The CXorf36 gene is located at Xp11.3.

The location of the CXorf36 gene on chromosome X

It can be transcribed into 8 different transcript variants, which in turn can produce 6 different isoforms of the protein.[4]

The genomic DNA is 52,529 base pairs long,[1] while the longest mRNA that it produces is 4,735 bases long.

Gene Neighborhood

CXorf36 is closely surrounded by the following genes on chromosome X:[1]

  • DUSP21
  • KDM6A
  • MIR222
  • TBX20

CXorf36 is also surrounded by two other genes on chromosome X that have been implicated in X-linked mental retardation.[5]

Protein

The longest protein isoform that is produced by the CXorf36 gene is termed hypothetical protein LOC79742 isoform 1 and is 433 amino acids long.[6] The protein has a predicated molecular weight of 48.6 kDa and isoelectric point of 8.11.[7]

Domains

The CXorf36 gene protein product contains a region of low complexity from position 16 to position 40.[8]

Post-translational Modification

The CXorf36 protein is predicted to undergo phosphorylation at several serines, threonines, and tyrosines throughout the structure.[9] However, many of these sites are predicted at serines. There is also a predicted N-linked glycosylation site at position 100 on the protein product.[10]

Expression

CXorf36 is shown to be expressed ubiquitously at low levels in various tissues throughout the body. It is expressed highly in the ciliary ganglion, ovary, and uterus corpus. However, highest expression is seen in the trigeminal ganglion tissue.[11]

Conservation

CXorf36 has one paralog in humans known as C3orf58.[12] Orthologs have been found in all mammals and through numerous eukaryotes.[13] However, conservation of the full gene halts past this, most likely a result of duplication from the ancestral gene into CXorf36 and C3orf58. The full list of organisms in which orthologs have been found is given below.

  • Pongo abelii
  • Macaca mulatta
  • Callithrix jacchus
  • Canis familiaris
  • Ailuropoda melanoleuca
  • Equus caballus
  • Oryctolagus cuniculus
  • Mus musculus
  • Rattus norvegicus
  • Monodelphis domestica
  • Ornithorhynchus anatinus
  • Taeniopygia guttata
  • Gallus gallus
  • Danio rerio
  • Bos taurus


References

External links

Further reading