Biology:C1orf122

From HandWiki
Short description: Protein-coding gene in the species Homo sapiens


A representation of the 3D structure of the protein myoglobin showing turquoise α-helices.
Generic protein structure example

C1orf122 (Chromosome 1 open reading frame 122) is a gene in the human genome that encodes the cytosolic protein ALAESM..[1] ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney.[2] This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.[3]

Gene

C1orf122 is located on chromosome 1 at 1p34.3. The gene is 1,665 nucleotides long, covering 37,808,405 to 37,809,454.[4] It contains three exons[5] with boundaries between amino acids 12 and 13, and amino acids 79 and 80.

mRNA

C1orf122 has two isoforms. Variant one contains 1,329 nucleotides with three exons.[6] Variant two contains 1,226 nucleotides with three exons.[7] Variant two lacks an in-frame portion of the 5' coding region, resulting in a shorter N-terminus.[8]

Protein

ALAESM has a molecular weight of 1100 kDa and an isoelectric point of 6.29.[9] It is a cytosolic protein without a transmembrane domain.

This conceptual translation shows various points of interest in the C1orf122 gene.

Predicted post-translational modifications

There are few predicted kinase phosphorylation sites in this protein. Position 7 is predicted to be phosphorylated by CK1, VRK, and VRK2. Position 10 is predicted to be phosphorylated by CRK1, VRK, PKC, PLK, and AGC. Position 82 has a possible phosphorylation by TKL and MLK. Position 94 is predicted to be phosphorylated by PKC, AGC, MAPK, NEK, CMGC and IKK.[10]

ALAESM does have a few predicted reactive sites. It is predicted to be palmitoylated at position 10, allowing the covalent attachment of fatty acids.[11] It is predicted to undergo glycation at positions 21 and 101 which attaches a sugar molecule to the amino acid.[12] It is predicted to have a nuclear export signal strand from position 55-64 which signals the protein to leave the nucleus.[13] It likely can be glycosylated at position 82 and 94 which attaches a carbohydrate to the amino acid.[14] It is predicted to be phosphorylated by an unspecified actor at position 10, 82, and 94 in the nucleus.[15]

Structure

The secondary structure of ALAESM is predicted to be structured as 55% random coil, 35% alpha helix and 9% extended strand. There are two alpha helices between positions 11-18 and 36-68. There are three 2 amino acid sections after position 80 and one 4 amino acid section at position 20 of extended strand.[16] The rest of the protein is random coil. There is no transmembrane domain within ALAESM[17]

Expression

ALAESM is expressed throughout all tissue cells in the body.[18] It is also expressed up to 2.5 times higher than its average level in the brain, spinal cord, adrenal gland and kidneys. The protein is expressed in the cytoplasm and since it is predicted to have a nuclear export signal, it is kept in the cytoplasm even in telophase when the nuclear envelope disassembles.

Homology

This graph shows the date of divergence, sequence length, and sequence identity for the orthologs of the human gene C1orf122.

Human C1orf122 does not have any paralogs, however it has multiple orthologs amongst placental mammals. These species range from cats, horses, rabbits, alpacas, and elephants.[19] The sequence across these species are highly conserved.

References

  1. "DAS-TMfilter server". http://mendel.imp.ac.at/sat/DAS/DAS.html. 
  2. "C1orf122 chromosome 1 open reading frame 122 [Homo sapiens (human) - Gene - NCBI"]. https://www.ncbi.nlm.nih.gov/gene/127687#gene-expression. 
  3. "AceView: Gene:C1orf122, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView.". https://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/av.cgi?db=human&term=c1orf122&submit=Go. 
  4. "Genome Data Viewer". https://www.ncbi.nlm.nih.gov/genome/gdv/browser/gene/?id=127687. 
  5. "C1orf122 chromosome 1 open reading frame 122 [Homo sapiens (human) - Gene - NCBI"]. https://www.ncbi.nlm.nih.gov/gene/127687. 
  6. "Homo sapiens chromosome 1 open reading frame 122 (C1orf122), transcript variant 1, mRNA" (in en-US). Int. J. Mol. Med. 24 (2): 233–246. 2009. http://www.ncbi.nlm.nih.gov/nuccore/NM_198446.3. 
  7. "Homo sapiens chromosome 1 open reading frame 122 (C1orf122), transcript variant 2, mRNA" (in en-US). Int. J. Mol. Med. 24 (2): 233–246. 2009. http://www.ncbi.nlm.nih.gov/nuccore/NM_001142726.1. 
  8. "C1orf122 chromosome 1 open reading frame 122 [Homo sapiens (human) - Gene - NCBI"]. https://www.ncbi.nlm.nih.gov/gene/127687. 
  9. "ExPASy: SIB Bioinformatics Resource Portal - Categories". https://www.expasy.org/proteomics/families__patterns_and_profiles. 
  10. "HTTP in PHP". Multi-Tier Application Programming with PHP. Elsevier. 2004. pp. 21–43. doi:10.1016/b978-012732350-3/50003-x. ISBN 978-0-12-732350-3. 
  11. "CSS-Palm - Palmitoylation Site Prediction". http://csspalm.biocuckoo.org/showResult.php. 
  12. "NetGlycate 1.0 Server" (in en). http://www.cbs.dtu.dk/services/NetGlycate/. 
  13. "NetNES 1.1 Server". http://www.cbs.dtu.dk/services/NetNES/. 
  14. "DictyOGlyc 1.1 Server". http://www.cbs.dtu.dk/services/DictyOGlyc/. 
  15. "NetPhosK 1.0 Server". http://www.cbs.dtu.dk/services/NetPhosK/. 
  16. "NPS@ : GOR4 secondary structure prediction". https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_gor4.html. 
  17. "TMpred Server". https://embnet.vital-it.ch/software/TMPRED_form.html. 
  18. "AceView: Gene:C1orf122, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView.". https://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/av.cgi?db=human&term=c1orf122&submit=Go. 
  19. "C1orf122 chromosome 1 open reading frame 122 [Homo sapiens (human) - Gene - NCBI"]. https://www.ncbi.nlm.nih.gov/gene/127687.