Biology:C4orf17

From HandWiki

A representation of the 3D structure of the protein myoglobin showing turquoise α-helices.
Generic protein structure example

Chromosome 4 open reading frame 17 (C4orf17), is a protein-coding gene in humans. C4orf17 (accession: NP_115525.2) spans approximately 31,289 base pairs on the plus strand of chromosome 4 at gene locus 4q23.[1] It has a molecular weight of approximately 39.6 kDa.[2]

C4orf17
Structure of C4orf17 in Homo sapiens[1]
Identifiers
Aliases[3] dkfzp434g072, Q53FE4, chromosome 4 open reading frame 17, Ci-Spatial
External IDs UniProt: Q53FE4; GeneCards: GC04P099511; HGNC ID: HGNC:25274; NCBI Gene ID: 84103; Ensembl: ENSG00000138813
Accession numbers[4] (Homo sapiens) NM_032149 (mRNA) and NP_115525 (protein)
Gene location (human)
Human Chromosome 4
Chromosome Chromosome 4 (human)[3]
Location of the C4orf17 gene in humans.[3]
Band 4q23 Start chr4: 99,511,012[3] bp
Orientation Plus (+) strand End chr4: 99,542,303[3] bp
Gene location[5] (mouse)
The C4orf17 ortholog is found on chromosome 3 in mice.
Chromosome Chromosome 3 (mouse)
Band 3 G3; 3 64.1 cM Start chr3: 137,869,839 bp
Orientation Complement (-) strand End chr3: 137,899.892 bp
RNA Expression pattern
Human Mouse (ortholog)
Bgee[6] Top expressed in
  • Left ventricle myocardium
  • Buccal mucosa cell
  • Sperm
  • Male germline stem cell (in testes)
Top expressed in
  • Left ventricle myocardium
  • Buccal mucosa cell
  • Sperm
  • Male germline stem cell (in testes)
BioGPS[7]
  • Testes
  • Sperm
  • Testes

mRNA

C4orf17 has a total of three known splice variants.[8][2] There are 9 exons found within C4orf17's mRNA sequence. The two additional isoforms have 10 and 7 exons, respectively.

Table 1: mRNA Variants[9]
mRNA variant Variant identifier mRNA length Protein length Exons
NM_032149.3 Q53FE4 ~31K bp 359 aa 9
XM_011532315.3 Q53FE4-1 ~31K bp 186 aa 10
XM_054350982.1 Q53FE4-2 ~23K bp 93 aa 7

Protein

The C4orf17 gene encodes the Q53FE4 protein, otherwise knowns as the C4orf17 protein.[10]

Expression

In Ciona intestinalis, Ci-Spatial/C4orf17 is an essential part of beta-catenin signaling pathways in early embryonic development. Researchers isolated various cDNA clones and injected them with specific morpholinos. As a result, this study suggests that it is possible that Ci-Spatial/C4orf17 is involved in nuclear translocation of Ci-beta-catenin or enhancement of transcriptional activation.

Figure 1: C4orf17 expression in various tissues across 15 primates.

In humans, there is notable expression of C4orf17 in the testes and early spermatids.[9]

Promoter

One of the promoters for the human C4orf17 gene is Alcohol Dehydrogenase 6 (ADH6),[2] located on chr4:99,404,578-99,405,078 (500 nucleotides.) It acts as a catalyst dependent on nicotinamide adenine dinucleotide (NAD) oxidation of primary alcohols to corresponding aldehydes. It also oxidizes secondary alcohols to the corresponding ketones.

Figure 2: Immunofluorescent staining of human cell line. Sperm shows localization to mid piece

Localization

In humans, C4orf17 is specific to the testes.[10] During human fetal development, however, C4orf17 expression appeared in adrenal, intestinal, and renal tissues[11] at 20 weeks after conception.

Interactions

The C4orf17 protein interacts with numerous other proteins, such as ATRX, CHD3, KAT2B, KDM1A, PRMT1, and SUV39H1. These specific interactions suggest that C4orf17’s function is associated with chromatin modifications through histone methylation.[12]

In a 2006 study[13] regarding Abetalipoproteinemia (ABL,) an autosomal recessive disorder caused by mutations of the MTP gene (and thus the microsomal triglyceride transfer protein,) researchers hypothesized their subject was homozygous for a genomic deletion. This deletion included the entire MTP gene, spanning the 4q23 region, which ultimately deleted C4orf17 among eight other genes. Many of these genes are a part of an alcohol dehydrogenase (ADH) gene cluster that essentially metabolizes ethanol, retinol, aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. However, C4orf17 is not included in this cluster.

A 2022 study,[14] nevertheless, further investigated the ADH gene cluster. According to the researchers, “Nucleotide variation in ADH genes can affect the catalytic properties of these enzymes and is associated with a variety of traits, including alcoholism and cancer,” (McQuillan et al., 2022). C4orf17 is one of the many genetic mutations listed as having a strong effect on ADH traits.

Evolutionary aspects

Orthologs

C4orf17 has a total of 86 orthologs recorded in the Ensembl database as well as 331 listed in NCBI. There were no orthologs found among invertebrates, protists, fungi, plants, or bacteria and archaea.[15] The C4orf17 gene likely arose approximately 463 million years ago[15]

Table 2: Orthologs of C4orf17 protein in humans
Taxonomic Class Genus and species Organism Common Name NCBI Accession Percent Identity (%) Percent Similarity (%) Length (aa) Date of Divergence (MYA)
Mammalia Homo sapiens Human NP_115525 100 100 359 0
Reptilia Anolis carolinensis Green anole XP_062838383 25.1 86 327 319
Amphibia Ambystoma mexicanum Axolotl XP_069486409.1 25.9 86 396 352
Aves Apteryx mantelli North Island brown kiwi XP_013806968.2 33 85 372 319
Chondrichthyes Heptranchias perlo Sharpnose sevengill shark XP_067828475.1 28.8 55 379 462
Mammalia

(Marsupialia)

Petaurus breviceps papuanus Sugar glider XP_068955299.1 45.9 62 333 160
Mammalia

(Primate)

Papio anubis Olive baboon XP_017814371.1 79.7 100 313 28.8
Mammalia

(Rodentia)

Castor canadensis North American beaver XP_020036598.2 67.2 99 366 87

Table 2: Percent identity accounts for exact matches, such as a DNA base or amino acids. Percent similarity accounts for identical and biochemically similar substitutions, such as substituted amino acids.

Paralogs

The C4orf17 protein is a SPATIAL (stromal protein associated with thymic and lymph node) domain-containing protein,[16] specifically at the interval 27-223 aa.[17] This suggests that it may be involved in spermatid differentiation. This family may also be referred to as “TBATA” or “TBATA-like.”

Protein Analysis Through Evolutionary Relationships (PANTHER) Classification System categorizes C4orf17 into the PTHR33772:SF2 protein family.[18]

Figure 3: Date of Divergence (MYA): C4orf17 (yellow), cytochrome c (orange), fibrinogen alpha chain (blue)
Error creating thumbnail:
Figure 4: The following figures exhibit conserved regions of C4orf17 among the orthologs listed in Table #. The size of the letters at the top of each figure represents the degree of conservation. Larger letters indicate more conservation compared to smaller letters.
Conserved regions of the C4orf17 protein
Error creating thumbnail:
Conserved regions of the C4orf17 protein

Protein Divergence

The figure to the right named "Date of Divergence..." exhibits the evolution of human C4orf17, cytochrome c, and fibrinogen alpha chain. Cytochrome C has a weaker slope indicating fewer mutations over time, while the fibrinogen alpha chain has a taller slope, meaning it has evolved quicker than both human C4orf17 and cytochrome c. [10]

Multiple sequence alignment

Figure 4 displays the most conserved regions. Accession numbers on the left side of the images indicate the species including sea lamprey, turbot, cyclopterus, sharpnose sevengill shark, giant oceanic manta, rhinatrema, axolotl, Montequma quail, Norther Island brown kiwi, Komodo dragon, green anole, red-eared slider, Chinese softshell turtle, sugar glider, koala, Bennet's brown lemur, gray mouse lemur, North American beaver, Olive baboon, and human.

Conceptual translation

Error creating thumbnail:
Figure 5: Conceptual translation of (human) C4orf17 mRNA

Figure 5 exhibits a conceptual translation of C4orf17 in humans. It is annotated to highlight exons, start and stop codons, and disordered regions among other genetic features.

SNPs

C4orf17 has a total of 14,681 SNPs recorded in the NCBI Variant Viewer, though only two are clinically significant.[19] Variants rs1397320372 and rs1417597081 are both single nucleotide, missense variants.

Table 3: SNPs
SNP Position Alleles Type
rs1397320372 chr4:99529854 A>C / A>G missense
rs1417597081 chr4:99529903 G>A / G>C / G>T missense

References

  1. 1.0 1.1 "UniProt" (in en). https://www.uniprot.org/uniprotkb/Q53FE4/entry. 
  2. 2.0 2.1 2.2 GeneCards Human Gene Database. "C4orf17 Gene - GeneCards | CD017 Protein | CD017 Antibody" (in en). https://www.genecards.org/cgi-bin/carddisp.pl?gene=C4orf17. 
  3. 3.0 3.1 3.2 3.3 3.4 "Gene: C4orf17 (ENSG00000138813) - Gene expression - Homo_sapiens - Ensembl genome browser 115". https://useast.ensembl.org/Homo_sapiens/Gene/ExpressionAtlas?g=ENSG00000138813;r=4:99511012-99542303. 
  4. "Genes for Homo sapiens (human)" (in en). https://www.ncbi.nlm.nih.gov/datasets/gene/id/84103/products/. 
  5. "4930579F01Rik RIKEN cDNA 4930579F01 gene [Mus musculus (house mouse) - Gene - NCBI"]. https://www.ncbi.nlm.nih.gov/gene/67741. 
  6. "C4orf17 ENSG00000138813 expression in Homo sapiens (human)" (in en). https://www.bgee.org/gene/ENSG00000138813. 
  7. "BioGPS - your Gene Portal System". http://biogps.org/?full#goto=genereport&id=84103. 
  8. "Homo sapiens chromosome 4 open reading frame 17 (C4orf17), mRNA" (in en-US). 2025-04-28. http://www.ncbi.nlm.nih.gov/nuccore/NM_032149.3. 
  9. 9.0 9.1 "AceView: Gene:C4orf17, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView.". https://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/av.cgi?db=human&term=c4orf17&submit=Go. 
  10. 10.0 10.1 10.2 "Chromosome 4 open reading frame 17 [Homo sapiens - Protein - NCBI"]. https://www.ncbi.nlm.nih.gov/protein/AAH74759.1. 
  11. "C4orf17 chromosome 4 open reading frame 17 [Homo sapiens (human) - Gene - NCBI"]. https://www.ncbi.nlm.nih.gov/gene/84103. 
  12. "AlphaFold Protein Structure Database". https://alphafold.ebi.ac.uk/entry/Q53FE4. 
  13. "Abetalipoproteinemia in Israel: evidence for a founder mutation in the Ashkenazi Jewish population and a contiguous gene deletion in an Arab patient". Molecular Genetics and Metabolism 90 (4): 453–457. April 2007. doi:10.1016/j.ymgme.2006.12.010. PMID 17275380. 
  14. "Signatures of Convergent Evolution and Natural Selection at the Alcohol Dehydrogenase Gene Region are Correlated with Agriculture in Ethnically Diverse Africans". Molecular Biology and Evolution 39 (10). October 2022. doi:10.1093/molbev/msac183. PMID 36026493. 
  15. 15.0 15.1 "C4orf17 orthologs" (in en). https://www.ncbi.nlm.nih.gov/gene/84103/ortholog/. 
  16. "CDSearch: Navigate results". https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?INPUT_TYPE=live&SEQUENCE=NP_115525.2. 
  17. "CDSearch: Navigate results". https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?INPUT_TYPE=live&SEQUENCE=NP_115525.2. 
  18. "Transcript: ENST00000326581.9 (C4orf17-201) - Protein summary - Homo_sapiens - Ensembl genome browser 109". http://feb2023.archive.ensembl.org/Homo_sapiens/Transcript/ProteinSummary?g=ENSG00000138813;r=4:99511021-99542303;t=ENST00000326581. 
  19. "Variation Viewer". https://www.ncbi.nlm.nih.gov/variation/view.