Biology:Chromosome 16 open reading frame 13

From HandWiki


A representation of the 3D structure of the protein myoglobin showing turquoise α-helices.
Generic protein structure example

Chromosome 16 open reading frame 13, also called C16orf13, is a protein-coding gene of unknown function, also known as JFP2.[1] Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues.[2][3] Underexpression of this gene has also been linked to disease consequences in humans.[4]

Gene

Exon breakdown of C16orf13 transcript variant 1. The AdoMet-MTase domain is also included in the diagram.

C16orf13 is located on the short arm of chromosome 16 in humans, in the thirteenth open reading frame.[5] There are five transcript variants of this gene, named 1, 2, 3, 4, and 7. The longest cDNA transcript (transcript variant 1) contains 854 base pairs.[6] This transcript is composed of six exons, all of which contribute to the major superfamily included in the protein, the methyltransferases superfamily.[7] The primary transcript of this gene is 1,919 base pairs long.[8]

Species distribution

File:C16orf13DotPlot.tiff Using the Dotlet program, a dot plot was constructed comparing the Human gene with its Chimpanzee ortholog.

The plot indicates sequence conservation at the beginning and end of the gene, suggesting conservation and similarity in the 5' and 3' untranslated regions.

This sequence similarity in the 5’ UTR and 3’ UTR does not extend past mammalian species, and shows almost no similarity in a Dot Plot of the Human gene with distantly related species, such as Xenopus tropicalis.

A multiple sequence alignment conducted using the SDSC Biology Workbench [9] reveals little sequence similarity among species more distantly related than primates in the upstream region of the gene. Near the start of transcription site in the human C16orf13 gene, there is high conservation among the primates in which upstream data was available, specifically the human, orangutan, and rhesus monkey C16orf13 gene orthologs. High sequence similarity among primates is evident throughout the promoter region, the 5' UTR, and the C16orf13 gene.

The graph below shows selected gene orthologs for C16orf13 transcript variant 1. These data are collected from NCBI BLAST.

Species Organism Common name Gene Common name NCBI accession number Sequence identity Expected value Sequence length (bp) Time since split from humans, MYA (Data from TimeTree.org)
Homo sapiens Human C16orf13 NM_032366.3 100% 0 854 0
Pan troglodytes Chimpanzee LOC467858 NM_032366.3 98% 0 784 6.4
Canis lupus familiaris Dog C6H16orf13 XM_547214.3 88% 0 865 94.4
Mus musculus Mouse 0610011F06Rik NM_026686.2 86% 0 825 92.4
Xenopus (Silurana) tropicalis Western clawed frog c16orf13 NM_001039734.1 BLAST search found no significant similarity BLAST search found no significant similarity 993 371.2

Tissue distribution

The human expression profile from NCBI UniGene suggests that this gene has widespread expression in many different tissues in the body.[10] This expression profile suggests that this gene is a “housekeeping gene,” one that has important effects in all cells, regardless of tissue. The highest levels of expression appear to be in the adrenal gland, lung, and parathyroid.[11] There are many additional sites besides these highest three where the gene is expressed in high levels. There seems to be no real similarity in the few tissues where the gene is not expressed. This expression data does not seem to give any clues into specific function, except to suggest that the gene is involved in a “housekeeping” function of nearly all cells.

Gene neighborhood

File:Chromosome16Schematic.tiff File:C16orf13GeneNeighborhood.tiff

The C16orf13 gene is located near the end of chromosome 16, potentially subject to deletion mutations.

The surrounding genes of the C16orf13 gene include hypothetical protein LOC100287175 and LOC100138285 to the right and RAB40C and WFIKKN1 to the left. This gene is located on the minus strand, along with LOC100138285. The other surrounding genes are oriented in the opposite way on the plus strand. The gene neighborhood is represented in the schematic below, originally from NCBI Gene.

Protein

The protein that this gene codes for is known as UPF0585, where UPF signals unknown protein function. There are five isoforms of this protein, corresponding to the five splice variants of the gene.[12] The isoforms are named a, b, c, d, and g[12] As mentioned above, the conserved domain detected in a BLAST search of this amino acid sequence is a methyltransferase superfamily.

Conservation

A multiple sequence alignment conducted using the protein tools in the SDSC Biology Workbench [9] reveals some sequence similarity among distantly related protein orthologs, as far back as archaea, in the region known to code for the methyltransferase domain. The methyltransferase superfamily portion of the protein appears more highly conserved among many of the more closely related orthologous proteins in a diverse array of species.

Species distribution

The C16orf13 has homologs in many species, including distant orthologs in fungi and plants.[13][14] There are no known paralogs of this protein[15][16] This gene and its protein are very highly conserved in primates and mammals, particularly in the functional methyltransferase domain.

The graph below shows selected protein orthologs for C16orf13 transcript variant 1. These data are collected from NCBI BLAST.

Species Organism Common name Protein Common name NCBI accession number Sequence identity Expected value Sequence length (aa) Time since split from humans, MYA (Data from TimeTree.org)
Homo sapiens Human UPF0585, isoform a NP_115742.3 100% 0 204 0
Pan troglodytes Chimpanzee LOC467858 XP_001154838.1 98% 1E-150 204 6.4
Canis lupus familiaris Dog LOC490093 XP_547214.3 91% 4E-141 204 94.4
Mus musculus Mouse 0610011F06Rik NP_080962.1 87% 5E-134 204 92.4
Xenopus (Silurana) tropicalis Western clawed frog UPF0585 protein C16orf13 homolog NP_001034823.2 58% 1E-82 203 371.2

Predicted properties

File:C16orf13SecondaryStructure.tiff

The protein secondary structure can be predicted using algorithms to predict the occurrence of alpha helices and beta sheets within the protein. An analysis of the protein structure was conducted using the CHOFAS, GOR4, and PELE algorithms in the SDSC Biology Workbench.[17] The analyses were combined and included in the adjacent diagram. Only structures that appeared in more than one output were included.

Interactions

There are few known interactions for this protein. No interactions were found in the GeneCards database[5] or in the MINT database.[18] A STRING search resulted in two gene outputs.[19] These two gene interactions, though, are both in the evidence category of gene neighborhood, which does not necessarily suggest that these genes are interacting in any meaningful way, or are even expressed at the same time. There is no strong evidence, currently, for interactions with this protein.

Disease linkage

Data from microarray experiments has linked over expression of this gene to cancer in various tissues, particularly breast and gastric cancer. In addition, under expression of this gene is also linked to disease, particularly connective tissue disease, nutritional and metabolic disorders, and digestive disorders.[20] The canSAR Workbench database reveals microarray data that may link over or under expression of the C16orf13 gene to various carcinomas [21]

References

  1. "C16orf13 - UPF0585 protein C16orf13 - human protein (Identifiers)". Nextprot.org. http://www.nextprot.org/db/entry/NX_Q96S19/identifiers. Retrieved 2012-05-18. 
  2. "Breast Cancer Database". Itb.cnr.it. http://www.itb.cnr.it/breastcancer/php/geneReport.php?id=84326. Retrieved 2012-05-18. 
  3. "Transcriptome analysis of human gastric cancer". Mamm. Genome 16 (12): 942–54. December 2005. doi:10.1007/s00335-005-0075-2. PMID 16341674. 
  4. "C16orf13 Disease Atlas". NextBio. http://www.nextbio.com/b/search/da/C16orf13?type=gene&id=74059&name=C16orf13&openCategory=146930. Retrieved 2012-05-18. 
  5. 5.0 5.1 GeneCards Human Gene Database. "C16orf13 Gene - GeneCards | CP013 Protein | CP013 Antibody". GeneCards. https://www.genecards.org/cgi-bin/carddisp.pl?gene=C16orf13&search=c16orf13#genomic_location. Retrieved 2012-05-18. 
  6. "Homo sapiens chromosome 16 open reading frame 13 (C16orf13), transcrip - Nucleotide - NCBI". Ncbi.nlm.nih.gov. 2012-04-04. https://www.ncbi.nlm.nih.gov/nuccore/NM_032366.3. Retrieved 2012-05-18. 
  7. "Homo sapiens chromosome 16 open reading frame 13 (C16orf13), transcrip - Nucleotide - NCBI". Ncbi.nlm.nih.gov. 2012-04-04. https://www.ncbi.nlm.nih.gov/nuccore/93102386?report=genbank&to=854. Retrieved 2012-05-18. 
  8. "Homo sapiens chromosome 16, GRCh37.p5 Primary Assembly - Nucleotide - NCBI". Ncbi.nlm.nih.gov. 2012-04-04. https://www.ncbi.nlm.nih.gov/nuccore/NC_000016.9?from=684429&to=686347&report=genbank&strand=true. Retrieved 2012-05-18. 
  9. 9.0 9.1 "SDSC Biology Workbench". Workbench.sdsc.edu. http://workbench.sdsc.edu/. Retrieved 2012-05-18. 
  10. "EST Profile - Hs.239500". Ncbi.nlm.nih.gov. https://www.ncbi.nlm.nih.gov/UniGene/ESTProfileViewer.cgi?uglist=Hs.239500. Retrieved 2012-05-18. 
  11. https://www.ncbi.nlm.nih.gov/UniGene/ESTProfileViewer.cgi?uglist=Hs.239500
  12. 12.0 12.1 "C16orf13 chromosome 16 open reading frame 13 [Homo sapiens] - Gene - NCBI". Ncbi.nlm.nih.gov. https://www.ncbi.nlm.nih.gov/gene/84326. Retrieved 2012-05-18. 
  13. GeneCards Human Gene Database. "C16orf13 Gene - GeneCards | CP013 Protein | CP013 Antibody". GeneCards. https://www.genecards.org/cgi-bin/carddisp.pl?gene=C16orf13&search=c16orf13#ortholog. Retrieved 2012-05-18. 
  14. "Ensembl genome browser 67: Homo sapiens - Orthologues - Gene: C16orf13 (ENSG00000130731)". Useast.ensembl.org. http://useast.ensembl.org/Homo_sapiens/Gene/Compara_Ortholog?g=ENSG00000130731;r=16:684429-686358. Retrieved 2012-05-18. 
  15. GeneCards Human Gene Database. "C16orf13 Gene - GeneCards | CP013 Protein | CP013 Antibody". GeneCards. https://www.genecards.org/cgi-bin/carddisp.pl?gene=C16orf13&search=c16orf13#paralogs. Retrieved 2012-05-18. 
  16. "Ensembl genome browser 67: Homo sapiens - Comparative Genomics - Gene: C16orf13 (ENSG00000130731)". Useast.ensembl.org. http://useast.ensembl.org/Homo_sapiens/Gene/Compara?g=ENSG00000130731;r=16:684429-686358. Retrieved 2012-05-18. 
  17. "Prediction of the secondary structure of proteins from their amino acid sequence". Adv. Enzymol. Relat. Areas Mol. Biol. 47: 45–148. 1978. doi:10.1002/9780470122921.ch2. PMID 364941. http://seqtool.sdsc.edu/CGI/BW.cgi#!. [yes|permanent dead link|dead link}}]
  18. "HomoMINT database". Mint.bio.uniroma2.it. http://mint.bio.uniroma2.it/HomoMINT/search/search.do. Retrieved 2012-05-18. [yes|permanent dead link|dead link}}]
  19. "STRING: functional protein association networks". String-db.org. http://string-db.org/. Retrieved 2012-05-18. 
  20. http://www.nextbio.com/b/search/da/C16orf13?type=gene&id=74059&name=C16orf13&openCategory=146930
  21. https://cansar.icr.ac.uk/cansar/index2.php?redirect=treport&redirect_value=Q96S19&focus=&context=&goto=&tab=target_report_expression#main_tab_holder:target_report_tab:target_report_expression