Biology:CFAP206

From HandWiki
Revision as of 13:14, 29 June 2023 by MainAI (talk | contribs) (simplify)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
A representation of the 3D structure of the protein myoglobin showing turquoise α-helices.
Generic protein structure example

Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood.[1][2] Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.”[3] In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more[4] C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis.[5] The molecular weight of UPF0704 is 71,193 Da [6] and the PI is 6.38[6]

Gene Locus

The CFAP206 gene is located at Chromosome 6 from 88119558-88173965(6q15).[7] It contains 12 exons.[8] The genomic DNA is 54,407 base pairs long, while the longest mRNA that it produces is 2,215 bp long.[8]

Homology and Evolution

Orthologs

This protein is well conserved through a series of distantly related organisms including mammals, birds, amphibians, tunicates, bony fish, lancelets, insects, and sea urchins. The list of organisms in which orthologs have been found is shown below.

scientific name common name divergence from human lineage (MYA) accession number sequence length (aa) sequence identity to human protein
Homo sapiens Human 0 622 100%
Macaca mulatta Rhesus macaque 92.3 XP_001089007.2 658 98%
Rattus norvegicus Brown rat 92.3 NP_001073169.1 622 81%
Felis catus Cat 94.2 XP_003986405.2 629 85%
Chrysochloris asiatica Cape golden mole 98.7 XP_006870694.1 622 85%
Elephantulus edwardii Cape elephant shrew 98.7 XP_006902101.1 608 79%
Anolis carolinensis Arboreal lizard 296 XP_003215583.1 621 70%
Gallus gallus Chicken 296 XP_004940450.1 621 58%
Xenopus (Silurana) tropicalis Western clawed frog 371.2 XP_002938343.1 635 65%
Danio rerio Zebrafish 400.1 NP_991180 624 55%
Branchiostoma floridae Lancelet 713.2 XP_002603798.1 626 63%
Oikopleura dioica Oikopleura dioica 722.5 CBY12373.1 631 44%
Ciona intestinalis Sea squirt 722.5 XP_002128218.1 624 60%
Helobdella Leech 725.5 ESO10267.1 620 37%
Aedes aegypti Mosquito 725.5 630 30%
Crassostrea gigas Japanese oyster 782.7 EKC36332.1 624 61%
Anopheles gambiae Str. PEST 782.7 642 28%
Albugo laibachii Oomycetes 1317.5 642 26.8%

Paralogs

C6orf165 has no paralog.

Phylogeny

The rooted phylogeny tree is shown below[9]

Protein

The protein that is produced by the C6orf165 gene is termed DUF3508 and is 622 amino acids long.[10] The protein has a predicated molecular weight of 71.20 kDa and isoelectric point of 6.38.[11]

Domains

The C6orf165 gene protein product contains a well conserved domain DUF3508[7] This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 280 amino acids in length.[12]

Motifs

This domain has two conserved sequence motifs: GFC and GLL.[13]

Post-translational modifications

The only predicted post-translational modification this protein undergo is phosphorylation after trying all tools under post translational modification category on expasy.org.[14] Three phosphorylation site is predicted with score over 0.8. Phosphorylation on Ser 176,Thr 232 and Ser 310 are notified on the conceptual translation.

Secondary structure

The consensus of the prediction software PELE[15] predicts that protein UPF0704 is dominated by alpha helices with interspersed regions of random coil.

PSORT II analysis[16] predicts that there is a coiled_coil_region from 88 to 117 with sequence MNYTNRVEFLEEHHRVLESRLGSVTREITD.

Location

PSORT II analysis[16] trained on yeast data predicts that the subcellular location of this protein is most likely in the cytoplasm (56%). Less likely possibilities are in the mitochondria (21%) or in the nucleus (17%) or in vacuoles (4%).

Gene expression

Gene expression data

From the EST file of Unigene, the gene expression in human is not strong, the gene EST/EST in pool is really low, even low than 0.01%. These little expression is in brain, connective tissue, kidney, lungs, parathyroid, pharynx, placenta, testis and trachea. In mouse, the gene expression of C6orf165 is even lower, the gene is only expressed in two body parts, ovary and testis. In chicken, the weak expressions are in two body part, brain and testis. In zebra fish, gene expression is still low, the very weak expressions are in eye, kidney and reproductive system. In sea squirt, the expressions are in gonad, heart and neural complex. In summary, c6orf165 is expressed conservatively in testis across the species and partially conservatively in brain or neural complex.[17]

Promoter

The promoter region for human c6orf165 is identified by ElDorado (at Genomatix).[18] In addition to this, the start codon is at the second exon of the mRNA and this indicate the first exon is spliced during the modification.

Transcript variants

In humans, the c6orf165 gene produces 4 different transcripts, 2 of which form a protein product (one undergoes nonsense mediated decay ang the other is retained intron). The main transcript in humans is transcript ID ENST00000369562, or C6ORF165-001; it has 13 exons and 12 coding exons; the translation length is 622 residues[19] The second protein coding transcript in human is transcript ID ENST00000480123 or C6ORF165-002;it contains 7 exons and only 6 exons are protein coding; the translation length is 252 residues[20]

Interactions

Two-hybrid experiments revealed interacting proteins such as Myogenic repressor I-mf.[21] This repressor is highly expressed in sclerotome. It inhibits the transactivation activity of the MyoD family and represses myogenesis.[22] Protein complex co-immunoprecipitation (Co-IP) experiments revealed interacting protein NRF1 nuclear respiratory factor 1[23] This gene encodes a protein that homodimerizes and functions as a transcription factor which activates the expression of some key metabolic genes regulating cellular growth and nuclear genes required for respiration, heme biosynthesis, and mitochondrial DNA transcription and replication. Two-hybrid experiments revealed interacting protein RNF138 (ring finger protein 138),[21] an E3 ubiquitin protein ligase. Affinity Capture-Western reveal an interaction protein called TP73 tumor protein p73,[24] which is a protein related to the p53 tumor protein.

Clinical significance

C6orf165 has no currently known disease associations or mutations.

References

  1. "Entrez Gene: C6orf165". 17 July 2006. https://www.ncbi.nlm.nih.gov/nuccore/34191365?report=genbank. Retrieved 2014-03-01. 
  2. "The DNA sequence and analysis of human chromosome 6.". Nature 425 (6960): 40–45. Oct 2003. doi:10.1038/nature02055. PMID 14574404. Bibcode2003Natur.425..805M. 
  3. "GeneCards: C6orf165 Gene". https://www.genecards.org/cgi-bin/carddisp.pl?gene=C6orf165. Retrieved 2014-02-28. 
  4. "NCBI gene: C6orf165 Gene". https://www.ncbi.nlm.nih.gov/gene?linkname=protein_gene&from_uid=72534780. Retrieved April 27, 2014. 
  5. "NCBI EST: C6orf165 Gene". https://www.ncbi.nlm.nih.gov/UniGene/ESTProfileViewer.cgi?uglist=Hs.82921. Retrieved April 27, 2014. 
  6. 6.0 6.1 "PhosphoSitePlus". http://www.phosphosite.org/proteinAction.do?id=21671&showAllSites=true. Retrieved 2014-05-08. 
  7. 7.0 7.1 "NCBI: C6orf165 Gene". https://www.ncbi.nlm.nih.gov/gene/154313. Retrieved 2014-03-09. 
  8. 8.0 8.1 "UCSC: C6orf165". http://genome.ucsc.edu/cgi-bin/hgc?hgsid=365752405&g=htcUserAli&i=..%2Ftrash%2FhgSs%2FhgSs_genome_496d_cd8300.pslx+..%2Ftrash%2FhgSs%2FhgSs_genome_496d_cd8300.fa+NP_001026913.1&c=chr6&l=88119557&r=88173965&o=88119557&aliTable=user&table=hgUserPsl. Retrieved 2014-02-28. 
  9. "Gene: C6ORF165 ENSG00000272514". SDSC Biology Workbench. http://useast.ensembl.org/Homo_sapiens/Gene/Compara_Tree?collapse=none;db=core;g=ENSG00000272514;r=6:88117731-88145775;t=ENST00000296885. Retrieved 27 April 2014. 
  10. "NCBI Protein: protein DUF3508 C6orf165". https://www.ncbi.nlm.nih.gov/protein/72534780?report=genpept. Retrieved 2013-03-09. 
  11. "Compute pI/Mw". http://web.expasy.org/cgi-bin/compute_pi/pi_tool. Retrieved 2014-03-09. [yes|permanent dead link|dead link}}]
  12. "C6orf165 chromosome 6 open reading frame 165 [ Homo sapiens (human) "]. https://www.ncbi.nlm.nih.gov/gene/154313. Retrieved 2014-03-09. 
  13. "Conserved domains on [ Homo sapiens (human) "]. https://www.ncbi.nlm.nih.gov/gene/154313. Retrieved 2014-03-09. 
  14. "post-translational_modification". http://www.expasy.org/proteomics/post-translational_modification3. Retrieved 2014-05-06. [yes|permanent dead link|dead link}}]
  15. "PELE". SDSC Biology Workbench. http://seqtool.sdsc.edu/CGI/BW.cgi#!. Retrieved 27 April 2014. [yes|permanent dead link|dead link}}]
  16. 16.0 16.1 "PSORT II: Results of Subprograms". http://psort.hgc.jp/cgi-bin/runpsort.pl. Retrieved 2014-05-08. [yes|permanent dead link|dead link}}]
  17. "Unigene". National Center for Biotechnology Information. https://www.ncbi.nlm.nih.gov/unigene. Retrieved April 27, 2014. 
  18. "Eldorado". http://www.genomatix.de/. Retrieved April 27, 2014. 
  19. "Ensemble: gene c6orf165". Ensembl. http://useast.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000272514;r=6:88117731-88145775;t=ENST00000369562. Retrieved April 27, 2014. 
  20. "Ensemble: gene c6orf165". Ensembl. http://useast.ensembl.org/Homo_sapiens/Transcript/Summary?db=core;g=ENSG00000272514;r=6:88117731-88145775;t=ENST00000480123. Retrieved April 27, 2014. 
  21. 21.0 21.1 Rual, Jean-François, et al. "Towards a proteome-scale map of the human protein–protein interaction network." Nature 437.7062 (2005): 1173-1178.
  22. Chen, C-M. Amy, et al. "I-mf, a novel myogenic repressor, interacts with members of the MyoD family." Cell 86.5 (1996): 731-741.
  23. Satoh, Jun-ichi, Natsuki Kawana, and Yoji Yamamoto. "pathway Analysis of chIp-seq-Based nRF1 Target Genes suggests a Logical Hypothesis of their Involvement in the pathogenesis of neurodegenerative Diseases." Gene regulation and systems biology 7 (2013): 139.
  24. Lunardi, Andrea, et al. "A genome-scale protein interaction profile of Drosophila p53 uncovers additional nodes of the human p53 network." Proceedings of the National Academy of Sciences 107.14 (2010): 6322-6327.