Biology:FAM71F2

From HandWiki
Short description: Protein-coding gene in the species Homo sapiens


A representation of the 3D structure of the protein myoglobin showing turquoise α-helices.
Generic protein structure example

FAM71F2 or Family with Sequence Similarity 71 member F2 is a protein that in humans is encoded by the Family with Sequence Similarity 71 member F2 gene. This gene is highly active in the reproductive tissues, specifically the testis, and may serve as a potential biomarker for determining metastatic testicular cancer.

Gene

Location

Figure 1. Location of the gene FAM71F2, marked with a blue rectangle, on Chromosome 7. Neighboring genes in the area are labelled as well.

FAM71F2 gene is located on chromosome 7 in humans (7q32.1),[1][2] starting at 128,671,636 and ending at 128,702,262 on the positive strand.[2] The gene paralog FAM71F1 and the gene LINC01000 directly neighbor FAM71F2 on chromosome 7.

Size of gene

The gene spans 30,627 base pairs[2] and codes for 12 exons.[1]

Common aliases

FAM71F2 is also referred to as family with sequence similarity 137 member B, FAM137B.[2]

mRNA

FAM71F2 has 14 transcript variants.[1] Isoform a is the longest of the mRNA transcripts and spans 5,775 base pairs that translates into a 309 amino acids sequence.[1] It codes for 5 exons.[1] Other alternative splice isoforms are labelled in the diagram in Figure 2.

Figure 2. Diagram of the 14 annotated transcript variants for FAM71F2 with the protein isoform names on the left.

At the first splice site, Isoform b, found in most reptiles having FAM71F2 protein, deletes the following nine amino acids and picks up at the valine amino acid at location 61 in humans. Isoform c uses an alternative downstream start site to Isoform a and adds another exon between the first and second exons of Isoform a.

General properties

The molecular weight of FAM71F2 is 34.5 kilodaltons.[3] The isoelectric point is 6.15.[4]

Domains and motifs

FAM71F2 protein contains only one domain, named domain of unknown function, DUF3699.[1][2] This domain is located from amino acids 114-185 on the human FAM71F2 protein.[1] This domain family is found only in eukaryotes and approximately 71 amino acids in length.[1] There is also a potential R-2 mitochondrial pre-sequence cleavage site[5] that would signal the protein to the mitochondria. These sites are labelled in Figure 4 below.

Secondary structure

The secondary structure of FAM71F2 contains alpha helices and beta sheets.[6][7] These structures are identified in the generated image of the FAM71F2 protein in Figure 3. Highly conserved amino acid residues, such as the Val61-Thr62-Lys63 sequence where the reptiles and isoform b pick up in the second exon, are labelled on this figure as well.

Figure 3. Diagram of the FAM71F2 protein[8] with secondary structures and highly conserved amino acid residues. Alpha helices are labelled in orange, beta sheets in cyan, the N-terminus is magenta, and the C-terminus is green.

Post-translational modifications

FAM71F2 has seven highly conserved phosphorylated sites. There is one acetylation site[9] and one N-glycosylation site,[10] playing a role in stabilizing the protein.

Figure 4. Schematic diagram of FAM71F2 protein.[11] The DUF3699 domain is labelled as the large, grey hexagonal shape. A potential R-2 mitochondrial targeting sequence shown at 5' end is labelled with a red rectangle. Conserved phosphorylation sites are labelled with the red stop sign images and a 5' acetylation site on the second amino acid is labelled with a grey stop sign image. A N-terminus glycosylation site on Met98 is shown as a thin, green rectangle and two cysteine bonds are labelled with the green brackets.

Sub-cellular localization

FAM71F2 protein stays in the cytoplasm of cells,[5] but may have localization in the nucleus and mitochondria.[5]

Expression

Tissue expression pattern

FAM71F2 is highly expressed in reproductive structures, such as the testis, epididymis, uterus and ovaries.[1][12] There is some expression in the brain and connective tissue as well.[13] As development stages progress, the number of gene transcripts increases and are at highest expression levels in adults.[13] In the mouse, during spermatogenesis and development of the testis, gene transcript levels of FAM71F2 increase dramatically.[14]

Cellular expression

FAM71F2 protein expression has been detected in the cytoplasm of Leydig cells and in epididymis cells of the male testis and is also detected in the cytoplasm in ovarian follicles.[12]

Figure 4. Immunohistological staining of FAM71F2 in the human testis.[12]

Expression level

FAM71F2 is moderately expressed in comparison to other proteins in the human, with an average protein expression level of 8.47 part per million.[15]

Clinical significance

FAM71F2 is repressed in males with non-obstructive azoospermia[16] and teratozoospermia,[17] or abnormalities in sperm morphology and quantity. These diseases lead to fertility issues. In addition, FAM71F2 gene expression is up-regulated with Dopamine receptor D1 expression in testicular cancer patients, and may be an important biomarker for metastatic forms of this cancer.[18][19][16]

Homology

FAM71F2 has 91 orthologs in other animal species.[1] Its evolutionary history goes as far back as the reptiles, and its most distant relative is the homolog in the west Indian Ocean coelacanth.[20][21][22] The time of divergence between eight orthologs from the human FAM71F2 is shown in Figure 5. It is not found in birds or in Gallus gallus (chicken).[21][1] FAM71F2 has six paralogs in humans: FAM71A, FAM71B, FAM71C, FAM71D, FAM71E1, and FAM71F1.[1]

Figure 5. Date of Divergence of FAM71F2 from the human ortholog.

References

  1. 1.00 1.01 1.02 1.03 1.04 1.05 1.06 1.07 1.08 1.09 1.10 1.11 NCBI (National Center for Biotechnology Information) entry on FAM71F2 [https://www.ncbi.nlm.nih.gov/gene/346653]
  2. 2.0 2.1 2.2 2.3 2.4 Database, GeneCards Human Gene. "FAM71F2 Gene - GeneCards | F71F2 Protein | F71F2 Antibody". https://www.genecards.org/cgi-bin/carddisp.pl?gene=FAM71F2. 
  3. Kramer, Jack (1990). "AASTATS". http://seqtool.sdsc.edu/CGI/BW.cgi#!. [yes|permanent dead link|dead link}}]
  4. Toldo, Dr. Luca. "PI: Isoelectric point determination". http://seqtool.sdsc.edu/CGI/BW.cgi#!. [yes|permanent dead link|dead link}}]
  5. 5.0 5.1 5.2 "PSORT II Prediction". https://psort.hgc.jp/form2.html. 
  6. "Phyre 2 Results for FAM71F2__". http://www.sbg.bio.ic.ac.uk/phyre2/phyre2_output/4b50b5f1dafb7270/summary.html. [yes|permanent dead link|dead link}}]
  7. Pappas, Georgios Jr.. "PELE Protein Structure Prediction". http://seqtool.sdsc.edu/CGI/BW.cgi#!. [yes|permanent dead link|dead link}}]
  8. "UCSF Chimera Home Page". https://www.cgl.ucsf.edu/chimera/. 
  9. Kiemer, Lars (2004). "NetAcet 1.0 Server" (in en). http://www.cbs.dtu.dk/services/NetAcet/. 
  10. Gupta, R. (2004). "Prediction of N-glycosylation sites in human proteins.". http://www.cbs.dtu.dk/services/NetNGlyc/. 
  11. Castro, Edouard de. "PROSITE" (in en-US). http://prosite.expasy.org/cgi-bin/prosite/mydomains/. 
  12. 12.0 12.1 12.2 "Tissue expression of FAM71F2 - Summary - The Human Protein Atlas". http://www.proteinatlas.org/ENSG00000205085-FAM71F2/tissue. 
  13. 13.0 13.1 Group, Schuler. "EST Profile - Hs.445236". https://www.ncbi.nlm.nih.gov/UniGene/ESTProfileViewer.cgi?uglist=Hs.445236. 
  14. "4778451 - GEO Profiles - NCBI". https://www.ncbi.nlm.nih.gov/geoprofiles/4778451. 
  15. "FAM71F2 protein abundance in PaxDb" (in en). http://pax-db.org/protein/1861459. 
  16. 16.0 16.1 "Novel gene biomarkers of spermatogenesis-potential for spermatogenesis assessment and treatment monitoring". Fertility and Sterility 102 (3): e349. 2014. doi:10.1016/j.fertnstert.2014.07.1179. http://www.fertstert.org/article/S0015028214018068/pdf. 
  17. "38158770 - GEO Profiles - NCBI". https://www.ncbi.nlm.nih.gov/geoprofiles/38158770. 
  18. "Global incidence and outcome of testicular cancer". Clinical Epidemiology 5: 417–27. October 2013. doi:10.2147/CLEP.S34430. PMID 24204171. 
  19. "Predicting metastasized seminoma using gene expression". BJU International 110 (2 Pt 2): E14–20. July 2012. doi:10.1111/j.1464-410X.2011.10778.x. PMID 22243760. 
  20. San Diego Supercomputer Center. "Biology Workbench". http://seqtool.sdsc.edu/. 
  21. 21.0 21.1 Kent, Jim. "BLAT Search Genome". https://genome.ucsc.edu/cgi-bin/hgBlat?hgsid=586822855_KcuB7ByxDMU5hFjv34IFfSeueZQC&command=start. 
  22. "TimeTree :: The Timescale of Life". http://www.timetree.org/.