Biology:FAM76A

From HandWiki
Short description: Protein-coding gene in the species Homo sapiens


A representation of the 3D structure of the protein myoglobin showing turquoise α-helices.
Generic protein structure example


FAM76A is a protein that in Homo sapiens is encoded by the FAM76A gene.[1] Notable structural characteristics of FAM76A include an 83 amino acid coiled coil domain as well as a four amino acid poly-serine compositional bias.[2] FAM76A is conserved in most chordates but it is not found in other deuterostrome phlya such as echinodermata, hemichordata, or xenacoelomorpha—suggesting that FAM76A arose sometime after chordates in the evolutionary lineage. Furthermore, FAM76A is not found in fungi, plants, archaea, or bacteria.[3] FAM76A is predicted to localize to the nucleus and may play a role in regulating transcription.[4]

Gene

Location

FAM76A gene locus[5]

FAM76A is located on the (+) strand of the short arm of chromosome 1 (1p35.3), with the genomic sequence starting at 27725979 and ending at 27762915. The coding region is made up of 3462 base pairs and is translated into 341 amino acids.[1][6]

Gene neighborhood

Genes that flank FAM76A on the telomeric side include IFI6, CHMP1AP1, and RPEP3, while genes that flank FAM76A on the centromeric side include STX12, PPP1R8, and L0C105376894.[1]

Common aliases

In Caenorhabditis elegans, FAM76A is referred to as K04F10.7.[7] Outside of this, FAM76A does not have any significant alternative names.

mRNA

In Homo sapiens, the FAM76A gene produces 9 different mRNAs, 7 of which are alternatively spliced and 2 of which are unspliced. Of the alternatively spliced mRNAs, isoform 1 is the longest variant of the gene and is the subject of this article.[1]

Protein

General properties

The molecular weight of FAM76A is 38.4 kDa, making it possible for this protein to diffuse through nuclear pores.[8] The isoelectric point is 9.28. FAM76A does not have any significant positive, negative, or mixed charge clusters. In addition, FAM76A does not have any predicted hydrophobic or transmembrane segments, suggesting that this protein is not found within the cell membrane.[9]

Composition

The amino acid composition of FAM76A protein showed amino acid frequencies within 1.5% of that of normal human proteins for all but cysteine, valine, and lysine. Cysteine and lysine have higher frequencies compared to a normal Homo sapiens protein, while valine has a lower frequency compared to a normal Homo sapiens protein. These same amino acid frequency differences are seen in FAM76A orthologs such as Gallus gallus (H. sapiens sequence identity 84%), Serinus canaria (H. sapiens sequence identity 77%), and Crassostrea gigas (H. sapiens sequence identity 57%).

Domains and motifs

Pictorial representation of FAM76A domains

NCBI conserved domains search identified an uncharacterized conserved protein (YqiK) that contains the Band7/PHB/SPFH domain, whose function is unknown and is conserved in various species ranging from humans to bacteria.[6] In Homo sapiens, the Band7/PHB/SPFH domain spans from amino acids 252-326. The molecular weight of this domain is 8.9 kDa, and it has an isoelectric point of 9.23. The Band7/PHB/SPFH domain does not have any amino acids frequency composition that differs from a normal Homo sapiens protein.[9] This domain is yet to be assigned to any domain superfamily.

Secondary structure

I-TASSER predicted tertiary structure for FAM76A. This structure has a C-score of -3.27 (measured on a scale of -5 to 2, with higher values equating higher confidence) and a cluster density of .35 (on a scale of 0 to 1, with higher values indicating greater protein prediction coverage). The C-score takes into account both significance of the model's structure and the quality of the prediction coverage from other proteins.

FAM76A is predicted to only have alpha helices. In total, there are 17 alpha helices predicted, the longest of which contains the Band7/PHB/SPFH domain.[10] From this, only 8 alpha helices are located within conserved regions of FAM76A (see conceptual translation).

Tertiary/quaternary structure

FAM76A contains a coiled-coil domain, which is located within the Band7/PHB/SPFH domain. No significant ligand-binding sites or active sites were predicted from I-TASSER.[11] There is no evidence to suggest that FAM76A interacts with other proteins to form a quaternary structure.

Sub-cellular localization

The protein subcellular localization prediction tool, PSORT II, predicts FAM76A to be located within the nucleus. This prediction is observed in orthologs such as Gallus gallus and Callorhinchus milii.[12] Further evidence for FAM76A localizing to the nucleus is provided by the presence of a nuclear localization signal.[4]

Expression

According to NCBI Geo Profile, FAM76A is expressed in Homo sapiens parathyroid, lymph node, esophagus, and bone marrow tissue. Developmental stages where FAM76A expression is detected include the embryoid body, fetus, and adult.[13]

Brain atlas

Allen human brain atlas predictions for FAM76A expression are depicted below. FAM76A appears to have higher expression within the cerebral cortex and lower expression in parts of the reptilian brain such as the pontine tegmentum (see expression table for further details).[14]

Expression Table
Brain Area Function FAM76A Expression Level
Frontal lobe Planning, organizing, problem solving, selective attention, personality, and higher cognitive functions High
Occipital lobe Visual processing High
Temporal lobe Auditory processing High
Parietal lobe Sensation, perception, and integration of sensory input High
Cerebellum Coordination of voluntary movements High
Hippocampal formation Memory/spatial coding Low
Pontine tegmentum Sensory and motor function, sleep stage control, and arousal Low
Myelencephalon cuneate nucleus Receive fine touch and proprioceptive information from upper body Low

Experimental data

Select data from three experiments involving FAM76A are shown below. In one experiment, CLDN1 over-expression in lung adenocarcinoma cells decreased FAM76A expression.[15] In another experiment, androgen insensitive prostate cancer cells were shown to have reduced expression of FAM76A compared to androgen sensitive cells.[16] Another experiment demonstrated that metaphase II oocyte cells were shown to have more expression of FAM76A compared to control cells.[17]

Regulation of expression

Post-translational modifications

FAM76A is predicted to undergo a variety of post-translational modifications. Post-translational modifications found within conserved regions include 7 phosphorylation sites, 2 sumoylation sites, and 1 nuclear localization signal.[18] These modifications indicate that FAM76A is localized to the nucleus. Refer to conceptual translation for a visual representation of the aforementioned modifications.

Promoter

Genomatrix's ElDorado program predicts a promoter for FAM76A that is named GXP_71042 and is 679 base pairs. It is located on chromosome 1, starting at 27725479 and ending at 27726157. GXP_71042 overlaps with the start of the coding sequence of FAM76A.[19] There are several transcription factors that bind to this promoter. Many of the transcription factors that bind to the promoter region of FAM76A have function dealing with blood cells, the immune system, and leukocytes—perhaps suggesting that FAM76A is involved in immune function. It would also appear that the most common matrix families include C2H2 zinc fingers and myeloid zinc fingers, suggesting that these matrix families may be heavily involved in FAM76A transcription.

FAM76A Transcription Factor Binding Sites
Transcription Factor Binding Site Table

RNA binding proteins

Common RNA binding proteins within the 3’ UTR of FAM76A include PABPC1, ELAVL1, and PUM2—each with predicted binding frequencies of 32, 18, and 16 times, respectively.[20]

Interacting proteins

FAM76A was found to have a physical interaction with ELAVL1. The interaction was detected by immunoprecipitation by Abdelmohsen et al., 2009.[21] ELAVL1 is involved in regulating gene expression.

Homology/evolution

Paralogs

FAM76B is a paralog of FAM76A. It is estimated that FAM76A and FAM76B diverged from each other around 17.5 MYA.[1] Structural similarities that are conserved between FAM76A/B include a coiled coil domain as well as a poly serine compositional bias.[6] FAM76A and FAM76B both exhibit high expression in tissues such as lymph node, whole blood, testis, ovary, brain, kidney, liver, and lung.[22] FAM76B has about 62% sequence identity with FAM76A.[6]

Orthologs

Genus and Species Common Name Date of Divergence (MYA) Amino Acid Sequence Identity (%)
Homo sapiens humans 0 100
Macaca fascicularis crab-eating macaque 29.1 95
Tarsius syrichta Philippine tarsier 67.6 85
Dipodomys ordii Ord's kangaroo rat 90.9 85
Nannospalax galili blind mole rat 90.9 88
Gallus gallus red junglefowl 320.5 84
Nipponia nippon crested ibis 320.5 83
Egretta garzetta little egret 320.5 75
Anolis carolinensis Carolina anole 320.5 73
Oryzias latipes Japanese rice fish 429.6 59
Callorhinchus milii Australian ghostshark 482.9 64
Crassostrea gigas pacific oyster 847 57
Cryptosporidium parvum Iowa II N/A 1724.7 27
Cryptosporidium hominis N/A 1724.7 26

Shown here is a table of a select number of orthologs for Homo sapiens FAM76A. The table includes closely, intermediately, and distantly related orthologs. Mammals are shown to have greater similarity, while aquatic vertebrates such as actinopterygii/chondrichthyes have lesser similarity. Orthologs of Homo sapiens protein FAM76A are listed above in descending order of date of divergence and then by sequence identity.

Evolution

FAM76A appears to have a moderate rate of mutation when compared to fibrinogen (fast mutating) and cytochrome c (slow mutating).[3][23] This suggests that FAM76A has been at least somewhat resistant to mutation during the course of evolution.

Mutation rate of FAM76A relative to cytochrome C and fibrinogen. Mutation rate, m, which is the corrected percent of amino acid changes between the Homo sapiens protein and its orthologs is plotted against the log (millions of years since the date of divergence of Homo sapiens lineage and the lineage of the species in which the ortholog is seen. Here, "m" is calculated from the observed number of amino acid changes observed between the two organisms as follows: m/100=-ln(1-n/100) where m is the total number of amino acid changes occurred in a 100-amino-acid segment of a protein and n is the observed number of amino acid changes per 100 residues compared to the Homo sapiens protein sequence

Clinical significance

Disease association

FAM76A expression is highest in adrenal tumors, esophageal tumors, and soft tissue/muscle tissue tumors.[1][24] Copy number gain/loss of FAM76A—along with neighboring genes—has shown to produce detrimental phenotypes. In one case report, a patient with a copy number gain from 1p36.11-34.2 was shown to have developmental delays.[25] Another patient, who had a copy number gain from 1p36.1-35, showed similar delays.[26] In another case report, a patient with a copy number loss of 1p35.3, the exact location of FAM76A, developed macrocephaly.[26]

Multiple sequence alignment (MSA)

The MSA, shown below and generated with Biology Workbench CLUSTALW, arranges orthologs by the first letter of genus and then the first two letters of species.[9] There are 3 domains that are highly conserved across orthologs. Two of these domains have an unknown function, while the third domain is a coiled-coil domain. Conservation of these regions was traced back to Cryptosporidium parvum Iowa II, which diverged from Homo sapiens 1724.7 MYA. Conserved region 1 contains mostly polar amino acids; conserved region 2 contains both polar and non-polar amino acids; and the coiled-coil domain contains mostly polar amino acids.

References

  1. 1.0 1.1 1.2 1.3 1.4 1.5 "FAM76A family with sequence similarity 76 member A [Homo sapiens (human) - Gene - NCBI"]. https://www.ncbi.nlm.nih.gov/gene/199870. 
  2. "FAM76A Uniprot Entry". https://www.uniprot.org/uniprot/Q8TAV0. 
  3. 3.0 3.1 "NCBI BLAST". http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastSearch&BLAST_SPEC=OGP__9606__9558. 
  4. 4.0 4.1 "NLS Mapper". http://nls-mapper.iab.keio.ac.jp/cgi-bin/NLS_Mapper_form.cgi. 
  5. "NCBI FAM76A Gene Entry". https://www.ncbi.nlm.nih.gov/gene/199870. 
  6. 6.0 6.1 6.2 6.3 "protein FAM76A isoform 1 [Homo sapiens - Protein - NCBI"]. https://www.ncbi.nlm.nih.gov/protein/NP_001137384.1. 
  7. "Wormbase K04F10.7 profile". http://www.wormbase.org/species/c_elegans/gene/WBGene00019399#19-c8a-3. 
  8. "A small ubiquitin-related polypeptide involved in targeting RanGAP1 to nuclear pore complex protein RanBP2" (in English). Cell 88 (1): 97–107. January 1997. doi:10.1016/S0092-8674(00)81862-0. PMID 9019411. 
  9. 9.0 9.1 9.2 "SDSC Biology Workbench". http://workbench.sdsc.edu. 
  10. "Phyre2". http://www.sbg.bio.ic.ac.uk/phyre2/phyre2_output/f60b250ed920151a/summary.html. [yes|permanent dead link|dead link}}]
  11. "I-TASSER Protein Structure & Function Predictions". http://zhanglab.ccmb.med.umich.edu/I-TASSER/. 
  12. "PSORT II Prediction". http://psort.hgc.jp/form2.html. 
  13. "NCBI Geo Profile". https://www.ncbi.nlm.nih.gov/geoprofiles/?term=fam76a. 
  14. "FAM76A Allen Brain Atlas". http://human.brain-map.org/microarray/search/show?exact_match=false&search_term=FAM76A&search_type=gene&page_num=0. 
  15. "Claudin-1 is a metastasis suppressor and correlates with clinical outcome in lung adenocarcinoma". American Journal of Respiratory and Critical Care Medicine 179 (2): 123–33. January 2009. doi:10.1164/rccm.200803-456OC. PMID 18787218. http://ntur.lib.ntu.edu.tw//handle/246246/188821. 
  16. "Genome-wide characterization of gene expression variations and DNA copy number changes in prostate cancer cell lines". The Prostate 63 (2): 187–97. May 2005. doi:10.1002/pros.20158. PMID 15486987. 
  17. "The transcriptome of human oocytes". Proceedings of the National Academy of Sciences of the United States of America 103 (38): 14027–32. September 2006. doi:10.1073/pnas.0603227103. PMID 16968779. Bibcode2006PNAS..10314027K. 
  18. "ExPasy Post-trasnlational Modifications". http://www.expasy.org. 
  19. "Genomatrix ElDorado". http://www.genomatix.de/cgi-bin//sessions/login.pl?s=a987e05f8296ab67ca47a3f3425241d6. [yes|permanent dead link|dead link}}]
  20. "RBPDP: The database of RNA-binding protein specificities". http://rbpdb.ccbr.utoronto.ca. 
  21. "Ubiquitin-mediated proteolysis of HuR by heat shock". The EMBO Journal 28 (9): 1271–82. May 2009. doi:10.1038/emboj.2009.67. PMID 19322201. 
  22. "NCBI AceView". https://www.ncbi.nlm.nih.gov/ieb/research/acembly/. 
  23. "Time Tree". http://www.timetree.org. 
  24. "FAM76A Gene Cards". https://www.genecards.org/cgi-bin/carddisp.pl?gene=KIAA1841&search=KIAA1841. 
  25. "Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies". American Journal of Human Genetics 86 (5): 749–64. May 2010. doi:10.1016/j.ajhg.2010.04.006. PMID 20466091. 
  26. 26.0 26.1 "An evidence-based approach to establish the functional and clinical significance of copy number variants in intellectual and developmental disabilities". Genetics in Medicine 13 (9): 777–84. September 2011. doi:10.1097/GIM.0b013e31822c79f9. PMID 21844811. 

Further reading