Biology:C2orf72
C2orf72 (Chromosome 2, Open Reading Frame 72) is a gene in humans (Homo sapiens) that encodes a protein currently named after its gene, C2orf72.[1] It is also designated LOC257407[1] and can be found under GenBank accession code NM_001144994.2.[2] The protein can be found under UniProt accession code A6NCS6.[3]
This gene is primarily expressed in the liver, brain, placental, and small intestine tissues.[4] C2orf72 is an intracellular protein that has been predicted to reside within the nucleus, cytosol, and plasma membrane of cells.[1] The function of C2orf72 is unknown, but it is predicted to be involved in very-low-density lipoprotein particle assembly and also involved in the regulation of cholesterol esterification.[5] This prediction also matches with the fact that both estradiol[6] and testosterone[7] have been reported to upregulate expression of C2orf72.[8]
Gene
File:Ideogram human chromosome 2.svg

Locus
C2orf72 is a protein-coding gene found on the forward (+) strand of chromosome 2 at the locus 2q37.1, on the long arm of the chromosome.[1]
mRNA
C2orf72's mRNA transcript is reported to be about 3,629 base pairs long.[2] It appears to have two polyadenylation sites near the 5′ end of the mRNA transcript, each preceded by their respective regulatory sequences, such as ATTAAA or AATAAA.[2]
There are three predicted exons reported for human C2orf72.[2]
Expression pattern
C2orf72 is preferentially expressed in brain, liver, placenta, colon, small intestine, gallbladder, stomach, and prostate, and to a lesser extent in adrenal gland, appendix, pancreas, lung, kidney, testis, and urinary bladder.[4]
Predicted Biological Functions

It is predicted via Archs4[9] (July 16, 2022) that the function of this gene may be related to very-low-density lipoprotein particle assembly[10] and also involved in the regulation of cholesterol esterification.[5]
Regulation
Gene-level regulation
Gene perturbation data
In a study of embryonic liver samples lacking hepatocyte nuclear factor 4 alpha (HNF4α), the expression of C2orf72 was downregulated.[11]
Both estradiol[6] and testosterone[7] upregulate expression of C2orf72.[8]
Expression pattern
C2orf72 mRNA and protein products are found preferentially in the liver, kidney, and placenta.[12] The protein is localized to the cell membrane and cytoplasm in liver, brain, and placental tissues.[12]
Transcript-level regulation
miR-1271-5p is a microRNA that could bind to the 3′ untranslated region of the C2orf72 mRNA transcript at 5′-...GUGCCAA...-3′.[2][13][14]
Protein-level regulation
Predicted phosphorylation sites
There are at least two predicted phosphorylation sites for the human C2orf72 protein, one at threonine-286 and the other at serine-294.[3]

Protein




Human protein
The predicted molecular weight of C2orf72 is 30.5 kDa,[15] and it has a predicted isoelectric point (pI) of pH 8.7.[16]
There are eight cysteine residues, for a potential of four disulfide bonds.[17] Most of the cysteine residues are positioned next to a polar amino acid (uncharged or positively or negatively charged).[17]
At physiological pH, there are 33 positively charged amino acid residues, including histidine, most of which are arginines.[17] Likewise, there are 33 negatively charged amino acid residues, most of which are glutamates.[17]
There are 14 hydroxyl-containing residues (tyrosine, threonine or serine) that could serve as typical phosphorylation sites; most of these are serines.[17]

Interacting proteins
These proteins have been reported to interact with human C2orf72: RASN (GTPase NRas),[19] RASK (GTPase KRas),[19] and CD81.[20][21]
Homology
There are at least 203 organisms with an ortholog of C2orf72.[22] The most evolutionarily distant reported ortholog of C2orf72 is in the Australian ghost shark (Callorhincus milii);,[23][24][25] and it is broadly conserved from Actinopterygii (bony fish) to Mammalia.
| Genus and species | Common name | Order | Date of divergence from human
(million years ago) |
GenBank accession
code |
Sequence
length |
Sequence identity (%) | Sequence
similarity (%) |
|---|---|---|---|---|---|---|---|
| Pan troglodytes | Chimpanzee | Primates | 6.7 | XP_516141.5 | 295 | 98.6 | 98.6 |
| Pongo abelii | Sumatran orangutan | Primates | 15.76 | XP_024099683.1 | 295 | 95.3 | 96.9 |
| Castor canadensis | American beaver | Rodentia | 90 | XP_020011841.1 | 282 | 77.6 | 82.4 |
| Oryx dammah | Scimitar-horned oryx | Artiodactyla | 96 | XP_040084064.1 | 285 | 74.6 | 79.3 |
| Sus scrofa | Wild boar | Artiodactyla | 96 | XP_005657646.1 | 282 | 75.3 | 80.7 |
| Tursiops truncatus | Common bottlenose dolphin | Cetacea | 96 | XP_033715450.1 | 285 | 76.9 | 80.7 |
| Felis catus | Domestic cat | Carnivora | 96 | XP_023115562.1 | 286 | 80.1 | 83.1 |
| Eptesicus fuscus | Big brown bat | Chiroptera | 96 | XP_027993078.1 | 151 | 36.1 | 38.9 |
| Corapipo altera | White-ruffed manakin | Passeriformes | 312 | XP_027503457.1 | 181 | 26.7 | 34.0 |
| Pipra filicauda | Wire-tailed manakin | Passeriformes | 312 | XP_027606890.1 | 243 | 34.7 | 45.2 |
| Taeniopygia guttata | Zebra finch | Passeriformes | 312 | XP_030136117.3 | 255 | 35.1 | 45.4 |
| Corvus cornix cornix | Hooded crow | Passeriformes | 312 | XP_039412719.1 | 245 | 36.0 | 45.3 |
| Hirundo rustica | Barn swallow | Passeriformes | 312 | XP_039930397.1 | 243 | 37.0 | 46.7 |
| Aythya fuligula | Tufted duck | Anseriformes | 312 | XP_032049188 | 251 | 36.3 | 46.7 |
| Anas platyrhynchos | Mallard | Anseriformes | 312 | XP_038039556.1 | 251 | 36.3 | 46.7 |
| Protobothrops mucrosquamatus | Brown-spotted pit viper | Squamata | 312 | XP_029139335.1 | 278 | 22.9 | 34.5 |
| Python bivittatus | Burmese python | Squamata | 312 | XP_025023716.1 | 279 | 23.3 | 35.9 |
| Pseudonaja textilis | Eastern brown snake | Squamata | 312 | XP_026577460.1 | 272 | 31.6 | 41.0 |
| Pantherophis guttatus | Corn snake | Squamata | 312 | XP_034263860.1 | 252 | 33.0 | 42.5 |
| Pogona vitticeps | Central bearded dragon | Squamata | 312 | XP_020657305.1 | 295 | 24.1 | 34.0 |
| Zootoca vivipara | Common lizard | Squamata | 312 | XP_034989711.1 | 285 | 37.9 | 48.6 |
| Lacerta agilis | Sand lizard | Squamata | 312 | XP_033004091.1 | 289 | 38.0 | 49.5 |
| Podarcis muralis | Common wall lizard | Squamata | 312 | XP_028587763.1 | 272 | 38.7 | 50.8 |
| Gopherus evgoodei | Goode's thornscrub tortoise | Testudines | 312 | XP_030431493.1 | 481 | 24.2 | 31.1 |
| Terrapene carolina
triunguis |
Three-toed box turtle | Testudines | 312 | XP_029766982.1 | 262 | 35.1 | 43.2 |
| Chrysemys picta bellii | Painted turtle | Testudines | 312 | XP_023966073.1 | 306 | 36.6 | 47.4 |
| Dermochelys coriacea | Leatherback sea turtle | Testudines | 312 | XP_038272534.1 | 271 | 38.1 | 48.1 |
| Mauremys reevesii | Reeves' turtle | Testudines | 312 | XP_039344659.1 | 277 | 39.5 | 51.4 |
| Nanorana parkeri | High Himalaya frog | Anura | 351.8 | XP_018432004.1 | 304 | 27.3 | 40.1 |
| Xenopus tropicalis | Tropical clawed frog | Anura | 351.8 | XP_002937397.3 | 289 | 30.7 | 42.4 |
| Rhinatrema bivittatum | Two-lined caecilian | Gymnophiona | 351.8 | XP_029473197.1 | 358 | 30.3 | 36.1 |
| Geotrypetes seraphini | Gaboon caecilian | Gymnophiona | 351.8 | XP_033814148.1 | 233 | 33.9 | 44.2 |
| Parambassis ranga | Indian glass fish | Perciformes | 435 | XP_028260036.1 | 334 | 19.7 | 34.5 |
| Acanthochromis polyacanthus | Spiny chromis | Perciformes | 435 | XP_022050415.1 | 317 | 21.8 | 35.6 |
| Acanthopagrus latus | Yellowfin seabream | Perciformes | 435 | XP_036971960.1 | 309 | 22.0 | 35.5 |
| Cyprinodon tularosa | White Sands pupfish | Cyprinodontiformes | 435 | XP_038147473.1 | 296 | 20.1 | 33.1 |
| Esox lucius | Northern pike | Esociformes | 435 | XP_012990404.1 | 332 | 20.6 | 33.1 |
| Thunnus maccoyii | Southern bluefin tuna | Scombriformes | 435 | XP_042273029.1 | 329 | 20.2 | 34.0 |
| Syngnathus acus | Greater pipefish | Syngnathiformes | 435 | XP_037106050.1 | 274 | 19.5 | 34.9 |
| Callorhinchus milii | Australian ghost shark | Chimaeriformes | 473 | XP_007887618.1 | 413 | 17.6 | 26.5 |
References
- ↑ 1.0 1.1 1.2 1.3 "C2orf72 GeneCards". https://www.genecards.org/cgi-bin/carddisp.pl?gene=C2orf72.
- ↑ 2.0 2.1 2.2 2.3 2.4 "Homo sapiens chromosome 2 open reading frame 72 (C2orf72), mRNA" (in en-US). 2020-12-12. https://www.ncbi.nlm.nih.gov/nuccore/NM_001144994.2.
- ↑ 3.0 3.1 "iPTMnet Report A6NCS6 C2orf72". https://research.bioinformatics.udel.edu/iptmnet/entry/A6NCS6/.
- ↑ 4.0 4.1 "C2orf72 chromosome 2 open reading frame 72 [Homo sapiens (human) - Gene - NCBI"]. https://www.ncbi.nlm.nih.gov/gene/257407.
- ↑ 5.0 5.1 "ARCHS4". https://maayanlab.cloud/archs4/gene/C2ORF72.
- ↑ 6.0 6.1 "Gene Set - estradiol_homo sapiens_gpl570_gds3283". https://maayanlab.cloud/Harmonizome/gene_set/estradiol_homo+sapiens_gpl570_gds3283/GEO+Signatures+of+Differentially+Expressed+Genes+for+Small+Molecules.
- ↑ 7.0 7.1 "Gene Set - testosterone_mus musculus_gpl1261_gse17553". https://maayanlab.cloud/Harmonizome/gene_set/testosterone_mus+musculus_gpl1261_gse17553/GEO+Signatures+of+Differentially+Expressed+Genes+for+Small+Molecules.
- ↑ 8.0 8.1 "Gene - C2ORF72". https://maayanlab.cloud/Harmonizome/gene/C2ORF72.
- ↑ "ARCHS4". https://maayanlab.cloud/archs4/gene/C2ORF72.
- ↑ "QuickGO". https://www.ebi.ac.uk/QuickGO/term/GO:0034379.
- ↑ "Gene Set - hnf4a_16714383_e18dot5_liver_lof_mouse_gpl1261_gds1916". https://maayanlab.cloud/Harmonizome/gene_set/hnf4a_16714383_e18dot5_liver_lof_mouse_gpl1261_gds1916/GEO+Signatures+of+Differentially+Expressed+Genes+for+Transcription+Factor+Perturbations.
- ↑ 12.0 12.1 "Tissue expression of C2orf72 - Summary - The Human Protein Atlas". https://www.proteinatlas.org/ENSG00000204128-C2orf72/tissue.
- ↑ "miRDB - MicroRNA Target Prediction Database". http://mirdb.org/mirdb/index.html.
- ↑ "TargetScanHuman 7.2". http://www.targetscan.org/vert_72/.
- ↑ "C2orf72 protein expression summary - The Human Protein Atlas". https://www.proteinatlas.org/ENSG00000204128-C2orf72#gene_information.
- ↑ "Compute pI/MW - SIB Swiss Institute of Bioinformatics | Expasy". https://www.expasy.org/resources/compute-pi-mw.
- ↑ 17.0 17.1 17.2 17.3 17.4 "uncharacterized protein C2orf72 [Homo sapiens - Protein - NCBI"]. https://www.ncbi.nlm.nih.gov/protein/NP_001138466.1?report=fasta.
- ↑ "I-TASSER server for protein structure and function prediction". https://zhanggroup.org/I-TASSER/.
- ↑ 19.0 19.1 "The Functional Proximal Proteome of Oncogenic Ras Includes mTORC2". Molecular Cell 73 (4): 830–844.e12. February 2019. doi:10.1016/j.molcel.2018.12.001. PMID 30639242.
- ↑ "Hepatitis C virus enters liver cells using the CD81 receptor complex proteins calpain-5 and CBLB". PLOS Pathogens 14 (7). July 2018. doi:10.1371/journal.ppat.1007111. PMID 30024968.
- ↑ "HitPredict - High confidence protein-protein interactions". http://www.hitpredict.org/htp_int.php?Value=9676.
- ↑ "C2orf72 orthologs" (in en). https://www.ncbi.nlm.nih.gov/gene/257407/ortholog/.
- ↑ "LOC103176070 uncharacterized protein C2orf72 homolog [Callorhinchus milii (elephant shark) - Gene - NCBI"]. https://www.ncbi.nlm.nih.gov/gene/?term=c2orf72+shark.
- ↑ "uncharacterized protein C2orf72 homolog [Callorhinchus milii - Protein - NCBI"]. https://www.ncbi.nlm.nih.gov/protein/XP_007887618.2.
- ↑ "PREDICTED: uncharacterized protein C2orf72 homolog Callorhinchus mili - Protein - NCBI". https://www.ncbi.nlm.nih.gov/protein/XP_007887618.1.
