Biology:Chimera (EST)

From HandWiki

In genetics and molecular biology, a chimera is a single DNA sequence originating from multiple transcripts or parent sequences. It can occur in various contexts. Chimeras are generally considered a contaminant, as a chimera can be interpreted as a novel sequence while it is in fact an artifact. However, the formation of artificial chimeras can also be a useful tool in the molecular biology. For example, in protein engineering, "chimeragenesis (forming chimeras between proteins that are encoded by homologous cDNAs)"[1] is one of the "two major techniques used to manipulate cDNA sequences".[1]

Transcript chimera

A chimera can occur as a single cDNA sequence originating from two transcripts. It is usually considered to be a contaminant in transcript and expressed sequence tag (EST) databases.[2] It is estimated that approximately 1% of all transcripts in the National Center for Biotechnology Information's Unigene database contain a "chimeric sequence".[3]

PCR chimera

A chimera can also be an artifact of PCR amplification. It occurs when the extension of an amplicon is aborted, and the aborted product functions as a primer in the next PCR cycle. The aborted product anneals to the wrong template and continues to extend, thereby synthesizing a single sequence sourced from two different templates.[4]

PCR chimeras are an important issue to take into account during metabarcoding, where DNA sequences from environmental samples are used to determine biodiversity. A chimera is a novel sequence that will most probably not match to any known organism. Hence, it might be interpreted as a new species thereby inflating over diversity.

Chimeric read

A chimeric read is a digital DNA sequence (i.e. a string of letters in a file that can be read as a DNA sequence) that originates from an actual chimera (i.e. an physical DNA sequence in a sample) or produced due to misreading the sample. The latter is known to occur with sequencing of electrophoresis gels.[5]

Some methods have been devised to detect chimeras, like:

  • CHECK_CHIMERA of the Ribosomal Database Project [6]
  • ChimeraSlayer in QIIME[7][4]
  • uchime in vsearch[8]
  • removeBimeraDenovo() in dada2[9]
  • Bellerophon[10]

Examples

  • "The first mRNA transcript isolated for..." the human gene C2orf3 "...was part of an artificial chimera..."
  • CYP2C17 was thought to be a human gene, but "...is now considered an artefact based on a chimera of CYP2C18 and CYP2C19."[11]
  • Researchers have created receptor chimeras in their studies of Oncostatin M.

See also

References

  1. 1.0 1.1 Lajtha, Abel; E. A. Reith, Maarten (2007). Handbook of Neurochemistry and Molecular Neurobiology Neural Membranes and Transport. Boston, MA: Springer Science+Business Media, LLC.. pp. 485. ISBN 978-0-387-30347-5.  p. 424
  2. Unneberg, P; Claverie, JM; Hoheisel, Jörg (2007). Hoheisel, Jörg. ed. "Tentative Mapping of Transcription-Induced Interchromosomal Interaction using Chimeric EST and mRNA Data". PLoS ONE 2 (2): e254. doi:10.1371/journal.pone.0000254. PMID 17330142. Bibcode2007PLoSO...2..254U.  open access
  3. Charlie Nelson. "EST Assembly for the Creation of Oligonucleotide Probe Targets". Agilent Technologies. http://www.chem.agilent.com/Library/applications/5989_0750_EST_final72.pdf. Retrieved May 12, 2009. 
  4. 4.0 4.1 Birren, Bruce W.; Knight, Rob; Petrosino, Joseph F.; Consortium, The Human Microbiome; DeSantis, Todd Z.; Methé, Barbara; Sodergren, Erica; Highlander, Sarah K. et al. (2011-03-01). "Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons" (in en). Genome Research 21 (3): 494–504. doi:10.1101/gr.112730.110. ISSN 1549-5469. PMID 21212162. 
  5. "Sequencing a Genome, part VI: Chimeras are not just funny-looking animals | ScienceBlogs". https://scienceblogs.com/digitalbio/2007/02/01/sequencing-a-genome-part-vi-ch. 
  6. Maidak, B. (1996). "The Ribosomal Database Project (RDP)". Nucleic Acids Research 24 (1): 82–85. doi:10.1093/nar/24.1.82. PMID 8594608. PMC 145599. http://nar.oxfordjournals.org/cgi/content/full/24/1/82#tbl01. Retrieved May 12, 2009. 
  7. "Chimera checking sequences with QIIME — Homepage". http://qiime.org/tutorials/chimera_checking.html. 
  8. "UCHIME algorithm". http://drive5.com/usearch/manual/uchime_algo.html. 
  9. "removeBimeraDenovo function | R Documentation". https://www.rdocumentation.org/packages/dada2/versions/1.0.3/topics/removeBimeraDenovo. 
  10. Hugenholtz, Philip; Faulkner, Geoffrey; Huber, Thomas (2004-09-22). "Bellerophon: a program to detect chimeric sequences in multiple sequence alignments" (in en). Bioinformatics 20 (14): 2317–2319. doi:10.1093/bioinformatics/bth226. ISSN 1367-4803. PMID 15073015. 
  11. "Entrez Gene: CYP2C18 cytochrome P450, family 2, subfamily C, polypeptide 18". National Center for Biotechnology Information. https://www.ncbi.nlm.nih.gov/sites/entrez?Db=gene&Cmd=ShowDetailView&TermToSearch=1562. Retrieved May 12, 2009.