Biology:Oligotyping (sequencing)

From HandWiki

Oligotyping is the process of correcting DNA sequence measured during the process of DNA sequencing based on frequency data of related sequences across related samples.

History

DNA sequences were originally read from sequencing gels by eye. With the advent of computerized base callers, humans no longer 'called' the bases and instead 'corrected' the called bases. The bases were called by the software using the relative intensity of each putative basepair signal and the local spacing of the signals.

With the advent of high throughput sequencing, the volume of sequence to be corrected exceeded human capacity for sequence correction.

Use

Multiple applications require single-base pair accuracy across populations of closely related sequences. An example is amplicon sequencing to assess the relative contribution of DNA from diverse organisms to a sample.

The requirement for single basepair accuracy led to the development of methods which drew on frequency data distributed across several samples to identify variant sequences which shared the same frequency profile and were thus likely errors from the same original sequence.[1][2] The ability to use higher-order statistics to correct sequences is an important element in decreasing the burden of error in DNA sequence datasets.

See also

References

  1. Eren, A. Murat; Maignien, Loïs; Sul, Woo Jun; Murphy, Leslie G.; Grim, Sharon L.; Morrison, Hilary G.; Sogin, Mitchell L. (1 December 2013). "Oligotyping: differentiating between closely related microbial taxa using 16S rRNA gene data" (in en). Methods in Ecology and Evolution 4 (12): 1111–1119. doi:10.1111/2041-210X.12114. ISSN 2041-210X. PMID 24358444. 
  2. Preheim, Sarah P.; Perrotta, Allison R.; Martin-Platero, Antonio M.; Gupta, Anika; Alm, Eric J. (1 November 2013). "Distribution-Based Clustering: Using Ecology To Refine the Operational Taxonomic Unit" (in en). Applied and Environmental Microbiology 79 (21): 6593–6603. doi:10.1128/AEM.00342-13. ISSN 0099-2240. PMID 23974136. Bibcode2013ApEnM..79.6593P. 

External links