Biology:Fiber-Seq

From HandWiki

Fiber-seq is a molecular biology multiplexed sequencing assay capturing genomic and epigenomic information for individual chromatin fibers.  It achieves this by combining labelling of accessible DNA with long-read sequencing to map cis-regulatory elements onto the DNA template.

Background

Fiber-seq is a novel method in chromatin biology designed to map chromatin fibers onto their underlying DNA template to capture cis-regulatory architectures revealing chromatin structure at near single nucleotide resolution. Fiber-seq utilizes the non-specific DNA N6-adenine methyltransferase Hia5 (an enzyme that adds a methyl group to the N6 position of adenine nucleotides) to precisely stencil the structure of chromatin fibers onto their DNA templates.[1] Adenines in the DNA sequence that are not protected by a bound protein are methylated, creating a DNA stencil of protein occupancy. These DNA stencils are then sequenced through single-molecule long-read circular consensus sequencing, a long-read sequencing method that generates highly accurate reads by sequencing a circularized DNA molecule multiple times.[2] Regions of DNA with non-methylated adenines indicate binding of proteins, and the protein type can be inferred by the size of the unmethylated 'footprint' as well as the underlying DNA sequence. Readout of Fiber-seq includes the DNA sequence, endogenous DNA methylation, and protein occupancy footprints, revealing the primary architecture of multikilo-base segments of individual chromatin fibers.[1] There are existing Fiber-seq data analysis pipelines publicly available such as Fibertools to perform quality control and infer regulatory elements.[3] The Fiber-seq workflow involves the following steps:[1]

  1. Treatment of extracted cell nuclei with non-specific m6A-MTase Hia5 + methyl donor SAM
  2. PCR-free library construction on high-molecular weight DNA extracted from nuclei
  3. Single-molecule circular consensus sequencing
  4. Data processing and analysis

Development and history

Fiber-seq was first developed and published by Stergachis et al. in the 2020 paper, "Single-molecule regulatory architectures captured by chromatin fiber sequencing."[1] Its breakthrough is in coupling the use of non-specific DNA N6-adenine methyltransferases to stencil the chromatin fiber architecture onto the underlying DNA with long-read single-molecule sequencing technology. Together, this generates multiple layers of genomic data including the DNA sequence, endogenous DNA methylation states, and a protein occupancy map.[1] While both major components of this method, exogenous DNA methylation to tag accessible chromatin[4] and single-molecule long-read sequencing, existed prior to Fiber-seq, its novelty is integrating these components together.[2]

Distinction from existing methods

Many chromatin profiling methods fragment chromatin prior to sequencing through enzymatic digestion, transposition, or sonication.[5] Short-read chromatin sequencing methods that fragment DNA prior to sequencing prevent recovery of long-range regulatory architectures present on individual chromatin fibers.[5] For example, ChIP-seq, CUT&RUN, and CUT&Tag rely on antibodies and generate short fragments,[6] while ATAC-seq, DNase-seq, and MNase-seq depend on enzymatic cleavage of accessible DNA[6][7]. NOMe-seq similarly uses an exogenous methyltransferase to label accessible chromatin, but relies on bisulfite conversion and short-read sequencing[4], limiting long-range single-molecule resolution. Fiber-seq is distinct in its use of a non-specific adenine methyltransferase (m6A-MTase) to methylate accessible adenine nucleotides coupled with long-read sequencing. Fiber-seq does not require immunoprecipitation, targeted cleavage, or intentional fragmentation prior to sequencing, enabling direct reconstruction of multi-kilobase regulatory architectures on individual DNA molecules.[1]

Scientific principles

  • Chromatin architecture and gene expression: In every eukaryotic cell, DNA must be highly compacted to fit into the nucleus. This high level of compaction means that at any given time, only a small portion of the genome is accessible to transcription machinery. Gene accessibility is dynamic and influenced by multiple factors, including the structure of chromatin, which has several levels of organization. The fundamental repeating unit is the nucleosome, comprising 146 base pairs of DNA wrapped around a histone octamer, often referred to as the nucleosome array or 11 nm fiber.[8] Gene expression can be dynamically modulated by histone modifications, DNA methylation, histone variants, transcription factor binding, and nucleosome positioning.[9][10] Depending on how accessible the DNA is, regions of chromatin can be classified as euchromatin or heterochromatin . This complex primary chromatin architecture of cis-regulatory elements can be captured by Fiber-seq at single-molecule resolution.[1]
  • Hia5 enzyme: A key part of the fiber-seq workflow is incubating nuclei with the enzyme Hia5 and SAM, a methyl donor. Hia5 is a non-sequence-specific adenine methyltransferase that modifies adenine residues to N6-methyladenine.[1][11] In animal genomes, adenines occur at an average frequency approaching one in every two base pairs[1] and are typically devoid of methylation.[12] Therefore, Hia5 can be used to accurately mark regions of DNA that are not protected by protein by using methyl groups donated by SAM and catalyzing exogenous methylation of adenines.[1][11]
  • Long-read sequencing and CCS: Long-read sequencing describes modern DNA sequencing methods that allow for single molecule resolution of long DNA fragments typically in the range of tens of kilobases, with some ultralong sequencing platforms able to sequence up to 1 Mb.[13] Multiple methods of long-read sequencing exist, but for the foundational Fiber-seq experiments, circular consensus sequencing (CCS) was used.[1] CCS involves circularizing long fragments of native DNA with epigenetic modifications still intact and sequencing them several times in order to generate a consensus sequence that has extremely high accuracy.[13] The long-read aspect of Fiber-seq is what allows for the novel information of cis-regulatory architecture along individual chromatin fibers that underlies gene activation or repression.[1]

Properties

One of the main properties of Fiber-seq is that it does not use nucleases unlike other common methods for mapping chromatin like DNAse-seq which uses the nuclease DNAse I.  Using Hia5, a non-specific N6-adenine DNA methyltransferases, Fiber-seq marks accessible DNA regions without fragmenting and compromising the structure of the DNA molecule.[1] As a result, Fiber-seq enables the capture of long-range regulatory architecture.

To complement this, Fiber-seq utilizes CCS, a high-fidelity long-read sequencing method with nucleotide resolution. As such, Fiber-seq provides high base calling accuracy across chromatin fibers averaging 10 - 15 kilobases in length.[1] This method is able to distinguish between methylated and unmethylated adenine residues based on the kinetics of the DNA polymerase used during sequencing.[1]  Thus, Fiber-seq is able to capture three distinct layers of information from a single read: the primary DNA sequence, endogenous 5-methylcytosine (5mC) patterns, and exogenous 6mA marks representing protein occupancy.[1] 

Another feature of Fiber-seq is single-molecule resolution. This method is able to map the primary chromatin architecture of individual DNA fibers rather than an average of large chromatin fiber populations needed for other methods.[1]

In summary, Fiber-seq has:

  • Resolution: nucleotide resolution of individual chromatin fibers
  • Read length: average 10 - 15 kilobases
  • Data layers: genome sequence, endogenous methylation marks, exogenous m6A marks

By integrating non-destructive enzymatic labeling with high-fidelity long-read sequencing, Fiber-seq provides a multi-layered and continuous map of the genome that resolves individual chromatin architectures.[1]

Limitations

One limitation of Fiber-seq is that while it is able to observe regions of protein binding, it is unable to identify which proteins bound the DNA at that region.[14]

Since the method uses an m6A methyltransferase, there may also be a labelling bias as accessibility can only be mapped where adenine residues are present. In regions with extremely low adenine content, the resolution of may be reduced.[1][15]

As well, due to Fiber-seq's reliance on amplification-free long-read sequencing, the assay requires a significant amount of input material.[15] Along with the fact that Fiber-seq does not provide single-cell resolution [15][16], Fiber-seq is limited for studies using rare cell types.[15]

Applications

Transcription factor footprinting

Fiber-seq can detect where transcription factors are bound to DNA, shielding the region from methylation. This allows for the investigation of the effect of specific TF binding and long-range chromatin interactions to study how transient TF binding impacts regulatory DNA accessibility.[15][1] For example, Grasberger et al. utilized Fiber-seq to study thyroid specimens from participants with resistance to thyrotropin, and revealed that mutated short tandem repeats was associated with an increase in TF footprinting at a thyroid-specific enhancer cluster.[17]

Chromatin mapping of complex tissues

In complex tissues, other chromatin mapping methods like DNase-seq and ATAC-seq have cleavage biases, decreased resolution, and overestimated size calculation of nucleosome-depleted sequences.[15][18]  To overcome these challenges, Peter et al. applied Fiber-seq in sequencing neuronal and non-neuronal cells from human brain tissue to identify 20,000 accessible chromatin regions that could not be mapped using short-read epigenomic sequencing.[15]

"Fiber-seq allows resolution of individual nucleosomes (147 bp) and internucleosomal linker DNA on the single chromatin fibers, which enabled us to confirm principles of nucleosomal organization hitherto elusive to assess in complex tissues and established only in simple eukaryotes and cells in culture." [15]

Chromatin mapping of highly repetitive sequences

By using long-read sequencing, Fiber-seq has improved accuracy in mapping repetitive genomic regions. In studies on the maize genome, researchers using Fiber-seq were able to identify and map a significantly greater number of repetitive transposable elements (TEs) compared to ATAC-seq.[19][16] In another study on centromeres, which are flanked by highly repetitive regions, Dubocanin et al. were able to use Fiber-seq to identify chromatin architectures within centromeres at single-molecule and near-single-nucleotide resolution.[14]

References

  1. 1.00 1.01 1.02 1.03 1.04 1.05 1.06 1.07 1.08 1.09 1.10 1.11 1.12 1.13 1.14 1.15 1.16 1.17 1.18 1.19 Stergachis, Andrew B.; Debo, Brian M.; Haugen, Eric; Churchman, L. Stirling; Stamatoyannopoulos, John A. (2020-06-26). "Single-molecule regulatory architectures captured by chromatin fiber sequencing" (in en). Science 368 (6498): 1449–1454. doi:10.1126/science.aaz1646. ISSN 0036-8075. PMID 32587015. Bibcode2020Sci...368.1449S. https://www.science.org/doi/10.1126/science.aaz1646. 
  2. 2.0 2.1 Wenger, Aaron M.; Peluso, Paul; Rowell, William J.; Chang, Pi-Chuan; Hall, Richard J.; Concepcion, Gregory T.; Ebler, Jana; Fungtammasan, Arkarachai et al. (October 2019). "Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome" (in en). Nature Biotechnology 37 (10): 1155–1162. doi:10.1038/s41587-019-0217-9. ISSN 1087-0156. PMID 31406327. Bibcode2019NatBi..37.1155W. 
  3. Jha, Anupama; Bohaczuk, Stephanie C.; Mao, Yizi; Ranchalis, Jane; Mallory, Benjamin J.; Min, Alan T.; Hamm, Morgan O.; Swanson, Elliott et al. (November 2024). "DNA-m6A calling and integrated long-read epigenetic and genetic analysis with fibertools" (in en). Genome Research 34 (11): 1976–1986. doi:10.1101/gr.279095.124. ISSN 1088-9051. PMID 38849157. 
  4. 4.0 4.1 Kelly, Theresa K.; Liu, Yaping; Lay, Fides D.; Liang, Gangning; Berman, Benjamin P.; Jones, Peter A. (December 2012). "Genome-wide mapping of nucleosome positioning and DNA methylation within individual DNA molecules" (in en). Genome Research 22 (12): 2497–2506. doi:10.1101/gr.143008.112. ISSN 1088-9051. PMID 22960375. 
  5. 5.0 5.1 Giani, Alice Maria; Gallo, Guido Roberto; Gianfranceschi, Luca; Formenti, Giulio (2020). "Long walk to genomics: History and current approaches to genome sequencing and assembly" (in en). Computational and Structural Biotechnology Journal 18: 9–19. doi:10.1016/j.csbj.2019.11.002. PMID 31890139. PMC 6926122. https://linkinghub.elsevier.com/retrieve/pii/S2001037019303277. 
  6. 6.0 6.1 Chawla, Anjali; Nagy, Corina; Turecki, Gustavo (2021-07-16). "Chromatin Profiling Techniques: Exploring the Chromatin Environment and Its Contributions to Complex Traits" (in en). International Journal of Molecular Sciences 22 (14): 7612. doi:10.3390/ijms22147612. ISSN 1422-0067. PMID 34299232. 
  7. Mansisidor, Andrés R.; Risca, Viviana I. (2022-12-31). "Chromatin accessibility: methods, mechanisms, and biological insights" (in en). Nucleus 13 (1): 238–278. doi:10.1080/19491034.2022.2143106. ISSN 1949-1034. PMID 36404679. 
  8. Luger, Karolin; Mäder, Armin W.; Richmond, Robin K.; Sargent, David F.; Richmond, Timothy J. (September 1997). "Crystal structure of the nucleosome core particle at 2.8 Å resolution" (in en). Nature 389 (6648): 251–260. doi:10.1038/38444. ISSN 0028-0836. PMID 9305837. Bibcode1997Natur.389..251L. https://www.nature.com/articles/38444. 
  9. Janssen, Sanne M.; Lorincz, Matthew C. (March 2022). "Interplay between chromatin marks in development and disease" (in en). Nature Reviews Genetics 23 (3): 137–153. doi:10.1038/s41576-021-00416-x. ISSN 1471-0056. PMID 34608297. https://www.nature.com/articles/s41576-021-00416-x. 
  10. Talbert, Paul B.; Henikoff, Steven (February 2017). "Histone variants on the move: substrates for chromatin dynamics" (in en). Nature Reviews Molecular Cell Biology 18 (2): 115–126. doi:10.1038/nrm.2016.148. ISSN 1471-0072. PMID 27924075. https://www.nature.com/articles/nrm.2016.148. 
  11. 11.0 11.1 Drozdz, Marek; Piekarowicz, Andrzej; Bujnicki, Janusz M.; Radlinska, Monika (March 2012). "Novel non-specific DNA adenine methyltransferases" (in en). Nucleic Acids Research 40 (5): 2119–2130. doi:10.1093/nar/gkr1039. ISSN 1362-4962. PMID 22102579. 
  12. Kong, Yimeng; Cao, Lei; Deikus, Gintaras; Fan, Yu; Mead, Edward A.; Lai, Weiyi; Zhang, Yizhou; Yong, Raymund et al. (2022-02-04). "Critical assessment of DNA adenine methylation in eukaryotes using quantitative deconvolution" (in en). Science 375 (6580): 515–522. doi:10.1126/science.abe7489. ISSN 0036-8075. PMID 35113693. Bibcode2022Sci...375..515K. 
  13. 13.0 13.1 Logsdon, Glennis A.; Vollger, Mitchell R.; Eichler, Evan E. (October 2020). "Long-read human genome sequencing and its applications" (in en). Nature Reviews Genetics 21 (10): 597–614. doi:10.1038/s41576-020-0236-x. ISSN 1471-0056. PMID 32504078. 
  14. 14.0 14.1 Dubocanin, Danilo; Hartley, Gabrielle A.; Sedeño Cortés, Adriana E.; Mao, Yizi; Hedouin, Sabrine; Ranchalis, Jane; Agarwal, Aman; Logsdon, Glennis A. et al. (April 2025). "Conservation of dichromatin organization along regional centromeres" (in en). Cell Genomics 5 (4). doi:10.1016/j.xgen.2025.100819. PMID 40147439. 
  15. 15.0 15.1 15.2 15.3 15.4 15.5 15.6 15.7 Peter, Cyril J.; Agarwal, Aman; Watanabe, Risa; Kassim, Bibi S.; Wang, Xuedi; Lambert, Tova Y.; Javidfar, Behnam; Evans, Viviana et al. (December 2024). "Single chromatin fiber profiling and nucleosome position mapping in the human brain" (in en). Cell Reports Methods 4 (12). doi:10.1016/j.crmeth.2024.100911. PMID 39631398. 
  16. 16.0 16.1 Li, Lei; Chen, Guanyu; Zhu, Guangquan; Wu, Jili; Kui, Xinglong; Zhang, Zihui; Feng, Lingyang; Huang, Minghan et al. (2025-12-02). "Single-molecule views of chromatin accessibility and structure during photomorphogenesis" (in en). Proceedings of the National Academy of Sciences 122 (48). doi:10.1073/pnas.2516708122. ISSN 0027-8424. PMID 41296722. PMC 12685059. https://pnas.org/doi/10.1073/pnas.2516708122. 
  17. Grasberger, Helmut; Dumitrescu, Alexandra M.; Liao, Xiao-Hui; Swanson, Elliott G.; Weiss, Roy E.; Srichomkwun, Panudda; Pappa, Theodora; Chen, Junfeng et al. (May 2024). "STR mutations on chromosome 15q cause thyrotropin resistance by activating a primate-specific enhancer of MIR7-2/MIR1179" (in en). Nature Genetics 56 (5): 877–888. doi:10.1038/s41588-024-01717-7. ISSN 1061-4036. PMID 38714869. 
  18. Minnoye, Liesbeth; Marinov, Georgi K.; Krausgruber, Thomas; Pan, Lixia; Marand, Alexandre P.; Secchia, Stefano; Greenleaf, William J.; Furlong, Eileen E. M. et al. (2021-01-21). "Chromatin accessibility profiling methods" (in en). Nature Reviews Methods Primers 1 (1). doi:10.1038/s43586-020-00008-9. ISSN 2662-8449. PMID 38410680. 
  19. Bubb, Kerry L.; Hamm, Morgan O.; Tullius, Thomas W.; Min, Joseph K.; Ramirez-Corona, Bryan; Mueth, Nicholas A.; Ranchalis, Jane; Mao, Yizi et al. (2025). "The regulatory potential of transposable elements in maize" (in en). Nature Plants 11 (6): 1181–1192. doi:10.1038/s41477-025-02002-z. PMID 40360747. Bibcode2025NatPl..11.1181B.