Abstract
Repeat expansions are responsible for over 40 monogenic disorders, and undoubtedly more pathogenic repeat expansions remain to be discovered. Existing methods for detecting repeat expansions in short-read sequencing data require predefined repeat catalogs. Recent discoveries emphasize the need for methods that do not require pre-specified candidate repeats. To address this need, we introduce ExpansionHunter Denovo, an efficient catalog-free method for genome-wide repeat expansion detection. Analysis of real and simulated data shows that our method can identify large expansions of 41 out of 44 pathogenic repeats, including nine recently reported non-reference repeat expansions not discoverable via existing methods.
Original language | English |
---|---|
Article number | 102 |
Number of pages | 14 |
Journal | Genome Biology |
Volume | 21 |
Issue number | 1 |
DOIs | |
Publication status | Published - 28 Apr 2020 |
Keywords
- Fragile X syndrome
- Friedreich ataxia
- Genome-wide analysis
- Huntington disease
- Myotonic dystrophy type 1
- Repeat expansions
- Short tandem repeats
- Whole-genome sequencing data
- Fragile X Syndrome/genetics
- Humans
- Myotonic Dystrophy/genetics
- Case-Control Studies
- Whole Genome Sequencing
- DNA Repeat Expansion
- Huntington Disease/genetics
- High-Throughput Nucleotide Sequencing
- Software
- Friedreich Ataxia/genetics
- Microsatellite Repeats