diff --git a/README.md b/README.md index 9065bb3b8..257bc4fcc 100644 --- a/README.md +++ b/README.md @@ -1,38 +1,38 @@ -# ![nfcore/test-datasets](docs/images/test-datasets_logo.png) -Test data to be used for automated testing with the nf-core pipelines +# Rare Disease Test Datasets -> ⚠️ **Do not merge your test data to `master`! Each pipeline has a dedicated branch (and a special one for modules)** +This repository contains subsampled long-read sequencing datasets tailored for rare disease analysis. -## Introduction -nf-core is a collection of high quality Nextflow pipelines. This repository contains various files for CI and unit testing of nf-core pipelines and infrastructure. +--- -The principle for nf-core test data is as small as possible, as large as necessary. Please see the [guidelines](https://nf-co.re/docs/contributing/test_data_guidelines) for more detailed information. Always ask for guidance on the [nf-core slack](https://nf-co.re/join) before adding new test data. +## Contents -## Documentation +- `bam_pass/` – subsampled aligned BAM files for variant calling tests +- `spectre/` – VCF files and BED regions for whole-genome CNV testing +- `straglr/` – Chromosome 22 STR test regions +- `test.exclude.bed` – CNV test exclude regions +- `reference/` – reduced human genome references +- `samplesheet_*.csv` – metadata for pipeline test runs -nf-core/test-datasets comes with documentation in the `docs/` directory: +--- -01. [Add a new test dataset](https://github.com/nf-core/test-datasets/blob/master/docs/ADD_NEW_DATA.md) -02. [Use an existing test dataset](https://github.com/nf-core/test-datasets/blob/master/docs/USE_EXISTING_DATA.md) +## Sample Overview -## Downloading test data +| Sample ID | File type | Size (approx.) | Purpose | +|-----------|-------------|----------------|----------------------------------------------| +| Test | BAM | ~100 MB | End-to-end pipeline testing from alignment (minimap2) through variant analysis | +| Reference | FASTA / BED | <5 MB | Subset references (Chromosome 22) for rare disease test runs | -Due the large number of large files in this repository for each pipeline, we highly recommend cloning only the branches you would use. +--- -```bash -git clone --single-branch --branch -``` - -To subsequently clone other branches[^1] +## Usage -```bash -git remote set-branches --add origin [remote-branch] -git fetch -``` +These datasets are intended for automated testing of long-read rare disease pipeline (https://github.com/nf-core/longraredisease). -## Support +The data in this repository will be used to test the pipeline starting from unaligned BAM files (using minimap2). +The associated parameters and settings to run the pipeline can be found in the **test.config** file. -For further information or help, don't hesitate to get in touch on our [Slack organisation](https://nf-co.re/join/slack) (a tool for instant messaging). +Example run: -[^1]: From [stackoverflow](https://stackoverflow.com/a/60846265/11502856) +```bash +nextflow run nf-core/nanoraredx -profile test,docker diff --git a/genome_22/genome_22.fasta b/genome_22/genome_22.fasta new file mode 100755 index 000000000..b0ea69be0 --- /dev/null +++ b/genome_22/genome_22.fasta @@ -0,0 +1,668 @@ +>chr22 +ACTCAAGATAATGATGAGTAAAGAATATATTTCTAACAACAAAAAGGAAATTTGATAGTA +TTTCTAAAGACAAAAAGGAAATTTGTATTCACATTCAGTTAGTCATTCCACCAGAATGAC +TTCATCACACAATATTTTGTGACAAGAACCTGAACAGCCTCATGTTTTACAATATTCTTT +TCATCTTTTATTATATGCACCAAAATTTTCTTTTTTAAATTTTCTTGAACCTCTAAATCT +ACTTTAAAAATTTACCTGATACACTTTTTAAATGGACAAATGCTGAAGGTAGCTGTGTAT +ACAAATGTGACTAGAAGGAAAAAGATGATGTAGAAATACAATAACTCCTTGAGTTGATCA +TTCTGATTGGCATTTATAGAGTAGAAATGTTTTGTAATTACAGAGGAAAAAAGATGGCCT +TTCCTTCAACAGTTATGAGCCGTCAGAATTTTCAAAAATATTGCATTTTGACAATGTAGT +TTCTAGTTTGACAATGATATATTTATCTTCAAAACCAGGAAAATGTAGATAAGGATTTGG +TTTTATAATATTTAAATTCTTATTAAAATGTATAATAAAATTGTTTTCCCCATCACTTTA +TTCTTCTGTAAGTTATTTTACGTTTAAAATGTAAACAAATAAAAATAAGTAAATAAACAG +TAGCAGCTTCTTTTCCTGGTGAATCGAGGATTGAGTATGTATTATATCTTTCCTGGACTA +TTGGAATAACCTCTCCCTCCTTCCACAGAGAAGCCATAATAATCTTTATGAAATACAAAT +CAAATCATGGTATTCATTCTTTAAATAGTTATCAATAAAAATAAAATCCCAACTTTATAC +CCTGTTCTGCAAATTTTAACGTGGTCTGAATTCAGCTTACATTTCTTCTTTCCCTTGTCT +ATTGCCCATCAGGCTCACTGGCCTTATTCCTTCACACCAAACTAGTTATTTCCGGGGTGG +GAGGAAGGCTTGCAGTGTTTTCTCCATCTGCAATAGTCTTTCCCAAATCTTAGTGTGGAT +AAAGTTTCCTTCTTGTTACTTGAATCACAAATACTATGTTCTTAGTCATTCTCTGTTACA +TCATCCAGAGTACATTATATCAATTTTCCAATATTTTTATTTATTTGATTTCCCACTATA +ACAGAGGCTCTGTTAGTGCAGGGTCTTTTACTCTTTTGTAATCCCAACAGCAAGAACAAA +ACAAGGTACATAGTACATATTTAATAAATACCTGTTGAACAAATATGTGCCAGTAATATT +TCTTCATGCTGCTGAATAAGTTAACAGCATATAAACACATACAAACCAAGTGGCATGGAT +GTCTGCTTTGATTTTTAGCCATTTAAAAATATACGTAACCCATCCTAAGGGGTTTATATT +TGTTTTGCATAATACATTAATATGTACTCATTATTCATTACACAGTTAATATATCTATAT +TTGCAGGGAATATACATTGCTTGGAATTATACAAAAAAATATTATTTTTCGTTTTCTAAT +ATTCAGGATACAGTGTTTTAATGGGGGTGTTTCTTCATTCTTTTTTTCTTACTGGTTTTT +ACTTTTTAAATTTGAAAGCCTTGCAGTGATCATAAGGATCTGTTCAGGCAAAGAACATGA +AAGAGTTTAAATTTTTATCATTTTAGTGTTTCTTATTCTCTATATCAAAAACATTCACAG +GTAAGTTAACAAGATCCTCATCAGGAGGAAAAGTAAATTGTTCACTACCATCCTCTAGTA +TCCTAATCTGGTCTTGTTGTTGGCTAACTTCAGCAGTTACTATTCTGTGATTGGTGTAAT +ATTAACCAAATAAATTACTGGATTTGTTCCACAAATATTATATCTTAGATTGGTTCTTTC +CTGTCTCTGAAAATAAAGTCTTGCAATGAGAATAAATTATTTTACAACAGTTAATTAGCA +ATGTAAAGTTTATTGAAAATGTATTTGCTTTTTTTGTAAATCATCTGTGAATCCAGAGGG +GAAAAATATGACAAAGAAAGCTATATAAGATATTATTTTATTTTACAGAGTAACAGACTA +GCTAGAGACAATGAATTAAGGGAAAATGACAAAGAACAGCTCAAAGCAATTTCTACACGA +GATCCTCTCTCTGAAATCACTGCGCAGGAGAAAGATTTTCTATGGACCACAGGTAAGTGC +TAAAATGGAGATTCTCTGTTTCTTTTTCTTTATTACAGAAAAAATAACTGACTTTGGCTG +ATCTCAGCATGTTTTTACCATACCTATTAGAATAAATGAAGCAGAATTTACATGATTTTT +AAACTATAAACATTGCCTTTTTAAAAACAATGGCTGTAAATTGATATTTGTAGAAAATCA +TACTACATTTGTAGTTGGCACATTAAATGCTTTTTCTTACTCTGAATTCCTGATATGACT +TTCTTTAGGATTGTTTAAAATATTCTAGTAGTTTTAGGTCAATTTAGATGTGATTTAGTT +GCTCTAGATATTATAATTTTTAGGGGTTCCCTTTCATTTTTTTCTTACGTTTCTTCAAAT +AGTATAATGCCTTATTTTCATTTATGAAGAAATTACCCTGCTGTTGGTGATACGGGTATA +TTTAAATAAACCAGTTGCAGTGCATTTTTGCAGAAAGTCCATTAAGACATAAATTTTGTC +CAGTAACCACAGTAGAAGTGGTGACTCTATGATTCATTCATGTTGCATAAGTAGGTGAAA +AATATGAGCTATATTCTGTCTGTTAAATGGAATTCTAGAGATGAAGTAGCCCAGGTAAAT +GTATGTTTGAGATTACTAGATAACTGTTGTACAAATTGGTATGTCACTTAAATTGTTTTC +TCTCAGAAAGTCCACATAAATAAATGAAATAGACTAATAATAGTAATATGGTGTAGAAAA +AACTCCCTTAACATTATTTCCATAGATAAAACTAATTAGAACTGTAAATTCTAAGGAGAT +TATTTATCTAAACTAATTTTAAAATCAGAAGTTAAGGCAGTGTTTTAGATGGCTCATTCA +CAACTATCTTTCCCCTTTAAATATGATTTATTGTCTTTCTCATACACAGATGTATTGCTT +GGTAAAAGATTGGCCTCCAATCAAACCTGAACAGGCTATGGAACTTCTGGACTGTAATTA +CCCAGATCCTATGGTTCGAAGTTTTGCTGTTCAGTGCTTGGAAAAATATTTAACAGATAA +CAAACTTTCTCAGTATTTAATTCAGCTAGTACAGGTAAAATAATGTAAAATAGTGAATAA +TGTTTAATTACAATAATAATTTATTTTAGATCCATACAACTTCCTTTTAAAAAACCTACT +GCACTAACTAGTTTTATGCTTAAAAAAAATTATTACCAGTAATATCCACTTTCTTTCTGA +AAAAATTTTCTTTAGATCGGCCATGCAGAAACTGAACCTGATTTGTTTTTTTTGAATCAC +CTAGGTCCTAAAATATGAACAATATTTGGATAACTTGCTTGTGAGATTTTTACTGAAGAA +AGCATTGACTAATCAAAGGATTGGGCACTTTTTCTTTTGGCATTTAAAGTAAGTCTAATT +ATTTTCCCATTAAATTCTTAAGGTACATATTACTTGCTTTCTTAATAGATTTATAAATAT +GTATTACTTATATACTTTTGTTTATGTTTGGCTGGAAGAGTTTTCCATACTAAAACTATT +TTGTACCAGTGATGAGCTTCTCAACTTTTGCTCTTTGAAATTTAAAAAGTAATAAATTCA +AAACTAAATTTCAGTCATGAATGAGAGCTTAAATATTTTTAAAGATTTTTGTTCTACTTA +AGTAAAATTTTCTAGGTCCAGATGAATATTGCTGTAGGTTTCACTGTGTGTATGGATTAA +AATATCCCCAAAAAAAGAAAAAAAATGTTTTACCTTGAGATTCAGAACAATAATGTCAAA +CTCCCGTGGTTCTTACTGAAAAACAAGCTAATTAAGAATAAAAAATGTTTTGTAGAATGT +GATATATGCAGTACTCAAAAGTTACAGGTCATAAACCATATAACTTTTCATAAATTTAGA +AACAGATTTATATCTAATATGATATTTTAAGTGTTAAAATTTAAAAATGGAACCCAGAAG +TTAAGTTGAAAACAAGAAGCGTAGACGTGTGTCAGAAGAGTCAAACAGCATTCACTGAGC +GCTTTGTTCCCTCCCTCTTCATTTGATTATTTTTGTGCTCAATTTCCTTTTTTCATGCTT +TTATATCTTGTACTGAGATTAGTCAATGAAAACTAGTTGAAATAAACCTAAAAACTAGAT +GTTTATTTAATCACATATTCAGGAACTACCTGAAACTCATGGTGGTTTTGCTTCTAAATT +ACAGGTTTTGAATAATGTTATTATTAGTATGATTGTAACATTTATTGGATTTCAAAAATG +AGTGTTTAAATTGTTTAGCAAAGATTATTTGTATACTGATTTAAGACTATATATATATTT +TTCTAATTTTGCATGATTCTTTTAGATCTGAGATGCACAATAAAACACTTAGCCAGAGGT +TTGGCCTGCTTTTGGAGTCCTATTGTCGTGCATGTGGGATGTATTTGAAGCACCTGAATA +GGCAAGTCGAGGCAATGGAAAAGCTCATTAACTTAACTGACATTGTCAAACAGGAGAAGA +AGGATGAAACACAAAAGTTGTGTGACTCTAGTCTGTGTTTGAGACTCTTTTCACTGCAGT +GGGGCAGAGTTGTTTAGAAGCCCAGTGTATATACAGATCATGGTCCTTGGAATCAAGCAG +ATTAGGATTTGGAACCAAGTTCCACTGCCTCTCATCTGTGTAGTGTTAGACACGTTATGC +AGGCTCTCAAGACTCATTTTCTTTGTCTGTAAAATGGGAATAATACCTGCTTCGTAAGGC +CATTGTGAGAATTAAATTACATGAGATATGCAAAGAACCTATCACAATCCTTGGAACACA +GAAGGTGCCCAATAAATGTTAGATCCCTTTACTTTCCCTTCCTTTCTCTTATTCAGGTCC +CTAAGTATTTACAGTGATTATTTCCTTATTCTGTCATTTATTATCTCTCAGTAATGACCC +TGAAAATGAGTGGAAAGAAGTTAGTTTTTACATTTCCAAGTTTAAAATGGATTTCGAGTC +ACTCAGTAAATATATCACACCCTCTAGTCATCTGCTGTCTAGCTTAGTGTAACTAAGAGT +AGGAAATACAATGTAAACTTTTTTTTTTGAGACAGGGTCTGGCTCTTTTGCCCGGCCTGG +AATGCAGTGGTGCAATTTCGGCTCACTGCAGCCTTGACCTCCTGGGTTCAAGCCATCCTC +CCACCTCAGCCTCCTGAGTAGCTAGGACTATAGGAGCATGCCACCACTCCCAGCTAATTT +TTGTATTTTTAGTAGAGACAGTGTTCTATTCTGCTTTATATTAAAAGCCCCTTAGAAAAT +GGGAACCTGGTGAATATATAATGAATTGTAAAATATTTTAATGTGTAACTTTTTCAACTG +TGAAACTGACTACTGATTTTTTGATGAAAACAGCTGCTGATAAAGTATTTTGTGTAAAGT +GTAGTTCTTATTAATCAGGAAAATGATGACTTGATTAGACTGTATATGCCCTCTTGGATT +TTATTTTAAATGGATTGGTGACTTTCACATAGGTAAAACACAGTCCATCTGTATTCTTTT +TTCCATCAAAAAGCGAGTGATTTAGAATTATAAAAAAATTTGTGAGCAGCCTATTTGAAA +GGCATCATGGAAATTTCACAGCACAATAACATGGATTTGTTTTTTTCTTAATGATGTAAA +TCCGTTTAATTCATATTTTGATCAATAGCCCATGCTTGCCAACTCTGAAGAAATTTAATT +TCCAGCAGTATTTTAAAGCTAGCCTGTTAACTTTTTCTGAATATTTAAAGTTCCTCTTTT +TTCTATGTCTGCACAAACTGCAGACCTGGGCTGGACCCACATACTCAAGAGTCCACCTTA +AGAAATTATTTTGATGTCCAAGACATCACTAAAATATTTCAGTTTAAAGATAACATGTGG +TGTTAATAGATTGTGGTGCTTTTACTATTTAAAGACAACTTTCATACTTCAGATGTTTTT +GAGAAGAGGGGAATGTGAGGGGAGGGGGCAGAACAGGGAGGAGTTTGAATGAATTACATT +CTTTATATCCATCCTGCTCATTTGGGGCATGTCTTTAAGAGAAGGCTGAAAGTTGTGAGA +GTATATTGTATACCGTAAGAGAATCAACTCTTCATCATGGATGGGATTGTGAAGGCTGAA +CTGTAAAAGTCAGCATTGACAGCATCCTCAATTAATAATTCTTGGTGACAGAATAATACA +GCTGGGCTGTTTTATAAATATAAACAATACCATTTTTAATTATTACATTAAAAATTTTAA +ATATATCTATGTGCCATGGCCTGGGAAGCCTGTTTTCTATTTTCATAAAAATTATTTTTA +CTGTATGAAAAGATTATGGGGTTTAGCTCAAAATATCTGTGGTCCTGATAAAATTGGATT +GGTAACTCTACCTCAGAAGGAAAATGGGAAAAAAAAATAGATGAGTCACAATTCAATACT +TCAAGCTCAGAAACTGTGCAGATCACTGAATTTTAGATTTATAAAGTCAGAGTTGGCATG +CGTTGTTTTTAATGATATGGAAGACCTTAAGAAAAAAACTTGGCTGAAGTTTAATCGTTG +GTCCAGCCATTTGAAAAAGGCAATAGTTCGAGGAGGTTTCCGAATTCGGCATTTGAAATT +CATTTTGTTCTCTCTTCTTCATTATTAGTGCATTTGGTGTGTGTATACTTGCACACAATT +CTGTTTGTGTACACACTGCTTGCTAAGCCCTAGTCAAGAGGCATCTTTTATAAAAGGTGT +AAAGAAATATCAAGGTTCTAAAATTCGGAAGAGTTTAGAATTTATTAGGAGTTTCCCAAG +TTGGGATGTTAGTCTTTAAATAAACTTCATGCACCTATTCCACTTAAGGTTTTGCACCTC +CTTTTTATTAGTGCAGTGCCATTTCTTCTGCTTGATTTTAGGTATGTTAATATTCCAGCC +TTGCTAGTTAGCATAAAGTGACAGGTGTGAGCCATGAGGAAATTTTCTGACTTAATTTTT +ATACAACTACATATGAGTTTTAGTGGAGAAAAAAAATTAGTCCCTTGTGCATATATAGTA +GTTAGGTAAATGATTTTTCTACCAACAGTGTACTCCATTCCTCATGTAGGTAAGTACAGA +AAAGGTTTTTAAATGTATTTTGTTAGCCAGTTAAAGTCTATGAATCTATCTGCAACCTTA +TTTAATCTGTCACTACAATAATTTTGTGGTTATGCTAAGAACCATGTATACTTTTAGGTA +TTCTTATTTTTGTCAATTTTTCTAGGTTAGCAAGGAGGCAGAAAAGCTTCACTGTTTCAT +ATTAAAATATAATTAGACTAAACTTAATTCTAGTATGAATTTCCAAAATCATTATCTATT +TATTTCATTTTTATTTAATTTTGTTTTTAGTTCATTTTTAAAAGTCCCTTGTTCAATTTA +ATTTATGTTCCTAAGAGTGGTTGGAGAACTTGGCCTTCATCTGATTTCAAAAACATTTTG +AGTTTCAAATGAAGTTAATGGTTTCAGTGTGATTCAGTCCTCAGACCTAATTGGGTTGAA +TAAAATCTAAAAGAATATACCCTTTTGGAGCATAACATTTTAATACCTTGAGGAATGTGG +CACTACCAAAAGAAGACTACTAACACGTCAGATGTTCACCTGGAAGCTTTAACAAGAAAT +TCGAACCACCCTTTTGGCCCCATTAATTGTAGCAAGTTTATTTCTCTATATTTTGTCATT +CAGTGAATTGAAGTCCTGTGGTATACTGCATTCATTAGAAGAAAAACGTTTTTAATGTCC +TTTTAATGATGGCCCAGAAAGCATTTGACACAGCAAGATGCATGTATTATTATATTGAGA +ATACAGAATAATAACAGTATCACTAAATTTAAGACCTCTTCCCAGTCTTGCTGTTCCTAG +CAAGAAGTTTGGCCCGTGACTGCACTTACTGTTTATGCTCATCAGAAACTGTCAATGTCT +GCTTTTCTTTAACTCTGCAGTCTGTAACATCATGCTGTTTATTAAAAAAAAAGAAAAATT +ACTTTGACTTGTGTCCAAACAATCCTTAGTGTACTACATAAGCAAAAAACTGTGATAATT +CTCTTTTGCCATTCCTTTTGAAAAGCAAGCCAGTGTTGCTAAAATCAAAATTTAGCTGAA +TTTGAGTTCTTTTCAGTAATGACTAAGAATACTTGATTGAAAATCTGAAACTATTATACC +TTAAAAGCCAATTTTTCTGCCCCAGTAAAGTGATGAATATTAAAGAAATGTATGTTTAAA +TATTTACTTCCTTTAAGCATAAAGAATTATATGCTTGTATTTTAAGAAATATATGTATGT +ATACATACATATGAATGTATGTATATGCAATAGGTAAGTGGACTTTTTTCCAAGTCATTT +GAAGATCAGAACCTAGAAATGAAGTTAGGCTACAAGCAAACTGGTTTTGCTTTCAGTTCT +CATAAACATTGCAAAAGGTAAGTGTGGGCTTTTCTTTGACCATTAATGCACATAGGCATT +AACAACTTAGTATTTCTGAGCAATTAAGCAAATAATTACTTACATTTTATTTATTTGCCA +AATGGTTTAAATAATTTTGAATTGACTTTGCTCTCCAGGGATAATATCTCTCTTTGCTGG +AATGATTCAGGTAGCTCCTATCTAAATGGAAAACTGTGGTAATTGAAACACACACTTTAC +ATTTTAAATTAGCAGTTTTGAATTTGTTAGGGAAAAAAATCCCAGCAATTGCATATTGTT +AGGTAGAAGTCAAATTTACAAAGAAACGGAATAGAGATGTGCCCTTGAGAAAAGTGTAGA +ATCTCAATGTGCAGATGATTTAAAATGTGCGTGCATATAAAATGTTCATGTGTACTTACA +TACTTTATTACAGAGAAGTCTTTGGTATACAAAATAGTTTACCACAACCTTTTAAACAGC +AGGTTCTGGGCCTTAAATGCGTATCACATTTAGCCAAGAGAACTCGGGTAGGGGCATGGA +AAATGAACTGCAGCTCCCTATCCCTAGCCTCTATACCAGCTGTTCAATGAAAAGTACCAA +GGCTCACTGAATGTTATAACCTAGCAGATTGTTACATAAATGATCTAACATTTTTGAGCA +CCGCTACTGGATGCTAGAAGCTAAGCTAAAGTGTTTCACATGCCCTACTTTGCTTATTCT +ATAAAATAACTGCGTGAAAGAACAGGTTATCCCCATTTTATAGATGAGAAAAGAAAGGTT +TACACAGGTTAGCTTATTTGCCCAAAGTTGTGATTATGGCCTACAAAGTCAAATAAATCC +TACTCTGAGACACATGTTCTTTCCACCATTGCACACTAGAAAGGAAAACACCAAGATTAT +TCATTACTGATCAAGTCAATATTGCTGTATTCAGCTAATTTAGTAATATGTGTCTTGAAA +TTAATTGCTAAAAGGGATTAAACTGACTTAGAATCAGTTTTTTGTTTGATTACATCTACA +TACAAAAGTAGCTTCAAATGTCTCATTCTACTGTCCATAATTTAAGATTTTTGAGTATAA +TACAATTTTAAAGATACTTTGAGGCACTTTGGAAAATCAGACCAAAATCTCTTTTCCACT +CACAGATTCGGCTTAATCAATCTGGAAAGCATTTGTTGAGAGCCTTATGACATCATTTAA +TAACCACGGTTGATTCATTAATTAAAGTACAGACAATTGTTGACTATCCATGTGGGACTT +TTCTATTAGGTTGACGCAAAAATAATTGCGGTTTTTCGCCATTAAAGGTTAACAGCGAAA +ACTGGAATTACTTTTGCACCAGCCTAATACGATGTGGATCATCTGAGATGAATGTTGAAA +TCCAGTATAGCTTCTTCATATTTCTGGCCCATTTTTCCCACCAGAAAGTGCACAAAGTGA +AATGAGCTTATGAAAAGCTTAATTAACTAGAAAAATGTTACTGAAAGAAAAATTACATGG +TACATGACAAGGCTAAATACTAGTAACTCTAAACTTAGTGAATTTTCTAGGCAGCAGCTT +TCCTCTGCTGTCTAGACTGGTAAAGAACAAACTAAGGCCAGGCGCAGTGGCTCATGCCTG +TAATCCCAGCACTTTGGGAGGCTGAGGCGGCCAAATCACCTGAGGTCAGGAGTTCAAGAC +CAGCCTGATCAACATGGTGAAACCCTGTCTACACTAAAAATATAAAAATTAGCTGGGCGT +GGTGGTGCACACCTGTAATCCCAGCTACTTAGGAAGCTGAAGCAGGAGAATTGCTTGAAC +CCAGGAGGCAGAGGTTGCAGTGAGCCAAGATCACGCCACTGTGCTCCAGCCTGGGCTACA +AGAGCAAAACTCCATCTCAAAAAGGAAAAAAAAGAAAAAAACTATAATAAATATGTTAGG +TCCATGTTTTCTTAAGTTTTCTACCGGATTTTTATCTTCGTATAGTGAACGAACTGTTAA +GAACTTTTTTATGAGAAATATTTTAGTATGACTATATTGCATAGAGTTAGGCTGATGGTT +CAGTGTTCAGTAGGTTAGATACCCTCATTGTTTATTTCCATATTGACTGGTTCTAGCTAG +AGCTGAAATTAGGCAAAGAATATCTTGAACTCATTTTGCTATACAGGAAAAAAGTGCTTC +CTTAGCTCATTTGGAAAGAGATTGAGATTAGAAAAGATGGTTAATTTGTATGTATTTATA +GAAATAAATAGAATACAAAATGAGGCTTTTAAATTTTTTCCCACATGAAAATATGATACT +TTAATCATTACGTTTTACATTGTTAGTTTGCAGACAGGCATAATTAGGTCCTCAGTTGCA +GAAATCACAGACATCTGAAGGCCAGCCCTTTAATTTGGCCACCGTCTTAAGATTTCTCTG +CTCCTTCCTTTGCTCCTCCTCCTACTGCACAGTTTGAACTGATGCTGTTCTATATAAGGT +ACTTTTCCACCTACCTCATCTCTGACTACAGTGCTATATTTTTCACACAGTAAGGACAGG +TGTTGTGTTAATCTCACCATGCCAACAATCAGGGCACCACCTAGCAGAGTCAGTGAAGGC +CAAAATAAACAGTGGAAGATAGCCATTTGGTCATACTTTTTTATAAGAATGACATCTTCA +GATTGGCTGGCTGGACTGTAGAAGCATGAAAAGGGGGTTCCATTTTTGTGATCGAAGAAT +TCTTTTATGTCCAGAGCACTGTTGAGCAAATCATTTCTATCTTGGTGGCACTTAGGTGTG +TAAAAGCACTAGGAATATGGAAGAGGGAAAAAGATAAAGGCACTGTCACCAATACCAAAT +ACTTAACAGTTTCTAATTATGAAATAGCTTCAGGCTGAAGTTATTAGTGGGCAGTTTCAA +TCTTAGAAGGTGGTAAAATATTACATAGCTCATGGGAAAGGGTTGATTGGAGGGCCACAG +TGAAATGGCCATTTCCAGTCATTAAGCAAGGATGTGGAAGAGAATTCTTAGTTTATATGA +CATTGCAGGAGAGTCAGTGACCAATTTCATAAGGAATATGACTCCTCCCTACATGCAGGT +TCTTGGACTCTTGGACAGTATGAATCCGTTTGTCCATTGAACAAAAATGTATTGAGCCTT +ACTATGAGCTTTCAACACCTAGTAATGCCTCTGTGGTCTCTGTCTTGATCTCCTGTAGCA +AAATATTACCCTGAAGAAAAGCACGTTGAGGCTTTTGCTCTAGACTCACAGACAGGGAGC +CCCACCTGGACTTTGGTTCCTGGGAGACAGAACCAGTGGAGAAGGGAGCTCTGTCAGCTG +GTGACTTTTTTCAAAAAAGCTTGAGGTTTATTACCATATCCATTAGGTACTTGAGGTACT +GTGCTAAAGGCCTACAAACTGTTTGAAATCTTAAAAATCATTGCATCCAAAATAGAAAAC +AAAAGTCATCAGATTGAAATTGATGCTTAAAGACAATAAAGTGTAACATGTCAACTAATC +TAACACAACTCAACTTTTATAGTTAGGTATAAATATAAATTTTAAATCATATGAAAGACT +ATACTTTCAGGGATCATTTCTATAATTCGTTAAATCATATGAACCCATTGTGTAACTTAT +TAAAATAAAAATAATCTTTACATTTATTTGATAAGAAAAAATTACTCGCTTGATTCAAGG +GAGACTGTGGTACACTGTAGCATATGTTATATGGCGCGGAGTGGAATCTCCAAAAGAAAG +ACTCCCCACAAATGACTACTCATTGGCTCAGCCTATAAATTCCAGACACCAAGTTGTGAA +ATTGGAATAATTTCTCTCCTTTCTATATACCCCATTTCTCCACCAAGAAGAAAGCTTCAT +TTATCCTGATTTGATCACTATAAAAATGTTCACTCCAAAAAAATAGATTTATCCCTAAAG +ACAGCCCTGGGTTATTTATGTACCCTGCTAGGGACAGTCTGGCAGGGAAAGGTTGCTGTC +ATAAGAACTCTTTAAACTTTACAATACCTTGGGATTTATCTGGACAGCCTCTTCATTATA +ATGTAGGAGAGCTTTCTGAGCTGAATGGGTGAGGTTCACAAACACCCGAAGACACGAGTA +CTTCCCGTGACCACGGCAGTGCACACCACAGGTGAAGGCACAGTCCAGCCAGTCGTCCAT +GATATCTGTGTGGATGGCAGTGCAGGTTGATTCTTCTCTCCGAATGCTTCAATTTGAAAA +AAAAAAAAATGTTCTTCACTTACTAGAAAATTTCGTTCTACATTTTGGTGCGGTTATGAG +CTTATGTACACAATTAGCTGGGATTACAGGCGCTCAGCTGCCATGTCCAGCTAATTTTTG +TATTTTTAGTAGAGACAGGGTGTTGGCCAGGCTCGTCTCCAACTCCTGACCTCAAGTGAT +CCACCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGCACCTGGCCC +AAATACTATGTTTTATCAATTCTAAAGTGCACTTTAGTATTTACATTTTAATATAACTAA +AATCAATATGTATTTTGCAATCAATGGCATCTTGCTATTATTTGAAAACATTTCTTTAAT +AGTCTGTAAAATAATGGAACATGCCCAGATGCAGTGGCTTATGCCTGTAATCCCAGCACT +TTGAAGGGTCAAGATAGGAGGATCGCTTGAGCCCAGGAGCTGGAGACCAGCCTGGCCAAT +ATAGTGACAGAATAAATAAATAAGTAAATAAAATAATGGAAAATCTCACAAATGGTGATG +TTTTAGGTTCGACAAAATACATTAACTAGCCCATTTAGTTTTCTGAAATTATTTTGATGT +TATTGCTTACAATATTTGTTCTGTGGTACACAACCATAGGATTAATAATATTGATGAAAA +TAATAAAAGAATAATAAGCATGTATTGAGCTCTTCCTGTGTGAAGTTCTGGACAAATCCT +CATAAAGCCTTAAAAGGCAGATACTAGGCTGGGCACGGTGGCTCATGCCTGTAATCCCAG +CACTTTGGGAGGCCGAGGCAGGCAGATCACGCGGTCAGGAGATTGAGACCATCCTGGCTA +ACATGATGAAACACGGTCTCTACTAAAAATACAAAAAATTAGCCAGGCATGGTGGCACGT +GCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAAAATCGCTTGAACCTGGGAGGCT +GAGGTTGCAGTGAGCCAAGATCGCACCACTGCTCTCCAGCCTGGGCGACAGAGCAAGACT +CTGTCTTAAAAAAAAAAAAAAAAAAAAGAAAGAAACAGGCAGATACTAGCCCAGGCACGG +TGGCTCATGCCTGTAATCCCACACCTTCGAAGGCCCAGGCGGGTGGATTATCTGAGGTCA +GGAGTTTGAGACCAGCCTGACCAACATTGTGAAACCCTGTCTCTACTAAAAATACAAAAA +TATTAGCCAGGTGTGGTGACAGGTGCCTGTAATTCCAGCTACTCAGGAGGCTAAGGCAGG +AGAATCGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCTGAGATTGTGCCACTTTACTC +CAGCCTAGGTGACAGAGGAAGACTCTGTCTCAAAAAAAACAAACAAACAACAACAACAAC +ATCAAAAAGAAACCTATAGTAATAAAATTGAAATAGAAGGAGGTTTGCAATCAAAATGAC +TGACTAGGAATGAAATAGGAAACATAATATTTTGCATCTGCATAGGGAAGTCTGAGATTG +GCTGATCTTGTTCTCTTCTGTAGGGGAAATACTAGTCCAGAACTTGGGGTGCCTGCCAAG +AGGGGAGCAGCCACAGTAGGAAAGGGGGACTCTGGAATGCTAGGGTTCTGGGGTCTGTGG +ACACAGGAGGCAGAGGACATGTGTTAAGATGTTTTAAGAAATGAATGTTGAACTGGATAT +GAAAATATTTTTCAGCCGGGCGCAGTGGCTCACGTCTGTAATCCCAGTACTTTGGGAGGC +TGAGGCGGGTGGATCATGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACC +CCGTCCGTCTCTACTGAAAATACAAAAAGTTAGCCAGGCGTGGTGGCGGAGGCCTGTAAT +CCCAGTTACTCTGGCGGCTGAAGCAGGAGAATGGCGTGAACCTGGGAGACGGAGCTTGCA +GTGAGCCGAGATTGCACCAGTGCACTCTAGCCTGGGCGACAGAGGGAGACTCCATCTAAA +AAAAAAAAAAAAAGAAAGAAAATATTTTTCACTATAGAGAGGCATATGTCCCCTGAACTT +GCCGGGATCCACCTTTCCTGCTGGTGCATTCTGTGAGTTAGAAGAAAACTTCCAAAGAGC +CATTTTTTCCACCCTGTCTACTGTATAAAATTGCTTCTCAAACATGTGCTGCATTGCAGA +GGATTACCATTGTTTTGCTAACCAGCGTCTGGTCTTTCTTATGTGGCGCTGCAATTACTA +GTGTCAAACCCTGTTGGTAATACCCAGAGGACGGTGTCTGAAGTCTTTACTCAATATTCA +CATTTGGCCGGGTGTGGTGGCTCACACCTGTAATCCCAGCACTTTCGGAAGCAGAGGCAG +GCGGATCACTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACTCCATCT +CTACTAAAAATACAAAAATTAGCCGGGTATGGTGGCGGGTGCCTGTAATCTCAGCTACTA +GGGAGGCTGAGACAGGAGAATCACTTGAACCCAGGAGGTGGAGGTTACAGTGAGCCAAGA +TTGTGCCACTGTACTCCAGCCTGGGGGAAAATTCACATTTGTAGAGAGTTTAAATTCTTT +TTTGATACGGAGTCTCGCTCTGTTGCCCGGGCTGGAGTGCAGTGGCAGGGTCTTGACTCA +CTACAACCTCTGCCTCCCAGGCTCAAGGGATTCTCCTGCTTTAGCCTCCTGAGTAGTTGG +GATTACAGGCACCCACCAAAACACCTGGGCAATTTTTGTATTTTTATTAGAGACAGGGTT +TCACCATGTTGTCCAGGCTGATCTGAAACTCCTGACCTCAGGTGATCTGCCTGCCCTGGC +CTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCACGCCCGGCCGAGAGTTTAAATTCT +TAAGTCCTACACTCCAATGTGTGGGAAGTATTCGTGCTATGCTTTTATAACTAAATCATC +TCAGTATTTCTATTTCTAGCCCCCTTTTTCTGCCTGATGGTAAGATACTTAATCTAGTCA +ATTCCAGGTAAACTTTGGCCTTTTATGATTTTTCCTGATCAGGCCAAACCTCAACCAAGT +CCCTTCTTGATCTTCTCCTTCACCTCCTTCTCTCATTCACCCGACAATTAGCCTCCAGTC +CACGGGCTGATGCAGCATCTTGGTGTCCTGTGGTCTGAGGTCATTTTCTGTCTTTCTCAA +GCCTCAGCTAAAGTTTACAATCCTACCTTTTCTCATGACCTTGAAATGCCCTAAGGTTCA +GGGGCTTCATGGTTGCTGCTTCATGGGGGAACCTGGCTGTTCTCTGAGGCTGCTCGGCCG +CGAACACCCCATCAACTACCCGGGGCCCATCTACGCCCGAGGCCTCAGCCATTCCTGCTC +TACAGCTCTGCTGTCCCATTGGCACAGGGAACTTCTTGGGGCCCCAGGGTTCCAGATTGG +AAGCAGAGAATCTCCTCTGTTCTCAGACCCCCAAACTTTGTTGTGGATTCTAATTGTCCT +TTCCCCCATCTCACTCCTTGGAACCCACTGGGAGGTGAGTAGAATCCCTGTCAGAGATTC +TACCACCATCTCCCTCATTCTTACCCTAACTTTCTTCCTCTTCCTCCCTAGTTAGGAAAG +AGGATCTTTAGCCTGCGGCGGGGGGGTGGGGGTGGGGATGCTTGATGTTTCAGGGGAAAA +GGTGACTCAGCTACTTTTGGAATATCTGTCATACCTGTCTACTGGTGCAATGAGCTGGGA +TCACACCACTACACTCCAGCCTGGGTGACAGAGCAAGATTCCATCTCAAAAATAAATAAA +TAAATAAATAAAGACTCTGGAGAAACAACTCAATACACATGAGAAGAGGCTGGCCCATGT +AGGGAAAGGACTGGCAAACTATGACAACTCTTTTCTGTTGTTTTGTTTTCAATAGTCTCT +TCACAGTTCTTTTCACAGTTTGGAATTGATACCTTTTTCTCTTCATCAGAACTCCAATGT +TTTTGTAGATTGAAGTCTTTTTTTTTTTTTTTCTTGAGAAAGGGTCTCACTTTGTCACCC +AGGCTGGAGTGCAGTGGACCAATCACTGCTCACTGCAGCCTCGACTTCCTGGGCTCAAGA +AATCCTTCCACCTCAGCCCCCCAGTAGCTAGGACTACAGGTGTTCACCACCATGCCCAGT +TAATTTTTATTTTTTAATGTATTATTATTATTATTATTATTATTATTATTATTATTATTA +TTATTTTGAGATGGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGCGGCACCATCTCGG +CTCACTGCAACCTCTGCCTCCTGGGTTCAAGAGATTCTCTTGCCTCAGCCTTCCAAGTAA +GTGGGACTACAGGTGCATGCCCCCACACCTGGGTAATTTATTTTTTTGTAGAAAAGGGGT +ATCAGTGTGCTGTCCAGGCTGGTCTCAAACTCCTAACCTCGAGTGATCTGCCTGCCTTGG +CCTTCCAAACTACTGGGATTAGAGGTAATGAGTCACCATGACTGGCCTACGTATAGCCCA +AATGGATGAGCAGTTCCCAAGGCTCATTCCCAGCCTCCACTATCCAAGTCAGCCTCTCAT +CTCCTTCATTTCCCAGGACTTAGTTCTCATTTTCCTCCCCTGTTTTCTCCGGATTGTGGC +TATTGTTCCCTGGTTGCTAGATCAACCTGGAGCACAGTAAAGCAGTGTCACAAAGCTGGA +AGGGGTCTGGGATGAGTCCACCAGCTACAAGTTCTTATAGAAAACGTACTCCGGGGATGG +CCGGGCCCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATC +CCCTGAGGTTGGGAGTTCGAGACCAGCCTGACCAACATGGAGAAACCCCGTCTCTACTAA +AAATACAAAATTAGCTGGGTGTGGTGGCACATGCCTGTAATCCCAGCTACTAGGGAGGCT +GAGGCAGGGGAATCGCTTGAACCTGGGAGGCGGAGGTTGCGGTGAGCCAAGATTATGCCA +TTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCATCTCAAAAAAAAAAAAAAAAAAGA +AAATGTACTCCAGGAATTGTCATTTCTGAAATTCAACAGCTTCTGGAATTGAAGCAAACA +GCTCATCTTGGAAGAGAAATATGTAGCCAACTCCAAAGCCAAAGCCTTTGAGTATTGAGA +CCTAGCATGCTAGGAGACCTTGATCCTGTAACCTCAGAAGAAGAATCTGGATCTGGCCAA +ATTGAGGTCAAATTCTGCTCAACTTCTCCATAGTCAGTAGGAGAAAAAAACCAACTTGAT +GTTTGAGTCATATGTTTTGACAACTAAAGAGGACACTTATGCTGGGGTCGGTGGTTCATG +CCTGTAATCCCAGCACTTTGGGAGGTCGAGGCGGGTGAATCATTTGAGGTCAGGGGTTCG +AGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACAAAAAATTCAAAAAAATTGGCT +GGGGGCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGATGGGTGGATCAC +GAGGTCAGGAGTTCAAGACCAGCCTGGCCATTATGGTGAGACCCTGTCTCTACTAAAAAT +ACAAAAATGATCCGGGCATGGTGGCGCACGCCTGTGGTCCCAGCTACTCAGGAGGCTGAG +ACAGAAGAATCTCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCCGAGATCACGCCACTG +CACTCCAGGCTGGGTGACAGAGTGAGATGTCATCTCAAAAAATAAATAAATAAATAAATA +AAATTAGTCTGACTTAGTGGCGGGCCCCTGTAATCCCAGCTACTGGGAGGCTGAGGCAGG +AGAATCACTTGAACCCGGGAGGTGGATGCAGTGAGCCAAGATCATGCCACTGCACTCTAG +CCTGGGCGAGTGAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAAAGACACTTAAAGATG +ACATTAAAGAGGATACTTAGATTCTAGACAAAATCAAGATATAGCAAATTGGGGTGGGAC +ACACCTGTAATCTCAGCATTTGGGGAGGCCGAGGCAGGTGGATCACCTGAGGTCCAAAGT +TTGAGACCACCCTGACCAACATGGCGAAACCCCGTCTCTACTAAAAATACAAAAATTAGC +CAGGCATGGTGGTGGACACCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCA +TTTGAGCCCAGGAGGCAGAGGTTGCAGTGAGCTGAGACTGCACTGCTGCACTGGTGCCTG +GGCCACACCAGTCACTATGCCTGGGTGACAGAGCAAGACTCTGTCTCAAAATAAATAAAT +AAATAAATAAAATTTTGTTTTGCTGTGTTGCGGCTAATATGCGTGCTATAAGACAATGGT +TTCTTGAGTCTCATTCTCTCTGCATATGCCTAAAGCTTTTTTATTTTTATGATTCTAAAA +GATTGTACCTTCTCATCTCCTAGATTCTGTCCCATAGGTTCTGATTTTTCCTAGAGTAAC +TTGGAAGTTAAAAAAGTGGAAAAAGCTTTGCGTATTAGGTGCCAAACCCACTCAGCTCTG +CTCAAACCCCTTCTTTAATGCCCAAGGTTGTCCAATCCTAGCCCTTCCCCCTACCCTCAG +CTTTCTCCTCACCTACACAGCAACCTTAGTATAGTCCTAAAGTATGTGTTCTTATCTTCT +GTTATCTATGCCAAGGATGTTTGCTGGTTTTGTTTTGTTTTGTTGAGACAGGGTCTTGCT +CTGTCTCTTAGGCTGGAGTGCAGTGGCACAATCACAGCTCACTGCAACCTCGATCTCCTG +GGCTTAAGTGATCCCCCCACTCAGCCTCCTGAGTAGCTGGGACTACAGGTATGCATCACC +ACGCCTGGCTAATTTTTTTTTTTTTTTTTTTTTTTGAGGCAGAGTTTTGCTCTTGTTGCC +CAGGCTGGGGTACAATAGTGTCATCTCAGCTCACCACAACCTCTGCCTCCCAGGTTCAAG +CAATTCTCCTGCCTCAGCCTCTCAAATAGCTGGGATTACAGGCATGTGCCATCACATCCG +GCTACTGTTTTGTATTTTTAGTAGAGATGGGGTTTCTCCACGTTGGCCAGGCTGGTCTTG +AACTCCTGACCTCAGCTGATCCACCCACCTTGGGCTCCCAAAGTGCTGGGATTAAAGGCT +TGAGCCACCATGCCCGGCCCATGCCTGGCTAATTTTTTTTAATTTTTATTTTTGTAGAGA +TAGGGTCTCACTATGTTGTCCAGGCTAGTCTTGAACTCCTGGACTCAAGCGATCTTCCTG +TCTCAGCCTCCCAAAGTGCAGGAATTATAGGCATGAGCCACTTTGCCAGGCAAGGATTTT +TTTCTTTTTAAGTTACATTTCTGCCTGCCACCACAGCAGCTCTTTCTCCTGCTCTCTCTC +TCTCTCTGTGCTTTAAGATGATAGTCCCTTCTTTTTTTTCAAATAACCACAACAGGAAGG +ACTGACCACTCTTGTAAGCTGCAACTGATGTTTTCAGACTCCTAAAGTGACATCTAGACA +TAAGTCCATATATGTCAGAATATCATGCAGGGAATGCTCAAATAGTTGGGAAGAGATTGC +TGCACTGTGTTTTGCACGCCCAAAGCCCACATAGGTACTCAGTTTAAAAATCTTAATAGA +ATTGAATCCTGCTCTTATCATAGGAAAGGAAGAGCATCTGATAGAAACACAAAATGAAAA +GGTCAAGAACTGGCTGGGCACAGTGGCTCTCGCCTGTAATCCCAGCACTTTGGGAGGCTG +AGGCGGGAGGATCATGAGGTCAGGAGTTCGAGACCAGCCTGGTCAATATGGTGAAACCCC +GTCTCTACTAAAAATACAAAAAATAGCTGGGCGTGGTGGCGCGCACCTGTAGTCCCAGCT +ATTCAGGAGGCTGAGGCAGGAAAATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCC +AAGATCACGCCACTGCACACCAGCCTGGGCAACAGAGCAAGACTCCGTCTCTCAAAAAAA +AAAAACAAAAAAAGTCGAGAACTGGAAAGGAACTAAGCGCATGAAAAGAAATTTTATGTT +CCTTCATGTTTTTATTTAAAGAAAGTGAATCAAGTACCAAACACGGAATAAAGGCAAACA +TTCATTTTTGGGGTGATTGTTCCCTTCTTGGCAATCCCTGTTTTATTGAGGGTATCACTA +GTTATTCAATCCAAGGATTTTTTTTGTTTCCACAGGAGGTGGGTGTTTCTTTGTCTTCTT +AGAGTCAGGATTCCAGATCTCCTGATGTGTGGGACTTTTCTTGGCCACTACGATTTCATC +TACAGTCACGAGCTGTAGCACCACCTCAGCCACTGCTCGAAATCCTTGGGCTTTGACTAT +TAGGGTGTCCCACACCCCTTCCTGGGCCACATTTATTATCCCTTCAGTTCCCACACCCAT +TAGGAGGTTCCCACCTTGGTGCACTCCACTCATTTCTGCCATCACGTCTGAGACAGCTAA +GCCTGCATTCTCTGCCAAAGTTTTAGGAAGATACTTCAGGGCCCAGGCAAATGCTAGGAA +TGCAGGCCCACTGGGCCCTTCCAATCTGCTTCCTTTATCAGAAAGCATTTTTGCCAAAGC +CATTTCTGTGGCCCCAGCTCCTGGAATCAGTCTGGGATCTTGACATAGCTGGAAATAGGC +ATCAATGCCGTGGTAGACGGCCTGCTCTGCACTCCGCAGCCCCTGGGTGGTGGCTCCCCT +GAGAACCACAGTGAGGGCAGGTGTGCCTGTACATTCCCATTCAAATACCACAGCCAAACC +ATCTCCCAGCTCCTGCCTGTAAACCCTCTGGCACTTGCCTGGCCTCTGGGGAGGGAGCAG +ACGAGGCAGCAGAGGTGTGTCCAACACCTCACTCAGGTAAATGATCTCCATCCAAGACCT +AGCTTGAATCACCACGATGCCATACTTGTCCGCCAGTGTGAGGGTCTCCTCGTCGACCTC +CCCCAACACCACTGCCACATTAATTCCTGCAGCTGCTAGCTGGCCTACTTGCTTTTCTAG +TAATTGATCGCTTCCTTTACTAAATTGAGCTAGATCAGCAGGACTAGAAAGACGGGCCGT +TGCTGGTGCATTTGGATGGGCAGGACCAAAGGGGCAAGCAAAGAGAGCCACCCTGGCACC +ACTTAACACTGTGGCCATTTGCCCACAGAGCTTCCCAGATATTGCTAACCCCGGGAGGAG +GCAGGAATCCTCCAGTGTCCCCCCGGGCAGCGCGCACACCCCAACACGCTCAGGCTTGAA +GCTGCCGTCTAGTTCCTTGATAGCCCAGCAGGCGTGGGCCACCAGCTTGGTCAAGTGGTC +CATGGGGGACAGGGTGTGGGTATTCATCACAGAATGGAGGGCCCAGGATGGATCTTCCAA +AGGCCCCAGAGATTGGATGGCCAGGGAGGGCAGTGTGGCCAGGACCTCTGCAGTGGCCGT +GGCGTAGGCCTCCCGGAGCTGCGGGCGAGGCAGGCCAGCCTTCAGCAGCTGCTCTGCCTG +TTCCAGCAAGGCTTCCGTCAGCAGAACCACGAAGGCTGTGCCGTCCCCACTATTCTCTGC +CTGGGTTTGTCCTGCTTCCCGGAGGAGCCATGCTGCTGGGTGCTCCAGCTCCAGGGCCCT +GAGGATGGCAGTGGCACACCCCGTGCACACTGTTTCTCCTTTCATGGTCACCAGGAACTT +CTGCCGGCCGTGGGGGCCATAGCAAGGCCGGATGACACTGGCCAGGGTCTGGACTGCAGC +CAAGCTGCTCAGCAGGTGGGGCTCCTCCTCTTCTGGACTCCTCGGGCTCTCCCTTGGGTT +CAGTGCCAGCCGCTGGGGCAGCTCCAGGGCTGAAGGGACTGTGCTGTCCATGGCCCGCAG +AGAGAGGAGAGGCCACCGTGGGTTGCAGAGATGCTCTAGAAACAGCAGCTGGGGCACTCC +TGACACCGATCGTTGAAAGTACTCAAGAGGTCAGTGGAAGCAAGGAGCCAAATGCCCATT +GATTGGTATCTGAAGACATCAGCACGGACCAGCACTCCACTGTGGGTCCAAGGATGAGCT +CCAAAGAGCCCAGTCCTAAAGCCACCCCAGGGTTGATTCTGTAAAGGAACTGGGTCTTGG +GGCCTCTCAACCTTGGTGGCTGAAATGGGATCTTTAACTGATGAAGTCACAAAGTGGAAA +ATGGAACCAGGATAGAGAATGAGGTCACAGAAGGCTGGTTAGAACTGAGGAGGCCCTACC +AGCAGGCAAAAGTCAGGCCTTGTCCAGCAATGGAGGTACATGCACCTCTGCACCAGGTTT +GAGACTTGTTTAAACGTAAGAGACAATGAGGAGGAGATCAAGTGAAAAACTACCCATTTC +ACCCTATCTGGAGTGCAGGGGCATAACCATGGTTCACTGCAGGCCCAGCTCCCTGGTCTC +AAGCAGTCCTCCTGCTCAGGTTCCCAAGTACCTGGGACTACAGGCACACACCACCACACC +TAGCTAGTTTTTTTATTTTTTGTAGAGACAGTGTTTCTGTCTGTTGTCCAGGCAGGTCTC +GAATTCCTAGCCTCAAGAGAGCCTTCCACCTTGGCCTCCCAAAGTGCTAGGACTACAGGT +GTGAGCCACCACCTCACCCACCCTTTTTTTTTTTTTTTTTTTGAGACAGAGTCACACTCT +GTTGCCCAGGCTGGAGTGCAGTGGTACAATCTTAGCTCACTGCAACCTCCACCTCCCAGG +TTCAAGCAGTTCTCCTGCCTCAGCCTCTCAGTAGCTGGGATTACAGGTGCCAGCCACCAC +GCCCGGCTAATTTTTTATATTTTTAGTAGAGATAGGGGGATTTCACCATGTTGGCCATGG +TTGGCCAGGTTAGTCTCAAACTCCTGGCCTCAAGTGATCCGCCCACCTCGGCCTCCAAAA +GTGCTGGGATTACAGGTGTGAGCCACTGCACCTGGCCTTTTTTTTTTATTTGAGAAGGAA +CTGAGAGATGATGTCTGTGTTTTGTTTTGTTTTGGTGTTACTTTCTCTTGCAGTACTGTG +TAATATTAGCCATGTTTTGCTGTCTGCCTTTGACTTTTTGGGTATCTTATCAGTTTGTGC +TTGTGTATCAGGTTTCTTAGGGTGTCTGTTGGTCTTTCAGGGTGCAGGTGTGGGAGGCTG +CACAGCGTGCATGCCTGTGCCACGACTCCCAACTCTGCCTCCCTGGCAGAGGCAGGGCAA +GACAAGTGGGGAAGGATGCTGACAGCTCACAGACAAATAGAAGTGAACCCAGAGGGGTGA +AAAGCAACCAGCCTCCCAGCGGTCAGGGAGGTAGAAGCCTAAATGGGGTCCTGAGATTTA +AATGCGAATCGCCTTCCCATCCTAACCTTCAATGCTTACAATTTAAGTCTCTTTTTTTCA +TTCTCTCTCCTTTCCTCACTTGTCTCCTCTTTCCTCCTATAGAGCCTACTCGGGTAATGA +TGCTTCTGCTTTAGTTTAACACATATTTAGTCTGGGCGTGGTGGCTCATGCATGTAATCC +CTGCACGTTGGGAGGCTGAGGCGGGAGGATTGCTTAAGCTCAGGAGGTTGAGGCTTCAGT +GAGCCATGATTGCACCACTGCATTCCAGCTAGGGCAACAGAGTGAGACTTGTCTCAAAAA +AAATAGGGGAAAGGTCATTTGGAATCCTAGTCCAGAGATAACCATTGTTTACAACTTGAT +GAACATTACTACTTTGCACATATTATATGCATACATAATTATAGATTTACACCATTTTAC +ATAAGATTATGATACATATATGCTATTCTGTGATCATTTCCCCCTCAACATTATCTTGGC +TCAGAGAAATGTTTCTTTTTTTGTTTGGACATGGAGTTTCGGAGTTTCGCTCTTGTCGCC +CAGGCTGGAGTACAATGGCGCAATCTCGGCTCACCCTCGGCTCACCACAGCCTCTGCCTC +CCGGGTTCAAGCAATTCTCTTGCCTCAGCCTCCTGAGTAGCTGGGACTGAGTAGCCATGT +GCCACCATGCCCGGCTAATTTTGTGTTTTTAGTAGAGACAGGGTTTCTCCATGTTAGTCA +GGCTGGTCTCAAACTCCTGACCTCAGGGGATCCACCCGCCTCGGCCTCCCAAAAGTGCTG +GGATTACAGGCGTGTGCCACTGTGCCTGGTCTGTGAGCCACTGTGCCCGGCCTGAGAAAT +GTTTCTTTTTTTCTTTCTTTTTTTTTTTTTAAGCAGAAACACATTCATTTATTAACCAAA +GGGATGATCCTAATGAATCCAACACACTTTGAAATAGCTGCATGTAAAATGTTTGTGATA +AAGATAATTGAACACAGTAATGAAAAAAAAAAAAGAAAGAAAGAAACGGTATGGAGATTT +GCTCATTGAACTGAGCTTGGTCATTCTCTTAGTTAACTCCTGTCCAAAGTGATGATGGAA +TCTTTATTGTACTTTTTCATAGATCCGAGTACAGGCGACATGGTTCATGACACAGTCCAC +CACTAATTTCCCATCTTTCAATGTTCTTGTTATTGTGCTTTCCTTCCCATCCCACTCCTG +ATGCTGAACCAATGCACCATCTGTAAAGTTGCACACAGTCTGAGTTTTTCTGCCATCAGC +TGTGGTTTCTTCAAACTTCTCTCCCAGGGTACAAGAAAACTGTGTTGTTTTCAAAGTGCT +CTCAGTTTTTATGGTGAGGTTTTTGCCATCACAAGTGATGATACAATCTGGCTTGGCCAT +TGCGCCCATTTTTTGCAAAGCTATTTCCTCCTAGCTCCTTCATGTATTCATCAAAGCCTT +CGCTGTCCACCAGGCGCCATCTTCCTTCCAGCTGCTGAACTGTGGCCATGGTGGGTGCAG +GGGGGCTGGTGTGCAGAGCAGGGTCTGCGTCGGCGTGGCAGCGTGCTGTCGAGAAATGTT +TCTAAGGAGATCTTATTTGGTCTGAGAACCATGAATGATTATTTTGAGCACTTTTGATTC +TGGAGACTCCATTTGGATCAGGCATGGTCCTCCAAATTCAGGCTTCTGAAAGCCTGTACC +TCAGAGTAGGCTTGATGTTCCATAAAAGATGTGGTTATGAGTGCAAAGATGACTTGCCTG +TATTGTTATACAAATGTAAAATGTAACAATCAACAAAAATGTAGCAAAGTATGCATGTAT +ACATTTTCTCTAAAGATACAGTTTCTTTTTTGAAAAAATAAACACATTAGGCAGGTGTGA +TGGCGGGTGCCTGTTATCCCAGCTACTCCGGAGGCTAAGGCACGAGAATCTCTTGAACCT +GGGAGGTGGACAAATTGCAGTGAGCCAAGATTGCGCCACTATACTCCAGCCTGGGCAATA +GAGCGAGACTCAGTCTCAAAAAATAAATAAATAAATAAATAAATAAATAAATAAATAAAA +TAAACACTACCGGCCAGTGGCCATGGCTCGAGCCTATAATCCCAGCACTTTGGGAGGCCT +GAGCCAGGTGGAGTTCAGGCATTCAAGACCAGCTTGGGCAATATGACAAGACCCCTGTCT +CTACTAAAAATACAAAACAATAGCCGGCCGTGGTGGTGTGTGCCTGTAGTCAGCTGCTTG +GGAGGCTGAGGTGGGAGGATTGCTTGAGCCCTGAAGGTGGAAGTTGCAGTGAGCTGAGAT +AGTGCCATTGCACTCCAGCCTGGGTGACAGAGTGAGACCCTGTCTCAAAAAATAAAATAA +AATAAACACTCCTATAAAGGATCCTCTTAGCTCTTTTTCTAACACCTAATCTACATTTTC +ATATTCATTTCAGTTACCCTACAACTGTTCACTGAGCTGCTGTTGAATAGGGGAAATAAG +GCAGATAACTACTGCCATCTCCGCTGGAGGGACGATACAGACATTAATCTGGGCACTTTG +ATTACAGGCAATGAGAGCTGTGAGTGGGGAAAGCACAAGGTTGGCAGAAGCATTTAGGGG +GACACAGCCATTCTCACGGAGGGCAGAGGTCTAAAGCAAGAGCTGAATAAAAAGTAGGAA +CTGGCCTCGTGGAAAGGGGAAGGGTGATGGGACAGCCTGGTGGTTTGTAGCCCACTGGAA +GGAGTTCTGAAAACTGGTGGTCAGGTGAGAAGGAAAGCTGGGGAAGAGATGAGCACGTTC +GCCAGAGGGTAGCAGGGGCTCTCCGGACCTAGTGAGTCAAGCCAAGGAATTAAGGCTTCA +GCCTGCAGGGTGATGAATAGGGCTGTCTATTCCATTTCTTCCTTCTTTCTTTCTTTTCTT +TCTTTTTTTGAGACAGCGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGCACGATCCT +GGCTCACTGCAACCTCTGCCTCCCTGATTCAAGCAATTCTCCTGCTTCAGCCTCCAGAAT +AGCCGGGATTACGGGTGCCTGCTACCACGCCTGGCTAATTTTGTATTTTTAGTAGAGGCG +AGGTTTCACCATGTTGGTCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCTGCCTACC +TCGGCCTCCCAAAGTGCTGGGATTACAGGTGTAAACCACCGTGCCTGGCCTGAAAATTTC +TAGTTTATGATACTTGCCAGCAGAATGTGTTCTGTCACCCTCTTCTGAATAGATATGGTT +GTCTGCTATGACTTCTCCCACTGCTGCCCTTCCCCCTGAATCCACAGATGCATTTCTTTT +AAAACTATGATCTTGTACACAATGGATGTAAATATTTAATCTTTCTATTTGTATGTTTTT +CCATGTTTCTTTTCTTTCTTTCTCTTTTTTTTTTTTTTTTTTTTTTTTTTGGAGGTGGTG +TCTGCCTCTATTGCCCACAGGCTGGAGTGCACTGGTACAATCTCGGCTCACTGCACCCTC +CGCCTCCTAGGTTCAAGGGATTCTGCTGCCTGAGCCTCCTGAGTAGCTGGGACTACAGGT +GTGCACCACCACGCCCGGCTAGTTTTTATATTTTTAACAGAGACAGGGTTTCACCATATT +GGCCAGGCTGGTCTCGAACTCCTGACCTCGTGATCCTCTCACCTCGTCCTCCCAAAGTGC +TGGGATTACAGGCATGAGCCACCGTGCCCGGCCTCCATGTTTATTTTCTAGTTGCTTACT +TGTCCTTTTGTGTTTATCCTTGTTAACTACTACTGCCAGGCTTAAAGTATAGACCCCTAG +AGGGCAAGATTTGTATCTATATAAAATGTACTGCAAAACATCTACTTAAGCCTCACATTC +TTAAACACAAATTACTTTTGAAGATGACTGTTCTGTTTGTTTCCTTCCTGGTTTCTTCCT +TTAACTTTTCCACCAAACAGGTACATGATATACTTTACTGAAATAACTTATATAGCAATA +TGAATTTTTTTTTTGAGGCGGAGTTTCGCTCTTGTTGCCCAGGCTAGAGTGCAATGGCGT +GATCTTGGCTCACTGCAACCTCCGCCTCCTGGGTTCAAACAATTCTCCTGTCTCAGCCTC +CAGAATAGCGGGGATTACAGGCGCACACCACCATGCCAGGCTAATTTTTGTATTTTTAGT +AGAGACGGGGGTTCACCATGTTGGCCACGCTGGTCTCGAACTCCTGACCTCAGGTGATCC +GCCTGCCTTGGCCTCCCAAAGTGCTGGGACTACAGGCATGAGCCACCGTGCCCGGCAAAT +TTGAGGTGGAGGTTGCAGTGAGCTGAGATCGCATCACTGCACTCTAGCCTAGGTGACAGA +GCAAGACTGTCTCCCACTTCAGCCTCCCAAGTAGCTGGGACTACAAGCATGTGCCACCAG +ACCTGGTTAATTTTTTTTTTTTTTTTTTTTGAGACGGAGTCTCGCTCCATCACCCAGGCT +GGAGTGCAGTGGCGCGATCTCAGCTCACTGCAAGCTCCCCCTCCCGGGTACACGCCACTC +TCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACCTGCCAGCACGCCCGGCTAAC +TTTTTGCATTTTTAGTAGAGACAGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCC +TGACCTCATGATCCACCTGCCTTGGCCTCTCAAAGTGCTGGGATTATAGGCGTGAGCCAC +CGCGCCCAGCCAGGCCTGGTTAATTTTCTTTGGTATTTTTTTGTAGAGACGGAGGTCTCA +CTATGTTGCCCAGGCTGGTCTCGAACTCCTGAGCTCAAGTGATCCACCTGCCTTGGCCTT +CCAAAGTGCTAGGATTACAGGCATGAGCCACGGTGCCCAGCCTACAGTGCAACTTTAATA +ATAACAATATGAACACAAAAATTCTAAGATCTAAAATTTAAGCTTTCAGTAGTCCTTCTA +TAACTGTGAAAGTTTGGTTCCTAAAAAGCCCTGAGGAATTTATGGGAAAACAAGAGAGAC +AACATTTAGTAGTGAACCTGTGCATTCTAAATAAAGACAATATCAATGACGTGTTATAGG +TCTTCAATTAGTAAGAATGAATATTGGACTATGAATTTTTATTCACTGTCACTTGTTTGC +TAGATGCTTTGAGAATCTTCCTTGCCTATATTTTCCTGAGATGTTGGTTTTTCTTTGTCA +CAGATAACAATGCTCATTCCCTCCCCATTAAAAACTAAATATATATATATATATATATAT +GATTAAACGATTACTACATGTGCTTTGAAATATTCAAATATTTTAGACAGTAAAAGTCCC +TTGTAATTCAACCCTTTGCAGATGATTGGTTAACAGGTTAGTACACATCTACCTAAATTT +AAAATCCCATATTTAACATGTATACTTATTAGAAAGTACACATTCTAATATTTTTCTATT +GTATTTGGTACTATTTTCAGATGCTCCTGCCTTTTTCTTTCGTAATTTTGAAGGACCTCA +GCTCCCTGCCTCCTAGATTTTTGCTACTATGGTCTCAGAGCTGTGTAATTTGGATGACTG +AGATGGAAAAACCTCTGGAAAACCTTTATTTATGTTGAATAAGTATTCCTTGAATCCTTC +CTCAGCATCCTGGGTTATATTTGATTTGCTCTGCTCATGATAACTTCATGCCAAGGAGAC +TGCTATCAGTTCTCTTAAAACAGATCCCAACTCCCTGCTCATAGTGGCCAAAGGAATGGA +GATTTCAGGCTGAGTTTACTTACGTGCATCATCTTCATCTATCCAGAAGCATCCCTGCAC +AAAACCTCTGTTTCTACCCTTCCATTCACTCGGCTCACTTTTCTGCTCTTAGTACCCTTT +GTTTCTTGTGAACTCTCCAGCAGGAGTGACTTGCAATTTGTATCCACTGACACTTAAGTT +CTCGGAAGTGCTGGAGAAGTGTATGGAAGTAAATTATCCTGATGTATAATTTTGTGCATG +TGAAACTCACCGTGGAAGTGCCTATCTAATTTCAGTATGGAACACAGCTAAACATTTGGA +TCAATAATCCAGTTTTGAAACCACACTTCATTTAAAGTACAATGTGCTGAAAAAAATGAA +AAAAGGGTGCTTTCAAATTTGTACTTAGTAAACTTTCACTAGATCACATCATATGTTTAT +CACTAGTCATGTTGTATTTCTATGTGTAATCGCCAGGCACTTTTAATTTCTAGTTTGCAT +TTACCATGCCAGCCTCCTCCTCAATCCCAAATTTCCTTTGGTTATAAATTTAGTAAATTT +GAAAGAGCCAGCAGGGATTAAACCCTGAAGGTATTCAAATGACTATCTGACGTTATTCCT +CATTTCAGCCATTTCGAAAAATTATGCTTTCATTTAGAATAGGCTCTGGGAATCAAAGTG +TGTGTATTTTGCCCAAGTAGAAGACACAGTTTAAAGTTAACATCCTAGCTACTAGAAGGG +AAAGCAAACAACATCGCTGCAAAAGGAGCCTATTTTTTTTTTACCTTACACTAAAACTAC +ATTGTGAAGATCAAACGAAATCAAGATGAGAGTGTGCCTCTTAACGCCAGGTCCAAAGTA +GATGCTTATTAAATGATAGTTTACCCCAATCCTTCACAAATGGTTGATAGGTCTTACTAT +TTCCCCCCTATTCAAATCTAGAATTTTTTCACTCCCATATACTAATCGATAGTTAATGGA +AAGCACAGAATAGATCATCGTCCAAGTGTTAGGTATTAGCCTGAGGAATCCGGAATCCCA +TATTTGTAACTGTCCTTCTTGAGAAAGTGCATTTTTCAGGCGGATTCTAGCCCCATTTTT +CCTTTTACCATTTTTACATGTTATGAGAGGTGGCTTAGAAATACTTCGATTTTTGCCTCT +TCATCACAACACACTGAACGTTAAAATCAAGTGGTTGGGTTTTTATTGGCTTATTTTGTC +TCTAACCGTTTTATTTCTCGAGCTGTCATCGTTCTTTTCGTCTTACATCCTTATGAACCT +TTTCTGGATTAAAAAAATGACGTTATAATAAGGAAACTGTAACTGGCGTTGGATTAGAAC +GAAGTTGACTCCATTCCTTTTCCTCCCCGTAGTGTGGGCGATACGAGGAAAGACCTCGGC +AAGAACCAGCGAAGCCCCGGCTGCCCTCGCCCTGCGGGCGCACACTTGCTCCTCGCGCCG +GGCTGCGCCGGGCGCCCGCGCCGCCTCGGCGTGTGTCCGCGGCTCCCTCCCGCCCTCGCC +CGCAGTCCCCCGATCCCGATCCCGGATCTCTGGGTCCACAGCTTGGCTCCCTCCCGAGCC +GGAGCCGGAGCCGGAGCCGAAGTCGCGGCTGGGCCCGGCCGCCCCGTCACAGGGGGAGGG +AACCCATGGGGAGGGGGAGGGGCGGTGAGGTCAGCGGCGGCGGCGCGTCCGCGGGCGGCG +GGAGCTTCGCATGCGCGGAGCGAGGCCCGTGAGTGGCAGCGGCGGCGCGCGGGGGGCGGG +CGAGGGGCCGAGAGTGGGGGAGCGGGCGGGGGCCGTCGAGGAGGCGTTGTGTGGGCGCGA +CGGCTGCGAGTTGGGGAGGTCTGTGGTGCGGGTCGCCCCGGGGGATCCCCGGCGCGGGCC +TCGCGCGACGGCCACGGTCGCGCGGCGTGTGTGGGGGGTCCACGCACACCCGCAAAACTT +CCTCCTCCCCTGCTCCGGGAGAGCGAGCGAGCGTGTGTGAGAGCGAGTGTGAGGAGCGAG +CCGCGGCCCGACGCCCAGCGCCGCCGCTGGAGCAGCTGTCAAAACTTCGCCGCCGCCCGG +GCCCCGCGGCCCGCCCTCCCCGCGCCGGGCCCCTTTCTCTTCCTGCTGCGGGCGGCCCGG +GGGAGGGGCCGCGGGCGGAGACCCCGGAGGCCGGCGCCCCTCACGCCGCCCGCCCGCCCG +CTCCCCGCCCGGCCCCTGCGCGCGTGCGTGTCCTGCTCGCTCCATGTTGCCGCCTCTCCC +GGTACCTGCTGCTGCTCCCGGGGCTTCGGGAAATGCGAGAGTCTGAGCCGGGGAGGAGGA +ACCCGAGCAGCGGCGGCGGCGGCCGCGGCGGCGGGAGCCCCCCAAGAGGAGGACCGGGAT +CCATGTGTCTTTCCTGGTGACTAGGATGTCGTCGGAGGAGAACAAGTGCGTGGAGCAGCC +GCAGCCACCACCCCCCGAGGAGCCTGGAGCCCCGGCCCCGAGCCCCCCAGCCGCAGACAA +AAGACCTCGGGGCCGGCCTCGCAAGGCGCTTCCCCTTTCCAGAGAGCCAGAAAGAAGTAA +GTTGAGTGCGAGGGAGCCAGGCCGGGAGCCAGCGGCGGCGCCGGGCCGGAGCTGCCACCG +GGCGCCCGCCCCGCGGCCTCCACGCCTTGGCGCCCCCCGGCGGGATGGGGGCGGGGCGGG +CCCGCGGGCGGCGGCAGCTCCCGGCCCCGGCCCCACGCCCCTCGGTAGCCGCCCGCGCCC +GGCCTCCCCCGCTCCGCGCCGCCCGCCCGGGCTCCCGTCGGCGCCCGGCTTCGCACACTT +TACTTTTCAGTCGGGCCTTTTCAGTGGGTCTTCTCCGCGACTCTTCTTTTGGAGAAATTT +CTCGTAGCCGCGTCTTGGCCTAGCTGGATCATTGAGAAAACAAGCCCGGAGCGCGCGCAG +GTAGTCCCCGGACGGACTCCGAGCGAACCGCCGAGCCGTGGGCGCTCGGGAAACTCGGAG +CTGTCAAAACGCCCGGGCCAGGTGGTCTCGGGGCGCGGGCTGGGGGCGAGAAGAAAGCGG +CCGGGCGAGTGCAGCTTTTGTTTGTCAGCGACTCGTTCGTGGAACTTTTCCTGGTCCCAA +ACCTGTGTTTTCTTCTTTTGATGATATATTAGGAAGCCATTTGGCTTCTTCCTTCCCCCT +CCCCCAACACCCAGCACCGCACTCCCGGGCTCCGAAAGCACAAGTCCTGTGGGAACCCCC +AGCTTCGGGGAACGGCCTGCCTAAGTTTTGGAGACGTAGCCAGCGTCCCCTCGTAAGGCA +GAATACCAAGAGCACTTATTCAGAGAGAGTGCAGATGTAAATGTCGTTTCCCTCGTAAGT +CTTAGCTGTAAGGGGCTTGGGAATAGGGTCGCCTGCCTTTGACCGACCGTACTGTAGGGC +TGGACACCGGCTTATTAGAGGACCAGAAATGTCTTCTTACAGAACGGTTATTTGACGGCT +TTGCTTGTAAATTAAGACACCGTTTTAGTGCCAGCGAGCTGCTCGGCTTCTGTGGCTCTC +GCGTGTGCCGTGGAAGAACTGTGAATGTCTTTCGAAGTTGTAGAATGGCGTGTGTGCTTA +CTCATTTCATGAGATGATATTCTCATTGAACTGTCGGGAGTGGAAGGGTGCGCTGGGACG +TGAAGGAAGCCAGCACGTTTATGGATAGGCTGTTTCTTTGGTTCGGGTGCATTCACTTAG +TAATAGTGTTGTTTGGTGATTTGTAGTAAAAATAGTAGCGTGAACTGAGGCATAGCAGAG +CTGGGTTGTGGGAACCCATTAAGCTCTTGACTTGAATGTGCTCTTTTCTTGCCCCGCTGT +CCTTTTACTATGAAAATGATTCAGGGCCTTCAACTTGCCTCCATATTTTATTGCCAGCTC +TTACCTAGCTATGATAATCGTGAGGGAGGCAAGTACAGGATGTGTGTACGTTATTACATT +AGCTTCTTCGTGATACAAAGTTAGGACTTACTTATGCCACTTGCGTTGTAATACAATGGC +AAATATAAAATGCCCTTATTCTATATTAACTGAAATTTGGAGAAGGAAGTGGAGGTTTAA +GTAATTTTTAGACGTCTAAGCCACTTTTTTGCATCCTTTAAAGCAACTCAGGACAAGCCA +TATTGGGGGTTTTACCTTGATTGCCTCCCATTTCACTATTTGCAAAGCATTTCTTCATCT +CTTACTGAACATTAATTTGCAATTTTTTTTTTTAATTTGCATTTGAATTCTTACTCCAGA +AAGATTAGATCTGTGTTGTCACACCCCACACCCCATACTCCTGTAAGGGCGTGCTTGTGC +ACGCGCACACGCTCACACGCACGCGCACACTCGCACACACCCTACTTTTGAAATGAGCTC +ATTTGTATTAGTGCAGCTCCTGAGTGCACTGGACGATTAGGGTATTGCCACTTTATTATT +TTAATTCTTAATCTCATATTATGAAGAAATAGGTAGCCTTTGGAGAAGATAAAAAATTTC +TGCTGAATAACAGTATAATCTAACTATGAAACATCAAAACTTTTGGAAATATTTAGAACA +AATGTAAGTCTGTAGAGAGCTTTTTCTTTTAGATTTGAAAACTAGTACTGCTTTCTTTAT +AGGAAAGTAAAGTCTACTGGTAAATTTCACGGGTCTAAACTTTTTAGAGCTTTTTTTTGA +AATTGTGTCTTTTGAAGGGAGTGGAATCTCCAGTTGTTTTTAGAAACATGTAAATGGAAA +CTAACATATGAATTGGAAAGCAAAGAGAAAGTTTTTCAATTGTGTATCTCTATACTGTAT +AAGAATCCATGCAGAAAAGACCCTGTAGTTGGATAGTAAAGACCCTGAAGGTGAAACTTA +TGTGTAACCAGTGTAAATTAGGTTTGTAACCAGTGAAATTATGTGAAATTGCAAATAATT +CACCTGAGAAATGAAAATTAATCTTCTTTGCTAAATGCCATAGAGATATTTTAAGTTGCT +AATGTTACTTAGATGTTCATTAACTTAGTGAGTTACATTAAGTAGAGAAGATGCCTTTTT +TTTTTTTCTGTACGAAGTCTTGCTCTGTAGCCCAGTGTAGTGGTATGATCTCGGCTCACC +ACAACCTCCGCCTCCTGTATTCAAGCGACTCTCCTGCCTCAGCCTCCAGAGTAGCTGGGA +TTACAGGTGTGCACCATCGCACCTAGCTAATTTTTTGTATTTTTAGCAGAGACAGCATTT +CACCATGTTGGCCAGGCTGTTCTTGAACCCCCGACCTCAGGTAATCCACCCTCCTTAGCC +TCCCAAAGTGCCAGGATTACAGGCGTGAGCCACTGCACCCTGCTGAGAAGATGCCTTTTG +ACAATGAAGTGGATTTGTATATTTATCTTTGGCTTAAAAAAACATGCACCACCAATTACA +CTTTCCTCAAGTTTAAATTTTTAATAATTAGGAAAATAAAGCATTTTCTTGTCTTATAGT +GTTAGCTAGATTGTTTTTGTGTATTTTGTCATGAATAAAAAGCATAGCTATATAGTTACT +GCTTTTACATTAACTATAAATATCTTAAAATTTTACTACCTAAAATCAGGAAACTTGAAC +TGAAGCTACTAATCTTAGAGTTGGAAAAGTAAATACATAGAGGTTTCCTGTTGTACAAAT +GTCAAGTGGCACAGTGAAATTTACATTCATTTGAAAGTTTTCCTTAACTGTAAAAAGTAT +CAAATTACTTGATACTTTGGAGTAGTTCATCATCTTTATCAGAGGCACAGGTCTTAACCA +TTGGCAAGCCTCTGTCAGAATATGCACATATTAAAGATCTGATTATTTTTGTGTTAATGT +TAAAAAATTTTTCTGAAGCTTTTATCTTATTTTTTCCATCCTTACACCGTAAATTCACAT +TACCAAGTTGGGAAGCCAAAGAAACATTCTACTCTACTATGTTTCTTACCAGTTCATGAA +AGTTGATGTTAGAAATGGGTGTGGGTGTGGGGGATGGGGGTGGTTGTACAGAAGCAGCAG +GTGGTAGGGATAGGATTTCTGAAGCACTATCCTTGGCCTTTTTTGAGTAAACTCTTTATA +CCCTGAGCCACTTTCTTTTCAGAGGGCAATTGCTATTATTAGAGAGCCACCTTAAGCATT +ATTGTTGTAGAAAAATTAGGCACAACCAGTGATTGTCATTACAAGGACCAGCAAAAATGG +CTAGGTTGCTACTCTGTATTTGTAACGCCCTTCCCCCAACAAAATTTCTCCTTTTCATAT +CTGTGAATTAGAAATAAGTGATAGAAAACTGTACTGCATTACAATATATACCATTTAATA +AAACAAGTTTATAGTTGAGAGCACTATTCATGCTTTTTGAGATAATGCAAATTTGTAATT +TTTATGATAGCAATTCTTAATAATTTATTGTCCAAGAGATTTGATAAAATTTTTGATAGT +TATTGGTCTCTGGGACTCAATAGGCACTGAAATGTTTTAATTCAGTTGAAAAGTTGGTTC +AGGATTGCTACCCTCTCTTACCTGTTAGGAGGTTGTTGTTTAACCTGACCTGAAATTCCC +ATGAATAAGAACCTGTTTTTTTTTTTTTTTTCTTTGACAGAGTCTTGCTCTGTCGCCCAG +GCTGCAGTGCAGTGGTGCGATCTTGGCTCGCTGCAAGTTCCGCCTCCCAGGTTCAAGCGA +TTCTCCTGTCTCAGCCTCCCAAGTAGCTGGAGTAGCTGGGACTGCAGGCACGTACCACCA +TGCCTGACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAGCCAGGATGG +TCGCAATCTCTTGACCTCATGATCTGCCTGCCTTGGCCTCCCAAAGTGCTGGGATTACAG +GTGTGAGCCACCGCACCTGGCCCAGGGAATTTCTAATATTTGAGAAGATGTTATTTTTAG +TCTATTATACAAATTTATATATTGTTTACTAATATATAAATTTACATATTGGTTACTAAT +ATGTAAACACCAATTTACATATTGGTTACTAATATGTAAACTTGATAAACATGGATTTCC +ATGGAAATTTAAAAGTATCACAACAATTTGTTTTCCCATTCTGAAACTTGTGATTTATTA +CATTTTCCTACTATTTCAGTTAATTCCATAATGCCAGATTTGTTGTCAATTTGCCGAGTG +ACAAGCCACACTGCTTCCTCTCATTCCTCTATTCCGCAAAACTGCAAAGTTTCCCAGACC +ACAGTCAGGTTTCTCTGGGTTGTCCAACTCTGTAAACTTACAGAGTGGTTGTCCAACTCT +GTAAACTTACAGAGTGGTTGTCCAACTCTGTAAACTTACAGAGTGGTTGTCCAACTCTGT +AAACTTAAGTCACTTTAAGTTTATGACGGAGGGGCTTCGTGAAACTTCATTGACCTTCCA +AGGTGAAAATTGGTCAGTTTTCAGTTATAAAGGACATTAAGGATGGGTGTGGTGGCTGAT +ACATGTAATCCCAGCACTTTCGGGAGACTGAGTCAGGAGGATCACTTAATCCTCATTTAA +AAGGAGTTTGAGACCAGCCTGGGCAACAAAGTGAGGCCTTGTCTCTACAAAAAAATTAGC +TGGGTGTGGTGGTAGGCACTTGTAATCCCAACTACTCTGGAGACTGAGCTGAGAGAAGAT +TGTGTGAGGCTTGGAGGTTGAGGCTGCAGTGAACGGACATCACACCACTACACTCTAGTC +AGGTGACAGAGCAAGACTCTAAATAAATAGGAACATTAGATGGTCTCTCTGCACTCTTGC +CTGGTGGGGACGTGTTAGATACCCTCGTTAGGTTGTGATTTAGTTTTTAATCTGTGAGAT +GTTTGGGTCAAACAATTTTTAGCTGCCATGGAATAAACTTTCCAGTCAGCGTGTGAGTTT +GTGTTTGCCTTTACTTTTTTTTTTCTATATTGTTTTGGTCTATTTTTATCTTTTAATTTC +AGAAAGCTGATTAATCTCTTCCTTTTCTCTTTAAAAATTTTCTTTATCATGTTTGTGCTA +CAGTGGTTATTTTGAGAACTTGTTGGCAGGATAAGTTGCAAAAGTTATGAAGTAGAATAG +GGATGATTTCTGTTTTTGTTTTTTTTTTTTTCAGACAGAGTCTCACTCTCTTGCCTAGGC +TGGAGTGCAGTGGCGTGATCCTGGCTCACTGCAGCCGCCGCCCTCCGGATTCAAGTGATT +TGCCTGGCTCAGCCTCCCAAAAAGCTGGGATTACAGGTGCATGCCACCACACCCAGCTAA +TTTTTGTGTTTTTAGTAGAGATGGGTGTTCACCATGTTGGCCAGGCTGGTCTCAAACTCC +TGACCTCAGGTGATCTGCCTGCCTCCGCACTCCCAAAGTGCTGGGATTACAGACGTGAGC +CACCATGCCTGGCTGAGATTATTTCTTTTTTTATTATAGCCATTGCTTGTAGATATATGC +TGGTGGTTATCTGTAAAAATGTAATAGAAAGGCCGGGCACGGTGGCTCACACCGGTAATC +CCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACAAGGTCAGGAGTGGGAGACCAGCCTG +GCCAATATGGTGAAACCCCGTCTCTACCAAAAATACAAAAATTAGCTGGGCATAGTGGCG +GGCACCTATAGTCCCAGTGACTCGGGAAGCTGAGGCAGGACAATCGCTTGAACCCAGGAG +GCAGAGGTTGCAGTGAGCTGAGATCGTGCTATTATTGCACACCAGCCTGGGCGACAGAGT +GAGACTCCGTCTCAAAAAGAAAAAAGTAATAGACCAATCTTGAATTTATAATTGGAAGTG +TTGATCCCTTTATTTGCAGAATTTATTTATTTGTGACGCAGCTGTTGCTACCTCGCCTTT +TCTTTTGTTGAGCTTAATCTCATGTCAAGTCATTCAACCAACTCAAAAGCGATGAAGACA +TTATTGAATCAACCTGAACTAAATCAGACCTAGGCTTCTTAAAATATACAGCTTAATGCT +TCCAAATGATTTAGAAAACTAAAAAACCTAGCTACGCTGTAGGACACACAGTGGCCAATA +ATACAGGACCCCCAAACTGGCCAGTGGACCACTGCAACCACTATTTACTTCCTCCGTGTT +TAGGAATGTTCAACGCTCCAAGCCCCATAGGCTGATTCAAGAAGATAAAGTGAGACTCAA +GGAATTTCGAAGTGGAACAATACACCAAAGCCTTAAACCTGAAATGACTCTCCTTTTCTG +GGGGGTGAGGGGGAAAGAAAAAGAAAAAGTTTCTAGGGCTCTCGGGGTGGCCTGGATGCC +AGGGTCCCAGAAGTGGCCTTTTCTAGCTCCTGTAACTAAACCTGGCGGAAAACTCCCCGC +CTGCTCACTCCACCCCCACCCGCCCAAGAATGCGTCTTCCCGTCTTCGGTGGCCCTACCC +AGAATCCCAAAATGTGGGTTCCAACCCGGGCCCTGAATGTCTTCTCAAATCCCCGGGACC +CAGGTTCCGGTGCGTGCCTTGCGTGCCGGGTCTTGCCCCTCGGGCGGTACCACCCAGGCA +GCCCTAAATCCAGCCTCCCGGGCCCCCAGCAGCGCCCTCCGCCCCTCCACTATCCGGTCC +GGCTCGAAGTCGGGGCCAAATCCAGAGACAAGAGGGCTGTGCCTGAAACTGAGCAGTTTC +ACCACTCGGCACTCCTGGCGGAAACTTCCCTTTAAAAAAAAGAAAAGAAAAGAAAAGCAA +CAGCACTTTTGGGCTAGCATTTCAATCCTTCCTGCCCTTTAGAGTTCCCAGTTCTGCTTC +CAGCTGGCTTTGGGTGTTCCACTAGAATTGAGTTGTAAAGATATTCTTTAAGTGTTTATA +GAACATTAAGACTTAAAAAAAATCTTTAAAATTAGAGGAGGGAAAAAGCCACCTTATCGC +ACACATCCAGGAAATGCAGCCCCGTGCATCCCTGCTCAGGGATGAGCAGGCGCCCCAGGA +CTCCCGGAGACAGATTTTTGGGCACCCGAGGGAGTCACCGGGCGCGTGTCGGGGTCCGCG +GTGAGGCCCAGCCCCTCCGGCGGTCCCTTAGACGCGCCCTCTGCCCGGCCGGTGTGGACC +GTCCCGGCCATTGTTTACGGGGGATGCCCGTCCAGACGCATTGTTTTGGCCGTTTCCAAC +TTGCCCCGGCCCTTTCCGGGGCATCGCGGGGGACCCTACACCGACGTCCCCCCTCCGCCC +GCGCCCCAAGGGCTGACTGGGCAAATTGGCAGATCCGCCCCGCGGGGCGACCCAACTTTT +CGGAACAGCCCCCCACCGCCCACCCCTGCAGATCCCCGGACCCCCGCTCCCGGCGGAGAT +TCAGGGAACCCCGCATCCCAAGCCCTTCTAAATCGTGCGGCCTGAGTGTGACGGCCAAGA +GCGGATGCAGCCCGGGATCGCCCGCACCTTCCCGTGGGCGG diff --git a/genome_22/genome_22.fasta.fai b/genome_22/genome_22.fasta.fai new file mode 100755 index 000000000..b542e338a --- /dev/null +++ b/genome_22/genome_22.fasta.fai @@ -0,0 +1 @@ +chr22 40001 7 60 61 diff --git a/samplesheet.csv b/samplesheet.csv new file mode 100644 index 000000000..08f10ad81 --- /dev/null +++ b/samplesheet.csv @@ -0,0 +1,2 @@ +sample_id,bam_path,fastq_dir,aligned_bam,methyl_bam,hpo_terms +test,https://raw.githubusercontent.com/nourmahfel/test-datasets/longraredisease/unmapped_bam/test.bam,,,,HP:0002721;HP:0002110;HP:0500093;HP:0000717;HP:0001263;HP:0001763;HP:0003298;HP:0002857;HP:0001382 diff --git a/spectre/mosdepth.regions.bed.gz b/spectre/mosdepth.regions.bed.gz new file mode 100755 index 000000000..4915e2d42 Binary files /dev/null and b/spectre/mosdepth.regions.bed.gz differ diff --git a/spectre/test_clair3_merge_output.vcf.gz b/spectre/test_clair3_merge_output.vcf.gz new file mode 100644 index 000000000..3310db082 Binary files /dev/null and b/spectre/test_clair3_merge_output.vcf.gz differ diff --git a/straglr/str.test.bed b/straglr/str.test.bed new file mode 100644 index 000000000..604e87338 --- /dev/null +++ b/straglr/str.test.bed @@ -0,0 +1 @@ +chr22 45795354 45795424 ATTCT diff --git a/test.exclude.bed b/test.exclude.bed new file mode 100644 index 000000000..e69de29bb diff --git a/unmapped_bam/test.bam b/unmapped_bam/test.bam new file mode 100644 index 000000000..af7240cbc Binary files /dev/null and b/unmapped_bam/test.bam differ