|
6 | 6 |
|
7 | 7 | ## Introduction |
8 | 8 |
|
9 | | -nf-core/marsseq is a pre-processing pipeline for MARS-seq experiments. We additionally introduce RNA velocity workflow that can be used to study |
10 | | -cell dynamics along differentiation. |
| 9 | +nf-core/marsseq is a pre-processing pipeline for MARS-seq experiments. We additionally introduce RNA velocity workflow that can be used to study cell dynamics along differentiation. |
| 10 | + |
| 11 | +## Metadata information |
| 12 | + |
| 13 | +The pipeline requires 3 additional files for experiment. |
| 14 | + |
| 15 | +- [amp_batches.xlsx](../assets/amp_batches.xlsx) |
| 16 | + - **Amp_batch_ID**: Amplification batch unique identifier |
| 17 | + - **Seq_batch_ID**: The ID of the sequencing batch associated with this amplification batch |
| 18 | + - **Protocol_version_ID**: `Mars_2` (do not change, will be deprecated in next release) |
| 19 | + - **Pool_barcode**: Pool barcode sequence |
| 20 | + - **R2_design**: `7W.8R` (do not change, used by `demultiplex.pl`) |
| 21 | + - asdsa |
| 22 | + - **Experiment_ID**: An experiment ID |
| 23 | + - **Owner**: The person conducted the experiment |
| 24 | + - **Description**: Description of amplification batch |
| 25 | +- [wells_cells.xlsx](../assets/wells_cells.xlsx) |
| 26 | + - **Well_ID**: Well/Cell unique identifier |
| 27 | + - **Well_coordinates**: The position of the well on the place (row & column, e.g. L23) |
| 28 | + - **plate_ID**: The ID of the plate associated with this well |
| 29 | + - **Subject_ID**: The ID of the subject that donated the cell for this well (e.g. mouse ID) |
| 30 | + - **Amp_batch_ID**: The amplification batch associated with this well |
| 31 | + - **Cell_barcode**: The well barcode sequence |
| 32 | + - **Spike_type**: `ERCC_mix1` (do not change) |
| 33 | + - **Spike_dilution**: `0.000025` (do not change unless required) |
| 34 | + - **Spike_volume_ul**: `0.01` (do not change unless required) |
| 35 | + - **Number_of_cells**: `1` (do not change unless required) |
| 36 | + - **is_primer_added**: `1` (do not change unless required) |
| 37 | +- [seq_batches.xslx](../assets/seq_batches.xlsx) |
| 38 | + - **Seq_batch_ID**: Sequencing batch Unique identifier |
| 39 | + - **Run_name**: Short description |
| 40 | + - **Date**: Date of sequencing |
| 41 | + - **Genome_assembly**: Genome (not used in the pipeline, will be deprecated in next release) |
| 42 | + - **Spike_type**: Usually "ERCC_mix1" (do not change) |
| 43 | + - **R1_design**: `5I.4P.51M` (do not change, used by `demultiplex.pl`) |
| 44 | + - Explanation: 5bps Ignore, 4bps Pool barcode, 51bps mRNA |
| 45 | + - **Notes**: Additional notes (ignored in pipeline) |
| 46 | + |
| 47 | +For more examples please see [SB26](https://raw.githubusercontent.com/nf-core/test-datasets/marsseq/SB26.csv). The original documentation for MARS-seq2.0 can be found [here](https://tanaylab.github.io/old_resources/pages/672.html). |
11 | 48 |
|
12 | 49 | ## Samplesheet input |
13 | 50 |
|
|
0 commit comments