Added nf-core/taxprofiler documentation and DSP About by apalleja · Pull Request #1 · biosustain/dsp_nfcore_documentation

apalleja · 2026-03-07T09:51:06Z

I added the taxprofiler documentation. It will need some fine-tunning (perhaps an update from the next time we use it), but at least now is there. I updated the index accordingly.

Also, I added some info about the Data Science paltform on the About.md.

github-actions · 2026-03-07T09:51:17Z

See the preview of the website at:

https://biosustain.github.io/dsp_nfcore_documentation/pr-1/

Copilot

Pull request overview

Adds new documentation pages to the BRIGHT Nextflow pipeline docs site, covering nf-core/taxprofiler usage (including Azure/Seqera notes) and an “About” page describing the Data Science Platform.

Changes:

Added a new taxprofiler/README.md documentation page with input formats, example commands, and captured run/config details.
Updated index.md to include the new taxprofiler page in the docs navigation.
Replaced the placeholder about.md with DSP background text and pillar descriptions.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 11 comments.

File	Description
taxprofiler/README.md	New taxprofiler documentation page with setup/run instructions and configuration excerpts.
index.md	Adds taxprofiler docs to the Sphinx toctree navigation.
about.md	Adds DSP overview content intended for the “About” section.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-24T09:32:49Z

+You can find a more exhaustive description and running instructions in here:
+https://nf-co.re/taxprofiler/1.2.6
+
+Here we provide with a small manual to how to prepare, for running the pipeline and running it in the Microsoft Azure environment.


This introductory sentence is grammatically incorrect/awkward (“provide with a small manual to how to prepare, for running…”). Please rephrase to a clear instruction-focused sentence (e.g., that this page explains how to prepare inputs and run on Azure).

Suggested change

Here we provide with a small manual to how to prepare, for running the pipeline and running it in the Microsoft Azure environment.

This page provides step-by-step instructions for preparing inputs and running the pipeline in the Microsoft Azure environment.

Copilot · 2026-03-24T09:32:49Z

+The databases will be store in the corresponding data lake folder called databases. Until then you have to download and prepare the databases yourself.
+
+Files for Metaphlan were download from:


Minor grammar issues: “will be store” should be “will be stored”, and “were download” should be “were downloaded”.

Suggested change

The databases will be store in the corresponding data lake folder called databases. Until then you have to download and prepare the databases yourself.

Files for Metaphlan were download from:

The databases will be stored in the corresponding data lake folder called databases. Until then you have to download and prepare the databases yourself.

Files for Metaphlan were downloaded from:

Copilot · 2026-03-24T09:32:50Z

+conda install -c bioconda motus
+motus downloadDB
+```
+It got copied the database locally in here: /Users/apca/anaconda3/envs/motus/lib/python3.9/site-packages/motus/db_mOTU and I passed this dir to databases.csv


This line contains a user-specific absolute path and first-person phrasing (“/Users/... and I passed…”). For shared documentation, please replace with a generic placeholder path and instructions on how to locate the mOTUs DB directory in the environment, so it’s reproducible for other users.

Copilot · 2026-03-24T09:32:50Z

+      beforeText = '
+-[2m----------------------------------------------------[0m-
+                                        [0;32m,--.[0;30m/[0;32m,-.[0m
+[0;34m        ___     __   __   __   ___     [0;32m/,-._.--~'[0m
+[0;34m  |\ | |__  __ /  ` /  \ |__) |__         [0;33m}  {[0m
+[0;34m  | \| |       \__, \__/ |  \ |___     [0;32m\`-._,-`-,[0m
+                                        [0;32m`._,._,'[0m
+[0;35m  nf-core/taxprofiler 1.2.3[0m
+-[2m----------------------------------------------------[0m-


The embedded beforeText includes ANSI escape sequences (e.g., \x1b[0;32m...) which will render as garbage characters in the generated docs. Please strip ANSI codes (or capture the output with --monochrome_logs/equivalent) before committing this snippet.

Copilot · 2026-03-24T09:32:50Z

+# Worth noticing
+You need to add --shortread_qc_minlength 100 to require a minimum length for read after quality control
+You need to add --save_analysis_ready_fastqs to save the qc filetered reads before they go into classification or profiling
+You need to add --perform_runmerging to merge different lanes of the same sample
+You need to add --run_profile_standardisation so that all metaphlan profile of each sample get combined in a single report (This may be changed soon, follow issue: https://github.com/nf-core/taxprofiler/issues/494)


“# Worth noticing” introduces a second H1 in a page that already has an H1 at the top. Please demote this to an H2/H3 (and consider using a list) to keep the document structure consistent for Sphinx/Markdown rendering.

Suggested change

# Worth noticing

You need to add --shortread_qc_minlength 100 to require a minimum length for read after quality control

You need to add --save_analysis_ready_fastqs to save the qc filetered reads before they go into classification or profiling

You need to add --perform_runmerging to merge different lanes of the same sample

You need to add --run_profile_standardisation so that all metaphlan profile of each sample get combined in a single report (This may be changed soon, follow issue: https://github.com/nf-core/taxprofiler/issues/494)

## Worth noticing

- You need to add `--shortread_qc_minlength 100` to require a minimum length for reads after quality control.

- You need to add `--save_analysis_ready_fastqs` to save the QC-filtered reads before they go into classification or profiling.

- You need to add `--perform_runmerging` to merge different lanes of the same sample.

- You need to add `--run_profile_standardisation` so that all Metaphlan profiles of each sample get combined in a single report (this may be changed soon, follow issue: https://github.com/nf-core/taxprofiler/issues/494).

Copilot · 2026-03-24T09:32:52Z

@@ -0,0 +1,995 @@
+# nf-core/taxprofiler
+
+## Taxonomical profiling using nf-core/taxprofiler pipeline


The heading uses “Taxonomical”, which is nonstandard in this context; “Taxonomic profiling …” is the usual phrasing in bioinformatics/pipeline docs.

Suggested change

## Taxonomical profiling using nf-core/taxprofiler pipeline

## Taxonomic profiling using nf-core/taxprofiler pipeline

- shorten long lines - shorten long commands to be multiline - highlight filenames, repos, paths... using `foo.csv`

enryH

I updated links, paths and filenames. Shorted some lines and command to be not too long for displaying.

Added nf-core/taxprofiler documentation and DSP About

5ad9817

apalleja requested a review from enryH March 7, 2026 09:51

apalleja mentioned this pull request Mar 23, 2026

🎨 update links to BRIGHT biosustain/dsp_notes#28

Merged

enryH requested a review from Copilot March 24, 2026 09:19

Copilot started reviewing on behalf of enryH March 24, 2026 09:29 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

enryH added 2 commits March 24, 2026 10:36

🎨 format for readability

b9035c8

- shorten long lines - shorten long commands to be multiline - highlight filenames, repos, paths... using `foo.csv`

📝 links and remove JSON comment

8670420

enryH approved these changes Mar 24, 2026

View reviewed changes

enryH added 5 commits March 24, 2026 10:56

🎨 format and change to BRIGHT everywhere

b477320

🐛 do not have twice the same entry to the toc

ec012ac

🐛 taxprofiler, not quantms

6462f36

🎨 align headings

4d64290

🎨 format

a17b80a

apalleja merged commit 1fb400d into main Mar 25, 2026
1 check passed

apalleja deleted the taxprofiler branch March 25, 2026 15:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added nf-core/taxprofiler documentation and DSP About#1

Added nf-core/taxprofiler documentation and DSP About#1
apalleja merged 8 commits intomainfrom
taxprofiler

apalleja commented Mar 7, 2026

Uh oh!

github-actions bot commented Mar 7, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

enryH left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	Here we provide with a small manual to how to prepare, for running the pipeline and running it in the Microsoft Azure environment.
	This page provides step-by-step instructions for preparing inputs and running the pipeline in the Microsoft Azure environment.

		The databases will be store in the corresponding data lake folder called databases. Until then you have to download and prepare the databases yourself.

		Files for Metaphlan were download from:

		@@ -0,0 +1,995 @@
		# nf-core/taxprofiler

		## Taxonomical profiling using nf-core/taxprofiler pipeline

	## Taxonomical profiling using nf-core/taxprofiler pipeline
	## Taxonomic profiling using nf-core/taxprofiler pipeline

Conversation

apalleja commented Mar 7, 2026

Uh oh!

github-actions bot commented Mar 7, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

enryH left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants