-
Notifications
You must be signed in to change notification settings - Fork 3
Description
Name: Data Analysis Workflow
Acronym: DataAnalysisWorkflow
Proponent(s): Christian Meesters
This pull request introduces a new ontology for defining "Data Analysis Workflows" with initial support for distinguishing between various workflow management systems. The idea is to enhance the ontology repository by providing a structured framework for representing and relating different data analysis workflow types, facilitating better integration and interoperability in the domain of bioinformatics and scientific workflow management. The proposed ontology will enable users to define a nanopub description for a workflow - and ultimately further users to create nanopubs with workflow metadata citing such a workflow nanopub.
I propose a URL like http://purl.org/spar/.../dataanalysisworkflow
Eventually, sub categories should be
| Subcategory | Class Name | Label | Definition | Rationale |
|---|---|---|---|---|
| Snakemake Workflow | SnakemakeWorkflow |
"Snakemake Workflow" | "A data analysis workflow defined using the Snakemake workflow management system, emphasizing Python-driven, declarative workflow definitions." | Supports workflows built with Snakemake, a popular tool in bioinformatics. |
| Nextflow Workflow | NextflowWorkflow |
"Nextflow Workflow" | "A data analysis workflow designed for the Nextflow workflow management system, focusing on cloud-native, portable pipelines." | Represents workflows created with Nextflow, known for scalability and portability. |
| CWL Workflow | CWLWorkflow |
"CWL Workflow" | "A data analysis workflow specified using the Common Workflow Language (CWL), enabling platform-independent workflow execution." | Includes workflows defined in CWL, a standardized, community-driven format. |
| Galaxy Workflow | GalaxyWorkflow |
"Galaxy Workflow" | "A data analysis workflow developed within the Galaxy platform, emphasizing accessibility, reproducibility, and user-friendly workflow design." | Covers workflows designed in the Galaxy ecosystem, which prioritizes usability and transparency. |
| WDL Workflow | WDLWorkflow |
"WDL Workflow" | "A data analysis workflow written in the Workflow Description Language (WDL), commonly used for genomic data analysis and the Broad Institute's Cromwell engine." | Accommodates workflows authored in WDL, particularly relevant to genomic research. |
| Knime Workflow | KnimeWorkflow |
"KNIME Workflow" | "A data analysis workflow created using the KNIME Analytics Platform, focusing on graphical, node-based workflow construction for data analytics." | Encompasses workflows built with KNIME, which offers an intuitive, visual workflow development environment. |
| Taverna Workflow | TavernaWorkflow |
"Taverna Workflow" | "A data analysis workflow designed in Taverna, emphasizing the composition of web services and automated workflow execution for scientific research." | Includes workflows from Taverna, known for integrating heterogeneous web services. |
| Cromwell Workflow | CromwellWorkflow |
"Cromwell Workflow" | "A data analysis workflow optimized for execution on the Cromwell workflow engine, typically leveraging WDL for workflow description." | Represents workflows tailored for Cromwell, highlighting its execution capabilities. |
| Airflow Workflow | AirflowWorkflow |
"Airflow Workflow" | "A data analysis workflow managed by Apache Airflow, focusing on programmatically defined workflows for scheduling and monitoring tasks." | Covers workflows designed for Airflow, which excels in scheduling and monitoring data pipelines. |
| Prefect Workflow | PrefectWorkflow |
"Prefect Workflow" | "A data analysis workflow orchestrated by Prefect, emphasizing dynamic, scalable workflow execution with a focus on data-intensive tasks." | Includes workflows built with Prefect, a modern platform for scalable data workflow management. |
I am fairly new to ontolgies. If anything is either insufficient or incomprehensible or unexceptible for this standard. Please let me know.