Skip to content

BiG-SCAPE db location error and GCF_id issue #332

@ramanujamr

Description

@ramanujamr

Hi

I'm running into some issues with nplinker usage and outputs.
I have 2 issues, first one I've tried to fix, but not sure if it lead to errors downstream.

Background:

Running NPLinker 2.0.0-beta.1 on Ubuntu 18.04.6.

This is the NPLinker toml file I used (renamed to .txt to be able to upload here).
nplinker.txt

And this is the python script I used:
run_nplinker.txt

Errors:

  1. I get the following file not found error for big scape:
    FileNotFoundError: [Errno 2] No such file or directory: '/data/sdc/ram/20250715_DFIUM038/neg_mode/bigscape/bigscape_running_output/data_sqlite.db'
    I noticed that the bigscape output created was bigscape_running_output.db.
    To fix this manually, I renamed it to `data_sqlite.db'.
    When I reran the script, it worked and produced outputs (however, I'm not sure if it's the right way).
    bigscape_error.txt

  2. When I look at the genomics_data.tsv from npl.to_tsv(link_graph) output, I have multiple comma separated GCF_ids in a single row linked to one product in a given strain_id. As I understand, it should be a unique GCF_id per row right?
    I'm not sure if something went wrong or if I'm not understanding this correctly.
    genomics_data.tsv

-Ram

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    Backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions