BiG-SCAPE db location error and GCF_id issue

Hi 

I'm running into some issues with nplinker usage and outputs.
I have 2 issues, first one I've tried to fix, but not sure if it lead to errors downstream.

**Background:** 

Running NPLinker 2.0.0-beta.1 on Ubuntu 18.04.6.

This is the NPLinker toml file I used (renamed to .txt to be able to upload here).
[nplinker.txt](https://github.com/user-attachments/files/21798933/nplinker.txt)

And this is the python script I used:
[run_nplinker.txt](https://github.com/user-attachments/files/21798960/run_nplinker.txt)

**Errors:**

1. I get the following file not found error for big scape:
`FileNotFoundError: [Errno 2] No such file or directory: '/data/sdc/ram/20250715_DFIUM038/neg_mode/bigscape/bigscape_running_output/data_sqlite.db'`
I noticed that the bigscape output created was `bigscape_running_output.db`.
To fix this manually, I renamed it to `data_sqlite.db'.
When I reran the script, it worked and produced outputs (however, I'm not sure if it's the right way).
[bigscape_error.txt](https://github.com/user-attachments/files/21798970/bigscape_error.txt)


2. When I look at the `genomics_data.tsv` from `npl.to_tsv(link_graph)` output, I have multiple comma separated GCF_ids in a single row linked to one product in a given strain_id. As I understand, it should be a unique GCF_id per row right?
I'm not sure if something went wrong or if I'm not understanding this correctly.
[genomics_data.tsv](https://github.com/user-attachments/files/21799032/genomics_data.tsv)


-Ram

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BiG-SCAPE db location error and GCF_id issue #332

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

BiG-SCAPE db location error and GCF_id issue #332

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions