This repository provides a series of Jupyter notebooks designed for performing federated queries across Wikidata and other linked open data (LOD) endpoints, with a focus on querying bibliographic and author-related data.
These notebooks support our paper From Linked Open Data to Collections as Data: A Reproducible Framework Using Federated Queries and aim to present reproducible examples of query design strategies other than structured real-world use in library and information studies, digital humanities and other domains.
We recommend you to cite the paper the notebooks are supporting, as well as the repository as a whole. The citation for the paper is:
Meltem Dişli, Giulia Osti, Gustavo Candela & Richard L. Zijdeman (2025). From Linked Open Data to Collections as Data: A Reproducible Framework Using Federated Queries. Journal, Volume(Issue), Page range. DOI
You can cite this repository by using the . This DOI represents all releases, and will always resolve to the latest one.
The repository is organized into the following sections:
-
Query Building:
- 4 examples showcasing a query-building workflow for three different endpoints:
- Biblioteca Nacional de España (BNE) Query Building
- Bibliothèque nationale de France (BNF) Query Building
- Biblioteca Virtual Miguel de Cervantes (BVMC) Query Building
- Wikidata_map_viz, an exploratory query visualizing authors' places of birth from the Spanish Golden Age (wd:Q530936).
- 4 examples showcasing a query-building workflow for three different endpoints:
-
Single Author, Multiple Works:
- 2 examples focusing on retrieving multiple works by a single author:
-
Multiple Authors, Multiple Works:
- 1 example demonstrating queries involving multiple authors and their works,
This project includes example Jupyter notebooks that you can run directly in your browser using Binder. Click the Launch Binder button below to start a live session:
Once Binder finishes loading you will see a file browser on the left-hand side that will allow you to navigate all the notebooks contained in this repository.
- To run all cells at once: click “Cell” > “Run All” from the top menu.
- To step through cell-by-cell: use the
▶️ Run button at the top or next to each cell. - Feel free to edit code or markdown cells and re-run them — your changes are temporary and will not be saved once the session ends or times out (will not affect the repository structure).
The SPARQL queries leverage data from the following endpoints:
- Wikidata: https://query.wikidata.org/
- Biblioteca Nacional de España (BNE): http://datos.bne.es/sparql
- Bibliothèque nationale de France (BNF): https://data.bnf.fr/sparql
- Biblioteca Virtual Miguel de Cervantes (BVMC): https://data.cervantesvirtual.com/sparql
Content is licensed under a Creative Commons Attribution 4.0 International license.
