Skip to content

rokkian/spacy_for_data_integration

Repository files navigation

SpaCy for data integration

As part of my university course Informative Systems and Semantic Web I developed this project with the aim to test the potentialities of advanced NLP tools (SpaCy, https://spacy.io/) in the field of data integration, especially for solving issues related to:

  • Schema Matching (SM): the columns are seen as unified string texts and the similarity of tables schemas is measured through the similarity of such strings using NLP tecniques.

  • Entity Matching (EM): similarly the rows are seen as unified string texts and the similarity between string-tuples is measured through and NLP similarity measure.

About

Project for testing SpaCy NLP library as a tool for data integration problems

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published