Skip to content

(Optionally) don't index bookids #137

@bmschmidt

Description

@bmschmidt

@organisciak

The bookid indices take a long time to build on sources like Hathi. They could just be deleted to reduce index creation time and reduce index creation speed; that requires just, AFAIK¸ eliminating this line of code.

https://github.com/Bookworm-project/BookwormDB/blob/master/bookwormDB/CreateDatabase.py#L295

The only problem that I can see is that the creation of the 'nwords' table works from that index, I believe; so the 'nwords' table might have to be created from the flat files instead. That's not a problem, but it is a little more work.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions