Skip to content

Sentence 4556 has more than 256 words. Can not handle such long sentence. Please cut it short first! #4

@ajesujoba

Description

@ajesujoba

I want to create a suffix array index of the source and target sides of my training bitext. But it appears I cannot process sentences with more than 256 words. Is there a way I can increase the maximum number of words per sentence to 512 or 1024?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions