Skip to content

InvalidArgumentError: indices[11,21] = 243838 is not in [0, 243245) #29

@marc88

Description

@marc88

Hello,

I could run training and evaluation portions of the script.
Next, I'd converted Ontonotes 5.0 files into conll format as below (manually):

SOCCER NN I-NP O
- : O O
JAPAN NNP I-NP I-LOC
GET VB I-VP O
LUCKY NNP I-NP O
WIN NNP I-NP O
, , O O
CHINA NNP I-NP I-PER
IN IN I-PP O
SURPRISE DT I-NP O
DEFEAT NN I-NP O

And then I've tried to run it as I would for a CONLL file (since I couldn't successfully extract the Ontonotes file into the CONLL format using skel2conll.sh).
The file was divided into test and train files and then Preprocessed but, I still get an issue here:
InvalidArgumentError (see above for traceback): indices[2,34] = 243461 is not in [0, 243245)

The preprocessing should prepare embeddings on the entire corpus. But, why does it still complain of the words embedding beyond range?

Regards

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions