Skip to content

Terms within multiple contexts may not be found #345

@martindholmes

Description

@martindholmes

With @sydb, we're trying to debug a problem in the DHQ build where there are many labelled contexts, including contexts for many different languages. When indexing takes place, the "English" context is caught repeatedly and stored in the stem file items multiple times, but the lower-level context (e.g. div[@id='abstract']) do not get noticed, so searching in them fails.

When the language context is redefined as text()[lang('en')], the immediate problem goes away -- the abstract context works OK -- but now the language context no longer works; but this is because the XPath to find that context is not working, so it's not getting in the way.

The root question is: what should happen in the case of nested contexts (a term appears in a structure which has defined contexts at multiple levels). It should "just work", but I don't know if we have given much attention to testing this, so that will be the first thing to do. Another question would be why do we get multiple iterations of the same language context in the stem file; surely those should be unique values.

Metadata

Metadata

Labels

bugSomething isn't workingrelease-blockerThe dev branch must not be merged into master while a release-blocker bug is open.

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions