-
Notifications
You must be signed in to change notification settings - Fork 23
Description
With @sydb, we're trying to debug a problem in the DHQ build where there are many labelled contexts, including contexts for many different languages. When indexing takes place, the "English" context is caught repeatedly and stored in the stem file items multiple times, but the lower-level context (e.g. div[@id='abstract']) do not get noticed, so searching in them fails.
When the language context is redefined as text()[lang('en')], the immediate problem goes away -- the abstract context works OK -- but now the language context no longer works; but this is because the XPath to find that context is not working, so it's not getting in the way.
The root question is: what should happen in the case of nested contexts (a term appears in a structure which has defined contexts at multiple levels). It should "just work", but I don't know if we have given much attention to testing this, so that will be the first thing to do. Another question would be why do we get multiple iterations of the same language context in the stem file; surely those should be unique values.