Skip to content

Incremental encoding/decoding slows down string value parsing for fast field. #2806

@ChangRui-Ryan

Description

@ChangRui-Ryan

Is your feature request related to a problem? Please describe.
As I mentioned in my pr: #2795, The str-type fast fields in Tantivy adopt incremental encoding, which means retrieving each original string requires decoding from scratch—resulting in extremely low efficiency and sluggish speed. This also impacts the processing of str-type values based on TopDocs, especially when the limit is relatively large.

Describe the solution you'd like
Is it possible to choose not to use incremental encoding and only adopt basic dictionary encoding via parameter configuration, so that the original string value can be directly retrieved from the dictionary encoding of the str-type fast field? This approach is more user-friendly in many scenarios.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions