Skip to content

Evaluating on a new benchmark #10

@andimarafioti

Description

@andimarafioti

Hi guys 👋

We are working on a new long-video benchmark and would love to evaluate Long-VITA! We implemented our eval on a fork of lmms-eval, so we can evaluate any model present in the repo, but Long-VITA isn't there. Would you be willing to add it so that we can run the benchmark?
Thank you!

Andi from Hugging face

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions