Question regarding JEPA loss from LeJEPA and original JEPA

Hi, @RandallBalestriero 

Thanks for the great work. I'm wondering if there are any specific reasons on why LeJEPA used DINO style transformation for representation consistency. (e.g. I-JEPA, V-JEPA have masked images/videos as target, but LeJEPA doesn't seem to have them according to https://github.com/rbalestr-lab/lejepa/blob/main/MINIMAL.md?) Did you experiment on both and found DINO style (global/local views) augmentation give better performance?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding JEPA loss from LeJEPA and original JEPA #31

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question regarding JEPA loss from LeJEPA and original JEPA #31

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions