Skip to content
Change the repository type filter

All

    Repositories list

    • RoboInter

      Public
      [ICLR 2026] RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation
      Python
      29100Updated Feb 14, 2026Feb 14, 2026
    • InternVLA-A1

      Public
      InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation​
      Jupyter Notebook
      2233750Updated Feb 14, 2026Feb 14, 2026
    • The webpage of InternVLA-A1
      HTML
      0100Updated Feb 13, 2026Feb 13, 2026
    • InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
      Python
      1937150Updated Feb 11, 2026Feb 11, 2026
    • InternNav

      Public
      InternRobotics' open platform for building generalized navigation foundation models.
      Jupyter Notebook
      77682111Updated Feb 11, 2026Feb 11, 2026
    • Robo3R

      Public
      Robo3R: Enhancing Robotic Manipulation with Accurate Feed-Forward 3D Reconstruction
      02010Updated Feb 11, 2026Feb 11, 2026
    • MMSI-Video-Bench

      Public
      MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
      Python
      05500Updated Feb 10, 2026Feb 10, 2026
    • MMSI-Bench

      Public
      [ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
      Python
      17700Updated Feb 10, 2026Feb 10, 2026
    • PLANING

      Public
      0200Updated Jan 30, 2026Jan 30, 2026
    • Documentation of Intern Robotics Platform & Toolkits
      Python
      6201Updated Jan 30, 2026Jan 30, 2026
    • [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
      Python
      410010Updated Jan 27, 2026Jan 27, 2026
    • ARTDECO

      Public
      [ICLR 2026]ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation
      914910Updated Jan 26, 2026Jan 26, 2026
    • VLAC

      Public
      VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
      Python
      1027450Updated Jan 23, 2026Jan 23, 2026
    • GenManip

      Public
      [CVPR 2025] Official implementation of "GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation"
      Python
      314330Updated Jan 15, 2026Jan 15, 2026
    • G2VLM

      Public
      G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
      Python
      726290Updated Jan 15, 2026Jan 15, 2026
    • NavDP

      Public
      Official implementation of the paper: "NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance"
      Python
      3653050Updated Jan 12, 2026Jan 12, 2026
    • CronusVLA

      Public
      [AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
      Python
      38800Updated Jan 11, 2026Jan 11, 2026
    • VL-LN

      Public
      VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs
      Python
      14800Updated Jan 5, 2026Jan 5, 2026
    • F1-VLA

      Public
      F1: A Vision Language Action Model Bridging Understanding and Generation to Actions
      Python
      1016040Updated Jan 2, 2026Jan 2, 2026
    • AnySplat

      Public
      [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views
      Python
      39729391Updated Dec 22, 2025Dec 22, 2025
    • JavaScript
      0000Updated Dec 10, 2025Dec 10, 2025
    • MeshCoder

      Public
      Jupyter Notebook
      2243580Updated Dec 8, 2025Dec 8, 2025
    • Official implementation of EgoThinker at NIPS 2025
      Python
      02430Updated Nov 25, 2025Nov 25, 2025
    • EgoHOD

      Public
      Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024
      Python
      13210Updated Nov 25, 2025Nov 25, 2025
    • [NIPS 2025] MV-CoLight: Efficient Object Compositing with Consistent Lighting and Shadow Generation
      Python
      21520Updated Nov 21, 2025Nov 21, 2025
    • HTML
      0100Updated Nov 20, 2025Nov 20, 2025
    • StreamVLN

      Public
      [ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
      Python
      27403191Updated Nov 2, 2025Nov 2, 2025
    • Aether

      Public
      [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
      Python
      657100Updated Oct 26, 2025Oct 26, 2025
    • Astro
      0100Updated Oct 23, 2025Oct 23, 2025
    • [arxiv 2025] Official implementation of "Humanoid Goalkeeper: Learning from Position Conditioned Task-Motion Constraints"
      Python
      714300Updated Oct 22, 2025Oct 22, 2025