Intern Robotics

All

63 repositories

RoboInter
Public
[ICLR 2026] RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation
Python
•
MIT License
•2•91•0•0•Updated Feb 14, 2026Feb 14, 2026
InternVLA-A1
Public
InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
robotics manipulation vision-language-action-model
Jupyter Notebook
•22•337•5•0•Updated Feb 14, 2026Feb 14, 2026
internvla-a1.github.io
Public
The webpage of InternVLA-A1
HTML
•0•1•0•0•Updated Feb 13, 2026Feb 13, 2026
InternVLA-M1
Public
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
robotics vision-language-model vision-language-action-model
Python
•
MIT License
•19•371•5•0•Updated Feb 11, 2026Feb 11, 2026
InternNav
Public
InternRobotics' open platform for building generalized navigation foundation models.
robotics navigation vla vlm visual-navigation spatial-ai vision-language-navigation mllms spatial-intelligence vision-language-action-model
Jupyter Notebook
•
MIT License
•77•682•11•1•Updated Feb 11, 2026Feb 11, 2026
Robo3R
Public
Robo3R: Enhancing Robotic Manipulation with Accurate Feed-Forward 3D Reconstruction
0•20•1•0•Updated Feb 11, 2026Feb 11, 2026
MMSI-Video-Bench
Public
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
Python
•0•55•0•0•Updated Feb 10, 2026Feb 10, 2026
MMSI-Bench
Public
[ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
Python
•1•77•0•0•Updated Feb 10, 2026Feb 10, 2026
PLANING
Public
0•2•0•0•Updated Jan 30, 2026Jan 30, 2026
internrobotics.github.io
Public
Documentation of Intern Robotics Platform & Toolkits
Python
•6•2•0•1•Updated Jan 30, 2026Jan 30, 2026
InstructVLA
Public
[ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
Python
•4•100•1•0•Updated Jan 27, 2026Jan 27, 2026
ARTDECO
Public
[ICLR 2026]ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation
9•149•1•0•Updated Jan 26, 2026Jan 26, 2026
VLAC
Public
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
Python
•
MIT License
•10•274•5•0•Updated Jan 23, 2026Jan 23, 2026
GenManip
Public
[CVPR 2025] Official implementation of "GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation"
robotics simulation manipulation isaac-sim
Python
•3•143•3•0•Updated Jan 15, 2026Jan 15, 2026
G2VLM
Public
G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
3d-reconstruction spatial-reasoning mllms spatial-intelligence 3d-llms spatial-understanding
Python
•
Apache License 2.0
•7•262•9•0•Updated Jan 15, 2026Jan 15, 2026
NavDP
Public
Official implementation of the paper: "NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance"
Python
•36•530•5•0•Updated Jan 12, 2026Jan 12, 2026
CronusVLA
Public
[AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
Python
•
MIT License
•3•88•0•0•Updated Jan 11, 2026Jan 11, 2026
VL-LN
Public
VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs
Python
•
MIT License
•1•48•0•0•Updated Jan 5, 2026Jan 5, 2026
F1-VLA
Public
F1: A Vision Language Action Model Bridging Understanding and Generation to Actions
Python
•10•160•4•0•Updated Jan 2, 2026Jan 2, 2026
AnySplat
Public
[SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views
Python
•
MIT License
•39•729•39•1•Updated Dec 22, 2025Dec 22, 2025
internvla-n1-dualvln.github.io
Public
JavaScript
•0•0•0•0•Updated Dec 10, 2025Dec 10, 2025
MeshCoder
Public
Jupyter Notebook
•
MIT License
•22•435•8•0•Updated Dec 8, 2025Dec 8, 2025
EgoThinker
Public
Official implementation of EgoThinker at NIPS 2025
Python
•0•24•3•0•Updated Nov 25, 2025Nov 25, 2025
EgoHOD
Public
Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024
Python
•
Apache License 2.0
•1•32•1•0•Updated Nov 25, 2025Nov 25, 2025
MV-CoLight
Public
[NIPS 2025] MV-CoLight: Efficient Object Compositing with Consistent Lighting and Shadow Generation
Python
•
MIT License
•2•15•2•0•Updated Nov 21, 2025Nov 21, 2025
interndata-a1.github.io
Public
HTML
•0•1•0•0•Updated Nov 20, 2025Nov 20, 2025
StreamVLN
Public
[ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
Python
•27•403•19•1•Updated Nov 2, 2025Nov 2, 2025
Aether
Public
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
navigation multi-modal video-generation video-prediction embodied-ai visual-planning 4d-reconstruction foundation-models world-model 4d-generation
Python
•
MIT License
•6•571•0•0•Updated Oct 26, 2025Oct 26, 2025
internvla-m1.github.io
Public
Astro
•0•1•0•0•Updated Oct 23, 2025Oct 23, 2025
Humanoid-Goalkeeper
Public
[arxiv 2025] Official implementation of "Humanoid Goalkeeper: Learning from Position Conditioned Task-Motion Constraints"
Python
•
Other
•7•143•0•0•Updated Oct 22, 2025Oct 22, 2025