← Models

Octo

Open-Source Generalist Robot Policy — transformer-based diffusion policy for manipulation.

Overview

Octo is a transformer-based diffusion policy trained on 800K robot trajectories from Open X-Embodiment. It supports language commands or goal images, multiple RGB cameras, and various robot arms. Octo-Small (27M) and Octo-Base (93M) can be finetuned in hours on consumer GPUs.

Architecture & Scale

  • Octo-Small 27M params, Octo-Base 93M params
  • 800K trajectories from Open X-Embodiment
  • Language or goal-image conditioning
  • Multi-camera, multi-robot out-of-the-box

Official Links

Citation

See the project site for BibTeX and paper references.