Robot Learning Benchmarks

Standardized evaluation for robot manipulation — RLBench, LIBERO, CALVIN, and more. Success rates, task completion, evaluation metrics.

Suggested Models & Datasets

Comparable Metrics

Benchmarks are grouped for apples-to-apples performance checks.

Real vs Sim Coverage

Evaluate both controlled and deployment-oriented settings.

Model Mapping

Each benchmark path links to compatible model families.

Execution Support

Support for data capture and evaluation operations when needed.

Need Evaluation or Data for Your Benchmark?

We provide data collection and real-world evaluation support.