A library for building agentic benchmarks.
Project Links
Meta
Requires Python: >=3.13
Classifiers
benchmarks
A library for building agentic benchmarks.