mirror of
https://github.com/All-Hands-AI/OpenHands.git
synced 2026-01-09 06:48:02 -05:00
Tau-Bench Evaluation
This directory contains the evaluation scripts for Tau-Bench.
Setup
First, make sure you have installed the tau-bench package:
pip install tau-bench
Running Evaluation
To run the evaluation, use the following command:
python evaluation/benchmarks/tau_bench/run_infer.py \
--agent-cls CodeActAgent \
--llm-config <your_llm_config> \
--env retail