Files
AutoGPT/benchmark/agbenchmark/challenges/deprecated/d2.1_guided/data.json
merwanehamadi ff4c76ba00 Make agbenchmark a proxy of the evaluated agent (#5279)
Make agbenchmark a Proxy of the evaluated agent

Signed-off-by: Merwane Hamadi <merwanehamadi@gmail.com>
2023-09-20 16:06:00 -07:00

33 lines
858 B
JSON

{
"name": "DebugSimpleTypoWithGuidance",
"category": [
"code",
"iterate"
],
"task": "1- Run test.py.\n2- Read sample_code.py.\n3- Modify sample_code.py.\nRepeat step 1, 2 and 3 until test.py runs without errors.\n",
"dependencies": [
"ReadFile"
],
"cutoff": 75,
"ground": {
"answer": "[0, 1] [2, 5] [0, 3]",
"should_contain": [
"[0, 1]",
"[2, 5]",
"[0, 3]"
],
"should_not_contain": [],
"files": [
"test.py"
],
"eval": {
"type": "python"
}
},
"info": {
"difficulty": "novice",
"description": "s ability for the agent to debug python code with a simple typo in it.",
"side_effects": []
},
"eval_id": "1ce0ccdd-cbe3-4000-a2a4-86d9c147fcfe"
}