Files
OpenHands/evaluation/mint/config_variables.py
Ryan H. Tran 01296ff79d Add remaining subsets for MINT benchmark (#2142)
* add MMLU subset

* add theoremqa subset

* remove redundant packages from requirements.txt, adjust prompts, handle gpt3.5 propose a wrong answer after a correct answer

* add MBPP subset

* add humaneval subset

* update README

* exit actively after the agent finishes the task
2024-05-31 20:04:13 +00:00

10 lines
433 B
Python

TASK_INFO_MAP = {
# === Reasoning ===
'gsm8k': {'class': 'ReasoningTask', 'type': 'reasoning'},
'math': {'class': 'ReasoningTask', 'type': 'reasoning'},
'mmlu': {'class': 'MultipleChoiceTask', 'type': 'reasoning'},
'theoremqa': {'class': 'TheoremqaTask', 'type': 'reasoning'},
'mbpp': {'class': 'MBPPTask', 'type': 'code_generation'},
'humaneval': {'class': 'HumanEvalTask', 'type': 'code_generation'},
}