OpenHands/evaluation/discoverybench/eval_utils/README.md at feature/signed-github-auth-cookie

mirror of https://github.com/All-Hands-AI/OpenHands.git synced 2026-04-29 03:00:45 -04:00

Files

Abhijeetsingh Meena 8857f02083 [Eval] DiscoveryBench OpenHands Integration (#4627 )

Signed-off-by: Abhijeetsingh Meena <abhijeet040403@gmail.com>
Co-authored-by: Harshit Surana <surana.h@gmail.com>

2024-11-02 07:24:34 -04:00

DiscoveryBench Evaluation Utils

eval_w_subhypo_gen.py: Implements the DiscoveryBench logic for evaluating agent-generated hypotheses.
lm_utils.py: Provides utility functions necessary for the evaluation process.
openai_helpers.py: Includes helper functions for OpenAI-related tasks.
openai_semantic_gen_prompts.py: Contains prompts used for semantic generation.
response_parser.py: Handles the parsing of agent-generated hypotheses.