Files
OpenHands/evaluation/discoverybench/eval_utils/README.md
Abhijeetsingh Meena 8857f02083 [Eval] DiscoveryBench OpenHands Integration (#4627)
Signed-off-by: Abhijeetsingh Meena <abhijeet040403@gmail.com>
Co-authored-by: Harshit Surana <surana.h@gmail.com>
2024-11-02 07:24:34 -04:00

477 B

DiscoveryBench Evaluation Utils

  • eval_w_subhypo_gen.py: Implements the DiscoveryBench logic for evaluating agent-generated hypotheses.
  • lm_utils.py: Provides utility functions necessary for the evaluation process.
  • openai_helpers.py: Includes helper functions for OpenAI-related tasks.
  • openai_semantic_gen_prompts.py: Contains prompts used for semantic generation.
  • response_parser.py: Handles the parsing of agent-generated hypotheses.