2 Commits

Author SHA1 Message Date
afourney
af5dcc7fdf
Significant updates to agbench. (#5313)
- Updated HumanEval template to use AgentChat
- Update templates to use config.yaml for model and other configuration
- Read environment from ENV.yaml (ENV.json still supported but
deprecated)
- Temporarily removed WebArena and AssistantBench. Neither had viable
Templates after `autogen_magentic_one` was removed. Templates need to be
update to AgentChat (in a future PR, but this PR is getting big enough
already)
2025-02-07 18:01:44 +00:00
Hussein Mozannar
e11d84b996
Adding Benchmarks to agbench (#3803)
* Move from tomllib to tomli

* added example code for magentic-one + code comments

* adding benchmarks temporarily

* add license for datasets

* revert changes to magentic-one

* change license location

---------

Co-authored-by: Ryan Sweet <rysweet@microsoft.com>
2024-10-18 06:33:33 +02:00