17 Commits

Author SHA1 Message Date
afourney
7b99b84dab Moved the benchmarks to another repo. (#349) 2024-08-08 11:43:55 -07:00
afourney
365d21b55e Fixed numerous issues with orchestrator, related to 342 (#344)
* Fixed numerous issues with orchestrator, related to 342'

* Refined prompts, and added max wall-clock time to orchestrators.

* Fixed formatting.
2024-08-08 11:15:57 -07:00
afourney
c7f5931dc8 Reverts TeamOne prompts to those used in Autogen experiments. (#331)
* Revised prompts to match autogen experiments.

* Handle sh code blocks

* Move executor prompt into coder.py

* Fixed formatting.
2024-08-07 13:52:45 -07:00
Jack Gerrits
39489ba395 Rename fields in agent metadata (#336)
* Rename fields in agent metadata

* team one fixes

* another fix
2024-08-07 13:08:13 -07:00
afourney
ecde17a255 Re-added multimodality to mdconvert. (#330)
* Re-added multimodality to mdconvert.

* Fixed formatting.
2024-08-07 07:14:47 -07:00
afourney
8b13d59b59 Initial work porting WebArena to async (#325)
* Initial work porting webarena to async

* Perhaps resolved some of the eval() issues.
2024-08-06 16:25:18 -07:00
peterychang
136af65b74 Code execute cancellation (#299)
* Hook cancelation token into code execution

* Add unit test for code cancellation

* actually save the merge
2024-07-29 23:46:04 -07:00
peterychang
070a97ceaa make code execution async (#219)
* make code execution async

* python 3.10 does not support asyncio.timeout()

* make code execution cancellable

* make code execution async

* python 3.10 does not support asyncio.timeout()

* make code execution cancellable

* make entire callstack for code_executor async

* Update python/src/agnext/components/code_executor/_impl/local_commandline_code_executor.py

Co-authored-by: Jack Gerrits <jackgerrits@users.noreply.github.com>

* fix variable description

* remove unnecessary code

* fix usage of execute_code_blocks

* fix usage of execute_code_blocks

---------

Co-authored-by: Jack Gerrits <jackgerrits@users.noreply.github.com>
Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>
2024-07-26 22:37:34 +00:00
afourney
61dec0a3ca Added token counting to TeamOne (#264)
* Added token counting to TeamOne
2024-07-26 09:45:41 -07:00
afourney
25b9a91328 Teamone webarena agents fully implemented. (#276)
* Initial onboarding of WebArena
* Migrate to try_get_underlying_agent_instance
* Fixed some race conditions and implemented Deactivate messages.
2024-07-26 09:42:12 -07:00
afourney
b33c9602b2 Migrates GAIA to try_get_underlying_agent_instance (#250) 2024-07-23 17:07:04 -07:00
Jack Gerrits
a52d3bab53 Agent factory can be async (#247) 2024-07-23 14:49:38 -04:00
afourney
211bfa01c3 TeamOne implementation of GAIA (#221)
Port of GAIA benchmark
2024-07-17 09:51:19 -07:00
gagb
9df928b73e Improve printing further (#172)
* Improve pretty printing

* Improve printing further
2024-07-02 18:09:45 -07:00
gagb
e0065cab42 Improve pretty printing (#171) 2024-07-03 00:50:08 +00:00
gagb
bcec0502d7 Numerous fixes for agbench (#170)
* Shift to new runtime API

* Add pretty printing

* Reformat

* Fix linting errors
2024-07-02 23:18:48 +00:00
afourney
ca42b560e9 Initial work porting AutoGenBench to AGNext (#141)
* Initial check-in of agbench

* Moved agbench to intended directory.

* Removed pyautogen dependency

* moved to using process_until_idle

* Added TeamOne template.

* User TeamOne agent classes.

* migrate to hatch, move benchmarks out, add CI (#166)

* Resolve type issues (#168)

Thanks for fixing types.

* Fixed import.

---------

Co-authored-by: Jack Gerrits <jackgerrits@users.noreply.github.com>
2024-07-02 10:58:49 -07:00