autogen

mirror of https://github.com/microsoft/autogen.git synced 2025-07-28 11:20:14 +00:00

Author	SHA1	Message	Date
Li Jiang	42b27b9a9d	Add isort (#2265 ) * Add isort * Apply isort on py files * Fix circular import * Fix format for notebooks * Fix format --------- Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2024-04-05 02:26:06 +00:00
Beibin Li	2f109f5f94	Add vision capability (#2025 ) * Add vision capability * Configurate: description_prompt * Print warning instead of raising issues for type * Skip vision capability test if dependencies not installed * Append "vision" to agent's system message when enabled VisionCapability * GPT-4V notebook update with ConversableAgent * Clean GPT-4V notebook * Add vision capability test to workflow * Lint import * Update system message for vision capability * Add a `custom_caption_func` to VisionCapability * Add custom function example for vision capability * Skip test Vision capability custom func * GPT-4V notebook metadata to website * Remove redundant files * The custom caption function takes more inputs now * Add a more complex example of custom caption func * Remove trailing space --------- Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2024-03-24 19:46:55 +00:00
Yiran Wu	c37227bd04	Allow user to pass in a customized speaker selection method (#1791 ) * init PR * update * update code check * update * update * update * update * Test the ability to have agents a,u,t,o,g,e,n speak in turn. * update * update * update * Evidence that groupchat not terminating because of the TERMINATE substring. * Raising NoEligibleSpeakerException allows graceful exit before max turns * update * To confirm with author that custom function is meant to override graph constraints * Confirmed the expected test behaviour with author * Update autogen/agentchat/groupchat.py * update * update --------- Co-authored-by: Joshua Kim <Joshua@spectdata.com> Co-authored-by: Qingyun Wu <qingyun0327@gmail.com>	2024-03-07 02:28:22 +00:00
Beibin Li	9de374a495	Use PIL Image internally for the Multimodal Agent (#1124 ) * Change defualt model for `lmm` * Try to use PIL image for LMM's _oai_messages * Update test cases and llava * Remove redundant files * Update the imports for lmm tests * Test case fix * Docstring update * LMM notebook lint * Typo correction for img_utils and its test * Update test_llava.py debug, reformat --------- Co-authored-by: Chi Wang <wang.chi@microsoft.com> Co-authored-by: Shaokun Zhang <shaokunzhang529@gmail.com> Co-authored-by: Shaokun Zhang <shaokun.zhang@psu.edu>	2024-02-18 15:08:55 +00:00
olgavrou	a911d1c2ec	set use_docker to default to True (#1147 ) * set use_docker to default to true * black formatting * centralize checking and add env variable option * set docker env flag for contrib tests * set docker env flag for contrib tests * better error message and cleanup * disable explicit docker tests * docker is installed so can't check for that in test * pr comments and fix test * rename and fix function descriptions * documentation * update notebooks so that they can be run with change in default * add unit tests for new code * cache and restore env var * skip on windows because docker is running in the CI but there are problems connecting the volume * update documentation * move header * update contrib tests	2024-01-18 17:03:49 +00:00
Davor Runje	1c4ae3d303	nbqa adedd to pre-commit, added black and ruff for notebooks (#1171 ) * nbqa adedd to pre-commit, added black and ruff for notebooks * polishing * polishing * polishing	2024-01-08 03:47:01 +00:00
Davor Runje	8f065e06e4	Add codespell to pre-commit hooks and fix spelling of existing files (#1161 ) * fixed spelling, minor errors and reformatted using black * polishing * added codespell to pre-commit hooks, fixed a number of spelling errors and a few minor bugs in the code * update autogen library version in notebooks * update autogen library version in notebooks * update autogen library version in notebooks * update autogen library version in notebooks * update autogen library version in notebooks	2024-01-07 01:41:33 +00:00
Beibin Li	c19f234149	Message "content" now supports both `str` and `List` in Agents (#713 ) * Change "content" type in Conversable Agent * content and system_message support str and List Update for all other agents * Content_str now also takes None as input * Group Chat now works with LMM too * Style: newline for import in Conversable Agentt * Add test for gourpchat + lmm * Resolve comments 1. Undo AssistantAgent changes 2. Modify the asserts and raises in `content_str` function and update test accordingly. * Undo AssistantAgent * Update comments and add assertion for LMM * Typo fix in docstring for content_str * Remove “None” out conversable_agent.py * Lint message to dict in multimodal_conversable_agent.py * Address lint issues * linting * Move lmm test into contrib test * Resolve 2 comments * Move img_utils into contrib folder * Resolve img_utils path issues	2023-12-03 01:40:50 +00:00
Chi Wang	dfcbea9777	seed -> cache_seed (#600 )	2023-11-08 23:39:02 +00:00
Beibin Li	b41b366549	Large Multimodal Models in AgentChat (#554 ) * LMM Code added * LLaVA notebook update * Test cases and Notebook modified for OpenAI v1 * Move LMM into contrib To resolve test issues and deploy issues In the future, we can install pillow by default, and then move back LMM agents into agentchat * LMM test setup update * try...except... clause for LMM tests * disable patch for llava agent test To resolve dependencies issue for build * Add LMM Blog * Change docstring for LMM agents * Docstring update patch * llava: insert reply at position 1 now So, it can still handle human_input_mode and max_consecutive_reply * Resolve comments Fixing: typos, blogs, yml, and add OpenAIWrapper * Signature typo fix for LMM agent: system_message * Update LMM "content" from latest OpenAI release Reference https://platform.openai.com/docs/guides/vision * update LMM test according to latest OpenAI release * Fully support GPT-4V now 1. Add a notebook for GPT-4V. LLava notebook also updated. 2. img_utils updated 3. GPT-4V formatter now return base64 image with mime type 4. Infer mime type directly from b64 image content (while loading without suffix) 5. Test cases modified according to all the related changes. * GPT-4V link updated in blog --------- Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2023-11-06 21:33:51 +00:00

10 Commits