autogen

mirror of https://github.com/microsoft/autogen.git synced 2025-08-04 06:42:35 +00:00

Author	SHA1	Message	Date
Chi Wang	dfcbea9777	seed -> cache_seed (#600 )	2023-11-08 23:39:02 +00:00
Beibin Li	b41b366549	Large Multimodal Models in AgentChat (#554 ) * LMM Code added * LLaVA notebook update * Test cases and Notebook modified for OpenAI v1 * Move LMM into contrib To resolve test issues and deploy issues In the future, we can install pillow by default, and then move back LMM agents into agentchat * LMM test setup update * try...except... clause for LMM tests * disable patch for llava agent test To resolve dependencies issue for build * Add LMM Blog * Change docstring for LMM agents * Docstring update patch * llava: insert reply at position 1 now So, it can still handle human_input_mode and max_consecutive_reply * Resolve comments Fixing: typos, blogs, yml, and add OpenAIWrapper * Signature typo fix for LMM agent: system_message * Update LMM "content" from latest OpenAI release Reference https://platform.openai.com/docs/guides/vision * update LMM test according to latest OpenAI release * Fully support GPT-4V now 1. Add a notebook for GPT-4V. LLava notebook also updated. 2. img_utils updated 3. GPT-4V formatter now return base64 image with mime type 4. Infer mime type directly from b64 image content (while loading without suffix) 5. Test cases modified according to all the related changes. * GPT-4V link updated in blog --------- Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2023-11-06 21:33:51 +00:00
Chi Wang	c4f8b1c761	Dev/v0.2 (#393 ) * api_base -> base_url (#383) * InvalidRequestError -> BadRequestError (#389) * remove api_key_path; close #388 * close #402 (#403) * openai client (#419) * openai client * client test * _client -> client * _client -> client * extra kwargs * Completion -> client (#426) * Completion -> client * Completion -> client * Completion -> client * Completion -> client * support aoai * fix test error * remove commented code * support aoai * annotations * import * reduce test * skip test * skip test * skip test * debug test * rename test * update workflow * update workflow * env * py version * doc improvement * docstr update * openai<1 * add tiktoken to dependency * filter_func * async test * dependency * migration guide (#477) * migration guide * change in kwargs * simplify header * update optigude description * deal with azure gpt-3.5 * add back test_eval_math_responses * timeout * Add back tests for RetrieveChat (#480) * Add back tests for RetrieveChat * Fix format * Update dependencies order * Fix path * Fix path * Fix path * Fix tests * Add not run openai on MacOS or Win * Update skip openai tests * Remove unnecessary dependencies, improve format * Add py3.8 for testing qdrant * Fix multiline error of windows * Add openai tests * Add dependency mathchat, remove unused envs * retrieve chat is tested * bump version to 0.2.0b1 --------- Co-authored-by: Li Jiang <bnujli@gmail.com>	2023-11-04 04:01:49 +00:00
Beibin Li	9932945765	Supporting MultiModal Models: an example with LLaVA Notebook (#286 ) * LMM notebook * Use "register_reply" instead. * Loop check LLaVA non-empty response * Run notebook * Make the llava_call function more flexible * Include API for LLaVA from Replicate * LLaVA data format update x2 1. prompt formater function 2. conversation format with SEP * Coding example added * Rename "ImageAgent" -> "LLaVAAgent" * Docstring and comments updates * Debug notebook: Remote LLaVA tested * Example 1: remove system message * MultimodalConversableAgent added * Add gpt4v_formatter * LLaVA: update example 1 * LLaVA: Add link to "Table of Content"	2023-10-24 05:26:41 +00:00

4 Commits