autogen

mirror of https://github.com/microsoft/autogen.git synced 2025-12-03 18:40:04 +00:00

Author	SHA1	Message	Date
Copilot	5f1c69d049	Add `include_name_in_message` parameter to make `name` field optional in OpenAI messages (#6845 ) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ekzhu <320302+ekzhu@users.noreply.github.com> Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-07-27 00:39:39 -07:00
Copilot	7c536a8c95	Fix OpenAI UnprocessableEntityError when AssistantAgent makes multiple tool calls (#6799 ) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ekzhu <320302+ekzhu@users.noreply.github.com> Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-07-20 00:30:42 -07:00
Ryan Sousa	203323fb3e	feat: add qwen2.5vl support (#6650 ) Co-authored-by: Victor Dibia <victordibia@microsoft.com> Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-07-07 23:13:02 +00:00
Sean Goedecke	02e6574566	Update GitHub Models url to the new url (#6759 ) Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-07-07 15:49:41 -07:00
Eric Zhu	89841b6aaf	Add script to automatically generate API documentation and remove hard-coded RST files; fix API docs (#6755 )	2025-07-06 18:38:39 -07:00
Z1m4-blu3	aa0d835f4d	Fix function calling support for Llama3.3 (#6750 ) ## Why are these changes needed? This PR fixes incorrect model metadata for llama3.3. The function_calling capability was previously set to False, but [llama3.3 supports function calling.](https://www.llama.com/docs/model-cards-and-prompt-formats/llama3_3/) with [ollama](https://ollama.com/library/llama3.3). <!-- Please give a short summary of the change and the problem this solves. --> ## Related issue number <!-- For example: "Closes #1234" --> ## Checks - [x] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. Co-authored-by: mh <>	2025-07-06 03:42:59 +00:00
Copilot	c150f85044	Add `tool_choice` parameter to `ChatCompletionClient` `create` and `create_stream` methods (#6697 ) ## Summary Implements the `tool_choice` parameter for `ChatCompletionClient` interface as requested in #6696. This allows users to restrict which tools the model can choose from when multiple tools are available. ## Changes ### Core Interface - Core Interface: Added `tool_choice: Tool \| Literal["auto", "required", "none"] = "auto"` parameter to `ChatCompletionClient.create()` and `create_stream()` methods - Model Implementations: Updated client implementations to support the new parameter, for now, only the following model clients are supported: - OpenAI - Anthropic - Azure AI - Ollama - `LlamaCppChatCompletionClient` currently not supported Features - "auto" (default): Let the model choose whether to use tools, when there is no tool, it has no effect. - "required": Force the model to use at least one tool - "none": Disable tool usage completely - Tool object: Force the model to use a specific tool --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ekzhu <320302+ekzhu@users.noreply.github.com> Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-06-30 14:15:28 +09:00
David Schmidt	3436ec2ca1	Add support for Gemini 2.5 flash stable (#6692 ) As Gemini 2.5 Flash was released as stable the model infos should be changed accordingly. See https://ai.google.dev/gemini-api/docs/models?hl=de#gemini-2.5-flash ## Related issue number No issue ## Checks - [x] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-06-28 01:57:36 +00:00
jeongsu-an	b9c01d0bc1	fix: enable function_calling for o1-2024-12-17 (#6725 )	2025-06-27 19:56:27 +00:00
Tejas Dharani	11b7743b7d	Fix completion tokens none issue 6352 (#6665 )	2025-06-26 23:26:27 +00:00
peterychang	8a2582c541	SK KernelFunction from ToolSchemas (#6637 ) ## Why are these changes needed? Only a subset of available tools will sent to SK ## Related issue number resolves https://github.com/microsoft/autogen/issues/6582 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed.	2025-06-06 10:15:56 -04:00
peterychang	1858799fa6	Parse backtick-enclosed json (#6607 ) ## Why are these changes needed? Some models enclose json in markdown code blocks ## Related issue number resolves https://github.com/microsoft/autogen/issues/6599. , #6547 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed. --------- Co-authored-by: Victor Dibia <victordibia@microsoft.com>	2025-06-03 18:22:01 +00:00
peterychang	03394a42c0	Default usage statistics for streaming responses (#6578 ) ## Why are these changes needed? Enables usage statistics for streaming responses by default. There is a similar bug in the AzureAI client. Theoretically adding the parameter ``` model_extras={"stream_options": {"include_usage": True}} ``` should fix the problem, but I'm currently unable to test that workflow ## Related issue number closes https://github.com/microsoft/autogen/issues/6548 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed.	2025-05-28 14:32:04 -04:00
Sungjun.Kim	b8d02c9a20	feat: Add missing Anthropic models (Claude Sonnet 4, Claude Opus 4) (#6585 ) <!-- Thank you for your contribution! Please review https://microsoft.github.io/autogen/docs/Contribute before opening a pull request. --> <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? <!-- Please give a short summary of the change and the problem this solves. --> ## Related issue number resolved https://github.com/microsoft/autogen/issues/6584 ## Checks - [x] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed.	2025-05-23 12:15:12 -07:00
Damien Menigaux	c6c8693b2c	Add gemini 2.5 fash compatibility (#6574 ) <!-- Thank you for your contribution! Please review https://microsoft.github.io/autogen/docs/Contribute before opening a pull request. --> <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? <!-- Please give a short summary of the change and the problem this solves. --> ## Related issue number <!-- For example: "Closes #1234" --> ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed.	2025-05-21 19:38:03 +00:00
GeorgiosEfstathiadis	113aca0b81	Allow implicit aws credential setting for AnthropicBedrockChatCompletionClient (#6561 ) <!-- Thank you for your contribution! Please review https://microsoft.github.io/autogen/docs/Contribute before opening a pull request. --> <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? Allows implicit AWS credential setting when using AnthropicBedrockChatCompletionClient in an instance where you have already logged into AWS with SSO and credentials are set as environment variables. ## Related issue number Closes #6560 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed. Co-authored-by: Jack Gerrits <jackgerrits@users.noreply.github.com>	2025-05-21 12:49:03 -04:00
Afzal Pawaskar	446da624ac	Fix missing tools in logs (#6532 ) Fix for LLMCallEvent failing to log "tools" passed to BaseOpenAIChatCompletionClient in autogen_ext.models.openai._openai_client.BaseOpenAIChatCompletionClient This bug creates problems inspecting why a certain tool was selected/not selected by the LLM as the list of tools available to the LLM is not present in the logs ## Why are these changes needed? Added "tools" to the LLMCallEvent to log tools available to the LLM as these were being missed causing difficulties during debugging LLM tool calls. ## Related issue number [<!-- For example: "Closes #1234" -->](https://github.com/microsoft/autogen/issues/6531) ## Checks - [x] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-05-15 17:12:19 -07:00
Chester Hu	9d297318b5	Add Llama API OAI compatible endpoint support (#6442 ) ## Why are these changes needed? To add the latest support for using Llama API offerings with AutoGen ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-05-15 16:57:27 -07:00
Miroslav Pokrovskii	aa22b622d0	feat: add qwen3 support (#6528 ) ## Why are these changes needed? Add ollama qwen 3 support	2025-05-14 09:52:13 -07:00
EeS	978cbd2e89	FIX/mistral could not recive name field (#6503 ) ## Why are these changes needed? FIX/mistral could not recive name field, so add model transformer for mistral ## Related issue number Closes #6147 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-05-12 19:32:14 -07:00
Victor Dibia	6427c07f5c	Fix AnthropicBedrockChatCompletionClient import error (#6489 ) <!-- Thank you for your contribution! Please review https://microsoft.github.io/autogen/docs/Contribute before opening a pull request. --> Some fixes with the AnthropicBedrockChatCompletionClient - Ensure `AnthropicBedrockChatCompletionClient` exported and can be imported. - Update the BedrockInfo keys serialization - client argument can be string (similar to api key in this ) but exported config should be Secret - Replace `AnthropicBedrock` with `AsyncAnthropicBedrock` : client should be async to work with the ag stack and the BaseAnthropicClient it inherits from - Improve `AnthropicBedrockChatCompletionClient` docstring to use the correct client arguments rather than serialized dict format. Expect ```python from autogen_ext.models.anthropic import AnthropicBedrockChatCompletionClient, BedrockInfo from autogen_core.models import UserMessage, ModelInfo async def main(): anthropic_client = AnthropicBedrockChatCompletionClient( model="anthropic.claude-3-5-sonnet-20240620-v1:0", temperature=0.1, model_info=ModelInfo(vision=False, function_calling=True, json_output=False, family="unknown", structured_output=True), bedrock_info=BedrockInfo( aws_access_key="<aws_access_key>", aws_secret_key="<aws_secret_key>", aws_session_token="<aws_session_token>", aws_region="<aws_region>", ), ) # type: ignore result = await anthropic_client.create([UserMessage(content="What is the capital of France?", source="user")]) print(result) await main() ``` <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? <!-- Please give a short summary of the change and the problem this solves. --> ## Related issue number <!-- For example: "Closes #1234" --> Closes #6483 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-05-10 09:12:02 -07:00
Victor Dibia	1ab24417b8	Add gpt 4o search (#6492 ) <!-- Thank you for your contribution! Please review https://microsoft.github.io/autogen/docs/Contribute before opening a pull request. --> Add gpt 4o search models to list of default models ```python "gpt-4o-mini-search-preview-2025-03-11": { "vision": False, "function_calling": True, "json_output": True, "family": ModelFamily.GPT_4O, "structured_output": True, "multiple_system_messages": True, }, "gpt-4o-search-preview-2025-03-11": { "vision": False, "function_calling": True, "json_output": True, "family": ModelFamily.GPT_4O, "structured_output": True, "multiple_system_messages": True, }, ``` <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? <!-- Please give a short summary of the change and the problem this solves. --> ## Related issue number <!-- For example: "Closes #1234" --> Closes #6491 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed.	2025-05-08 18:01:38 -07:00
EeS	6fc4f53212	FIX: `MultiModalMessage` in gemini with openai sdk error occured (#6440 ) ## Why are these changes needed? Multimodal message fill context with other routine. However current `_set_empty_to_whitespace` is fill with context. So, error occured. And, I checked `multimodal_user_transformer_funcs` and I found it, in this routine, context must not be empty. Now remove the `_set_empty_to_whitespace` when multimodal message, <!-- Please give a short summary of the change and the problem this solves. --> ## Related issue number Closes #6439 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-05-01 09:27:31 -07:00
Harini N	a91006cdc2	Adding bedrock chat completion for anthropic models (#6170 ) ## Why are these changes needed? Anthropic models are supported by AWS bedrock. ChatCompletionClient can be created for anthropic bedrock models using this changes. This enables the user to do the following - Add any anthropic models and version from AWS bedrock - Can use ChatCompletionClient for bedrock anthropic models ## Related issue number Closes #5226 --------- Co-authored-by: harini.narasimhan <harini.narasimhan@eagleview.com> Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-28 11:56:46 -07:00
Eric Zhu	f059262b6d	Remove `name` field from OpenAI Assistant Message (#6388 ) Resolves #3247	2025-04-24 13:11:40 -07:00
Peter Jausovec	d051da52c3	fix: ollama fails when tools use optional args (#6343 ) ## Why are these changes needed? `convert_tools` failed if Optional args were used in tools (the `type` field doesn't exist in that case and `anyOf` must be used). This uses the `anyOf` field to pick the first non-null type to use. ## Related issue number Fixes #6323 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. --------- Signed-off-by: Peter Jausovec <peter.jausovec@solo.io> Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-22 00:06:46 +00:00
Peter Jausovec	4d3e47a0f1	fix: ensure serialized messages are passed to LLMStreamStartEvent (#6344 ) ## Why are these changes needed? I was getting the following exception when doing tool calls with anthropic - the exception was coming form the `__str__` in `LLMStreamStartEvent`. ``` ('Object of type ToolUseBlock is not JSON serializable',) ``` The issue is that when creating the LLMStreamStartevent in the `create_stream`, the messages weren't being serialized first. ## Related issue number Signed-off-by: Peter Jausovec <peter.jausovec@solo.io> Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-21 11:43:21 -07:00
EeS	1de07ab293	Generalize Continuous SystemMessage merging via model_info[“multiple_system_messages”] instead of `startswith("gemini-")` (#6345 ) The current implementation of consecutive `SystemMessage` merging applies only to models where `model_info.family` starts with `"gemini-"`. Since PR #6327 introduced the `multiple_system_messages` field in `model_info`, we can now generalize this logic by checking whether the field is explicitly set to `False`. This change replaces the hardcoded family check with a conditional that merges consecutive `SystemMessage` blocks whenever `multiple_system_messages` is set to `False`. Test cases that previously depended on the `"gemini"` model family have been updated to reflect this configuration flag, and renamed accordingly for clarity. In addition, for consistency across conditional logic, a follow-up PR is planned to refactor the Claude-specific transformation condition (currently implemented via `create_args.get("model", "unknown").startswith("claude-")`) to instead use the existing `is_claude()`. Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-21 11:30:35 -07:00
EeS	b13264ac60	FEAT: adding multiple_system_message on model_info (#6327 ) ## Why are these changes needed? `SocietyOfMindAgent` has multiple system message, however many client/model does not support it. ## Related issue number Related #6290 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-17 22:39:47 -07:00
Eric Zhu	629fb86e96	Add GPT4.1, o4-mini and o3 (#6314 )	2025-04-17 01:10:14 +00:00
EeS	b24df29ad0	Fix/transformer aware any modelfamily (#6213 ) This PR improves fallback safety when an invalid `model_family` is supplied to `get_transformer()`. Previously, if a user passed an arbitrary or incorrect `family` string in `model_info`, the lookup could fail without falling back to `ModelFamily.UNKNOWN`. Now, we explicitly check whether `model_family` is a valid value in `ModelFamily.ANY`. If not, we fallback to `_find_model_family()` as intended. ## Related issue number Related #6011#issuecomment-2779957730 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-05 19:58:16 -07:00
Ardent Illumina	b1ae4ac79e	added: gemini 2.5 pro preview (#6226 )	2025-04-06 00:27:56 +00:00
Eric Zhu	d4ac2ca6de	Fix streaming + tool bug in Ollama (#6193 ) Fix a bug that caused tool calls to be truncated in OllamaChatCompletionClient when streaming is on.	2025-04-03 14:56:01 -07:00
Victor Dibia	bd572cc112	Ensure message sent to LLMCallEvent for Anthropic is serializable (#6135 ) Messages sent as part of `LLMCallEvent` for Anthropic were not fully serializable The example below shows TextBlock and ToolUseBlocks inside the content of messages - these throw downsteam errors in apps like AGS (or event sinks) that expect serializable dicts inside the LLMCallEvent. ``` [ {'role': 'user', 'content': 'What is the weather in New York?'}, {'role': 'assistant', 'content': [TextBlock(citations=None, text='I can help you find the weather in New York. Let me check that for you.', type='text'), ToolUseBlock(id='toolu_016W8g55GejYGBzRRrcsnt7M', input={'city': 'New York'}, name='get_weather', type='tool_use')]}, {'role': 'user', 'content': [{'type': 'tool_result', 'tool_use_id': 'toolu_016W8g55GejYGBzRRrcsnt7M', 'content': 'The weather in New York is 73 degrees and Sunny.'}]} ] ``` This PR attempts to first serialize content of anthropic messages before they are passed to `LLMCallEvent` ``` [ {'role': 'user', 'content': 'What is the weather in New York?'}, {'role': 'assistant', 'content': [{'citations': None, 'text': 'I can help you find the weather in New York. Let me check that for you.', 'type': 'text'}, {'id': 'toolu_016W8g55GejYGBzRRrcsnt7M', 'input': {'city': 'New York'}, 'name': 'get_weather', 'type': 'tool_use'}]}, {'role': 'user', 'content': [{'type': 'tool_result', 'tool_use_id': 'toolu_016W8g55GejYGBzRRrcsnt7M', 'content': 'The weather in New York is 73 degrees and Sunny.'}]} ] ```	2025-04-02 18:01:42 -07:00
EeS	27da37efc0	[Refactor] model family resolution to support non-prefixed names like Mistral (#6158 ) This PR improves how model_family is resolved when selecting a transformer from the registry. Previously, model families were inferred using a simple prefix-based match like: ``` if model.startswith(family): ... ``` This works for cleanly prefixed models (e.g., `gpt-4o`, `claude-3`) but fails for models like `mistral-large-latest`, `codestral-latest`, etc., where prefix-based matching is ambiguous or misleading. To address this: • model_family can now be passed explicitly (e.g., via ModelInfo) • _find_model_family() is only used as a fallback when the value is "unknown" • Transformer lookup is now more robust and predictable • Example integration in to_oai_type() demonstrates this pattern using self._model_info["family"] This change is required for safe support of models like Mistral and other future models that do not follow standard naming conventions. Linked to discussion in [#6151](https://github.com/microsoft/autogen/issues/6151) Related : #6011 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-02 22:08:17 +00:00
EeS	9de16d5f70	Fix/anthropic colud not end with trailing whitespace at assistant content (#6168 ) ## Why are these changes needed? This PR fixes a `400 - invalid_request_error` that occurs when using Anthropic models and the final message is from the assistant and ends with trailing whitespace. Example error: ``` Error code: 400 - {'error': {'code': 'invalid_request_error', 'message': 'messages: final assistant content cannot end with trailing whitespace', ...}} ``` To unblock ongoing internal usage, this patch introduces an ad-hoc fix that strips trailing whitespace if the model is Anthropic and the last message is from the assistant. ## Related issue number Ad-hoc fix for issue discussed here: https://github.com/microsoft/autogen/issues/6167 Follow-up structural proposal here: https://github.com/microsoft/autogen/issues/6167 https://github.com/microsoft/autogen/issues/6167#issuecomment-2768592840	2025-04-02 00:56:08 +00:00
EeS	61ba153614	Doc/moudulor transform oai (#6149 ) This PR adds a module-level docstring to `_message_transform.py`, as requested in the review for [PR #6063](https://github.com/microsoft/autogen/pull/6063). The documentation includes: - Background and motivation behind the modular transformer design - Key concepts such as transformer functions, pipelines, and maps - Examples of how to define, register, and use transformers - Design principles to guide future contributions and extensions By embedding this explanation directly into the module, contributors and maintainers can more easily understand the structure, purpose, and usage of the transformer pipeline without needing to refer to external documents. ## Related issue number Follow-up to [PR #6063](https://github.com/microsoft/autogen/pull/6063)	2025-03-31 06:39:27 +00:00
EeS	fbdd89b46b	[BugFix][Refactor] Modular Transformer Pipeline and Fix Gemini/Anthropic Empty Content Handling (#6063 ) ## Why are these changes needed? This change addresses a compatibility issue when using Google Gemini models with AutoGen. Specifically, Gemini returns a 400 INVALID_ARGUMENT error when receiving a response with an empty "text" parameter. The root cause is that Gemini does not accept empty string values (e.g., "") as valid inputs in the history of the conversation. To fix this, if the content field is falsy (e.g., None, "", etc.), it is explicitly replaced with a single whitespace (" "), which prevents the Gemini model from rejecting the request. - Gemini API compatibility: Gemini models reject empty assistant messages (e.g., `""`), causing runtime errors. This PR ensures such messages are safely replaced with whitespace where appropriate. - Avoiding regressions: Applying the empty content workaround only to Gemini, and only to valid message types, avoids breaking OpenAI or other models. - Reducing duplication: Previously, message transformation logic was scattered and repeated across different message types and models. Modularizing this pipeline removes that redundancy. - Improved maintainability: With future model variants likely to introduce more constraints, this modular structure makes it easier to adapt transformations without writing ad-hoc code each time. - Testing for correctness: The new structure is verified with tests, ensuring the bug fix is effective and non-intrusive. ## Summary This PR introduces a modular transformer pipeline for message conversion and fixes a Gemini-specific bug related to empty assistant message content. ### Key Changes - [Refactor] Extracted message transformation logic into a unified pipeline to: - Reduce code duplication - Improve maintainability - Simplify debugging and extension for future model-specific logic - [BugFix] Gemini models do not accept empty assistant message content. - Introduced `_set_empty_to_whitespace` transformer to replace empty strings with `" "` only where needed - Applied it only to `"text"` and `"thought"` message types, not to `"tools"` to avoid serialization errors - Improved structure for model-specific handling - Transformer functions are now grouped and conditionally applied based on message type and model family - This design makes it easier to support future models or combinations (e.g., Gemini + R1) - Test coverage added - Added dedicated tests to verify that empty assistant content causes errors for Gemini - Ensured the fix resolves the issue without affecting OpenAI models --- ## Motivation Originally, Gemini-compatible endpoints would fail when receiving assistant messages with empty content (`""`). This issue required special handling without introducing brittle, ad-hoc patches. In addressing this, I also saw an opportunity to modularize the message transformation logic across models. This improves clarity, avoids duplication, and simplifies future adaptations (e.g., different constraints across model families). --- ## 📘 AutoGen Modular Message Transformer: Design & Usage Guide This document introduces the new modular transformer system used in AutoGen for converting `LLMMessage` instances to SDK-specific message formats (e.g., OpenAI-style `ChatCompletionMessageParam`). The design improves reusability, extensibility, and maintainability across different model families. --- ### 🚀 Overview Instead of scattering model-specific message conversion logic across the codebase, the new design introduces: - Modular transformer functions for each message type - Per-model transformer maps (e.g., for OpenAI-compatible models) - Optional conditional transformers for multimodal/text hybrid models - Clear separation between message adaptation logic and SDK-specific builder (e.g., `ChatCompletionUserMessageParam`) --- ### 🧱 1. Define Transform Functions Each transformer function takes: - `LLMMessage`: a structured AutoGen message - `context: dict`: metadata passed through the builder pipeline And returns: - A dictionary of keyword arguments for the target message constructor (e.g., `{"content": ..., "name": ..., "role": ...}`) ```python def _set_thought_as_content_gemini(message: LLMMessage, context: Dict[str, Any]) -> Dict[str, str \| None]: assert isinstance(message, AssistantMessage) return {"content": message.thought or " "} ``` --- ### 🪢 2. Compose Transformer Pipelines Multiple transformer functions are composed into a pipeline using `build_transformer_func()`: ```python base_user_transformer_funcs: List[Callable[[LLMMessage, Dict[str, Any]], Dict[str, Any]]] = [ _assert_valid_name, _set_name, _set_role("user"), ] user_transformer = build_transformer_func( funcs=base_user_transformer_funcs, message_param_func=ChatCompletionUserMessageParam ) ``` - The `message_param_func` is the actual constructor for the target message class (usually from the SDK). - The pipeline is ordered — each function adds or overrides keys in the builder kwargs. --- ### 🗂️ 3. Register Transformer Map Each model family maintains a `TransformerMap`, which maps `LLMMessage` types to transformers: ```python __BASE_TRANSFORMER_MAP: TransformerMap = { SystemMessage: system_transformer, UserMessage: user_transformer, AssistantMessage: assistant_transformer, } register_transformer("openai", model_name_or_family, __BASE_TRANSFORMER_MAP) ``` - `"openai"` is currently required (as only OpenAI-compatible format is supported now). - Registration ensures AutoGen knows how to transform each message type for that model. --- ### 🔁 4. Conditional Transformers (Optional) When message construction depends on runtime conditions (e.g., `"text"` vs. `"multimodal"`), use: ```python conditional_transformer = build_conditional_transformer_func( funcs_map=user_transformer_funcs_claude, message_param_func_map=user_transformer_constructors, condition_func=user_condition, ) ``` Where: - `funcs_map`: maps condition label → list of transformer functions ```python user_transformer_funcs_claude = { "text": text_transformers + [_set_empty_to_whitespace], "multimodal": multimodal_transformers + [_set_empty_to_whitespace], } ``` - `message_param_func_map`: maps condition label → message builder ```python user_transformer_constructors = { "text": ChatCompletionUserMessageParam, "multimodal": ChatCompletionUserMessageParam, } ``` - `condition_func`: determines which transformer to apply at runtime ```python def user_condition(message: LLMMessage, context: Dict[str, Any]) -> str: if isinstance(message.content, str): return "text" return "multimodal" ``` --- ### 🧪 Example Flow ```python llm_message = AssistantMessage(name="a", thought="let’s go") model_family = "openai" model_name = "claude-3-opus" transformer = get_transformer(model_family, model_name, type(llm_message)) sdk_message = transformer(llm_message, context={}) ``` --- ### 🎯 Design Benefits \| Feature \| Benefit \| \|--------\|---------\| \| 🧱 Function-based modular design \| Easy to compose and test \| \| 🧩 Per-model registry \| Clean separation across model families \| \| ⚖️ Conditional support \| Allows multimodal / dynamic adaptation \| \| 🔄 Reuse-friendly \| Shared logic (e.g., `_set_name`) is DRY \| \| 📦 SDK-specific \| Keeps message adaptation aligned to builder interface \| --- ### 🔮 Future Direction - Support more SDKs and formats by introducing new message_param_func - Global registry integration (currently `"openai"`-scoped) - Class-based transformer variant if complexity grows --- ## Related issue number Closes #5762 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ v ] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-30 21:09:30 -07:00
EeS	0cd3ff46fa	FIX: Anthropic and Gemini could take multiple system message (#6118 ) Anthropic SDK could not takes multiple system messages. However some autogen Agent(e.g. SocietyOfMindAgent) makes multiple system messages. And... Gemini with OpenaiSDK do not take error. However is not working mulitple system messages. (Just last one is working) So, I simple change of, "merge multiple system message" at these cases. ## Related issue number Closes #6116 Closes #6117 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-28 09:05:54 -07:00
EeS	2754eda611	FEAT: Add missing OpenAI-compatible models (GPT-4.5, Claude models) (#6120 ) This PR adds missing model entries for OpenAI-compatible endpoints, including gpt-4.5-turbo, gpt-4.5-turbo-preview, and claude-3.5-sonnet. This improves coverage and avoids potential fallback or mismatch issues when initializing clients.	2025-03-27 18:39:22 -07:00
Griffin Bassman	7487687cdc	[feat] token-limited message context (#6087 )	2025-03-27 13:59:27 -07:00
Eric Zhu	29485ef85b	Fix MCP tool bug by dropping unset parameters from input (#6125 ) Resolves #6096 Additionally: make sure MCP errors are formatted correctly, added unit tests for mcp servers and upgrade mcp version.	2025-03-27 13:22:06 -07:00
Jay Prakash Thakur	b5ff7ee355	feat(ollama): Add thought field support and fix LLM control parameters (#6126 )	2025-03-26 23:14:26 -07:00
Jack Gerrits	8a5ee3de6a	Add autogen user agent to azure openai requests (#6124 )	2025-03-26 16:01:42 -07:00
y26s4824k264	0bec835d59	Emit <think> and </think> around reasoning chunks from model_extras in choices.detla So the behavior of hosted R1 model is the same as locally hosted R1 model. Addresses: #5989 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-25 16:17:53 -07:00
Jay Prakash Thakur	7047fb8b8d	Add support for thought field in AzureAIChatCompletionClient (#6062 ) added support for the thought process in tool calls for `OpenAIChatCompletionClient`, allowing additional text produced by a model alongside tool calls to be preserved in the thought field of `CreateResult`. This PR extends the same functionality to `AzureAIChatCompletionClient` for consistency across model clients. #5650 Co-authored-by: Jay Prakash Thakur <jathakur@microsoft.com>	2025-03-24 17:33:10 -07:00
EeS	bca4d7e82f	FIX: Anthropic multimodal(Image) message for Anthropic >= 0.48 aware (#6054 ) ## Why are these changes needed? This PR fixes a `TypeError: Cannot instantiate typing.Union` that occurs when using the `MultimodalWebSurfer_agent` with Anthropic models. The error was caused by the incorrect usage of `typing.Union` as a class constructor instead of a type hint within the `_anthropic_client.py` file. The code was attempting to instantiate `typing.Union`, which is not allowed. The fix correctly uses `typing.Union` within type hints, and uses the correct `Base64ImageSourceParam` type. It also updates the `pyproject.toml` dependency. ## Related issue number Closes #6035 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [v] I've made sure all auto checks have passed. --------- Co-authored-by: Victor Dibia <victordibia@microsoft.com>	2025-03-22 00:46:55 -07:00
Eric Zhu	a8cef327f1	Support json schema for response format type in OpenAIChatCompletionClient (#5988 ) Resolves #5982 This PR adds support for `json_schema` as a `response_format` type in `OpenAIChatCompletionClient`. This is necessary because it allows the client to be serialized along with the schema. If user use `response_format=SomeBaseModel`, the client cannot be serialized. Usage: ```python # Structured output response, with a pre-defined JSON schema. OpenAIChatCompletionClient(..., response_format = { "type": "json_schema", "json_schema": { "name": "name of the schema, must be an identifier.", "description": "description for the model.", # You can convert a Pydantic (v2) model to JSON schema # using the `model_json_schema()` method. "schema": "<the JSON schema itself>", # Whether to enable strict schema adherence when # generating the output. If set to true, the model will # always follow the exact schema defined in the # `schema` field. Only a subset of JSON Schema is # supported when `strict` is `true`. # To learn more, read # https://platform.openai.com/docs/guides/structured-outputs. "strict": False, # or True }, }, ) ````	2025-03-18 03:14:42 +00:00
Federico Villa	09d8d344a2	Filter invalid parameters in Ollama client requests (#5983 ) Remove unrecognized parameters in Ollama API calls. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-17 21:09:26 +00:00
ZakWork	685142cf51	Fix R1 reasoning parser for openai client (#5961 ) R1 reasoning tokens from hosted R1 model were not parsed correctly for the openai client Resolves #5941 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-17 10:09:41 -07:00

1 2 3 4

152 Commits