autogen

mirror of https://github.com/microsoft/autogen.git synced 2025-10-30 01:10:23 +00:00

Author	SHA1	Message	Date
EeS	f9268204ad	[BUGFIX] Add LLMCallEventMessage to resolve instantiation error in AgentStudio (#6204 ) This PR defines the missing `LLMCallEventMessage` class to resolve an instantiation error that occurs when using custom messages (e.g., via AgentStudio). > Discord Report > சravanaன் — 오후 6:40 > _“i updated agentchat and agentcore and tried running the config from agentstudio and it is now not running the agent and is throwing error `TypeError: Can't instantiate abstract class LLMCallEventMessage with abstract methods to_model_message, to_model_text, to_text`”_ The issue stems from `LLMCallEventMessage` being an abstract class that lacks required methods from `BaseChatMessage`. This PR implements the missing methods. Since `LLMCallEventMessage` is intended for logging/UI use only, and not to be sent to LLMs, the `to_model_message()` method raises `NotImplementedError` by design. ## Related issue number Reported in Discord Closes #6206	2025-04-04 10:24:55 -07:00
Eric Zhu	47602eac9e	Update version to 0.5.1 (#6195 )	2025-04-03 15:10:41 -07:00
Eric Zhu	d4ac2ca6de	Fix streaming + tool bug in Ollama (#6193 ) Fix a bug that caused tool calls to be truncated in OllamaChatCompletionClient when streaming is on.	2025-04-03 14:56:01 -07:00
Eric Zhu	5508cc7a43	Update versions to 0.5.0 (#6184 )	2025-04-02 18:15:50 -07:00
Victor Dibia	bd572cc112	Ensure message sent to LLMCallEvent for Anthropic is serializable (#6135 ) Messages sent as part of `LLMCallEvent` for Anthropic were not fully serializable The example below shows TextBlock and ToolUseBlocks inside the content of messages - these throw downsteam errors in apps like AGS (or event sinks) that expect serializable dicts inside the LLMCallEvent. ``` [ {'role': 'user', 'content': 'What is the weather in New York?'}, {'role': 'assistant', 'content': [TextBlock(citations=None, text='I can help you find the weather in New York. Let me check that for you.', type='text'), ToolUseBlock(id='toolu_016W8g55GejYGBzRRrcsnt7M', input={'city': 'New York'}, name='get_weather', type='tool_use')]}, {'role': 'user', 'content': [{'type': 'tool_result', 'tool_use_id': 'toolu_016W8g55GejYGBzRRrcsnt7M', 'content': 'The weather in New York is 73 degrees and Sunny.'}]} ] ``` This PR attempts to first serialize content of anthropic messages before they are passed to `LLMCallEvent` ``` [ {'role': 'user', 'content': 'What is the weather in New York?'}, {'role': 'assistant', 'content': [{'citations': None, 'text': 'I can help you find the weather in New York. Let me check that for you.', 'type': 'text'}, {'id': 'toolu_016W8g55GejYGBzRRrcsnt7M', 'input': {'city': 'New York'}, 'name': 'get_weather', 'type': 'tool_use'}]}, {'role': 'user', 'content': [{'type': 'tool_result', 'tool_use_id': 'toolu_016W8g55GejYGBzRRrcsnt7M', 'content': 'The weather in New York is 73 degrees and Sunny.'}]} ] ```	2025-04-02 18:01:42 -07:00
Jay Prakash Thakur	0d9b574d09	Add Azure AI Search tool implementation (#5844 ) # Azure AI Search Tool Implementation This PR adds a new tool for Azure AI Search integration to autogen-ext, enabling agents to search and retrieve information from Azure AI Search indexes. ## Why Are These Changes Needed? AutoGen currently lacks native integration with Azure AI Search, which is a powerful enterprise search service that supports semantic, vector, and hybrid search capabilities. This integration enables agents to: 1. Retrieve relevant information from large document collections 2. Perform semantic search with AI-powered ranking 3. Execute vector similarity search using embeddings 4. Combine text and vector approaches for optimal results This tool complements existing retrieval capabilities and provides a seamless way to integrate with Azure's search infrastructure. ## Features - Multiple Search Types: Support for text, semantic, vector, and hybrid search - Flexible Configuration: Customizable search parameters and fields - Robust Error Handling: User-friendly error messages with actionable guidance - Performance Optimizations: Configurable caching and retry mechanisms - Vector Search Support: Built-in embedding generation with extensibility ## Usage Example ```python from autogen_ext.tools.azure import AzureAISearchTool from azure.core.credentials import AzureKeyCredential from autogen import AssistantAgent, UserProxyAgent # Create the search tool search_tool = AzureAISearchTool.load_component({ "provider": "autogen_ext.tools.azure.AzureAISearchTool", "config": { "name": "DocumentSearch", "description": "Search for information in the knowledge base", "endpoint": "https://your-service.search.windows.net", "index_name": "your-index", "credential": {"api_key": "your-api-key"}, "query_type": "semantic", "semantic_config_name": "default" } }) # Create an agent with the search tool assistant = AssistantAgent( "assistant", llm_config={"tools": [search_tool]} ) # Create a user proxy agent user_proxy = UserProxyAgent( "user_proxy", human_input_mode="TERMINATE", max_consecutive_auto_reply=10, code_execution_config={"work_dir": "coding"} ) # Start the conversation user_proxy.initiate_chat( assistant, message="What information do we have about quantum computing in our knowledge base?" ) ``` ## Testing - Added unit tests for all search types (text, semantic, vector, hybrid) - Added tests for error handling and cancellation - All tests pass locally ## Documentation - Added comprehensive docstrings with examples - Included warnings about placeholder embedding implementation - Added links to Azure AI Search documentation ## Related issue number Closes #5419 ## Checks - [x] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-02 23:16:48 +00:00
EeS	d7f2b56846	FIX:simple fix on tool calling test for anthropic (#6181 ) Just simple change. ```python messages: List[LLMMessage] = [UserMessage(content="Call the pass tool with input 'task'", source="user")] ``` to ```python messages: List[LLMMessage] = [UserMessage(content="Call the pass tool with input 'task' and talk result", source="user")] ``` And, now. Anthropic model could pass that test case `test_model_client_with_function_calling`. -> Yup. Before, claude could not pass that test case. With this change, Claude (Anthropic) models are now able to pass the test case successfully. Before this fix, Claude failed to interpret the intent correctly. Now, it can infer both tool usage and follow-up generation. This change is backward-compatible with other models (e.g., GPT-4) and improves cross-model consistency for function-calling tests.	2025-04-02 23:10:11 +00:00
EeS	27da37efc0	[Refactor] model family resolution to support non-prefixed names like Mistral (#6158 ) This PR improves how model_family is resolved when selecting a transformer from the registry. Previously, model families were inferred using a simple prefix-based match like: ``` if model.startswith(family): ... ``` This works for cleanly prefixed models (e.g., `gpt-4o`, `claude-3`) but fails for models like `mistral-large-latest`, `codestral-latest`, etc., where prefix-based matching is ambiguous or misleading. To address this: • model_family can now be passed explicitly (e.g., via ModelInfo) • _find_model_family() is only used as a fallback when the value is "unknown" • Transformer lookup is now more robust and predictable • Example integration in to_oai_type() demonstrates this pattern using self._model_info["family"] This change is required for safe support of models like Mistral and other future models that do not follow standard naming conventions. Linked to discussion in [#6151](https://github.com/microsoft/autogen/issues/6151) Related : #6011 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-02 22:08:17 +00:00
Stuart Leeks	9143e58ef1	Add session_id_param to ACADynamicSessionsCodeExecutor (#6171 ) The initializer for ACADynamicSessionsCodeExecutor creates a new GUID to use as the session ID for dynamic sessions. In some scenarios it is desirable to be able to re-create the agent group chat from saved state. In this case, the ACADynamicSessionsCodeExecutor needs to be associated with a previous instance (so that any execution state is still valid) This PR adds a new argument to the initializer to allow a session ID to be passed in (defaulting to the current behaviour of creating a GUID if absent). Closes #6119 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-02 21:39:44 +00:00
EeS	9de16d5f70	Fix/anthropic colud not end with trailing whitespace at assistant content (#6168 ) ## Why are these changes needed? This PR fixes a `400 - invalid_request_error` that occurs when using Anthropic models and the final message is from the assistant and ends with trailing whitespace. Example error: ``` Error code: 400 - {'error': {'code': 'invalid_request_error', 'message': 'messages: final assistant content cannot end with trailing whitespace', ...}} ``` To unblock ongoing internal usage, this patch introduces an ad-hoc fix that strips trailing whitespace if the model is Anthropic and the last message is from the assistant. ## Related issue number Ad-hoc fix for issue discussed here: https://github.com/microsoft/autogen/issues/6167 Follow-up structural proposal here: https://github.com/microsoft/autogen/issues/6167 https://github.com/microsoft/autogen/issues/6167#issuecomment-2768592840	2025-04-02 00:56:08 +00:00
Eric Zhu	aec04e76ec	Stop run when an error occured in a group chat (#6141 ) Resolves #5851 * Added GroupChatError event type and terminate a run when an error occurs in either a participant or the group chat manager * Raise a RuntimeError from the error message within the group chat run	2025-04-01 20:17:50 +00:00
Eric Zhu	86237c9fdf	Add output_format to AssistantAgent for structured output (#6071 ) Resolves #5934 This PR adds ability for `AssistantAgent` to generate a `StructuredMessage[T]` where `T` is the content type in base model. How to use? ```python from typing import Literal from pydantic import BaseModel from autogen_agentchat.agents import AssistantAgent from autogen_ext.models.openai import OpenAIChatCompletionClient from autogen_agentchat.ui import Console # The response format for the agent as a Pydantic base model. class AgentResponse(BaseModel): thoughts: str response: Literal["happy", "sad", "neutral"] # Create an agent that uses the OpenAI GPT-4o model which supports structured output. model_client = OpenAIChatCompletionClient(model="gpt-4o") agent = AssistantAgent( "assistant", model_client=model_client, system_message="Categorize the input as happy, sad, or neutral following the JSON format.", # Setting the output format to AgentResponse to force the agent to produce a JSON string as response. output_content_type=AgentResponse, ) result = await Console(agent.run_stream(task="I am happy.")) # Check the last message in the result, validate its type, and print the thoughts and response. assert isinstance(result.messages[-1], StructuredMessage) assert isinstance(result.messages[-1].content, AgentResponse) print("Thought: ", result.messages[-1].content.thoughts) print("Response: ", result.messages[-1].content.response) await model_client.close() ``` ``` ---------- user ---------- I am happy. ---------- assistant ---------- { "thoughts": "The user explicitly states they are happy.", "response": "happy" } Thought: The user explicitly states they are happy. Response: happy ``` --------- Co-authored-by: Victor Dibia <victordibia@microsoft.com>	2025-04-01 20:11:01 +00:00
Federico Villa	9915b65929	Changed Code Executors default directory to temporary directory (#6143 ) ## Why are these changes needed? Changed default working directory of code executors, from the current directory `"."` to Python's [`tempfile`](https://docs.python.org/3/library/tempfile.html#tempfile.TemporaryDirectory). These changes simplify file cleanup and prevent the model from accessing code files or other sensitive data that should not be accessible. These changes simplify file cleanup and prevent the model from accessing code files or other sensitive data that should not be accessible. Changes made: - The default `work_dir` parameter in code executors is changed to `None`; when invoking the `start()` method, if not `work_dir` was specified (`None`) a temporary directory is created. - The `start()` and `stop()` methods of code executors handle the creation and cleanup of the working directory, for the default temporary directory. - For maintaining backward compatibility: - A `DeprecationWarning` is emitted when the current dir, `"."`, is used as `work_dir` as it is in the current code executor implementation. The deprecation warning is tested in `test_deprecated_warning()`. - For existing implementation that do not call the `start()` method and do not specify a `work_dir`, the executors will continue using the current directory `"."` as the working directory, mantaining backward compatibility. - Updated test suites: - Added tests to confirm that by default code executors use a temporary directory as their working directory: `test_default_work_dir_is_temp()`; - Implemented test to ensure that a `DeprecationWarning` is raised when the current directory is used as the default directory: `test_deprecated_warning()`; - Added tests to ensure that errors arise when invalid paths (doesn't exist or user has not the right permissions) are provided: `test_error_wrong_path()`. Feel free to suggest any additions or improvements! ## Related issue number Close #6041 ## Checks - [x] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-04-01 10:26:05 -07:00
Eric Zhu	68c1879675	Update mcp version to 1.6.0 to avoid bug in closing client. (#6162 ) Upgrade `mcp` package version to >=1.6.0 to avoid bugs causing hanging when using mcp_server_featch.	2025-04-01 14:54:30 +00:00
effedici	c620683ba6	fix: the installation instruction had a missing step (#6166 ) ## Why are these changes needed? Following the [guide instructions](https://microsoft.github.io/autogen/stable//user-guide/autogenstudio-user-guide/installation.html#a-install-from-source-manually) the user will execute the commands in the wrong directory. ## Related issue number Closes #6165	2025-04-01 07:44:49 +00:00
湛露先生	770bf2357a	Fix typos and optimize OrTerminationCondition (#5980 ) Signed-off-by: zhanluxianshen <zhanluxianshen@163.com> Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-31 00:14:58 -07:00
EeS	61ba153614	Doc/moudulor transform oai (#6149 ) This PR adds a module-level docstring to `_message_transform.py`, as requested in the review for [PR #6063](https://github.com/microsoft/autogen/pull/6063). The documentation includes: - Background and motivation behind the modular transformer design - Key concepts such as transformer functions, pipelines, and maps - Examples of how to define, register, and use transformers - Design principles to guide future contributions and extensions By embedding this explanation directly into the module, contributors and maintainers can more easily understand the structure, purpose, and usage of the transformer pipeline without needing to refer to external documents. ## Related issue number Follow-up to [PR #6063](https://github.com/microsoft/autogen/pull/6063)	2025-03-31 06:39:27 +00:00
EeS	fbdd89b46b	[BugFix][Refactor] Modular Transformer Pipeline and Fix Gemini/Anthropic Empty Content Handling (#6063 ) ## Why are these changes needed? This change addresses a compatibility issue when using Google Gemini models with AutoGen. Specifically, Gemini returns a 400 INVALID_ARGUMENT error when receiving a response with an empty "text" parameter. The root cause is that Gemini does not accept empty string values (e.g., "") as valid inputs in the history of the conversation. To fix this, if the content field is falsy (e.g., None, "", etc.), it is explicitly replaced with a single whitespace (" "), which prevents the Gemini model from rejecting the request. - Gemini API compatibility: Gemini models reject empty assistant messages (e.g., `""`), causing runtime errors. This PR ensures such messages are safely replaced with whitespace where appropriate. - Avoiding regressions: Applying the empty content workaround only to Gemini, and only to valid message types, avoids breaking OpenAI or other models. - Reducing duplication: Previously, message transformation logic was scattered and repeated across different message types and models. Modularizing this pipeline removes that redundancy. - Improved maintainability: With future model variants likely to introduce more constraints, this modular structure makes it easier to adapt transformations without writing ad-hoc code each time. - Testing for correctness: The new structure is verified with tests, ensuring the bug fix is effective and non-intrusive. ## Summary This PR introduces a modular transformer pipeline for message conversion and fixes a Gemini-specific bug related to empty assistant message content. ### Key Changes - [Refactor] Extracted message transformation logic into a unified pipeline to: - Reduce code duplication - Improve maintainability - Simplify debugging and extension for future model-specific logic - [BugFix] Gemini models do not accept empty assistant message content. - Introduced `_set_empty_to_whitespace` transformer to replace empty strings with `" "` only where needed - Applied it only to `"text"` and `"thought"` message types, not to `"tools"` to avoid serialization errors - Improved structure for model-specific handling - Transformer functions are now grouped and conditionally applied based on message type and model family - This design makes it easier to support future models or combinations (e.g., Gemini + R1) - Test coverage added - Added dedicated tests to verify that empty assistant content causes errors for Gemini - Ensured the fix resolves the issue without affecting OpenAI models --- ## Motivation Originally, Gemini-compatible endpoints would fail when receiving assistant messages with empty content (`""`). This issue required special handling without introducing brittle, ad-hoc patches. In addressing this, I also saw an opportunity to modularize the message transformation logic across models. This improves clarity, avoids duplication, and simplifies future adaptations (e.g., different constraints across model families). --- ## 📘 AutoGen Modular Message Transformer: Design & Usage Guide This document introduces the new modular transformer system used in AutoGen for converting `LLMMessage` instances to SDK-specific message formats (e.g., OpenAI-style `ChatCompletionMessageParam`). The design improves reusability, extensibility, and maintainability across different model families. --- ### 🚀 Overview Instead of scattering model-specific message conversion logic across the codebase, the new design introduces: - Modular transformer functions for each message type - Per-model transformer maps (e.g., for OpenAI-compatible models) - Optional conditional transformers for multimodal/text hybrid models - Clear separation between message adaptation logic and SDK-specific builder (e.g., `ChatCompletionUserMessageParam`) --- ### 🧱 1. Define Transform Functions Each transformer function takes: - `LLMMessage`: a structured AutoGen message - `context: dict`: metadata passed through the builder pipeline And returns: - A dictionary of keyword arguments for the target message constructor (e.g., `{"content": ..., "name": ..., "role": ...}`) ```python def _set_thought_as_content_gemini(message: LLMMessage, context: Dict[str, Any]) -> Dict[str, str \| None]: assert isinstance(message, AssistantMessage) return {"content": message.thought or " "} ``` --- ### 🪢 2. Compose Transformer Pipelines Multiple transformer functions are composed into a pipeline using `build_transformer_func()`: ```python base_user_transformer_funcs: List[Callable[[LLMMessage, Dict[str, Any]], Dict[str, Any]]] = [ _assert_valid_name, _set_name, _set_role("user"), ] user_transformer = build_transformer_func( funcs=base_user_transformer_funcs, message_param_func=ChatCompletionUserMessageParam ) ``` - The `message_param_func` is the actual constructor for the target message class (usually from the SDK). - The pipeline is ordered — each function adds or overrides keys in the builder kwargs. --- ### 🗂️ 3. Register Transformer Map Each model family maintains a `TransformerMap`, which maps `LLMMessage` types to transformers: ```python __BASE_TRANSFORMER_MAP: TransformerMap = { SystemMessage: system_transformer, UserMessage: user_transformer, AssistantMessage: assistant_transformer, } register_transformer("openai", model_name_or_family, __BASE_TRANSFORMER_MAP) ``` - `"openai"` is currently required (as only OpenAI-compatible format is supported now). - Registration ensures AutoGen knows how to transform each message type for that model. --- ### 🔁 4. Conditional Transformers (Optional) When message construction depends on runtime conditions (e.g., `"text"` vs. `"multimodal"`), use: ```python conditional_transformer = build_conditional_transformer_func( funcs_map=user_transformer_funcs_claude, message_param_func_map=user_transformer_constructors, condition_func=user_condition, ) ``` Where: - `funcs_map`: maps condition label → list of transformer functions ```python user_transformer_funcs_claude = { "text": text_transformers + [_set_empty_to_whitespace], "multimodal": multimodal_transformers + [_set_empty_to_whitespace], } ``` - `message_param_func_map`: maps condition label → message builder ```python user_transformer_constructors = { "text": ChatCompletionUserMessageParam, "multimodal": ChatCompletionUserMessageParam, } ``` - `condition_func`: determines which transformer to apply at runtime ```python def user_condition(message: LLMMessage, context: Dict[str, Any]) -> str: if isinstance(message.content, str): return "text" return "multimodal" ``` --- ### 🧪 Example Flow ```python llm_message = AssistantMessage(name="a", thought="let’s go") model_family = "openai" model_name = "claude-3-opus" transformer = get_transformer(model_family, model_name, type(llm_message)) sdk_message = transformer(llm_message, context={}) ``` --- ### 🎯 Design Benefits \| Feature \| Benefit \| \|--------\|---------\| \| 🧱 Function-based modular design \| Easy to compose and test \| \| 🧩 Per-model registry \| Clean separation across model families \| \| ⚖️ Conditional support \| Allows multimodal / dynamic adaptation \| \| 🔄 Reuse-friendly \| Shared logic (e.g., `_set_name`) is DRY \| \| 📦 SDK-specific \| Keeps message adaptation aligned to builder interface \| --- ### 🔮 Future Direction - Support more SDKs and formats by introducing new message_param_func - Global registry integration (currently `"openai"`-scoped) - Class-based transformer variant if complexity grows --- ## Related issue number Closes #5762 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ v ] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-30 21:09:30 -07:00
Eric Zhu	7615c7b83b	Rename to use BaseChatMessage and BaseAgentEvent. Bring back union types. (#6144 ) Rename the `ChatMessage` and `AgentEvent` base classes to `BaseChatMessage` and `BaseAgentEvent`. Bring back the `ChatMessage` and `AgentEvent` as union of built-in concrete types to avoid breaking existing applications that depends on Pydantic serialization. Why? Many existing code uses containers like this: ```python class AppMessage(BaseModel): name: str message: ChatMessage # Serialization is this: m = AppMessage(...) m.model_dump_json() # Fields like HandoffMessage.target will be lost because it is now treated as a base class without content or target fields. ``` The assumption on `ChatMessage` or `AgentEvent` to be a union of concrete types could be in many existing code bases. So this PR brings back the union types, while keep method type hints such as those on `on_messages` to use the `BaseChatMessage` and `BaseAgentEvent` base classes for flexibility.	2025-03-30 09:34:40 -07:00
Eric Zhu	e686342f53	Fix token limited model context (#6137 ) Token limited model context is currently broken because it is importing from extensions. This fix removed the imports and updated the model context implementation to use model client directly. In the future, the model client's token counting should cache results from model API to provide accurate counting.	2025-03-28 17:24:41 +00:00
EeS	0cd3ff46fa	FIX: Anthropic and Gemini could take multiple system message (#6118 ) Anthropic SDK could not takes multiple system messages. However some autogen Agent(e.g. SocietyOfMindAgent) makes multiple system messages. And... Gemini with OpenaiSDK do not take error. However is not working mulitple system messages. (Just last one is working) So, I simple change of, "merge multiple system message" at these cases. ## Related issue number Closes #6116 Closes #6117 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-28 09:05:54 -07:00
Stuart Leeks	c24eba6ae1	Add suppress_result_output to ACADynamicSessionsCodeExecutor initializer (#6130 ) When using the `ACADynamicSessionsCodeExecutor` it includes the stdout from the execution but also the `results` property from the call to dynamic sessions. In some situations, when the executed code results in a file being saved this is included in the result: ```console Plot saved as 'results_by_date.png' {'type': 'image', 'format': 'png', 'base64_data': 'iVBORw0KGgoAAAANSUhEUgAAA90AAAJOCAYAAACqS2TfAAAAOXRFWHRTb2Z0d2FyZQ... ``` In some situations, this additional output is not desirable: - when displaying the code output to a user - in this case, the stdout content is dwarfed by the base64 encoded file content - when an LLM agent is going to evaluate the code output to determine next steps - in this case, the base64 content will be included in the message history sent to the LLM increasing the prompt token cost To handle these cases, this PR adds a new (optional) argument to the `ACADynamicSessionsCodeExecutor` constructor that would allow suppressing the result content (but default to False to preserve the current behaviour in the default case) (from #6042) Closes #6042 Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-28 01:48:18 +00:00
EeS	2754eda611	FEAT: Add missing OpenAI-compatible models (GPT-4.5, Claude models) (#6120 ) This PR adds missing model entries for OpenAI-compatible endpoints, including gpt-4.5-turbo, gpt-4.5-turbo-preview, and claude-3.5-sonnet. This improves coverage and avoids potential fallback or mismatch issues when initializing clients.	2025-03-27 18:39:22 -07:00
Griffin Bassman	7487687cdc	[feat] token-limited message context (#6087 )	2025-03-27 13:59:27 -07:00
Eric Zhu	29485ef85b	Fix MCP tool bug by dropping unset parameters from input (#6125 ) Resolves #6096 Additionally: make sure MCP errors are formatted correctly, added unit tests for mcp servers and upgrade mcp version.	2025-03-27 13:22:06 -07:00
Jay Prakash Thakur	b5ff7ee355	feat(ollama): Add thought field support and fix LLM control parameters (#6126 )	2025-03-26 23:14:26 -07:00
Eric Zhu	025490a1bd	Use class hierarchy to organize AgentChat message types and introduce StructuredMessage type (#5998 ) This PR refactored `AgentEvent` and `ChatMessage` union types to abstract base classes. This allows for user-defined message types that subclass one of the base classes to be used in AgentChat. To support a unified interface for working with the messages, the base classes added abstract methods for: - Convert content to string - Convert content to a `UserMessage` for model client - Convert content for rendering in console. - Dump into a dictionary - Load and create a new instance from a dictionary This way, all agents such as `AssistantAgent` and `SocietyOfMindAgent` can utilize the unified interface to work with any built-in and user-defined message type. This PR also introduces a new message type, `StructuredMessage` for AgentChat (Resolves #5131), which is a generic type that requires a user-specified content type. You can create a `StructuredMessage` as follow: ```python class MessageType(BaseModel): data: str references: List[str] message = StructuredMessage[MessageType](content=MessageType(data="data", references=["a", "b"]), source="user") # message.content is of type `MessageType`. ``` This PR addresses the receving side of this message type. To produce this message type from `AssistantAgent`, the work continue in #5934. Added unit tests to verify this message type works with agents and teams.	2025-03-26 16:19:52 -07:00
Jack Gerrits	8a5ee3de6a	Add autogen user agent to azure openai requests (#6124 )	2025-03-26 16:01:42 -07:00
Liu Jia	ce92926e78	add read timeout for create_mcp_server_session (#6080 ) Closes #6031 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-26 17:51:09 +00:00
y26s4824k264	0bec835d59	Emit <think> and </think> around reasoning chunks from model_extras in choices.detla So the behavior of hosted R1 model is the same as locally hosted R1 model. Addresses: #5989 --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-25 16:17:53 -07:00
Kurok1	2e2a314f7e	Take the output of the tool and use that to create the HandoffMessage (#6073 ) Take the output of the tool and use that to create the HandoffMessage. [discussion is here](https://github.com/microsoft/autogen/discussions/6067#discussion-8117177) Supports agents to carry specific instructions when performing handoff operations --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-25 21:38:07 +00:00
Victor Dibia	9a0588347a	add utf encoding in websurfer read file (#6094 ) <!-- Thank you for your contribution! Please review https://microsoft.github.io/autogen/docs/Contribute before opening a pull request. --> <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? Add utf encoding to file reading. Without this, a default system encoding will be used. On Windows machines this can default to any local encoding causing errors. ```python with open( os.path.join(os.path.abspath(os.path.dirname(__file__)), "page_script.js"), "rt", encoding="utf-8" ) as fh: ``` <!-- Please give a short summary of the change and the problem this solves. --> ## Related issue number <!-- For example: "Closes #1234" --> Closes #6093 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed.	2025-03-25 09:01:27 -07:00
Jay Prakash Thakur	7047fb8b8d	Add support for thought field in AzureAIChatCompletionClient (#6062 ) added support for the thought process in tool calls for `OpenAIChatCompletionClient`, allowing additional text produced by a model alongside tool calls to be preserved in the thought field of `CreateResult`. This PR extends the same functionality to `AzureAIChatCompletionClient` for consistency across model clients. #5650 Co-authored-by: Jay Prakash Thakur <jathakur@microsoft.com>	2025-03-24 17:33:10 -07:00
tongyu	47ffaccba1	AssistantAgent.metadata for user/application identity information associated with the agent. #6048 (#6057 ) This PR introduces a metadata field in AssistantAgentConfig, allowing applications to assign and track identity information for agents. The metadata field is a Dict[str, str] and is included in the configuration for proper serialization. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-23 14:49:57 -07:00
jspv	fc2c9978fd	Add model_context property to AssistantAgent (#6072 ) AssistantAgent initiation allows one to pass in a model_context, but there isn't a "public: way to get the existing model_context created by default.	2025-03-22 20:21:29 -07:00
Abhijeetsingh Meena	e28738ac6f	Add async support for `selector_func` and `candidate_func` in `SelectorGroupChat` (#6068 )	2025-03-22 11:32:57 -07:00
EeS	bca4d7e82f	FIX: Anthropic multimodal(Image) message for Anthropic >= 0.48 aware (#6054 ) ## Why are these changes needed? This PR fixes a `TypeError: Cannot instantiate typing.Union` that occurs when using the `MultimodalWebSurfer_agent` with Anthropic models. The error was caused by the incorrect usage of `typing.Union` as a class constructor instead of a type hint within the `_anthropic_client.py` file. The code was attempting to instantiate `typing.Union`, which is not allowed. The fix correctly uses `typing.Union` within type hints, and uses the correct `Base64ImageSourceParam` type. It also updates the `pyproject.toml` dependency. ## Related issue number Closes #6035 ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [v] I've made sure all auto checks have passed. --------- Co-authored-by: Victor Dibia <victordibia@microsoft.com>	2025-03-22 00:46:55 -07:00
Victor Dibia	a2add02b12	[Draft] Add Tracing docs to agentchat (#5995 ) <!-- Thank you for your contribution! Please review https://microsoft.github.io/autogen/docs/Contribute before opening a pull request. --> <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? - Adds tracing docs page for AgentChat with Jaeger example - [x] Runtime tracing: Example code where tracing is done with the SingleThreaded Runtime, logging all events - [x] Custom event tracing: Example code logging messages returned from `team.run_stream()` - [ ] LLM span tracing .. depends on https://github.com/microsoft/autogen/issues/5895 - [ ] [TBD] Distributed tracing See [tracing.ipynb](`bdb6ac5315/python/packages/autogen-core/docs/src/user-guide/agentchat-user-guide/tracing.ipynb`) here <!-- Please give a short summary of the change and the problem this solves. --> ## Related issue number <!-- For example: "Closes #1234" --> #5992 ## Open Questions @ekzu - What is the recommended way to directly log custom events like LLMCallEvents and ToolCallEvents? LogEventhandlers in user code that become traced spans? - Currenltly tool calls and their args are already logged (not sure where this is done), but LLM call events are not. Should we include samples on this? ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed. --------- Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>	2025-03-22 00:09:19 -07:00
cheng-tan	6f784ac186	[Accessibility] Fix: screen reader does not announce theme change and nested nav label (#6061 ) ## Why are these changes needed? fix the accessibility issue that screen reader doesn't announce the theme when it changes ## Related issue number #5631 (13) (31) (59) --------- Co-authored-by: peterychang <49209570+peterychang@users.noreply.github.com>	2025-03-21 18:36:56 -04:00
Eric Zhu	26364e3dfb	doc: Improve documentation around model client and tool and how it works under the hood (#6050 ) Address some confusion such as #6036 --------- Co-authored-by: Victor Dibia <victordibia@microsoft.com>	2025-03-21 15:01:47 -07:00
湛露先生	97ccc582fc	Fix some code for clean autogenstudio (#5981 ) <!-- Thank you for your contribution! Please review https://microsoft.github.io/autogen/docs/Contribute before opening a pull request. --> <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? <!-- Please give a short summary of the change and the problem this solves. --> ## Related issue number <!-- For example: "Closes #1234" --> ## Checks - [x] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [x] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [x] I've made sure all auto checks have passed. Signed-off-by: zhanluxianshen <zhanluxianshen@163.com> Co-authored-by: Victor Dibia <victordibia@microsoft.com>	2025-03-21 14:35:56 -07:00
afourney	eaef7bab7c	Allow Docker-out-of-docker in AGBench (#6047 ) This PR allows docker-out-of-docker scenarios to be run in agbench (e.g., agent teams that rely on the DockerCommandLineExecutor) This is becoming increasingly important for benchmarking and testing, since the behaviors of running local executors can diverge in important ways.	2025-03-21 12:55:00 -07:00
cheng-tan	ff847cccad	[Accessibility] fix screen reader is not announcing 'Copied' information (#6059 ) <!-- Thank you for your contribution! Please review https://microsoft.github.io/autogen/docs/Contribute before opening a pull request. --> <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? <!-- Please give a short summary of the change and the problem this solves. --> If user tab to a code block copy button then hit enter, screen reader doesn't announce "Copied". This PR fixed this bug. ## Related issue number #5631 (8) <!-- For example: "Closes #1234" --> ## Checks - [ ] I've included any doc changes needed for <https://microsoft.github.io/autogen/>. See <https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md> to build and test documentation locally. - [ ] I've added tests (if relevant) corresponding to the changes introduced in this PR. - [ ] I've made sure all auto checks have passed.	2025-03-21 13:17:29 -04:00
Stuart Leeks	9a536e5d2b	Update migration guide type name (#5978 ) Update AzureContainerCodeExecutor to ACADynamicSessionsCodeExecutor to match the type name in the target docs	2025-03-21 02:52:25 +00:00
WuYunlong	334209f825	Correct README command examples for chess game sample. (#6008 ) Fix outdated script references from chess_game.py to main.py	2025-03-21 02:30:30 +00:00
peterychang	8f58e4704f	Add alt text for clickable cards on website (#6043 ) ## Why are these changes needed? Fixes (53) on screen reader issues. A special thanks to @sjay8 for starting the work on this task ## Related issue number https://github.com/microsoft/autogen/issues/5631	2025-03-21 00:29:24 +00:00
peterychang	a21a60b4f3	add alt text to images (#6045 ) Adds alt text to images. fixes screen reader issue (41) ## Related issue number https://github.com/microsoft/autogen/issues/5631	2025-03-20 17:23:35 -07:00
gagb	878aa4c3fc	Add linter to AGBench (#6022 ) This pull request introduces a new linting feature to the benchmark configuration in the `agbench` package. The main changes include adding a new command to the CLI, implementing the linter functionality, and integrating it with the existing codebase. ### New Linting Feature: * [`python/packages/agbench/src/agbench/cli.py`](diffhunk://#diff-0eafed70ad5e99e6f7319927bf92ee3ce4787d156dd2775b10a61baad7ec1799R10): Added `lint_cli` import and integrated the new "lint" command into the `main` function. [[1]](diffhunk://#diff-0eafed70ad5e99e6f7319927bf92ee3ce4787d156dd2775b10a61baad7ec1799R10) [[2]](diffhunk://#diff-0eafed70ad5e99e6f7319927bf92ee3ce4787d156dd2775b10a61baad7ec1799R37-R41) ### Linter Implementation: * [`python/packages/agbench/src/agbench/linter/__init__.py`](diffhunk://#diff-45842e728e3daad063b3cf84d5857a4fdfe14e6d977fb2054f284eb9f5bb5272R1-R4): Added necessary imports to initialize the linter module. * [`python/packages/agbench/src/agbench/linter/_base.py`](diffhunk://#diff-f7ea2f6706232406b6c727fda6d71f09c568b4573f070af79bb7f3da3514e364R1-R81): Defined core classes such as `Document`, `Code`, `CodeExample`, `CodedDocument`, and the `BaseQualitativeCoder` protocol. * [`python/packages/agbench/src/agbench/linter/cli.py`](diffhunk://#diff-e6ad1e14dc0df2c10fe62fede5a06d83865ad1961f99ec2d78f9052feb4d663bR1-R86): Implemented the `lint_cli` function, which includes loading log files, coding them, and printing the results. * [`python/packages/agbench/src/agbench/linter/coders/oai_coder.py`](diffhunk://#diff-5059129410822c8a214f797a6167cbfcfbe31bd6a3b1efcb65a2dd703ef9b331R1-R212): Implemented the `OAIQualitativeCoder` class to interact with OpenAI for coding documents and caching results. Example usage: <img width="997" alt="image" src="https://github.com/user-attachments/assets/6718688e-9917-4a43-a2f1-1105b030528d" /> <img width="999" alt="image" src="https://github.com/user-attachments/assets/7fcb9c43-70f2-4fe7-ae29-5ad6a4ef2a16" /> > If you are in VSCode Terminal, you can click on the links in the terminal output to jump to the exact error. --------- Co-authored-by: afourney <adamfo@microsoft.com>	2025-03-20 19:05:42 +00:00
Hussein Mozannar	fef953e062	Fix bytes in markdown converter playwright (#6044 ) Fix error: TypeError: Input stream must be opened in bytes mode, not in text mode. Markdown converter takes binary stream	2025-03-20 11:53:53 -07:00
Eric Zhu	46add11ec7	Move start() and stop() as interface methods for CodeExecutor (#6040 ) Resolves #6015	2025-03-20 10:00:52 -07:00

1 2 3 4 5 ...

1037 Commits