GitHub - QwenLM/Qwen-Agent: Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc. (original) (raw)

中文 | English

💜 Qwen Chat | 🤗 Hugging Face | 🤖 ModelScope | 📑 Blog | 📖 Documentation
💬 WeChat (微信) | 🫨 Discord

Qwen-Agent is a framework for developing LLM applications based on the instruction following, tool usage, planning, and memory capabilities of Qwen. It also comes with example applications such as Browser Assistant, Code Interpreter, and Custom Assistant. Now Qwen-Agent plays as the backend of Qwen Chat.

News

Getting Started

Installation

pip install -U "qwen-agent[gui,rag,code_interpreter,mcp]"

Or use pip install -U qwen-agent for the minimal requirements.

The optional requirements, specified in double brackets, are:

[gui] for Gradio-based GUI support;

[rag] for RAG support;

[code_interpreter] for Code Interpreter support;

[mcp] for MCP support.

git clone https://github.com/QwenLM/Qwen-Agent.git cd Qwen-Agent pip install -e ./"[gui,rag,code_interpreter,mcp]"

Or pip install -e ./ for minimal requirements.

Preparation: Model Service

You can either use the model service provided by Alibaba Cloud's DashScope, or deploy and use your own model service using the open-source Qwen models.

Developing Your Own Agent

Qwen-Agent offers atomic components, such as LLMs (which inherit from class BaseChatModel and come with function calling) and Tools (which inherit from class BaseTool), along with high-level components like Agents (derived from class Agent).

The following example illustrates the process of creating an agent capable of reading PDF files and utilizing tools, as well as incorporating a custom tool:

import pprint import urllib.parse import json5 from qwen_agent.agents import Assistant from qwen_agent.tools.base import BaseTool, register_tool from qwen_agent.utils.output_beautify import typewriter_print

Step 1 (Optional): Add a custom tool named my_image_gen.

@register_tool('my_image_gen') class MyImageGen(BaseTool): # The description tells the agent the functionality of this tool. description = 'AI painting (image generation) service, input text description, and return the image URL drawn based on text information.' # The parameters tell the agent what input parameters the tool has. parameters = [{ 'name': 'prompt', 'type': 'string', 'description': 'Detailed description of the desired image content, in English', 'required': True }]

def call(self, params: str, **kwargs) -> str:
    # `params` are the arguments generated by the LLM agent.
    prompt = json5.loads(params)['prompt']
    prompt = urllib.parse.quote(prompt)
    return json5.dumps(
        {'image_url': f'https://image.pollinations.ai/prompt/{prompt}'},
        ensure_ascii=False)

Step 2: Configure the LLM you are using.

llm_cfg = { # Use the model service provided by DashScope: 'model': 'qwen-max-latest', 'model_type': 'qwen_dashscope', # 'api_key': 'YOUR_DASHSCOPE_API_KEY', # It will use the `DASHSCOPE_API_KEY' environment variable if 'api_key' is not set here.

# Use a model service compatible with the OpenAI API, such as vLLM or Ollama:
# 'model': 'Qwen2.5-7B-Instruct',
# 'model_server': 'http://localhost:8000/v1',  # base_url, also known as api_base
# 'api_key': 'EMPTY',

# (Optional) LLM hyperparameters for generation:
'generate_cfg': {
    'top_p': 0.8
}

}

Step 3: Create an agent. Here we use the Assistant agent as an example, which is capable of using tools and reading files.

system_instruction = '''After receiving the user's request, you should:

Step 4: Run the agent as a chatbot.

messages = [] # This stores the chat history. while True: # For example, enter the query "draw a dog and rotate it 90 degrees". query = input('\nuser query: ') # Append the user query to the chat history. messages.append({'role': 'user', 'content': query}) response = [] response_plain_text = '' print('bot response:') for response in bot.run(messages=messages): # Streaming output. response_plain_text = typewriter_print(response, response_plain_text) # Append the bot responses to the chat history. messages.extend(response)

In addition to using built-in agent implementations such as class Assistant, you can also develop your own agent implemetation by inheriting from class Agent.

The framework also provides a convenient GUI interface, supporting the rapid deployment of Gradio Demos for Agents. For example, in the case above, you can quickly launch a Gradio Demo using the following code:

from qwen_agent.gui import WebUI WebUI(bot).run() # bot is the agent defined in the above code, we do not repeat the definition here for saving space.

Now you can chat with the Agent in the web UI. Please refer to the examples directory for more usage examples.

FAQ

How to Use MCP?

You can select the required tools on the open-source MCP server website and configure the relevant environment.

Example of MCP invocation format:

{
    "mcpServers": {
        "memory": {
            "command": "npx",
            "args": ["-y", "@modelcontextprotocol/server-memory"]
        },
        "filesystem": {
            "command": "npx",
            "args": ["-y", "@modelcontextprotocol/server-filesystem", "/path/to/allowed/files"]
        },
        "sqlite" : {
            "command": "uvx",
            "args": [
                "mcp-server-sqlite",
                "--db-path",
                "test.db"
            ]
        }
    }
}

For more details, you can refer to the MCP usage example

The dependencies required to run this example are as follows:

# Node.js (Download and install the latest version from the Node.js official website)
# uv 0.4.18 or higher (Check with uv --version)
# Git (Check with git --version)
# SQLite (Check with sqlite3 --version)

# For macOS users, you can install these components using Homebrew:
brew install uv git sqlite3

# For Windows users, you can install these components using winget:
winget install --id=astral-sh.uv -e
winget install git.git sqlite.sqlite

Do you have function calling (aka tool calling)?

Yes. The LLM classes provide function calling. Additionally, some Agent classes also are built upon the function calling capability, e.g., FnCallAgent and ReActChat.

The current default tool calling template natively supports Parallel Function Calls.

How to pass LLM parameters to the Agent?

llm_cfg = { # The model name being used: 'model': 'qwen3-32b', # The model service being used: 'model_type': 'qwen_dashscope', # If 'api_key' is not set here, it will default to reading the DASHSCOPE_API_KEY environment variable: 'api_key': 'YOUR_DASHSCOPE_API_KEY',

# Using an OpenAI API compatible model service, such as vLLM or Ollama:
# 'model': 'qwen3-32b',
# 'model_server': 'http://localhost:8000/v1',  # base_url, also known as api_base
# 'api_key': 'EMPTY',

# (Optional) LLM hyperparameters:
'generate_cfg': {
    # This parameter will affect the tool-call parsing logic. Default is False:
      # Set to True: when content is `<think>this is the thought</think>this is the answer`
      # Set to False: when response consists of reasoning_content and content
    # 'thought_in_content': True,

    # tool-call template: default is nous (recommended for qwen3):
    # 'fncall_prompt_type': 'nous'

    # Maximum input length, messages will be truncated if they exceed this length, please adjust according to model API:
    # 'max_input_tokens': 58000

    # Parameters that will be passed directly to the model API, such as top_p, enable_thinking, etc., according to the API specifications:
    # 'top_p': 0.8
}

}

How to do question-answering over super-long documents involving 1M tokens?

We have released a fast RAG solution, as well as an expensive but competitive agent, for doing question-answering over super-long documents. They have managed to outperform native long-context models on two challenging benchmarks while being more efficient, and perform perfectly in the single-needle "needle-in-the-haystack" pressure test involving 1M-token contexts. See the blog for technical details.

Application: BrowserQwen

BrowserQwen is a browser assistant built upon Qwen-Agent. Please refer to its documentation for details.

Disclaimer

The code interpreter is not sandboxed, and it executes code in your own environment. Please do not ask Qwen to perform dangerous tasks, and do not directly use the code interpreter for production purposes.