What is Microsoft AutoGen?

AutoGen is Microsoft's multi-agent framework that models AI agents as ConversableAgents that chat with each other. It supports two-agent conversations, GroupChat with multiple agents, and nested chats for sub-tasks. The core mechanic is a messages array passed between agent functions.

How does AutoGen compare to LangChain?

AutoGen focuses on multi-agent conversations where agents debate and collaborate. LangChain focuses on single-agent tool use with broad integrations. AutoGen excels at complex multi-turn agent interactions; LangChain excels at RAG pipelines and provider-agnostic tooling.

Can I build multi-agent systems without AutoGen?

Yes. Multi-agent systems in plain Python are multiple agent functions called in sequence on shared messages. A GroupChat is a for-loop over agent functions. Nested chats are a task queue. AutoGen's value is in dynamic speaker selection and conversation management — patterns you rarely need for straightforward workflows.

What is Rasa and what is it used for?

Rasa is an open-source framework for building conversational AI — chatbots, virtual assistants, and voice bots. It provides NLU (intent classification, entity extraction), dialogue management (conversation flow control), and custom actions (business logic). It's used primarily in enterprise settings for customer service, healthcare, and banking applications.

How does Rasa compare to using LLMs directly?

Rasa provides deterministic conversation flows with enterprise controls (on-premise, audit trails, compliance). LLMs with tool calling are more flexible and faster to prototype but less predictable. Rasa's CALM architecture combines both: LLMs for understanding, deterministic Flows for business logic. Choose Rasa for regulated, high-reliability chatbots; choose LLMs for general-purpose agents.

Do I need Rasa to build a chatbot?

No. A modern chatbot can be built with an LLM API call, a system prompt, and a tools dict for actions — about 60 lines of code. Rasa adds value for enterprise conversational AI with deterministic flows, on-premise deployment, and regulatory requirements. For simple or prototype chatbots, direct LLM usage is faster.

Comparisons / AutoGen vs Rasa

AutoGen vs Rasa: Which Agent Framework to Use?

AutoGen autogen by microsoft models agents as conversableagents that chat with each other. Rasa rasa is an open-source framework for building conversational ai — chatbots and virtual assistants. Here is how they compare — and what the same patterns look like in plain Python.

By the numbers

AutoGen

GitHub Stars

56.7k

Forks

8.5k

Language

Python

License

CC-BY-4.0

Created

2023-08-18

Created by

Microsoft Research

github.com/microsoft/autogen →

Rasa

GitHub Stars

21.1k

Forks

4.9k

Language

Python

License

Apache-2.0

Created

2016-10-14

Created by

Rasa Technologies

Cloud/SaaS

Rasa Pro / Rasa Cloud

Production ready

Yes

github.com/RasaHQ/rasa →

GitHub stats as of April 2026. Stars indicate community interest, not necessarily quality or fit for your use case.

Concept	AutoGen	Rasa	Plain Python
Agent	`ConversableAgent` with `system_message`, `llm_config`	Rasa agent with NLU pipeline, dialogue policies, and action server	A function with a system prompt that POSTs to the LLM API
Tools	`register_for_llm()` and `register_for_execution()`	Custom actions running on a separate action server via HTTP	A dict of callables + JSON schema descriptions
Conversation	Two-agent chat with `initiate_chat()`, message history	—	A `messages` array that grows with each turn
Multi-Agent	`GroupChat` with `GroupChatManager`, speaker selection	—	Multiple agent functions called in sequence on shared `messages`
Nested Chats	`register_nested_chats()` for sub-task handling	—	A task queue (BFS) — agent schedules follow-ups via a tool
Termination	`is_termination_msg` callback, `max_consecutive_auto_reply`	—	The `while` loop exits when no `tool_calls` or `max_turns` reached
NLU	—	NLU pipeline: tokenizer, featurizer, intent classifier, entity extractor	An LLM call with a prompt: `"Classify this message's intent: {message}"`
Dialogue	—	Stories/Rules YAML + dialogue policies for conversation flow	A state machine: `if intent == 'greet': state = 'greeting'; respond()`
Slots	—	Typed slots for tracking entities and state across turns	A dict updated during conversation: `slots = {"order_id": "123"}`
CALM	—	LLM for understanding + deterministic `Flows` for business logic	LLM parses user intent, `if`/`else` routes to the right handler function

What both do in plain Python

Every concept in the table above — agent, tools, loop, memory, state — maps to a handful of Python primitives: a function, a dict, a list, and a while loop. Both AutoGen and Rasa wrap these primitives in their own class hierarchies and APIs. The underlying pattern is the same ~60 lines of code. The difference is how much ceremony each framework adds on top.

When to use AutoGen

AutoGen excels at complex multi-agent workflows where agents need to debate or collaborate. For single-agent use cases or simple tool-calling agents, the plain Python version is significantly simpler.

What AutoGen does

AutoGen's core abstraction is the `ConversableAgent` — an agent that can send and receive messages. Two agents chat by alternating turns on a shared message history. `GroupChat` extends this to N agents, with a `GroupChatManager` that selects the next speaker (round-robin, random, or LLM-based selection). **Nested chats** allow an agent to spin up a sub-conversation to handle a complex subtask before returning to the main thread. AutoGen also provides code execution sandboxes, letting agents write and run code as part of their conversation. The framework thinks in terms of **conversations, not chains or graphs**. This makes it natural for workflows where agents need to debate, critique, or iteratively refine outputs together.

The plain Python equivalent

A `ConversableAgent` is a function that takes a `messages` array, calls the LLM with a system prompt, and returns the assistant message. Two-agent chat is a `while` loop where you alternate between calling `agent_a(messages)` and `agent_b(messages)`, appending each response. `GroupChat` is the same loop but with a **speaker selection step** — either rotate through a list or ask the LLM "who should speak next?" and call that agent function. Nested chats are a function call within the loop: pause the main conversation, run a sub-loop with different agents, and inject the result back. Tool registration is adding functions to a `tools` dict with their JSON schemas. The conversation-as-primitive model is **just `messages` arrays passed between functions**.

Full AutoGen comparison →

When to use Rasa

Rasa is purpose-built for production conversational AI with enterprise requirements — on-premise deployment, regulatory compliance, deterministic business logic. For general-purpose agents or simple chatbots, an LLM with a system prompt and a few tools is faster to build and more flexible.

What Rasa does

Rasa provides a **complete framework for building conversational AI systems**. The traditional stack includes: - an **NLU pipeline** (intent classification and entity extraction) - **dialogue management** (stories and rules that define conversation flows) - an **action server** for custom business logic The newer **CALM architecture** separates language understanding (handled by LLMs) from business logic (handled by deterministic `Flows`), giving you LLM fluency without sacrificing reliability. Rasa focuses on enterprise requirements: on-premise deployment, data privacy, regulatory compliance, and deterministic behavior for critical business flows. You define your domain in YAML — intents, entities, slots, responses, actions — and Rasa trains a model that handles the conversation lifecycle. The framework is **battle-tested in production** across banking, telecom, and healthcare.

The plain Python equivalent

Intent classification is **one LLM call**: send the user's message with a prompt asking for the intent and entities, parse the JSON response. Dialogue management is a state machine — a dict tracking the current state and a series of `if`/`else` branches routing to the next step. Custom actions are functions you call based on the classified intent. Slot filling is updating a dict as entities are extracted. The entire conversational agent — intent handling, state tracking, tool dispatch, response generation — fits in about **60 lines**. The LLM handles the language understanding that Rasa's NLU pipeline was trained for, and your `if`/`else` logic handles the flows that Rasa's dialogue policies managed. **No YAML domain files, no training pipeline, no action server.**

Full Rasa comparison →

Or build your own in 60 lines

Both AutoGen and Rasa implement the same 8 patterns. An agent is a function. Tools are a dict. The loop is a while loop. The whole thing composes in ~60 lines of Python.

No framework. No dependencies. No opinions. Just the code.

Build it from scratch →