Agent stacks

Choose the agent stack that matches the work, not the one with the loudest demo.

This board separates pipelines, retrieval stacks, memory-enabled agents, multi-agent orchestrators, coding agents, browser automation and local-first stacks before implementation begins.

Open recipes Open hardware guide

Families compared

Stack archetypes

High-control lanes

Deterministic or tightly governed

Memory-aware lanes

State or persistence matters

High-complexity lanes

Multi-agent or browser-heavy

Use memory

Choose memory only when task continuity is worth the state burden

If the agent must return to work, preserve context or maintain long threads, memory is justified. Otherwise pipelines stay simpler.

Use multi-agent

Choose multi-agent only when roles are truly separable

If you cannot define ownership, handoffs and validation, a single stateful agent is usually the better design.

Use browser stacks

Choose browser or computer-use only when the interface is the source of truth

If an API exists and is sufficient, browser automation should usually stay out of the critical path.

Decision board

Compare agent stack families by operating reality

Docs-verified snapshot

Stack family	Use when	Control and memory	Budget and complexity	Caution
Simple pipelines Temporal, n8n, Queues + workers	Use it when the real problem is coordinating reliable tasks, not simulating autonomy Repeatable flows, approvals, operational ETL and automation with clear steps	High and deterministic No conversational memory required	Low-medium Predictable	If you force too much autonomy into a fixed pipeline, support gets harder with little upside Official source: Temporal docs
Retrieval + tools stacks LlamaIndex, PydanticAI, LangGraph	Use it when you need tools and retrieval before you need a swarm of agents Grounded copilots, internal assistants and workflows that rely on your own data	High if you define retrieval, tools and guardrails Light or contextual memory	Medium Mid and controllable	Quality falls quickly if retrieval, ranking and tool policy are not defined well Official source: LlamaIndex docs
Stateful single agents LangGraph, PydanticAI, Custom memory store	Use it when one memory-enabled agent is enough and multi-agent would be over-engineering Agents that revisit work, remember context and need task continuity	High with a solid state layer Persistent or session memory	Medium-high Variable	Memory without a clear policy creates context bloat and inconsistent decisions Official source: LangGraph docs
Multi-agent orchestrators CrewAI, AutoGen, LangGraph supervisor	Use it when one loop is not enough and you can define ownership, handoffs and validation Complex problems with distinct roles, real parallelism and coordination needs	Medium-high, depends on ownership and messaging Shared or per-agent memory	High High without strict routing	Without file ownership, traceability and autonomy limits, cost and complexity explode Official source: CrewAI docs
Coding agents Codex CLI, Claude Code, Repo tool runners	Use it when the center of gravity is the repository, not a conversational UI Bounded refactors, test generation, code review and tool-guided execution	High if the repo has checks and ownership Repo and task context	Medium Mid-high depending on model	Without build, tests and small diffs, throughput turns into technical debt Official source: PydanticAI docs
Browser and computer-use stacks Playwright, Browser tool runners, Computer-use loops	Use it when the only source of truth is the interface and no sufficient API exists Web ops, QA, forms, guided scraping and tasks where the UI is part of the work	Medium unless you add asserts and snapshots Short memory and browser state	High Mid-high because of operational fragility	This family is most sensitive to UI changes, auth and non-deterministic flows Official source: Playwright docs
Local-first agent stacks Ollama, Local tool runners, Private vector stores	Use it when data residence, recurring cost or local autonomy matter more than the latest benchmark Privacy, sovereignty, edge and teams willing to trade some frontier quality for control	Very high Memory and data under local control	Medium-high Capex first, low opex	Hardware, observability and model maintenance become part of the product Official source: Ollama docs

Simple pipelines

Temporal, n8n, Queues + workers

Use when: Use it when the real problem is coordinating reliable tasks, not simulating autonomy

Best for: Repeatable flows, approvals, operational ETL and automation with clear steps

Control: High and deterministic

Memory: No conversational memory required

Complexity: Low-medium

Budget: Predictable

Caution: If you force too much autonomy into a fixed pipeline, support gets harder with little upside

Official source

Retrieval + tools stacks

LlamaIndex, PydanticAI, LangGraph

Use when: Use it when you need tools and retrieval before you need a swarm of agents

Best for: Grounded copilots, internal assistants and workflows that rely on your own data

Control: High if you define retrieval, tools and guardrails

Memory: Light or contextual memory

Complexity: Medium

Budget: Mid and controllable

Caution: Quality falls quickly if retrieval, ranking and tool policy are not defined well

Official source

Stateful single agents

LangGraph, PydanticAI, Custom memory store

Use when: Use it when one memory-enabled agent is enough and multi-agent would be over-engineering

Best for: Agents that revisit work, remember context and need task continuity

Control: High with a solid state layer

Memory: Persistent or session memory

Complexity: Medium-high

Budget: Variable

Caution: Memory without a clear policy creates context bloat and inconsistent decisions

Official source

Multi-agent orchestrators

CrewAI, AutoGen, LangGraph supervisor

Use when: Use it when one loop is not enough and you can define ownership, handoffs and validation

Best for: Complex problems with distinct roles, real parallelism and coordination needs

Control: Medium-high, depends on ownership and messaging

Memory: Shared or per-agent memory

Complexity: High

Budget: High without strict routing

Caution: Without file ownership, traceability and autonomy limits, cost and complexity explode

Official source

Coding agents

Codex CLI, Claude Code, Repo tool runners

Use when: Use it when the center of gravity is the repository, not a conversational UI

Best for: Bounded refactors, test generation, code review and tool-guided execution

Control: High if the repo has checks and ownership

Memory: Repo and task context

Complexity: Medium

Budget: Mid-high depending on model

Caution: Without build, tests and small diffs, throughput turns into technical debt

Official source

Browser and computer-use stacks

Playwright, Browser tool runners, Computer-use loops

Use when: Use it when the only source of truth is the interface and no sufficient API exists

Best for: Web ops, QA, forms, guided scraping and tasks where the UI is part of the work

Control: Medium unless you add asserts and snapshots

Memory: Short memory and browser state

Complexity: High

Budget: Mid-high because of operational fragility

Caution: This family is most sensitive to UI changes, auth and non-deterministic flows

Official source

Local-first agent stacks

Ollama, Local tool runners, Private vector stores

Use when: Use it when data residence, recurring cost or local autonomy matter more than the latest benchmark

Best for: Privacy, sovereignty, edge and teams willing to trade some frontier quality for control

Control: Very high

Memory: Memory and data under local control

Complexity: Medium-high

Budget: Capex first, low opex

Caution: Hardware, observability and model maintenance become part of the product

Official source

Route

LLM route

Return to the routing layer if the blocker is still vendor, row-level model detail or workflow scope.

Open LLM route

Route

Model fit radar

Use scenario-first model picks when the stack family is clear but the default model lane is not.

Open radar

Route

Inference guide

Jump to hardware guidance when private serving, VRAM or workstation shape now limits the stack.

Open hardware

Route

Workflow recipes

Use practical operating recipes when the family is chosen and the next question is how to run it.

Open recipes

Route

Agent directory

Return to the framework map once the stack family is decided and you need vendor tooling context.

Open directory