LLM route
Go back to the routing layer if you still need to clarify whether the problem is vendor, model or scenario.
This page turns recurring problems into bounded operating patterns for coding review, research, retrieval, browser agents, multimodal analysis and local-first deployment.
Local snapshot rules
Recipes live
Actionable workflow lanes
Low-cost lanes
Worker or local-first friendly
High-complexity lanes
Need stronger control or tooling
Local-first lanes
Privacy or self-host posture
Problem
Pick the recipe by the real bottleneck: review cost, retrieval quality, multimodal input or control.
Stack
Most useful flows split cheap workers, stronger planners and hard validation gates.
Validation
A workflow without real validation is only a demo, not an operating lane.
Go back to the routing layer if you still need to clarify whether the problem is vendor, model or scenario.
Open the stack layer when the recipe is blocked by architecture, governance or browser orchestration choices.
Use scenario-first model picks when the recipe is stable but the model lane is not.
Open hardware guidance when the recipe is now limited by local serving, VRAM, RAM or power budget.
Search recipes by problem, stack or model lane
Search by topic or category.
6 visible results
Coding review
When you need to review real PRs or diffs without paying a frontier lane on every intermediate pass.
Research
When the task mixes several sources, contradictions and a conclusion that must stay compact and useful.
Retrieval
When the model must operate against documents or external systems and internal knowledge is not enough.
Browser agents
When the task requires navigating real UIs, extracting state and closing actions with some control.
Multimodal
When image, audio, video or long PDFs enter the flow and the decision must land in one useful read.
Local-first
When data residence, fixed cost or stack autonomy matter more than the last point of frontier quality.
| Recipe | Recommended stack | Model and provider | Decision profile | Main caution |
|---|---|---|---|---|
| Coding review Coding review loop with a cheap worker and a strong closeWhen you need to review real PRs or diffs without paying a frontier lane on every intermediate pass. |
| Model: GPT-5.4 mini for workers and GPT-5.4 for the final close or review. Provider: OpenAI fits best when the loop depends on tooling, repo context and a strong close. | Medium Medium Medium |
|
| Research Research and synthesis with a long-context planner and executive closeWhen the task mixes several sources, contradictions and a conclusion that must stay compact and useful. |
| Model: GPT-5.4 as the main planner and Gemini 2.5 Flash-Lite if you need a cheap classification pass. Provider: OpenAI for sustained reasoning; Google as support when reading throughput matters. | Mid-high Moderate Medium |
|
| Retrieval Retrieval plus tools for flows that need real groundingWhen the model must operate against documents or external systems and internal knowledge is not enough. |
| Model: Gemini 2.5 Flash-Lite or GPT-5.4 mini for retrieval workers; GPT-5.4 for delicate synthesis. Provider: Google and OpenAI work well when you separate cheap workers from the final layer; retrieval design matters most. | Medium Medium High |
|
| Browser agents Browser flow with agents and human closesWhen the task requires navigating real UIs, extracting state and closing actions with some control. |
| Model: GPT-5.4 mini for browser workers and GPT-5.4 for the planner or approver. Provider: OpenAI fits best when the flow depends on iterative tool use and controlled outputs. | Medium Mid-high High |
|
| Multimodal Multimodal analysis with one strong lane and a compact outputWhen image, audio, video or long PDFs enter the flow and the decision must land in one useful read. |
| Model: Gemini 2.5 Pro as the first choice; GPT-5.4 mini only when the visual input is light. Provider: Google is usually stronger when the problem is truly multimodal and not just text with a decorative image. | Mid-high Moderate Medium |
|
| Local-first Local-first workflow for privacy, edge and controlWhen data residence, fixed cost or stack autonomy matter more than the last point of frontier quality. |
| Model: Mistral Large 3 or Ministral 3 8B depending on desired quality and operational footprint. Provider: Mistral fits best when open-weight or private-cloud strategy is a real decision, not a slogan. | Competitive Variable High |
|
Coding review
Problem: When you need to review real PRs or diffs without paying a frontier lane on every intermediate pass.
Recommended stack
Flow steps
Model: GPT-5.4 mini for workers and GPT-5.4 for the final close or review.
Provider: OpenAI fits best when the loop depends on tooling, repo context and a strong close.
Cautions
Research
Problem: When the task mixes several sources, contradictions and a conclusion that must stay compact and useful.
Recommended stack
Flow steps
Model: GPT-5.4 as the main planner and Gemini 2.5 Flash-Lite if you need a cheap classification pass.
Provider: OpenAI for sustained reasoning; Google as support when reading throughput matters.
Cautions
Retrieval
Problem: When the model must operate against documents or external systems and internal knowledge is not enough.
Recommended stack
Flow steps
Model: Gemini 2.5 Flash-Lite or GPT-5.4 mini for retrieval workers; GPT-5.4 for delicate synthesis.
Provider: Google and OpenAI work well when you separate cheap workers from the final layer; retrieval design matters most.
Cautions
Browser agents
Problem: When the task requires navigating real UIs, extracting state and closing actions with some control.
Recommended stack
Flow steps
Model: GPT-5.4 mini for browser workers and GPT-5.4 for the planner or approver.
Provider: OpenAI fits best when the flow depends on iterative tool use and controlled outputs.
Cautions
Multimodal
Problem: When image, audio, video or long PDFs enter the flow and the decision must land in one useful read.
Recommended stack
Flow steps
Model: Gemini 2.5 Pro as the first choice; GPT-5.4 mini only when the visual input is light.
Provider: Google is usually stronger when the problem is truly multimodal and not just text with a decorative image.
Cautions
Local-first
Problem: When data residence, fixed cost or stack autonomy matter more than the last point of frontier quality.
Recommended stack
Flow steps
Model: Mistral Large 3 or Ministral 3 8B depending on desired quality and operational footprint.
Provider: Mistral fits best when open-weight or private-cloud strategy is a real decision, not a slogan.
Cautions