Pre-release

LLM-driven program evolution

Evolve agents, without retraining your models.

Find host Already have a beta invite? app.promptpotter.com →

The critique loop

Generate, score against real data, critique, repeat — a measured search, not a grid guess.

Pluggable backends

Any backend that publishes a pipeline definition is optimizable — read-only.

What it does

One prompt in. A population out.

Read the docs

PromptPotter treats your prompt as a program with tunable axes — and breeds, scores and critiques a population of candidates each round until your numbers move.

PromptPotter

Get started

It just runs. Pick who runs it.

It’s just I/O — hand it your prompt and your examples, hit run. It spins up on a machine and, if you’re lucky, hands back a better prompt in about five minutes. Full live monitoring the whole way — only if you want it. No schemas, no special code, nothing to wire up.

We run it

Online, limited 10 campaigns · up to 20 rounds each

Includes:

Runs on our machines — zero setup
Full live monitoring as it runs
Bring your own API key when free runs end
Export or move your data anytime

Open the app How it works

You run it

Secure, Personal, Local Unlimited campaigns & rounds

Includes:

Claude-operated: the /potter-run skill drives it
Runs locally in your editor, on your own keys
Full control over run speed
The full source, nothing held back

Install guide How it works

Your team runs it

Team online Unlimited · multi-user

Everything above, plus:

Multi-user from day one
Behind a Cloudflare tunnel + OIDC allowlist
Whitelabel — your brand, your invite list

Self-host guide View the source

Compare all the ways to run it →

Others tune prompts by hand.
Our edge? An optimizer that evolves them.

PromptPotter evolves your prompts and pipeline params with a critique-guided loop — generate, score against real data, critique, repeat — plus posterior elimination and self-healing rails. It runs until your numbers move.

Find host

Free & open source

The whole thing is on GitHub.

No license keys, no seats, no paywall — the full source is public. Clone it, read it, run it.

View on GitHub Read the docs

The operator console

Watch every round, right in your browser.

When the loop is running, a read-only console streams the live pipeline, the candidate being scored and every round’s verdict — no extra tooling, no setup.

localhost:8001/ui

production · 84% acc

Input

Query

TermNorm

Output

Answer

New Chat

connected

My pipeline is stuck at 73%. Can’t push past it.

Share the eval set you’re scoring against — then flip on Auto-tune.

customer-tickets-eval.csv · 500 rows

Got it. Evolving the prompt now — every round surfaces here as it scores.

Type a message…

Send

Settings

Extended thinking

Web search

Code execution

Optimize prompt while usingBeta

Quietly evolves parameters across your project

Early access

Get in before launch.

PromptPotter isn't public yet. Leave your email and we'll bring you in when it opens — and tell us what you'd point it at, so we build the right thing first.

Trending Agents & Posts

GroqOpenAIAnthropicOpenRouterLangfusePython 3.13

Find host…

Evolve agents, without retraining your models.

One prompt in. A population out.

It just runs. Pick who runs it.

Includes:

Includes:

Everything above, plus:

Others tune prompts by hand. Our edge? An optimizer that evolves them.

The whole thing is on GitHub.

Watch every round, right in your browser.

Get in before launch.

Others tune prompts by hand.
Our edge? An optimizer that evolves them.