why praxis

an agent fleet you inherit on day one.

Twenty-five agents, pre-loaded across leadership, engineering, trust & safety, finance, support, growth, and people. Each agent ships with a typed output envelope, a decision class, and a kill switch. One approval queue lets a single operator supervise the fleet. The brain learns your company week over week.

01 · the lab

three things we built that nobody else has shipped together.

Sierra ships one customer-support agent. Decagon ships one customer-support agent. Cognition ships one software-engineer agent. CrewAI ships a framework for you to wire your own stateless agents together. Praxis ships a fleet plus the three things underneath that make a fleet shippable: a company brain, an autonomy queue, and a loop where the agents fix each other. All three are in code today, not on a roadmap.

01.a

the company brain

Every Praxis agent reads from and writes to a per-tenant, typed memory layer. Not chat memory. Not RAG-over-docs. A structural brain: note types are part of the schema, distillation runs as a worker, conflicts surface in the workspace. Week one the agents are generic. Week twelve they know your customers, your decisions, your projects, the idiosyncratic vocabulary your company runs on. Switching off Praxis means losing that accumulated brain — the moat compounds with every operator interaction.

Shipped
Typed memory schema. Distillation worker. Per-tenant isolation. Live in production.
01.b

the autonomy queue

Every action an agent wants to take with side effects declares a decision class. The runtime rejects the call if the class is missing. The class maps to auto, approval, or forbidden per tenant. The coding agent runs in an isolated sandbox with no credentials, capped at 5 files / 50 lines / 16KB / 10 minutes / 24 reads server-side — not in a prompt. There is a kill switch on every agent. This is what lets one operator supervise twenty-five agents touching money, comms, and the repo.

Shipped
Three-tier autonomy: auto, approval-gated, forbidden. Sandboxed coding agent. Per-tenant kill switch.
01.c

the self-improving loop

The product-pm agent reads the last week of agent runs, decisions, and memory notes; clusters repeat failures; RICE-scores them server-side (the LLM picks impact and effort, the server computes the math); and dispatches the top three to the coding agent. The qa-engineer runs the test suite on every PR inside a sandbox, surfaces the top three failures with suspected root cause, and queues bug-file decisions for operator review. Both leave a traceable improvement record so the next run reads its own scar tissue. Agents fix the agents that fix the company.

Shipped
Weekly RICE-scored proposals. Auditor-driven prompt revisions. Agent-to-agent dispatch.
02 · the receipts

shapes of work — drawn from running a real company.

The founder ran a live P2P marketplace before building Praxis — every Stripe dispute, every Persona inquiry, every payout reconciliation, every late-night ops escalation. That body of operational work is what these agents were designed around. The receipts below are illustrative shapes — the agent slugs and the situations are real; tenant-specific outcomes land as customers go live.

apr 2026
cto-engineer
asymmetric drift in the price-split helper overcharged a host by $493 on one trip. located the bug, drafted a surgical PR, founder shipped + refunded same day.
$493 recovered · 1 PR merged
mar 2026
dispute-response
three Stripe chargebacks landed between 11pm and 2am. assembled the evidence packet for each — GPS, check-in photos, host messages, trip timeline.
founder approved at 7am · funds recovered
may 2026
ads-manager
Memorial Day weekend: SF demand up 40%. drafted a weekly paid-acquisition plan reallocating $480 from underperforming YouTube placements into Meta Advantage+. founder approved.
~$14K added GMV · 1 plan shipped
apr 2026
chief-of-staff
07:14 PT: synthesized 24h of slack signals + agent outputs. ranked must-see by urgency × confidence. surfaced one blocked PR, three pending approvals, two T&S signals worth a glance.
founder cleared the queue in 9 minutes
apr 2026
qa-engineer
PR #1182 had 247 tests. ran the suite inside the sandbox, surfaced 2 failing tests, traced both to a missing null guard in the new price-split helper. queued the bug-file decision.
regression caught before merge
may 2026
product-pm
read the last week of agent runs + decisions. clustered repeat failures. RICE-scored a backlog of 8 items. dispatched the top 3 to cto-engineer.
3 PRs queued · 1 already merged
illustrative · shape-of-work · real receipts replace these as customer-zero lands.
onboard

inherit a working team. shaped by real operations.