why praxis

an agent fleet you inherit on day one.

Twenty-five agents, pre-loaded across leadership, engineering, trust & safety, finance, support, growth, and people. Each agent ships with a typed output envelope, a decision class, and a kill switch. One approval queue lets a single operator supervise the fleet. The brain learns your company week over week.

Onboard →See the fleet

01 · the lab

three things we built that nobody else has shipped together.

Sierra ships one customer-support agent. Decagon ships one customer-support agent. Cognition ships one software-engineer agent. CrewAI ships a framework for you to wire your own stateless agents together. Praxis ships a fleet plus the three things underneath that make a fleet shippable: a company brain, an autonomy queue, and a loop where the agents fix each other. All three are in code today, not on a roadmap.

01.a

the company brain

Every Praxis agent reads from and writes to a per-tenant, typed memory layer. Not chat memory. Not RAG-over-docs. A structural brain: note types are part of the schema, distillation runs as a worker, conflicts surface in the workspace. Week one the agents are generic. Week twelve they know your customers, your decisions, your projects, the idiosyncratic vocabulary your company runs on. Switching off Praxis means losing that accumulated brain — the moat compounds with every operator interaction.

Shipped

Typed memory schema. Distillation worker. Per-tenant isolation. Live in production.

01.b

the autonomy queue

Every action an agent wants to take with side effects declares a decision class. The runtime rejects the call if the class is missing. The class maps to auto, approval, or forbidden per tenant. The coding agent runs in an isolated sandbox with no credentials, capped at 5 files / 50 lines / 16KB / 10 minutes / 24 reads server-side — not in a prompt. There is a kill switch on every agent. This is what lets one operator supervise twenty-five agents touching money, comms, and the repo.

Shipped

Three-tier autonomy: auto, approval-gated, forbidden. Sandboxed coding agent. Per-tenant kill switch.

01.c

the self-improving loop

The product-pm agent reads the last week of agent runs, decisions, and memory notes; clusters repeat failures; RICE-scores them server-side (the LLM picks impact and effort, the server computes the math); and dispatches the top three to the coding agent. The qa-engineer runs the test suite on every PR inside a sandbox, surfaces the top three failures with suspected root cause, and queues bug-file decisions for operator review. Both leave a traceable improvement record so the next run reads its own scar tissue. Agents fix the agents that fix the company.

Shipped

Weekly RICE-scored proposals. Auditor-driven prompt revisions. Agent-to-agent dispatch.

02 · the receipts

shapes of work — drawn from running a real company.

The founder ran a live P2P marketplace before building Praxis — every Stripe dispute, every Persona inquiry, every payout reconciliation, every late-night ops escalation. That body of operational work is what these agents were designed around. The receipts below are illustrative shapes — the agent slugs and the situations are real; tenant-specific outcomes land as customers go live.

apr 2026

cto-engineer

asymmetric drift in the price-split helper overcharged a host by $493 on one trip. located the bug, drafted a surgical PR, founder shipped + refunded same day.

$493 recovered · 1 PR merged

mar 2026

dispute-response

three Stripe chargebacks landed between 11pm and 2am. assembled the evidence packet for each — GPS, check-in photos, host messages, trip timeline.

founder approved at 7am · funds recovered

may 2026

ads-manager

Memorial Day weekend: SF demand up 40%. drafted a weekly paid-acquisition plan reallocating $480 from underperforming YouTube placements into Meta Advantage+. founder approved.

~$14K added GMV · 1 plan shipped

apr 2026

chief-of-staff

07:14 PT: synthesized 24h of slack signals + agent outputs. ranked must-see by urgency × confidence. surfaced one blocked PR, three pending approvals, two T&S signals worth a glance.

founder cleared the queue in 9 minutes

apr 2026

qa-engineer

PR #1182 had 247 tests. ran the suite inside the sandbox, surfaced 2 failing tests, traced both to a missing null guard in the new price-split helper. queued the bug-file decision.

regression caught before merge

may 2026

product-pm

read the last week of agent runs + decisions. clustered repeat failures. RICE-scored a backlog of 8 items. dispatched the top 3 to cto-engineer.

3 PRs queued · 1 already merged

illustrative · shape-of-work · real receipts replace these as customer-zero lands.

onboard

inherit a working team. shaped by real operations.

Onboard your company →20-min intro with founder →