AI Agency Pricing Benchmarks 2026: Rate Cards by Archetype

Ravi Iyer

Ravi IyerJune 27, 202614 min read23 views

AI Agency Pricing Benchmarks 2026: Rate Cards by Archetype

June 2026 rate cards for AI agency builds split by archetype: solo operator, 3-person team, 8-person team, 20-plus-person studio. Discovery, build, retainer, audit, and white-label bands per archetype, plus a worked example on what to charge for a 6-hour AI-agent build.

Updated on June 27, 2026

Flat editorial illustration of a wall-mounted rate-card poster with four stacked archetype tiers, warm white background, charcoal line work, single amber accent.

On this page

Quick answer

In June 2026, AI agency rate cards diverge less by service line than by firm size. A solo operator and a 20-person studio billing the same "AI agent build" land on very different price points because their overhead, billable utilization, and discovery time differ by 4 to 6 times. The bands that hold across June 2026 published data: solo operators charge $4,500 to $18,000 fixed-price for an agent build; 3-person teams $9,000 to $35,000; 8-person teams $18,000 to $75,000; 20-plus-person teams $48,000 to $220,000. The rate cards below show what each tier actually puts on a quote line, drawn from per-archetype unit economics rather than aspirational hourly math.

Why rate cards differ by archetype, not by service line

Most "AI agency pricing in 2026" round-ups collapse the question to a service-line table: discovery sprint price, build price, retainer price. Digitalagencynetwork's January 2026 guide puts basic automation at $99 to $500 per month and enterprise-level at $1,000 to $5,000-plus per month. Monetizely's 2026 pricing-models piece shows the agentic per-task variant: $5,000 per month for the agent, $2 per task above 1,000. Both are correct as far as they go.

They miss the variable that explains the bulk of the variance on real quotes: who is sending the quote.

A solo founder who built and shipped a custom-trained scoring agent in 14 hours bills a different number than a 12-person studio with a partner, a head of delivery, two senior engineers, and a 38 percent billable-utilization floor. Same deliverable. Different cost base. Different acceptable margin. Different discovery cycle.

When SoW pricing came up two weeks ago, we built our 2026 fixed-price SoW guide for AI app builds around archetype-aware pricing structures; agencies that already used that template asked the obvious follow-on: what do the actual numbers look like at each archetype? This piece answers that.

A few definitions before the tables.

Discovery: the paid pre-build phase. Scoping, agent-spec, data audit, integration risk review, output of the SoW.
Build: the fixed-price implementation. Code, prompts, evals, deploy, handoff.
Retainer: monthly recurring. Could be observability + iterations, an "agent ops" SLA, or campaign-style usage.
Audit / one-off: paid teardown of an existing agent stack, output is a written brief plus an executive readout.

The same archetype band applies whether the build is an agent, an internal AI tool, or a small AI app. The driver is firm overhead and discovery time, not the model under the hood.

The four archetypes

Scroll to see more

Archetype	Headcount	Indicative annual revenue	Billable utilization target	Blended bill rate range (USD)
A. Solo operator	1	$90K to $400K	50% to 75%	$140 to $260 per hour
B. Small team	2 to 4	$250K to $900K	50% to 65%	$175 to $300 per hour
C. Mid team	5 to 12	$700K to $3.2M	45% to 60%	$200 to $340 per hour
D. Established studio	13 to 40-plus	$2.4M to $14M	38% to 52%	$235 to $420 per hour

The "blended bill rate" column is the rate the agency behaves like it bills, after weighting partner / senior / mid / junior hours by their actual mix at that headcount. It is not the rate they put on a public price page. The rate you put on a fixed-price quote works backward from this number, target margin, and an estimate of human hours required.

For context on how the blended rate is computed, see our prior piece on the effective hourly rate analysis for AI-native agencies in 2026; the methodology there is the input into every rate card below.

Rate card: solo operator (archetype A)

Solo operators win on discovery speed and on the agent-leverage ratio. The same agent build they could deliver in 14 human hours at the small-team archetype's 4-week SoW pace, they ship in 6 calendar days.

Scroll to see more

Line item	Price range (USD)	Notes
Discovery sprint (1 to 2 weeks)	$1,800 to $4,500	Often credited toward build if booked
Agent build, fixed-price	$4,500 to $18,000	The 6 to 14 human-hour sweet spot
Internal AI tool build (Slack bot, ops dashboard)	$3,500 to $12,000	Includes one integration
Small AI app (single-feature MVP)	$6,000 to $22,000	Single page, single workflow, no auth tiers
Monthly retainer	$1,500 to $4,800 per month	Up to ~10 hours of changes, light observability
Agent audit (one-off)	$1,200 to $3,800	8 to 16 hours, written brief plus 60-min readout

The honest pressure on this archetype is concentration risk. One client at $4,800 per month is 30 to 60 percent of monthly revenue, depending on the operator's other lines. Retainer pricing should reflect that the operator cannot absorb a missed payment without scrambling.

Rate card: small team (archetype B, 2 to 4 people)

Small teams gain bench depth but pay for the second seat. Discovery slows down because a second pair of eyes joins the SoW, and the retainer line gets richer because a small team can credibly offer "we'll watch your agent's evals next week" without it falling on one person's calendar.

Scroll to see more

Line item	Price range (USD)	Notes
Discovery sprint (2 weeks)	$3,500 to $9,500	Two-person review built in
Agent build, fixed-price	$9,000 to $35,000	Allows for a senior plus a builder on the spec
Internal AI tool build	$7,500 to $28,000	Includes two integrations and a basic admin UI
Small AI app	$12,000 to $48,000	Auth, two roles, one integration
Productized monthly retainer	$3,000 to $9,500 per month	Watching evals, weekly iteration
Custom monthly retainer	$5,000 to $14,000 per month	20 to 35 hours, includes a partner
Agent audit	$2,500 to $7,500	16 to 30 hours, includes a working repro

The retainer column splits at this archetype: productized (low-touch, high-margin, capped scope) versus custom (named accounts, partner involvement). Productized retainers should outnumber custom retainers two-to-one for the unit economics to work; see the 2026 productization decision framework that drives the productized-vs-custom split.

Rate card: mid team (archetype C, 5 to 12 people)

The mid team is where overhead becomes a serious line item: a head of delivery, an ops coordinator, software, sales travel. Billable utilization sits in the 45 to 60 percent band, which means the studio bills at $235 to $310 blended even though senior engineer rates are advertised at $280 to $375.

Scroll to see more

Line item	Price range (USD)	Notes
Discovery sprint (2 to 3 weeks)	$8,000 to $22,000	Includes a written technical due-diligence brief
Agent build, fixed-price	$18,000 to $75,000	Senior plus mid plus a partner readout
Internal AI tool build	$20,000 to $80,000	Multi-tenant, role-based access
Multi-feature AI app build	$35,000 to $180,000	Auth, payments, multi-role, three integrations
Productized monthly retainer	$4,500 to $14,000 per month	Tiered SLA
Custom monthly retainer	$9,000 to $32,000 per month	Named account, partner-on-call
Agent audit	$5,500 to $18,000	30 to 60 hours, multi-party readout
White-label tier (per client per month)	$1,500 to $6,500 per month	When the studio resells a builder platform

The white-label line is unique to this archetype and the next one up. Solo operators rarely have the cash-flow base to carry a multi-client SaaS-like cohort; archetype-D studios eventually move it onto the main contract grid. The mid team is the natural home for white-label as a revenue line.

Rate card: established studio (archetype D, 13 to 40-plus people)

Established studios have the discovery rigor and the senior-staff weight to justify the top of every band, but their overhead floor pushes the minimums up too. A studio at this archetype cannot honestly sell a $4,500 agent build, because it costs the studio more than that to scope it, send the contract, kick it off, and close it out.

Scroll to see more

Line item	Price range (USD)	Notes
Discovery sprint (3 to 4 weeks)	$22,000 to $65,000	Multi-track: technical, product, data, security
Agent build, fixed-price	$48,000 to $220,000	Partner-led, multi-engineer
Internal AI tool build	$55,000 to $260,000	Includes security review and change-management
Multi-feature AI app build	$85,000 to $480,000	Full product workstream, two to four month engagement
Productized monthly retainer	$9,000 to $28,000 per month	Tiered SLA, named SRE
Custom monthly retainer	$20,000 to $90,000 per month	Multi-named-account, dedicated cell
Agent audit	$12,000 to $42,000	60 to 140 hours, board-level deliverable
White-label tier (per client per month)	$3,500 to $14,000 per month	Bundled with managed services
Annual platform subscription	$90,000 to $480,000 per year	When the studio sells access to internal tooling

The annual platform line shows up only at this archetype. It is the late-stage move where a studio's accumulated internal tooling becomes a billable product in its own right.

Worked example: what to charge for a $9-15K AI-agent build that takes six human hours

This is the conversation every solo operator and small team has at least once a quarter. The agent is real, the spec is tight, the build is six to ten human hours. The studio's cost to deliver is somewhere between $750 and $1,800. What is the right ask?

The answer depends entirely on which archetype is doing the quoting and which archetype is doing the buying. The same agent has four defensible prices:

Scroll to see more

Seller archetype	Cost to deliver	Defensible quote	Implied margin
A. Solo operator	$750 to $1,800	$4,500 to $9,000	75 to 85 percent
B. Small team	$1,800 to $3,200	$9,000 to $15,000	65 to 80 percent
C. Mid team	$3,200 to $5,400	$14,000 to $24,000	55 to 70 percent
D. Established studio	$5,400 to $9,000	$22,000 to $48,000	40 to 65 percent

A few honest observations. First, the same six-hour build does not "deserve" a higher price at archetype D; the studio's overhead earns the right to charge that price, which is not the same thing. Second, the buyer's archetype matters too: a $480M-revenue insurer pays archetype-D rates for the same agent that a $4M-ARR startup buys from a solo operator at archetype-A rates, and both transactions are fair. Third, the margin math collapses when the agent stack carries hidden recurring cost (per-call LLM tokens, MCP server hosting, third-party data); model it through the unit-economics calculators we keep updated for this before sending the quote.

This is also where the platform-choice question sits. The build hours, and the agent's runtime cost over the next twelve months, are sensitive to which MCP servers and which builder substrate the agency standardizes on. Totalum's June 2026 roundup of MCP servers worth running in production is one honest read on the agency-grade end of that catalog; Taskip's June 2026 AI automation agency pricing range overview covers the retainer-side adjacent question and is worth reading in the same sitting.

Agency-side action checklist

For agencies setting or re-setting a 2026 rate card:

Pick the archetype honestly. The trap is quoting at one archetype above your real overhead; the quotes win, the margins do not survive. Headcount, billable utilization, and partner involvement are the inputs.
Anchor on blended bill rate, not posted hourly rate. Public price pages are advertising; quotes work off the blended.
Set the discovery line at 8 to 18 percent of the expected build. Anything lower turns discovery into free pre-sales; anything higher delays "yes."
Separate productized retainers from custom retainers on the price page. The first is a product; the second is a service. Pricing both as one number costs margin.
Pull the blended bill rate and the target margin through our calculators before any quote above the archetype's median. A 6 percent margin error on a $48,000 build is $2,880; do not eyeball it.
Anchor competitor pricing references with year tags. Digitalapplied's 2026 decision guide on AI agency pricing models is the strongest reference for the model question; combine with the archetype lens above for the actual number.
Re-rate every six months. Token cost is falling; the human hours in a build are falling faster on some workloads and slower on others; the band that fits Q1 2026 will not fit Q3 2026.

FAQ

What is the median fixed-price quote for an AI agent build in June 2026?
Across the four archetypes, the indicative median is around $14,000 to $32,000, depending on agency archetype. Solo operators cluster near $9,500; small teams near $18,000; mid teams near $32,000; established studios near $96,000.

Are AI agency margins really 65 to 75 percent?
Sometimes. The 65 to 75 percent figure that Rahul Gaur cited in his 2026 AI agency business model essay applies to agencies running heavily agentic workflows on top of their existing service contracts. Mixed studios still land in the 40 to 55 percent band, closer to traditional services.

How should an AI agency price a retainer when usage is unpredictable?
Use a base-plus-overage structure. A $5,000 per month base for 25 hours of changes and observability, then $180 to $260 per overage hour. The overage line carries the variance; the base line carries the relationship.

Do AI agency rate cards in 2026 still bill by the hour?
Most agencies bill by the hour only on audits and on custom retainers above the productized tier. Discovery and build are nearly always fixed-price in 2026, because hourly billing on AI work understates the leverage and the buyer reads "by the hour" as a budget risk.

When does white-label tier pricing make sense?
At archetype C and D, when the studio has at least three white-label customers, has standardized on a single builder substrate that supports rebranding cleanly, and has the cash-flow base to carry a multi-tenant cohort. Below three white-label customers the operational overhead does not pay back.

What is the discovery-sprint price band as a percentage of the build?
Eight to eighteen percent of the expected build, across all four archetypes. Lower than 8 percent turns the discovery into uncompensated pre-sales; higher than 18 percent makes the prospect ask why discovery is so expensive.

Should an AI agency raise rates mid-engagement when token cost falls?
No. The fixed-price quote is the contract. The next quote re-rates. Mid-engagement re-rates damage trust and almost never produce more revenue than the deal they jeopardize.

If you take one thing from this: the same six-hour AI agent build has four defensible 2026 prices, and the difference between them is not the build, it is the overhead of the firm doing the billing. Set your archetype honestly before you set your rate card.

By Ravi Iyer. Reviewed by Helena Marsh.

#ai agency #pricing #rate cards #2026 #agency operations #fixed-price

Back to the Library

Share

Written by

Ravi Iyer

Ravi Iyer covers AI-native agency operations for DevShopVault. Prior background in services-firm finance and delivery ops; focus on the unit economics behind fixed-price AI builds.

Frequently asked questions

What is the median fixed-price quote for an AI agent build in June 2026?

Across the four archetypes, the indicative median is around $14,000 to $32,000, depending on agency archetype. Solo operators cluster near $9,500; small teams near $18,000; mid teams near $32,000; established studios near $96,000.

Are AI agency margins really 65 to 75 percent?

Sometimes. The 65 to 75 percent figure that Rahul Gaur cited in his 2026 AI agency business model essay applies to agencies running heavily agentic workflows on top of their existing service contracts. Mixed studios still land in the 40 to 55 percent band, closer to traditional services.

How should an AI agency price a retainer when usage is unpredictable?

Use a base-plus-overage structure. A $5,000 per month base for 25 hours of changes and observability, then $180 to $260 per overage hour. The overage line carries the variance; the base line carries the relationship.

Do AI agency rate cards in 2026 still bill by the hour?

Most agencies bill by the hour only on audits and on custom retainers above the productized tier. Discovery and build are nearly always fixed-price in 2026, because hourly billing on AI work understates the leverage and the buyer reads "by the hour" as a budget risk.

When does white-label tier pricing make sense?

At archetype C and D, when the studio has at least three white-label customers, has standardized on a single builder substrate that supports rebranding cleanly, and has the cash-flow base to carry a multi-tenant cohort. Below three white-label customers the operational overhead does not pay back.

What is the discovery-sprint price band as a percentage of the build?

Eight to eighteen percent of the expected build, across all four archetypes. Lower than 8 percent turns the discovery into uncompensated pre-sales; higher than 18 percent makes the prospect ask why discovery is so expensive.

Should an AI agency raise rates mid-engagement when token cost falls?

No. The fixed-price quote is the contract. The next quote re-rates. Mid-engagement re-rates damage trust and almost never produce more revenue than the deal they jeopardize.