Home / Product / Features and Vision
Updated Jun 25, 2026 · Affirmology_HermesCreation_Brainstorm_v1.md
The vision in one line: a creator talks to Hermes like a brilliant studio partner, the council composes from the real chart, the script unfolds in the chat, the creator shapes it by talking, then says "render it" and it lands in the listening room and the person's app. This is Atlas's centerpiece.
Runs entirely on the existing cloud (Render + R2 + Anthropic for the council + ElevenLabs/Fish for voice). No Mac mini, no new hardware. The only real cost lever is API usage, which the script-first discipline already controls.
Everything below makes this spine stronger, faster, or more magical.
Build the spine plus just enough to make it feel alive, then layer the rest over the air:
Defer to fast-follows: A/B variations, series/journeys, companion audios, deep memory of past feedback, the full chart-visual lightups. All of it lands without a new app release.
Show the full council (it is cool and it builds trust), but lean on HERMES the most. Hermes is the one you actually talk to: the host, the orchestrator, the throughline. The other oracles appear as contributors and cameos when their domain is in play (Sophia synthesizing, Chiron choosing technique, Prometheus on the Human Design, Orpheus drafting, Apollo refining), and Hermes narrates and weaves them. So the creator has ONE relationship (with Hermes) while the council visibly works behind him. Full theater, single host.
Jeff has been creating a superior demo audio with Claude Code and the strategic back-and-forth has been the magic. The patterns worth encoding into Hermes (distilled from how that collaboration works and from the council design; point me at a specific session transcript and I will mine it for the exact moves):
This is the same quality that makes Jeff plus Code feel like co-creation; Hermes should feel like that partner for every creator.
The creator (Jeff now, elevated testers soon) tests on THEMSELVES: making new versions, listening, discovering use-cases ("this is great on a morning walk," "this one is for right before a hard call"). Build for that loop:
You may give creation to the beta v1 group for a window. The gate is API cost, so here is the rough picture (order-of-magnitude, current June 2026 rates, verify against real usage; the levers below cut these meaningfully).
Per-action rough cost: - ORACLE CHAT (one grounded reply): about 1 to 4 cents per turn (Haiku end cheaper, Sonnet end pricier). CHEAP. This is the safe-to-be-generous surface. - CREATION (a full council compose plus a few conversational iterations): about $0.50 to $2.00 in Anthropic, depending on model mix and how many passes. - RENDER (voice for a ~7 minute audio): about $0.70 to $2.00 on ElevenLabs, or roughly $0.10 to $0.15 on Fish. The long Master Reading (~70 min) is the expensive outlier (voice alone in the multiple-dollars-to-low-teens range) and stays a premium/upsell, not a free-for-all. - ALL-IN per made-and-rendered audio: roughly $1.50 to $4.00, less with the levers and Fish.
What that means at beta scale (about 40 people): - Open ORACLE CHAT is affordable: even heavy use (say 40 people x 15 turns/day) lands in the low hundreds of dollars a month, less on Haiku. Be generous here; the data is worth it. - Open, UNMETERED CREATION + RENDER is where it adds up: 400 renders at ~$2.50 is ~$1,000, and enthusiastic creators making dozens each scale that linearly. So METER creation/renders even in the beta preview (for example, a set number of renders per person during the window), exactly as the membership design already intends.
Cost levers (use all of them): - Prompt caching on the static chart + corpus context cuts cached input by about 90%; with the same chart reused across a creation session this is a big saving. - Haiku for the back-office passes (script map, build report), strong model only for the script itself (already the pattern). - Fish for voice where it fits (roughly an order of magnitude cheaper than ElevenLabs), with QC still gating it; ElevenLabs where the voice identity must be exact. - Preview clips (cheap) before full renders, so voice spend only happens on keepers. - Batch processing (50% off) for any non-interactive, can-wait renders.
Recommendation: in the beta preview, keep oracle CHAT generous, METER creation and renders per person, lean on Fish + caching + Haiku to stretch the budget, and watch the per-person cost so the eventual tier pricing is set from real numbers, not guesses.
More specific, per Jeff's ask. Built from the real code: the council makes a few Sonnet calls plus Haiku back-office per creation (sophia_brief caps at 1600 output tokens, the script at 8192, apollo_review at 1400, the Haiku script map at 4096), the script generator injects up to 9 corpus snippets, and renders are billed per character. Rates (June 2026): Sonnet 4.6 $3 in / $15 out per M tokens; Haiku 4.5 $1 / $5; ElevenLabs roughly $0.12 to $0.30 per 1,000 characters by tier (Turbo about half); Fish about $15 per 1M bytes. Assumptions are stated so you can adjust them.
ORACLE CHAT, one grounded reply: - Assume ~10k input tokens (chart facts + a few corpus snippets + short history + system) and ~600 output tokens. - Sonnet: ~$0.030 in + ~$0.009 out = ~$0.04/turn. With prompt caching on the static chart+corpus (about 90% of the input), the input drops to ~$0.005, so ~$0.014/turn. - Haiku: ~$0.005 to $0.013/turn. - Takeaway: roughly 1 to 4 cents per turn, closer to 0.5 to 1.5 cents with caching or Haiku.
CREATION, one full council compose (no render yet): - Calls: Sophia brief (Sonnet), the script draft (Sonnet, the big one), Apollo review (Sonnet), often one re-draft (Sonnet), plus the Haiku script map and build report. Call it 4 Sonnet + 2 Haiku. - Per Sonnet call assume ~8k in / ~1.5k out = ~$0.024 + $0.0225 = ~$0.046. Four of them = ~$0.18. Two Haiku calls ~$0.026. Base compose ~$0.20 to $0.40 without caching; ~$0.10 to $0.20 with chart+corpus caching across the session. - Each conversational REVISION after that: ~1 to 2 Sonnet calls = ~$0.05 to $0.10. - A typical creation session (compose + a few revisions): ~$0.30 to $1.00. A heavy one: $1 to $2.
RENDER, voice for a ~7 minute audio (~5,000 to 7,000 characters): - ElevenLabs: ~$1.50 to $2.10 at Creator-tier rates, ~$0.90 to $1.25 at Scale, ~$0.60 to $0.85 at Business; about half those with Turbo. - Fish: ~$0.08 to $0.11. Roughly 10 to 20x cheaper.
THE LONG MASTER READING (~70 min, ~40,000 to 55,000 characters, 7 to 8 council sections): - Generation (Anthropic): ~$0.50 to $1.00. Voice on ElevenLabs: ~$7 to $11. Voice on Fish: ~$0.55 to $0.85. - All-in ~$8 to $12 on ElevenLabs, ~$1 to $2 on Fish. This is the clear premium/upsell outlier.
ALL-IN per made-and-rendered ~7 min audio: ~$1.50 to $3.50 on ElevenLabs, or under ~$1 if voiced on Fish with caching + Haiku back-office.
Scale snapshots (rough, scale linearly with real usage): - 40 testers, oracle chat 15 turns/day each: ~$12/day Sonnet, ~$4/day Haiku, so ~$120 to $360/month. Affordable; be generous. - 40 testers each making + rendering 10 audios over a preview month: ~400 audios. On ElevenLabs ~$600 to $1,400; on Fish + caching ~$150 to $400. This is why creation/renders get metered while chat stays open.
The levers move these a lot: prompt caching (~90% off the repeated chart+corpus input), Haiku back-office, Fish voice where identity allows, preview clips before full renders, and batch (50% off) for anything that can wait.
Make A/B routine and DISTRIBUTED, not founder-only. Jeff: "we should do a lot of A/Bing all the time, to get data, and spread the polling across multiple beta testers, instead of just trusting me. I could be outvoted. We need to evolve to that level of teamwork and synarchy."
What this means to build: - Variants are first-class: when Hermes makes two takes (softer/bolder, lead-with-HD vs lead-with-astrology, bed A vs bed B), they are tracked as an A/B pair of the same intent. - Push the SAME pair to MULTIPLE testers and collect comparative feedback ("which one landed more, and why"), so the call is made by cohort data, not one person's taste. - The decision surfaces as a result the team can see (the vote, the notes, the usage), and the winner can be promoted. The founder can be outvoted on purpose; that is the point. - This rides the existing feedback + event + voting infra; it needs a "variant group" concept and a comparative prompt. Capture every A/B so the council learns what actually wins across people.
CHARGING (Jeff's musing): for the BETA, do not charge. These are founding members; charging adds friction and costs you goodwill, evangelism, and the very data you are after, and the beta token spend is modest enough to fund. Their payment is feedback and word of mouth. "Better to get funded and just pay these tokens" is the right call for now.
For the PAID product, the credit/markup idea is sound and standard: a subscription covers generous normal use, and the token-heavy features (custom soul-song creation, the Master Reading) are metered with credits priced ABOVE cost, so the margin is built in. The backend already has entitlements + credits, so the rails exist. The thing to avoid is surprise nickel-and-diming that breaks the sacred feel: bundle generous normal use into the tier, and make the heavy extras clearly-priced (a credit pack or a higher tier) so the value is obvious. Let the beta's real per-person cost data set the credit price and what each tier includes.
INCLUDED ALLOTMENT + TOP-UP (Jeff's model, 2026-06-25): each tier includes a set number of custom audios per month; beyond that, the member loads up on extra credits on their own dime. This is the right shape, and the principle behind it matters: never throttle the truly obsessed, committed Affirmologist. Your heaviest creators are your most engaged people, your evangelists, and your future creator-tier members; capping them is capping your best asset. So let them go as deep as they want and capture margin on the enthusiasm instead of rationing it. Design notes:
- Make the included monthly allotment feel ABUNDANT at each tier (not stingy), so most people never hit the wall and the wall only matters to the truly voracious.
- Price top-up credits ABOVE cost so every extra creation funds the compute and adds margin; the obsessed user is then a profit center, not a cost risk.
- Show the remaining balance calmly and make topping up one tap, so it feels like fuel, not a meter running against them.
- Decide roll-over (do unused monthly audios carry over?) and whether the creator/Affirmologist tier gets the biggest allotment and the best top-up rate (it should; they create for others).
- The obsessed user is also the natural ELEVATION candidate: their drive to make more is the on-ramp to the creator tier, where their output becomes content and community for others. Top-ups and the creator path are two answers to the same enthusiasm.
This flows into the product ladder (Affirmology_ProductLadder_Framework_v1.md) when tiers are finalized; the beta's real per-person numbers set the allotment sizes and the credit price.
FISH FOR SOUL SONGS: worth trying now, in the STUDIO sandbox only. The locked demo soul-song baseline stays ElevenLabs Charlotte (do not touch it). But for NEW soul songs, A/B a Fish voice against Charlotte on a real soul-song script: Fish is already wired (FISH_API_KEY, voices locked for readings/daytime), it is roughly 10 to 20x cheaper, and if a Fish voice carries the soul-song feeling through the QC gate, it transforms the render economics above. Start the A/B; keep the QC gate; confirm commercial terms on the paid Fish plan before anything client-facing.