CareerSIM is a direct-to-consumer (B2C) web application that helps individuals master critical career skills through hyper-realistic, AI-powered simulations. Users practice challenging professional situations โ from behavioral interviews to difficult conversations โ in a safe, repeatable environment and receive immediate, data-driven feedback to accelerate their personal and professional growth.
The platform empowers users to build confidence and competence for career-defining moments. By leveraging a LangGraph-based generative AI engine, CareerSIM provides dynamic, conversational practice with a diverse cast of AI personas, moving beyond rote memorization to foster genuine skill development.
Note
Repository is mid-migration. The active runtime is api/ + web/ + agent/ + postgres + redis, plus livekit + agent-voice for the browser-native voice surface, with a standalone static marketing site in landing/. Four earlier services (backend/, frontend/, rag/, transformers/) are still in the tree for reference only and are flagged as deprecated in both their own READMEs and docker-compose.local.yml. Do not build new features against them.
CareerSIM runs as three first-party services plus shared infrastructure. The API owns all persistence; the agent is a pure function of its inputs (full state snapshot in, new messages + updated state out) โ this makes replay, testing, and horizontal scaling straightforward. Voice mode adds two more processes โ a self-hosted LiveKit SFU and a dedicated agent-voice worker โ that reuse the same LangGraph engine over a WebRTC audio path.
โโโโโโโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโ
โ Web (Next.js 16) โ โโโโโ WebRTC (mic) โโโโโโบ โ LiveKit SFU โ
โ App Router + RSC โ โโโโโ agent audio โโโโโโโ โ :7880/:7882 โ
โ :3000 โ โโโโโโโโฌโโโโโโโโ
โโโโโโโโโโโโฌโโโโโโโโโโโโ โ PCM
โ REST + SSE (JWT Bearer) โ
โ text chat ยท sessions ยท auth ยท nudges โ
โโโโโโโโโโโโผโโโโโโโโโโโโ โโโโโโโโผโโโโโโโโ
โ API (Fastify 5 + โ โโโ internal + voice โโโโ โ agent-voice โ
โ Drizzle + Zod) โ turns over REST โ worker โ
โ :8000 โ โ STTโLGโTTS โ
โโโโโฌโโโโโโโโโโโโโโโฌโโโโ โโโโโโโโโโโโโโโโ
โ โ
POST /conv/* โ โ SQL
(batch + SSE) โ โ
โโโโโผโโโโโโโโ โโโโโผโโโโโโโโโโโโ
โ Agent โ โ PostgreSQL 17 โ
โ FastAPI + โ โ + Redis 7 โ
โ LangGraph โ โ :5432 / :6379 โ
โ :8001 โ โโโโโโโโโโโโโโโโโ
โ โ
โ embedded โ
โ Chroma โ
โโโโโโโโโโโโโ
The text-chat path (Web โ API โ Agent โ Postgres) is unchanged from the non-voice runtime. Voice adds a parallel audio lane: the browser streams mic audio to the LiveKit SFU, the agent-voice worker runs STT โ the same LangGraph turn โ TTS, and it loops every spoken turn back through the API's existing message/internal routes โ so persistence, goal eval, and quota all run through one engine, not a second copy.
Voice mode has its own operator runbook โ see VOICE_MODE.md for the end-to-end architecture, kill switches, provider selection, daily-budget enforcement, and the pre-merge smoke checklist.
| Service | Stack | Description |
|---|---|---|
| landing | Astro 6 static site, TypeScript, plain CSS | Public careersim.local marketing page generated from the Figma landing design. Includes a manual Figma sync script for reference screenshots and node metadata. |
| web | Next.js 16 (App Router, React 19), TypeScript 6, Tailwind 3 | Client-rendered SPA over the API: auth, simulation picker, session chat with SSE streaming, nudge auto-polling, follow-up bursts. |
| api | Node.js 22, Fastify 5, Drizzle ORM, PostgreSQL, @fastify/jwt, argon2id, Zod 4 |
Owns auth, persistence, and all session state. Proxies agent calls (including SSE) and enforces per-session ownership, nudge cadence, and proactive-trigger policy. |
| agent | Python 3.11+, FastAPI, LangGraph, Chroma (embedded), OpenAI / OpenRouter | Stateless conversation engine. One binary serves a Gradio dev console, a FastAPI production server (--serve api), or the voice worker (--serve voice). Retrieval uses an embedded Chroma store โ no separate RAG service. |
| agent-voice | Python 3.11+, LiveKit Agents SDK, faster-whisper / Piper (self-hosted defaults) | Dedicated voice worker. Joins LiveKit rooms minted by the API, runs STT โ the same LangGraph turn โ TTS, streams captions, and posts transcripts back through the public API on the user's bearer. |
| livekit | LiveKit Server (WebRTC SFU) | Self-hosted SFU routing audio between the browser and agent-voice. Runs in --dev mode locally; needs a real config + TLS in production. |
| postgres | PostgreSQL 17 | Source of truth for users, sessions, messages, state snapshots, and daily voice-minute usage. |
| redis | Redis 7 | Shared rate-limit buckets (@fastify/rate-limit); falls back to per-process LRU when REDIS_URL is unset. |
| Legacy directory | Replaced by | Notes |
|---|---|---|
backend/ (Express + TypeORM + Socket.IO + Stripe) |
api/ |
REST surface, JWT, and SSE streaming fully rewritten on Fastify. |
frontend/ (Vite + React 18 SPA) |
web/ |
Retro theme ported 1:1; admin panel / analytics dashboards dropped โ the new API doesn't back them yet. |
transformers/ (FastAPI + HuggingFace sentiment / emotion / toxicity) |
agent/ |
Evaluation now happens in-process via an LLM eval model (OPENAI_EVAL_MODEL). |
rag/ (FastAPI + ChromaDB service) |
agent/ |
Chroma is embedded directly in the agent; no HTTP boundary. |
Each of those directories carries a > [!WARNING] DEPRECATED banner at the top of its README. They are also commented out in docker-compose.local.yml under a deprecation block and will be removed in a future clean-up pass.
careersim/
โโโ landing/ # Astro static landing page + Figma design sync
โ โโโ src/{pages,styles}
โ โโโ figma/ # source design metadata and sync summary
โ โโโ scripts/sync-figma.mjs
โโโ api/ # Fastify + Drizzle API (active)
โ โโโ src/{agent,config,db,modules,plugins}
โ โโโ tests/ # Vitest + pglite + FakeAgent
โ โโโ drizzle.config.ts
โโโ web/ # Next.js 16 App Router frontend (active)
โ โโโ src/{app,components,contexts,lib}
โโโ agent/ # Python LangGraph agent (active)
โ โโโ src/careersim_agent/
โ โ โโโ voice/ # voice worker, STT/TTS providers, persona voice
โ โโโ scripts/ # voice_smoke.py, voice_perf.py, prefetch_voice_models.py
โ โโโ data/{personas.json,simulations.json,documents/}
โ โโโ tests/
โโโ backend/ # DEPRECATED โ superseded by api/
โโโ frontend/ # DEPRECATED โ superseded by web/
โโโ rag/ # DEPRECATED โ absorbed into agent/
โโโ transformers/ # DEPRECATED โ absorbed into agent/
โโโ infrastructure/ # Targets legacy stack; needs a pass for api/web/agent
โ โโโ aws/ # Terraform: ECS/Fargate, RDS, ElastiCache, ALB
โ โโโ aws-transformers/ # Standalone Transformers deployment
โ โโโ k8s/ # Kustomize (dev + prod overlays)
โโโ docker-compose.local.yml # Local dev stack (api + web + agent + agent-voice + livekit + postgres + redis)
โโโ PERSONAS.md # AI persona definitions
โโโ VOICE_MODE.md # Voice-mode operator guide + pre-merge smoke checklist
โโโ LICENSE.md # MIT
โโโ README.md # (this file)
- Docker + Docker Compose (for the one-shot path)
- Node.js โฅ 22.12 for
api/; Node.js โฅ 20 forlanding/andweb/; pnpm โฅ 10 - Python โฅ 3.11 +
[uv](https://docs.astral.sh/uv/)(for runningagent/outside Docker) - An OpenAI-compatible API key (OpenAI, OpenRouter, โฆ) for the agent
# 1. Configure each service's .env (all three are required)
cp agent/.env.example agent/.env # set OPENAI_API_KEY + model names
cp api/.env.example api/.env # set JWT_SECRET (min 16 chars)
cp web/.env.example web/.env # usually OK as-is; NEXT_PUBLIC_API_URL=http://localhost:8000
# 2. Bring the whole stack up (builds on first run, hot-reloads after)
docker compose -f docker-compose.local.yml up --buildThis starts:
| URL | Service |
|---|---|
| http://localhost:3000 | web โ Next.js app |
| http://localhost:8000 | api โ Fastify API |
| http://localhost:8000/docs | API Swagger UI (zod schemas โ OpenAPI) |
| http://localhost:8001 | agent โ FastAPI (stateless) |
| http://localhost:8001/docs | Agent Swagger UI |
| ws://localhost:7880 | livekit โ WebRTC SFU signalling (voice mode) |
| localhost:5432 | PostgreSQL (careersim / careersim_password) |
| localhost:6379 | Redis |
The compose stack also starts the agent-voice worker (no published port โ it
joins LiveKit rooms outbound). The first --build is slower than the others
because the voice image prefetches the default faster-whisper + Piper models
into the image / a named volume. Set VOICE_ENABLED=false in agent/.env +
api/.env to skip voice entirely (the worker exits cleanly and the Call button
is hidden) โ see VOICE_MODE.md.
The api container runs Drizzle migrations on start; no manual seeding needed. Register a new user from the web UI (there is no default admin account โ that concept belonged to the legacy backend/).
Useful when you want a faster hot-reload loop for a single service. Each subdirectory has its own README with detailed flags.
# Landing
cd landing && pnpm install && pnpm dev # :4321
# API
cd api && pnpm install && pnpm db:migrate && pnpm dev # :8000
# Web
cd web && pnpm install && pnpm dev # :3000
# Agent (FastAPI production server)
cd agent && uv sync && uv run python -m careersim_agent.main --serve api --port 8001
# Agent (Gradio dev console โ stateful, good for prompt iteration)
cd agent && uv run python -m careersim_agent.main # :7860
# Agent voice worker (joins LiveKit rooms; needs a running livekit + api)
cd agent && uv run python -m careersim_agent.main --serve voiceWhen mixing Docker + host, point each service at the others via host.docker.internal / localhost as appropriate โ see docker-compose.local.yml for the canonical wiring.
Nine first-party simulations shipped in agent/data/simulations.json, each bound to a persona with its own hidden goals, difficulty, and success criteria. The web app lists them at /simulations.
The api exposes POST /sessions/:id/messages/stream, which proxies the agent's SSE stream end-to-end. The web client shows an optimistic echo of the user's message, renders a typing indicator until the first AI chunk lands, then streams the reply token-by-token. Persistence happens exactly once when the upstream emits done.
Stateful graph inside agent/:
- Processes user input, fetches embedded-Chroma context, generates a persona response, and runs an LLM-based eval pass (sentiment / emotion / per-goal progress).
- Proactive messages are explicit graph branches:
start(conversation opener, fired during session init),followup(multi-message burst, capped byburstiness.max), andinactivity(guardrailed nudge). - Fully stateless at the API boundary โ the caller-owned
state_snapshotis sent on every turn, mirroring theTestStatelessnesssuite inagent/tests/test_api.py.
The api exposes POST /sessions/:id/nudge; the server decides idempotently whether to fire based on the persona's inactivityNudgeDelaySec window and inactivityNudges.max budget. The web client polls every 5 s while idle and stops automatically when the server returns nudges_disabled or budget_exhausted, re-arming on the next human reply. See api/README.md for the exact contract.
POST /sessions/:id/proactive/stream drives persona-initiated follow-ups capped by burstiness.max - 1 additional messages. The web UI exposes this behind a "Follow up" button and surfaces the cap as a {N} followups max badge alongside typing speed and nudge count.
Real-time spoken practice with the same personas, no install required. The web client mints a short-lived LiveKit token via POST /sessions/:id/voice/start, joins a self-hosted WebRTC SFU, and the agent-voice worker runs STT โ the existing LangGraph turn โ TTS in the room โ captions stream over a data channel and transcripts persist back through the public API on the user's bearer (audio itself is never stored). Defaults are fully self-hosted (faster-whisper + Piper); OpenAI, Deepgram, and ElevenLabs are opt-in per provider. Each persona declares a voice block (speaking rate, barge-in tolerance, filler-word density, per-provider voice IDs) so characters sound distinct. A daily per-user minute budget (VOICE_DAILY_MINUTES_PER_USER) is enforced authoritatively by the worker, and a single VOICE_ENABLED=false flag is a full kill switch. See VOICE_MODE.md for the operator runbook.
Per-simulation and per-persona Markdown under agent/data/documents/ is indexed into a persisted Chroma store (volume agent_chroma_db). No separate HTTP hop.
Sentiment and emotion for both sides of the conversation, plus per-goal progress with confidence scoring, computed by a cheaper eval model (OPENAI_EVAL_MODEL) on every turn. Results are persisted to the session's analysis + goal_progress columns and returned in GET /sessions/:id.
Email + password โ JWT bearer (stored in localStorage on the web side; Authorization: Bearer on every request). Passwords hashed with argon2id. No refresh tokens, email verification, or Stripe billing โ all of that lived in the deprecated backend/ and has not been re-implemented.
Shipped in agent/data/personas.json. Each declares a conversationStyle that the runtime surfaces in GET /sessions/:id.session_config and the web UI badges.
| Persona | Role | Simulation slug | Typing (wpm) | Nudges max | Burst max |
|---|---|---|---|---|---|
| Brenda Vance | By-the-Book HR Manager | behavioral-interview-brenda |
110 | 2 | 3 |
| Alex Chen | Passionate Tech Lead | tech-cultural-interview-alex |
140 | 3 | 3 |
| David Miller | Senior Analyst / Skeptical Veteran | pitching-idea-david |
120 | 2 | 1 |
| Sarah Jenkins | Overwhelmed Project Manager | saying-no-to-extra-work-sarah |
130 | 2 | 2 |
| Michael Reyes | Disengaged High-Performer | reengaging-disengaged-employee-michael |
100 | 1 | 1 |
| Chloe Davis | Eager but Anxious Junior | delegating-task-chloe |
135 | 2 | 2 |
| Priya Patel | Senior Data Analyst | data-analyst-technical-interview-priya |
125 | 3 | 2 |
| Vikram Shah | Pipeline-Pressured Recruiter | recruiter-coldreach-vikram |
150 | 3 | 3 |
| Marcus Whitfield | Time-Boxed VP of Engineering | informational-chat-marcus |
95 | 1 | 1 |
Each persona also declares a voice block (speakingRateWpm, bargeInToleranceMs, fillerWordFrequency, silenceThresholdMs, and per-provider voice settings for piper_local / openai_tts / elevenlabs) that drives how it sounds in voice mode โ so Vikram is rapid-fire and barges in quickly while Marcus is slow, dry, and tolerant of silence.
See PERSONAS.md for the full persona definitions, hidden goals, and success criteria.
agent/test_simulation.py is a CLI harness for running a complete simulation end-to-end against the agent's Gradio dev console โ useful for sanity-checking a new persona, a tweaked goal, or an evaluation-threshold change without clicking through the UI turn-by-turn. Two modes: interactive (you type the user's side) or **--auto** (an OpenAI-driven candidate plays the user side using a per-simulation strategy prompt baked into the script). The full transcript and per-turn goal-progress snapshots can be written to agent/logs/ with --log and exported as JSON with --json.
# Requires the Gradio dev console running on :7860 and OPENAI_API_KEY for --auto.
cd agent && uv run python -m careersim_agent.main &
cd agent && uv run python test_simulation.py --list
cd agent && uv run python test_simulation.py \
--sim recruiter-coldreach-vikram --auto --log --jsonSee agent/README.md for the full flag table and the convention for adding SIMULATION_PROMPTS entries when introducing a new simulation.
| Layer | Technology |
|---|---|
| Landing | Astro 6 static output, TypeScript 5.9, plain CSS, Figma reference sync |
| Web | Next.js 16 (App Router, Turbopack), React 19, TypeScript 6, Tailwind CSS 3, eventsource-parser, livekit-client (lazy) |
| API | Node.js 22.12+, Fastify 5, TypeScript 6 (strict ESM), Drizzle ORM + drizzle-kit, @fastify/jwt, argon2id, Zod 4 + fastify-type-provider-zod, undici, livekit-server-sdk |
| Agent | Python 3.11+, FastAPI, LangGraph, Chroma (embedded), OpenAI SDK, Gradio 5, uv |
| Voice | LiveKit Server (SFU) + LiveKit Agents SDK; faster-whisper + Piper self-hosted defaults; OpenAI Whisper/TTS, Deepgram, ElevenLabs opt-in |
| Data | PostgreSQL 17, Redis 7 |
| LLM / models | OpenAI-compatible chat + embeddings (OpenAI, OpenRouter, โฆ); in-process LLM eval |
| Testing | Vitest 4 + @electric-sql/pglite + FakeAgent (api), pytest + _FakeGraph (agent) |
| Infrastructure | Docker, Docker Compose; Terraform (AWS) and Kustomize (K8s) checked in but targeting the legacy layout |
Production-ready ECS Fargate topology with VPC, ALB, RDS PostgreSQL, ElastiCache Redis, EFS, Cloud Map service discovery, and optional GPU instances for the deprecated transformers service. Currently wired for the legacy backend + frontend + transformers + rag stack and has not been updated for api + web + agent.
Self-hosted deployment with dev and prod overlays, StatefulSets for databases, and GPU scheduling. Same caveat as above โ it targets the legacy layout.
Important
Both infrastructure trees need a pass before a production deploy of the new stack. The local docker-compose.local.yml is the canonical topology in the meantime.
# From the repo root
docker compose -f docker-compose.local.yml up --build # full stack
docker compose -f docker-compose.local.yml logs -f api web # tail two services
docker compose -f docker-compose.local.yml restart api # after a .env change
# Landing
cd landing && pnpm check && pnpm build
cd landing && pnpm sync:figma # requires FIGMA_TOKEN in the shell
# API
cd api && pnpm test # vitest (pglite + FakeAgent โ no OpenAI, no network)
cd api && pnpm e2e # interactive CLI against a live stack
cd api && pnpm typecheck
# Web
cd web && pnpm lint && pnpm typecheck
# Agent โ unit / contract tests (mocked LangGraph, no OpenAI key needed)
cd agent && uv run pytest
# Agent โ end-to-end simulation runs against a live Gradio dev console.
# Useful for sanity-checking new personas, goals, or eval thresholds.
# Requires the dev console running (`uv run python -m careersim_agent.main`)
# and OPENAI_API_KEY for --auto. See agent/README.md for the full flag list.
cd agent && uv run python test_simulation.py --list
cd agent && uv run python test_simulation.py \
--sim recruiter-coldreach-vikram --auto --log --json- Port infrastructure to the new stack โ Terraform + Kustomize for
api/web/agent/agent-voice/livekit. - Remove the deprecated directories once the migration is considered complete and nothing still references them.
- Real phone-call practice with AI personas โ building on the shipped browser-native voice mode, extend to outbound (the persona calls the user at a scheduled time) and inbound (the user dials a number) calls via Twilio Voice / LiveKit SIP, with the same LangGraph engine driving the conversation.
- Community features โ public leaderboards and discussion forums.
- Certification paths โ structured learning programs with shareable certificates.
- AI persona builder โ user-created custom personas for specialized practice.
- Team / B2B version โ enterprise offering with team analytics and management.
This project is licensed under the MIT License โ see the LICENSE.md file for details.
Pavel Vdovenko (reactivecake@gmail.com)