osmTalk Docs

Changelog

Version history and release notes for osmTalk.

v0.5.0 — Shared Credits & Billing (April 2026)

Billing & Payments

  • Usage-based credits — Pay per call, no subscriptions
  • Shared credits with osmAPI — Single balance across both platforms
  • Razorpay integration — Top up via card, UPI, or netbanking
  • Billing dashboard — Balance, transactions, usage with INR/USD toggle
  • Pre-call balance check — Calls rejected if credits below ₹1
  • Cost breakdown per call — See LLM, STT, TTS, and SIP costs individually
  • Chat billing — LLM token costs deducted for chat messages
  • Inbound call billing — SIP charges for incoming phone calls
  • WhatsApp call billing — SIP charges for WhatsApp voice calls
  • Minimum charge — ₹1 minimum per call
  • Phone number setup fee — ₹50 one-time on purchase
  • Monthly phone rental — Automatic billing via background job
  • Low balance alerts — Email when credits drop below ₹10
  • Phone deactivation — Numbers auto-deactivated when rental unpaid
  • Data retention — Free 30-day storage, cleanup after expiry

Auth

  • Shared auth with osmAPI — Single login across both platforms
  • Login/register redirects to osmAPI (email, Google, GitHub)
  • Cross-domain token-based authentication
  • Redis session cache (5-minute TTL)
  • Auth guard on all dashboard pages

Enterprise Features

  • Decimal.js precision — No floating-point billing errors
  • Atomic transactions — All-or-nothing credit deduction
  • Rate limiting — Per IP, user, and API key
  • Structured logging — Pino JSON logs for production
  • Background jobs — Phone billing + data retention cron endpoints
  • Idempotent webhooks — Duplicate payment protection

Infrastructure

  • Dual database (OsmTalk + StartFlow shared DB on Neon)
  • Docker PostgreSQL removed (migrated to Neon cloud)
  • Multi-organization support with org switcher
  • Projects table for grouping agents

Docs

  • Billing documentation (overview, credits, pricing, top-up)
  • API reference for all endpoints
  • Phone number guides updated
  • Changelog added

v0.4.0 — MCP Server & Outbound Calls (March 2026)

New Features

  • MCP Server — Make phone calls from Claude Desktop (npm)
  • openingMessage — Agent speaks a pre-written message instantly when call connects (no LLM delay)
  • callerName — Agent identifies who it's calling on behalf of
  • Call history via MCP — View calls, transcripts, and analytics from Claude
  • Dashboard analytics — Total calls, success rate, top agents

Improvements

  • Widget outbound call validation (instruction length, E.164 phone format)
  • MCP tools: list_calls, get_dashboard, get_call_result

v0.3.0 — WhatsApp & Call Transfer (February 2026)

New Features

  • WhatsApp integration — Connect WhatsApp Business numbers to voice agents
  • WhatsApp calling — Inbound and outbound calls via WhatsApp
  • Call transfer — Transfer active calls to a human agent or another number
  • Context summarization — Automatic conversation summarization for long calls

Improvements

  • ElevenLabs STT/TTS support (Scribe v2, Flash v2.5, Turbo v2.5)
  • Smart turn detection for natural conversations
  • Voicemail detection for outbound calls
  • End-call detection (agent hangs up when user says goodbye)

v0.2.0 — Phone Numbers & SIP (January 2026)

New Features

  • Phone number provisioning — Buy Indian phone numbers and assign to agents
  • Inbound calls — Assign agents to phone numbers for automatic answering
  • Outbound calls — Dial any number with a voice agent
  • SIP integration — LiveKit SIP bridge for PSTN connectivity
  • Call recordings — Automatic recording with MinIO storage
  • Widget embed — Embeddable voice/chat widget for websites

Improvements

  • Background sounds (office, nature, cafe)
  • Advanced VAD settings (confidence, start/stop times)
  • Multiple TTS voices per provider

v0.1.0 — Initial Release (December 2025)

Features

  • Voice agent creation with custom system prompts
  • Multi-provider support: OpenAI, Groq, Anthropic, Deepgram, Sarvam
  • Real-time voice calls via LiveKit WebRTC
  • Chat interface (text-based conversations)
  • Call transcripts and metrics
  • Team management with role-based access
  • Agent tools (HTTP tools, client tools, MCP servers)
  • Dashboard with call analytics