Skip to main content
Frequently asked

Questions that come up.

Browse by category, or scan the rail and jump straight in. Every answer links back to the page where the topic lives in full.

What's in here
01 / Product & how it works

What it is, what it does on a call, how it fits into your stack.

01 What does TransVoix actually do? +

Real-time, two-way speech translation on a live phone call. Your agent speaks; the customer hears it in their own language, sub-second latency, on the same call leg — no third human on the line. Both sides understand each other directly.

The pipeline (STT → translate → TTS) runs at the SIP boundary. See How It Works for the full anatomy of a call.

02 How is this different from a human interpreter? +

A human interpreter is one scheduled person, one call at a time, at roughly $2–4/min — and you wait on availability. TransVoix is real-time and always-on across every call and language pair, in each speaker's own voice, at roughly 1/3 to 1/5 that cost.

It's built for the full range of conversations — not a low-stakes carve-out. Teams that still want a human in the loop for a specific moment can keep one, but that's their choice, not a platform limit. See Pricing.

03 Where does the audio actually flow during a call? +

Audio is intercepted at your SIP boundary, processed in our US region, and returned to the call leg. No audio leaves the US. The audio path itself is stateless — frames are processed in memory and discarded.

The full subprocessor path (which vendors touch audio vs. metadata only) is documented on Compliance.

04 Does it work for inbound, outbound, and conference calls? +

Inbound and outbound today. Multi-party / conference is on the roadmap and works in pilot deployments today with a single target language. If you need it, mention it during your discovery call.

02 / Languages & quality

Coverage, accuracy, voice fidelity, and how we measure it.

01 Which languages do you support? +

Six languages live in production today — English, Spanish, French, Mandarin, Arabic, Hindi — for 30 bidirectional pairs. Tagalog, Vietnamese, Korean, and Portuguese are in beta. New languages ship on a published roadmap.

The full matrix — quality and speaker-count notes per language — lives on Languages.

02 How do you measure translation quality? +

An LLM-judge benchmark across the 30 production language pairs — average 4.72/5, scored 1–5. The methodology and the per-pair scores are on Languages.

03 You clone the agent's voice — is that consented? +

Yes. Voice cloning, recording, and biometric capture all require fresh affirmative consent each session — never carried over from a stored preference. Consumer voice profiles are deleted on call end or consent revocation, whichever comes first; per-pilot agent voices are retained per the pilot agreement and are never reused across tenants.

The consent and provenance posture is detailed on Compliance.

04 How does it handle accents, dialects, and code-switching? +

Translation routes to dialect-aware variants where it matters (e.g. MX vs. ES Spanish, Brazilian vs. European Portuguese, Mandarin vs. Cantonese). Code-switching mid-utterance is detected at the phrase level and translated in place.

03 / Security & privacy

Where data lives, who can see it, what we will and will not do.

01 Where is call audio stored, and for how long? +

By default the audio path is stateless — frames are processed in memory and discarded, not written to disk. Recording is opt-in per tenant; when enabled, recordings are encrypted (AES-256) with per-call signed URLs that expire in 7 days.

The full retention matrix is on Compliance.

02 Is customer call audio used to train your models? +

No. Customer call content is never used to train models. It's a hard line — verified by data-egress tests in CI, not just a policy doc. See the “Lines we don't cross” block on Compliance.

03 How is access to call data controlled? +

Sensitive tables (call recordings, consent audit, call turns) enforce service-role-only Row-Level Security, scoped to the originating tenant. The anonymous database key cannot read or write protected content — verified by a curl probe after every migration.

Consent and access events are written to an append-only audit trail — every grant, revoke, and expiry is a separate row, no UPDATE, no DELETE — so the state of access at any past moment is provable.

04 What encryption is in place? +

TLS 1.2+ on every endpoint, with WebSocket audio frames carried over WSS. AES-256 at rest; recordings encrypted with per-call signed URLs (7-day expiry). Provider API keys are stored as encrypted platform secrets — never committed to source, never logged.

Encryption and key-management posture is on the Compliance page.

04 / Compliance

HIPAA, SOC 2, BAAs, regional posture — the procurement asks.

01 Are you HIPAA-compliant? +

We sign BAAs and operate under HIPAA-aligned controls — encryption, access logging, breach notification, minimum-necessary data handling. BAAs are executed at pilot onboarding for healthcare tenants, before any production traffic flows; subprocessor BAAs are executed (or pre-signed) before that vendor sees any session tagged with a HIPAA compliance profile.

We don't claim “HIPAA certified.” The full posture is on Compliance.

02 What's your SOC 2 status? +

We're partway through a Type I audit with a recognized firm. The controls are designed and operating; the auditor's observation period is in progress. We'll publish the report (and start a Type II observation) the moment it's signed. We don't claim “SOC 2 certified” because we're not, yet.

The report and pen-test summary are available under NDA — request via the receipts list on Compliance.

03 Do you sign BAAs and DPAs? +

Yes to both. The BAA is executed at pilot onboarding for healthcare tenants. The DPA covers GDPR-scoped customers; custom redlines are accepted on Enterprise. Email security@transvoix.ai.

04 Are you available outside the US? +

By default, US data processing for every tenant. EU residency is available as an Enterprise-tier contractual term. Other regions are on the roadmap — if you have a non-US deployment requirement, raise it during discovery.

05 / Data & training

What we keep, what we delete, what crosses our boundary.

01 What customer data do you keep, and why? +

Call metadata (timestamps, duration, language pair, agent ID, tenant ID, error codes) for billing and debugging. Recording is opt-in per tenant. Voice-profile embeddings for any enrolled agents.

We do not keep transcripts, translation outputs, or user PII unless explicitly contracted into a retention plan.

02 Can we delete a specific call's data on demand? +

Yes — per-tenant deletion controls, with the deletion event captured in the append-only audit trail. Full-tenant export and purge on contract termination is documented in the DPA.

03 How do you handle subprocessor changes? +

Advance notice goes to your primary security contact before a new subprocessor that touches audio or PHI. The current named-vendor subprocessor list is available under NDA via the receipts list on Compliance.

04 What happens to our data if we churn? +

On contract termination there's a contractual export window, then a purge across primary, replicas, and backups. The exact mechanics and timelines are in the standard DPA.

06 / Integrations & deployment

SIP, APIs, pilots, and what “agent leg” means.

01 How do we integrate TransVoix with our phone system? +

SIP-native architecture. Live today on Amazon Connect; designed to integrate with Genesys, Five9, NICE CXone, and other SIP-capable contact centers per customer engagement.

02 How long does a pilot take to provision? +

Provisioning timing scales with the agreed-upon scope and size of the pilot. We need SIP credentials, one disclosure script, and a list of pilot agents.

The pilot is free — no fee, no credit card, no commitment. See Pricing.

03 Do you have an API or SDK? +

A REST + WebSocket API for session control, agent enrollment, and audit-pack retrieval. No client SDK is required for the standard SIP deployment; the SDK/SIP surface expands on a published roadmap.

04 What is the “agent leg” of a call? +

The audio segment on your agent's side of the call — distinct from the customer leg. Billing is metered on audio processed end-to-end on the agent leg only; idle time and silence over 500ms aren't billed.

Still have questions?

Talk to a person.

Procurement, security, and clinical teams: a 30-minute discovery call covers what an FAQ can't.

Book a discovery call tobi@transvoix.ai