Use Cases | Spirelight | Speech Data Applications

01 · Cabin scenario

In-car voice across regions, accents, and noise.

Wake word DA · SV · NO · DE HVAC on 60 km/h

Driver: "Hej bil, kør hjem."

Passenger: "Sätt på musiken."

Cabin noise condition + accent metadata captured per take.

02 · STT transcript

Word-level timestamps, low-confidence flags, dialect coverage.

Domain audio Word timestamps IAA scored Eval split ready

[00:01.420 → 00:01.890] "patient" · spk_02 · conf 0.97

[00:01.910 → 00:02.310] "tachycardia" · spk_02 · conf 0.62 ⚑

Low-resource accents structured into a controlled eval set.

03 · TTS session

Studio-grade capture with linked speaker metadata.

48 kHz / 32-bit Expressive prompts Voice profile linked Take 03 ✓

Script line 04 of 120 · neutral → warm → urgent passes.

Speaker: F · 32 · DK central · trained voice talent.

Same room, same mic, same mouth-distance every session.

04 · Conversation & assistive

Multi-speaker dialogue and accessibility-first capture.

Multi-speaker Channel-separated Consent ledger IRB-compatible

Agent (ch L): "Let's pull up your account first."

Customer (ch R): "It's been three calls about this."

Speaker selection and consent designed around accessibility.

Speech data for the use cases you ship into production

Four use cases where voice data has to match production conditions.

In-car voice across regions, accents, and noise.

Word-level timestamps, low-confidence flags, dialect coverage.

Studio-grade capture with linked speaker metadata.

Multi-speaker dialogue and accessibility-first capture.

Automotive & mobility

Speech-to-text systems

Text-to-speech & voice apps

Conversation & assistive

Three patterns we apply across every use case.

Coverage by region, not by language code.

Crew, kit, and consent in the markets that matter.

Eval splits stratified by accent, condition, and difficulty.

Same workflow, configured per use case.

Map the use case to data requirements.

Collect with crew and conditions to match.

Deliver structured, audited, training-ready.

Tell us what training data you need

Speech data for the use cases you ship into production

In-car voice across regions, accents, and noise.

Word-level timestamps, low-confidence flags, dialect coverage.

Studio-grade capture with linked speaker metadata.

Multi-speaker dialogue and accessibility-first capture.

Automotive & mobility

Speech-to-text systems

Text-to-speech & voice apps

Conversation & assistive

Coverage by region, not by language code.

Crew, kit, and consent in the markets that matter.

Eval splits stratified by accent, condition, and difficulty.

Same workflow, configured per use case.

Map the use case to data requirements.

Collect with crew and conditions to match.

Deliver structured, audited, training-ready.

Tell us what training data you need |

Tell us what training data you need