star-notify-white
Communicate fluently in 100+ languages and dialects with native-sounding accents, making global outreach effortless.

Senior Data Scientist – Speech AI

Are you a seasoned Data Scientist with deep expertise in Text-to-Speech (TTS) and Speech-to-Text (STT) technologies? Ready to shape the future of voice AI and build systems that power global real-time communication? WorkForce International is proud to partner with one of the most exciting startups in the conversational AI space to find a passionate and talented Senior Data Scientist – Speech AI to join their growing AI core team.

Our client is a rising global startup transforming human–AI interaction. As the creators of Kalimera.ai — the world’s leading virtual voice assistant for call centers and real-time communication — they are redefining how businesses engage with customers. With offices in Cyprus, Greece, and India, they’re scaling rapidly and using cutting-edge innovations in TTS, STT, and large language models to lead the next wave of voice automation.

Key Responsibilities:

  • Design, train, and optimize TTS models (e.g., Tacotron, FastSpeech, WaveNet) to deliver high-quality, multilingual voice synthesis.
  • Enhance STT pipelines with a focus on real-time accuracy, speaker diarization, and robust transcription for diverse accents and environments.
  • Collaborate closely with engineering and product teams to deploy models at scale in high-availability environments.
  • Lead research and benchmarking of voice models for performance, naturalness, and use-case fit (call centers, voice assistants, etc.).
  • Optionally integrate LLMs to boost STT performance with capabilities like sentiment analysis, summarization, and intent detection.
  • Influence architecture decisions and mentor peers as the team expands.

Your Profile:

  • 5+ years of hands-on experience in AI/ML with a strong focus on TTS and/or STT systems.
  • Proficient in Python and deep learning libraries such as PyTorch, TensorFlow, torchaudio, librosa, etc.
  • Practical experience with speech frameworks like ESPnet, NVIDIA NeMo, Coqui TTS, or similar.
  • Solid knowledge of speech signal processing, neural vocoders, and real-time system optimization.
  • Experience with deploying models in cloud-native or Kubernetes-based environments.
  • Strong collaboration, communication, and documentation skills.
  • Bonus: Experience working with LLMs or NLP tools in speech-related applications.

Why this role stands out:

Shape the voice of tomorrow

Be part of a mission-driven team that’s already powering real-world enterprise deployments through Kalimera.ai.

Innovate at the edge of AI

Work with a cutting-edge stack that combines speech and language models in real-time environments.

Grow your impact

Take ownership in a high-visibility role, with a chance to lead and mentor as the AI team scales.

Work globally, collaborate deeply

Join a distributed, multicultural team spanning Europe and Asia with ambitious plans for global expansion.

This role is exclusively managed by WorkForce International, a premier global talent partner. We specialize in connecting top-tier professionals with innovative tech companies shaping the future. Ready to make your mark? Apply now and let’s talk about your next big move.