Job Description
About the Role:
We are looking for an experienced AI Conversational Engineer with strong expertise in real-time voice systems, Reinforcement Learning (RL)based model alignment, and custom LLM orchestration using PipeCat.
You will architect and optimize end-to-end conversational pipelines from Speech-to-Text and Text-to-Speech systems to SLM reinforcement and multi-agent orchestration ensuring low-latency, high-accuracy interactions at scale.
Core Engineering Responsibilities
1. Real-Time Conversational Systems
A. Design and build low-latency, high-concurrency conversational systems for both voice and text.
B. Integrate STT, TTS, and LLM/SLM components into unified, real-time architectures.
C. Develop and maintain PipeCat-based orchestration pipelines for multi-agent conversational flows.
D. Engineer robust streaming APIs and telephony integrations (VoIP/SIP).
2. Reinforcement Learning and Model Fine-Tuning
A. Fine-tune Small Language Models (SLMs) using Supervised Fine-Tuning (SFT) and RL (DPO, PPO) for alignment and personality control.
B. Design reward models to guide tone, factual accuracy, and conversational flow.
C. Build RL feedback loops for continuous model refinement based on user interactions.
3. Voice Synthesis and Adaptation
A. Develop high-quality ASR and TTS models for expressive, natural-sounding speech generation.
B. Apply speaker adaptation and voice cloning techniques for personalization.
C. Utilize Diffusion- or HiFi-GANbased vocoders for high-fidelity audio generation.
D. Engineer robust handling of sampling frequency, audio fidelity, and streaming performance.
4. Infrastructure, Serving, and Deployment
A. Build containerized inference microservices using Docker and Kubernetes.
B. Deploy Ray Servebased endpoints for distributed, dynamically batched inference.
C. Implement autoscaling, monitoring, and observability for production-grade systems.
D. Optimize serving for latency, throughput, and fault tolerance.
5. Guardrails, Security, and Reliability
A. Implement guardrail frameworks to protect against prompt injection, jailbreaks, and unsafe outputs.
B. Develop input sanitizers, content filters, and boundary-check mechanisms.
C. Maintain secure integrations with authenticated APIs and external toolchains.
D. Enable traceability through conversation logging, replay, and audit pipelines.
What We're Looking For:
3+ years in AI conversational systems or RL-driven model architectures.
Languages & Frameworks: Python, PyTorch, TensorFlow.
Core Expertise:
1) RL-based model alignment (SFT, PPO, DPO)
2) ASR/TTS pipeline design and optimization
3) Transformer architecture and optimization
4) Ray Serve + Kubernetes deployment
5) Secure orchestration using PipeCat
6) Programming & Engineering
Foundational Knowledge: Optimization, Statistics, and Linear Algebra.
Why You Should Join Us:
1) Opportunity to work on cutting-edge LLM infrastructure and automation in fintech.
2) High-impact role shaping the future of intelligent systems at Paytm.
3) Access to large-scale computing resources and datasets.
4) A collaborative, research-driven development environment.
5) Competitive compensation and benefits.
6) Opportunities for professional development and attending leading AI conferences.
Preferred Qualifications:
Bachelor's/Master's Degree in Computer Science or equivalent
Compensation: If you are the right fit, we believe in creating wealth for you. With enviable 500 mn+ registered users, 21 mn+ merchants and depth of data in our ecosystem, we are in a unique position to democratize credit for deserving consumers merchants and we are committed to it. Indias largest digital lending story is brewing here. It is your opportunity to be a part of the story!
Job Classification
Industry: Banking
Functional Area / Department: Data Science & Analytics
Role Category: Data Science & Machine Learning
Role: Data Scientist
Employement Type: Full time
Contact Details:
Company: Paytm
Location(s): Bengaluru
Keyskills:
data scientist
kubernetes
python
ai
llm
fintech
financial services
research
microservices
docker
reinforcement learning
compensation and benefits
tensorflow
automation
data science
voip
pytorch
programming
architecture
statistics