Strategic Deployment Engineer, Chanakya

Job type: Full Time · Department: Engineering · Work type: On-Site

Delhi Division, Delhi, India; Bengaluru, Karnataka, India

About Sarvam

Sarvam is building the bedrock of Sovereign AI for India. The company is developing India's full-stack sovereign AI platform, building across research, models, infrastructure and applications with a singular focus on making AI genuinely work for India. Sarvam works with leading enterprises and public institutions and is backed by Lightspeed, Peak XV, and Khosla Ventures. Sarvam partners with India's leading brands, including Tata Capital, SBI Life, CRED, IDFC, and LIC.

About the Role

Strategic Deployment Engineers are Sarvam's forward-deployed technical assets. Embedded with clients, you own the full lifecycle of AI system deployments — including in air-gapped, classified, and on-prem environments — and in complex enterprise accounts where standard playbooks don't apply. You are the technical SPOC for your assigned accounts. Success is measured by whether the system works, the client trusts us, and the deployment creates durable capability — not by ticket closure. You will operate with autonomy and carry real accountability: for the system, for the relationship, and for outcomes.

What You'll Do

Own end-to-end deployment of Sarvam's full AI stack in client environments — on-prem, air-gapped, classified infrastructure, and complex enterprise accounts
Serve as technical SPOC for assigned accounts, from scoping and PoC through to steady-state operations
Diagnose and resolve integration failures, model drift, inference issues, and infrastructure breakdowns without escalation ladders
Surface field learnings that feed back into the product layer and replicable deployment library
Manage deployment pipelines, model serving, and environment configuration in non-standard, constrained settings
Drive client-side adoption through documentation, training, and operational handover where required
Own client satisfaction (CSAT, time-to-value, uptime) for your accounts; flag risks before they become escalations

What We're Looking For

3–6 years in software or ML engineering, with at least one full-cycle on-prem or enterprise deployment delivered end-to-end
Production-grade experience in Python, Docker, Linux systems administration, REST APIs, and CI/CD pipelines
Hands-on experience with LLM inference stacks — vLLM, TGI, Ollama, or equivalent — and RAG architectures and vector stores
Experience deploying in constrained environments: air-gapped networks, limited connectivity, non-standard hardware, or complex regulatory requirements
Full-stack debugging instinct — comfortable diagnosing across infrastructure, networking, and application layers without a specialist to hand
Demonstrated ability to ship and maintain a working system end-to-end in environments where reliability was non-negotiable
Proven ability to navigate ambiguous client requirements and make the call without explicit guidance

Signals We Look For

You've shipped and maintained a working system end-to-end in environments where the bar for reliability was non-negotiable
You've navigated ambiguous client requirements and made the call without explicit guidance
Open-source projects, side products, or entrepreneurial stints that demonstrate technical craft and sustained follow-through

Who You Are

You treat ambiguity as the baseline — you don't need perfect information to move
You own outcomes, not tasks. If something is broken in your line of sight, you fix it, flag it, or escalate — you don't wait to be asked
You operate in high-pressure, forward-deployed environments without daily oversight, and you thrive there
You are available and responsive when your clients are
You have a strong bias for action and an equal openness to being wrong
You are intellectually restless — the hardest, least-understood problems are the ones you find most energising

Bonus Points

Prior experience with strategic or complex enterprise accounts
MCP server experience or familiarity with agentic frameworks
Open-source projects, side products, or entrepreneurial stints that demonstrate technical craft and sustained follow-through

Why Sarvam?

Sarvam is a fast-moving, high talent-density team building full-stack AI for India, working on problems that push the frontiers of AI with real population-scale impact.

Work alongside researchers, engineers, builders, and business leaders who move fast and hold each other to a very high bar
High ownership and high impact, from day one
Everything we do is AI-first, from the way we build and ship to the way we think about problems
You can work on problems that could change how an entire country learns, works, and communicates

If you want to work on problems at the frontier of AI in India, Sarvam is the place to be.

Made with