Job Summary:
We are seeking an experienced LLM Engineer to design, develop, and deploy AI-powered solutions using Retrieval-Augmented Generation (RAG) pipelines, FastAPI, and LangChain within the healthcare domain. You will play a key role in implementing scalable backend services, document chunking, context injection, vector search, and real-time automation workflows using LLMs hosted on vLLM or MCP servers.
Tech Stack:
Good to Have:
Key Responsibilities:

Keyskills: server continuous integration kubernetes production ci/cd apache tomcat sql docker ansible cloud automation java git ecs postgresql devops linux jenkins shell scripting mysql deployment rest cd python github workflow maven microsoft azure engineering amazon ec2 terraform aws