Your browser does not support javascript! Please enable it, otherwise web will not work for you.

LLM Engineer @ InfoVision Inc

Home > Software Development

 LLM Engineer

Job Description

Job Summary:

We are seeking an experienced LLM Engineer to design, develop, and deploy AI-powered solutions using Retrieval-Augmented Generation (RAG) pipelines, FastAPI, and LangChain within the healthcare domain. You will play a key role in implementing scalable backend services, document chunking, context injection, vector search, and real-time automation workflows using LLMs hosted on vLLM or MCP servers.

Tech Stack:

  • Languages: Python (FastAPI), SQL
  • LLM Orchestration: LangChain, custom RAG modules
  • LLM Hosting: vLLM, MCP servers (Model Context Protocol)
  • Document Processing: Custom chunking, PDF/Word parsers
  • Vector Stores: FAISS, OpenSearch, Qdrant, Chroma
  • Databases: PostgreSQL / MySQL
  • Cloud: AWS (ECS, S3, Lambda optional)
  • Containerization: Docker
  • Task Queues: Celery (optional)
  • Nice to have: Databricks for data enrichment & batch document analytics

Good to Have:

  • Experience with Databricks for scalable pre-processing or analytics
  • Frontend integration with React or Streamlit
  • Hands-on with LLM-based automation and document understanding (OCR, ICD/CPT extraction)

Roles and Responsibilities

Key Responsibilities:

  • Design and implement end-to-end RAG pipelines using LangChain or custom orchestration.
  • Build RESTful APIs using FastAPI to expose LLM-powered features.
  • Develop chunking, embedding, and retrieval logic for clinical documents.
  • Deploy services on AWS ECS using Docker with robust CI/CD.
  • Integrate and query vector stores (FAISS, OpenSearch, or Chroma).
  • Optimize vLLM / MCP server deployments for latency and throughput.
  • Automate repetitive healthcare workflows using LLMs.
  • Create and manage SQL-based databases and schemas.
  • Monitor and log model inference calls, context size, and token cost.
  • Implement authentication, rate limiting, and auditing for production APIs.
  • Collaborate with data scientists, domain experts, and DevOps

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Search Engineer
Employement Type: Full time

Contact Details:

Company: InfoVision Inc
Location(s): Pune

+ View Contactajax loader


Keyskills:   server continuous integration kubernetes production ci/cd apache tomcat sql docker ansible cloud automation java git ecs postgresql devops linux jenkins shell scripting mysql deployment rest cd python github workflow maven microsoft azure engineering amazon ec2 terraform aws

 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Engineer /senior Engineer - (mcu Rtos)

  • Einfochips
  • 5 - 10 years
  • Hyderabad
  • 22 hours ago
₹ Not Disclosed

QA Automation & Infrastructure Engineer

  • FCS Software Solutions
  • 10 - 20 years
  • Noida, Gurugram
  • 3 days ago
₹ Not Disclosed

Ai Ml Engineer

  • Accenture
  • 12 - 20 years
  • Noida, Gurugram
  • 3 days ago
₹ Not Disclosed

Senior CPaaS Engineer

  • FCS Software Solutions
  • 8 - 13 years
  • Noida, Gurugram
  • 3 days ago
₹ Not Disclosed

InfoVision Inc

Info vision Software Solutions (India) Pvt. Ltd