Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Machine Learning Engineer - Inference & Fine-Tuning @ Insaito Software

Home > Data Science & Machine Learning

 Machine Learning Engineer - Inference & Fine-Tuning

Job Description

Get hired by a US-based company focused on the US, UK, and European markets. This would include travel to the US office located in California.


What we're looking for

We are looking for a talented Machine Learning Engineer to lead the development of an Inference Service built on open-source technologies. You will design, deploy, and fine-tune machine learning models in the cloud using frameworks like Hugging Face, TensorRT, PyTorch, and other open-source tools. The ideal candidate has hands-on experience building scalable inference services, fine-tuning pre-trained models, and working with cloud-native infrastructure.

In this role, you'll work on exciting machine learning applications spanning NLP, computer vision, and custom tasks all with a focus on efficient deployment and real-time inference. If you're passionate about open-source technologies and cloud-based ML infrastructure, this is the perfect role for you!


Responsibilities

  • Build and optimize a scalable inference pipeline using popular open-source frameworks (e.g., TensorRT).
  • Design real-time API endpoints for model serving and integration using frameworks like Flask.
  • Implement and optimize batch processing and streaming data pipelines to handle large-scale workloads.
  • Fine-tune pre-trained models (e.g., Llama, GPT, YOLO).
  • Deploy inference services on cloud platforms (GCP, AWS, Azure) using a containerized environment with Docker and Kubernetes.
  • Continuously optimize models for latency and throughput, troubleshooting bottlenecks.

Qualifications/ Required Skills and Experiences

  • Educational Background:
    • Bachelor's or Masters degree in Computer Science, Engineering, Data Science, or a related field, or equivalent practical experience.
  • Experience:
    • 3+ years of hands-on experience deploying machine learning models into production, focusing on inference services and fine-tuning.
    • Proficiency in Python or C/CC+.
    • Proven track record in creating high-performance libraries and tools.
    • Proficiency in model serving frameworks such as TensorRT.
    • Strong grasp of low-level OS concepts, including multi-threading, memory management, networking, storage, performance, and scalability.
  • Soft Skills:
    • Excellent communication skills with the ability to explain complex ML concepts to both technical and non-technical stakeholders.
    • Strong problem-solving and debugging skills, with the ability to analyze and resolve performance issues in production environments.
    • Comfortable working in an Agile environment and collaborating across teams to deliver results.

Job Classification

Industry: Software Product
Functional Area / Department: Data Science & Analytics
Role Category: Data Science & Machine Learning
Role: Machine Learning Engineer
Employement Type: Full time

Contact Details:

Company: Insaito Software
Location(s): Delhi, NCR

+ View Contactajax loader


Keyskills:   Artificial Intelligence Computer Science Machine Learning

 Fraud Alert to job seekers!

₹ 11-21 Lacs P.A

Similar positions

GEN AI Data Science Engineer

  • Gainwell Technologies
  • 3 - 8 years
  • Bengaluru
  • 23 hours ago
₹ Not Disclosed

Data scientist / ML Engineer -- US Client (Analytics)

  • US MNC (Analytics)
  • 3 - 8 years
  • Pune
  • 3 days ago
₹ 20-35 Lacs P.A.

Data scientist -- US MNC (analytics)

  • US MNC (Analytics)
  • 3 - 6 years
  • Noida, Gurugram
  • 3 days ago
₹ 15-30 Lacs P.A.

Data scientist / ML Engineer -- US Client (Analytics)

  • US MNC (Analytics)
  • 5 - 9 years
  • Pune
  • 3 days ago
₹ 25-40 Lacs P.A.

Insaito Software

Insaito, based in Silicon Valley, California, combines deep domain expertise with proven experience to deliver cutting-edge silicon solutions. We are driven by the passion for building silicon, software, and platform solutions for Next-Gen technologies such as advanced AI, 5G/6G, and autonomous sys...