Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Senior Software Engineer - Model Inferencing @ Red Hat

Home > Software Development

 Senior Software Engineer - Model Inferencing

Job Description

Red Hat OpenShift AI is a flexible, scalable artificial intelligence (AI) and machine learning (ML) platform that enables enterprises to create and deliver AI-enabled applications at scale across hybrid cloud environments. Built using open-source technologies, OpenShift AI provides trusted, operationally consistent capabilities for teams to experiment, serve models, and deliver innovative apps.


The OpenShift AI team seeks a Software Engineer with Kubernetes and Model Inference Runtimes experience to join our rapidly growing engineering team. Our team focuses on making machine learning model deployment and monitoring seamless and scalable across the hybrid cloud and the edge. This is a fascinating opportunity to build and impact the next generation of hybrid cloud MLOps platforms.


What You Will Do


  • Develop and maintain a high-quality, high-performing ML inference runtime platform for multi-modal and distributed model serving.

  • Contribute directly to upstream inference runtime communities such as vLLM, TGI, PyTorch, OpenVINO, and others.

  • Maintain CI/CD build pipelines for container images that allow faster, more secure, reliable, and frequent releases

  • Coordination and communication with various stakeholders

  • Applying a growth mindset by staying up to date with AI and ML advancements


What You Will Bring


  • Highly experienced with programming in Python and PyTorch

  • Familiarity with model parallelization, quantization, and memory optimization using vLLM, TGI, and other inference libraries.

  • Experience with Python packaging, such as PyPI libraries

  • Solid understanding of the fundamentals of model inference architectures

  • Experience with Jenkins, Git, shell scripting, and related technologies

  • Experience with the development of containerized applications in Kubernetes

  • Experience with Agile development methodologies

  • Experience with Cloud Computing using at least one of the following Cloud infrastructures: AWS, GCP, Azure, or IBM Cloud

  • Ability to work across a large, distributed, hybrid engineering team

  • Following is considered a plus

  • Experience with open-source development is a plus

  • Development experience with C++, especially with the CUDA APIs, is a big plus


 

 

 

 

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Platform Engineer
Employement Type: Full time

Contact Details:

Company: Red Hat
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   software engineer kubernetes mlops python ibm cloud ai machine learning artificial intelligence git hybrid cloud apis gcp shell jenkins pytorch agile developmen shell scripting agile cloud computing aws red hat programming azure

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Oracle Service Bus Developer Opportunity - Oracle India Pvt Ltd

  • Oracle
  • 4 - 9 years
  • Pune
  • 5 days ago
₹ Not Disclosed

CTO - Quantum Engineering - Developer

  • Wipro
  • 2 - 7 years
  • Bengaluru
  • 5 days ago
₹ Not Disclosed

Hiring For Hexaware - Oracle Fusion Technical Consultant - Fin & Scm

  • Hexaware Technologies
  • 8 - 13 years
  • Chennai
  • 5 days ago
₹ Not Disclosed

Data Engineer (Azure Purview)

  • Capgemini
  • 6 - 11 years
  • Hyderabad
  • 5 days ago
₹ Not Disclosed

Red Hat

Founded in 1993, Red Hat is the premier Linux and open source provider. The most recognized Linux brand in the world. We serve global enterprises through technology and services made possible by the open source model. Solutions include Red Hat Enterprise Linux operating platforms, sold through a sub...