Your browser does not support javascript! Please enable it, otherwise web will not work for you.

AI Platform Engineer BFSI Domain @ Expleo

Home > Data Science & Machine Learning

 AI Platform Engineer BFSI Domain

Job Description

Overview
Key Responsibilities
Platform Development and Evangelism:
  • Build scalable AI platforms that are customer-facing.
  • Evangelize the platform with customers and internal stakeholders.
  • Ensure platform scalability, reliability, and performance to meet business needs.
    Machine Learning Pipeline Design:
    • Design ML pipelines for experiment management, model management, feature management, and model retraining.
    • Implement A/B testing of models.
    • Design APIs for model inferencing at scale.
    • Proven expertise with MLflow, SageMaker, Vertex AI, and Azure AI.
      LLM Serving and GPU Architecture:
      • Serve as an SME in LLM serving paradigms.
      • Possess deep knowledge of GPU architectures.
      • Expertise in distributed training and serving of large language models.
      • Proficient in model and data parallel training using frameworks like DeepSpeed and service frameworks like vLLM.
        Model Fine-Tuning and Optimization:
        • Demonstrate proven expertise in model fine-tuning and optimization techniques.
        • Achieve better latencies and accuracies in model results.
        • Reduce training and resource requirements for fine-tuning LLM and LVM models.
          LLM Models and Use Cases:
          • Have extensive knowledge of different LLM models.
          • Provide insights on the applicability of each model based on use cases.
          • Proven experience in delivering end-to-end solutions from engineering to production for specific customer use cases.
            DevOps and LLMOps Proficiency:
            • Proven expertise in DevOps and LLMOps practices.
            • Knowledgeable in Kubernetes, Docker, and container orchestration.
            • Deep understanding of LLM orchestration frameworks like Flowise, Langflow, and Langgraph.
              Skill Matrix
              LLM: Hugging Face OSS LLMs, GPT, Gemini, Claude, Mixtral, Llama
              LLM Ops: ML Flow, Langchain, Langraph, LangFlow, Flowise, LLamaIndex, SageMaker, AWS Bedrock, Vertex AI, Azure AI
              Databases/Datawarehouse: DynamoDB, Cosmos, MongoDB, RDS, MySQL, PostGreSQL, Aurora, Spanner, Google BigQuery.
              Cloud Knowledge: AWS/Azure/GCP
              Dev Ops (Knowledge): Kubernetes, Docker, FluentD, Kibana, Grafana, Prometheus
              Cloud Certifications (Bonus): AWS Professional Solution Architect, AWS Machine Learning Specialty, Azure Solutions Architect Expert
              Proficient in Python, SQL, Javascript
              Responsibilities
              Key Responsibilities
              Platform Development and Evangelism:
              • Build scalable AI platforms that are customer-facing.
              • Evangelize the platform with customers and internal stakeholders.
              • Ensure platform scalability, reliability, and performance to meet business needs.
                Machine Learning Pipeline Design:
                • Design ML pipelines for experiment management, model management, feature management, and model retraining.
                • Implement A/B testing of models.
                • Design APIs for model inferencing at scale.
                • Proven expertise with MLflow, SageMaker, Vertex AI, and Azure AI.
                  LLM Serving and GPU Architecture:
                  • Serve as an SME in LLM serving paradigms.
                  • Possess deep knowledge of GPU architectures.
                  • Expertise in distributed training and serving of large language models.
                  • Proficient in model and data parallel training using frameworks like DeepSpeed and service frameworks like vLLM.
                    Model Fine-Tuning and Optimization:
                    • Demonstrate proven expertise in model fine-tuning and optimization techniques.
                    • Achieve better latencies and accuracies in model results.
                    • Reduce training and resource requirements for fine-tuning LLM and LVM models.
                      LLM Models and Use Cases:
                      • Have extensive knowledge of different LLM models.
                      • Provide insights on the applicability of each model based on use cases.
                      • Proven experience in delivering end-to-end solutions from engineering to production for specific customer use cases.
                        DevOps and LLMOps Proficiency:
                        • Proven expertise in DevOps and LLMOps practices.
                        • Knowledgeable in Kubernetes, Docker, and container orchestration.
                        • Deep understanding of LLM orchestration frameworks like Flowise, Langflow, and Langgraph.
                          Qualifications
                          • 35 years in AI/ML product development.
                          • Skill Matrix
                            LLM: Hugging Face OSS LLMs, GPT, Gemini, Claude, Mixtral, Llama
                            LLM Ops: ML Flow, Langchain, Langraph, LangFlow, Flowise, LLamaIndex, SageMaker, AWS Bedrock, Vertex AI, Azure AI
                            Databases/Datawarehouse: DynamoDB, Cosmos, MongoDB, RDS, MySQL, PostGreSQL, Aurora, Spanner, Google BigQuery.
                            Cloud Knowledge: AWS/Azure/GCP
                            Dev Ops (Knowledge): Kubernetes, Docker, FluentD, Kibana, Grafana, Prometheus
                            Cloud Certifications (Bonus): AWS Professional Solution Architect, AWS Machine Learning Specialty, Azure Solutions Architect Expert
                            Proficient in Python, SQL, Javascript
                            Essential skills
                            Skill Matrix
                            LLM: Hugging Face OSS LLMs, GPT, Gemini, Claude, Mixtral, Llama
                            LLM Ops: ML Flow, Langchain, Langraph, LangFlow, Flowise, LLamaIndex, SageMaker, AWS Bedrock, Vertex AI, Azure AI
                            Databases/Datawarehouse: DynamoDB, Cosmos, MongoDB, RDS, MySQL, PostGreSQL, Aurora, Spanner, Google BigQuery.
                            Cloud Knowledge: AWS/Azure/GCP
                            Dev Ops (Knowledge): Kubernetes, Docker, FluentD, Kibana, Grafana, Prometheus
                            Cloud Certifications (Bonus): AWS Professional Solution Architect, AWS Machine Learning Specialty, Azure Solutions Architect Expert
                            Proficient in Python, SQL, Javascript
                            Experience
                            • Skill Matrix
                              LLM: Hugging Face OSS LLMs, GPT, Gemini, Claude, Mixtral, Llama
                              LLM Ops: ML Flow, Langchain, Langraph, LangFlow, Flowise, LLamaIndex, SageMaker, AWS Bedrock, Vertex AI, Azure AI
                              Databases/Datawarehouse: DynamoDB, Cosmos, MongoDB, RDS, MySQL, PostGreSQL, Aurora, Spanner, Google BigQuery.
                              Cloud Knowledge: AWS/Azure/GCP
                              Dev Ops (Knowledge): Kubernetes, Docker, FluentD, Kibana, Grafana, Prometheus
                              Cloud Certifications (Bonus): AWS Professional Solution Architect, AWS Machine Learning Specialty, Azure Solutions Architect Expert
                              Proficient in Python, SQL, Javascript
                            • upto 6 years

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Data Science & Analytics
Role Category: Data Science & Machine Learning
Role: Data Science & Machine Learning - Other
Employement Type: Full time

Contact Details:

Company: Expleo
Location(s): Mumbai

+ View Contactajax loader


Keyskills:   GCP Postgresql MySQL Machine learning Javascript Cosmos Data warehousing AWS SQL Python

 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Business Intel Engineer II, Amazon

  • Amazon
  • 3 - 8 years
  • Hyderabad
  • 2 days ago
₹ Not Disclosed

AI / ML Engineer

  • Accenture
  • 2 - 5 years
  • Mumbai
  • 5 days ago
₹ Not Disclosed

Associate, Ml Data Operations, Go-ai Operations

  • Amazon
  • 0 - 4 years
  • Hyderabad
  • 10 days ago
₹ Not Disclosed

Business Intel Engineer I, Aop (level 4), Aop

  • Amazon
  • 2 - 7 years
  • Hyderabad
  • 10 days ago
₹ Not Disclosed

Expleo

With 1000+ Global teams and operations in the USA, UK, CANADA, and INDIA, VARITE is currently engaged with leading technology, financial, automotive, defense, energy, pharmaceuticals/life sciences, semiconductor, and engineering companies to provide software consulting, team augmentation and key bus...