Fine-tune and optimize Large Language Models (LLMs) for mid-to-large-scale live production-quality applications.
Host and deploy LLMs on custom infrastructure, ensuring high availability and performance.
Conduct LLM evaluation following best practices as outlined in the Hugging Face LLM Evaluation Guide.
Collaborate with cross-functional teams to design, develop, and implement AI-driven solutions tailored to business needs.
Ensure model scalability, security, and compliance with industry standards.
Required Skills and Experience :
Experience: 4 years of hands-on experience with Generative AI and LLMs. (Total IT experience is not a priority.)
Domain Expertise: Prior experience in the fintech or financial services domain is essential.
LLM Fine-Tuning: Demonstrated expertise in fine-tuning LLMs for live production environments (academic or PoC projects are not relevant).
Infrastructure Management: Experience with hosting and deploying LLMs on custom infrastructure.
LLM Evaluation: Proficiency in conducting LLM evaluations using industry-recognized methodologies and frameworks.
Technical Skills :
Proficiency in Python and relevant AI/ML libraries (e.g., PyTorch, TensorFlow, Hugging Face).
Strong understanding of LLM architectures and their optimization techniques.
Familiarity with cloud-based or on-premise infrastructure for AI deployments.
Job Classification
Industry: IT Services & Consulting Functional Area / Department: Engineering - Software & QA Role Category: Software Development Role: Technical Lead Employement Type: Full time