Experience with LLMs (e.g., DeepSeek, LLaMA) and inference frameworks (Ollama, vLLM, llama.cpp).
Proficiency in OpenCV, PyTorch, YOLO or TensorFlow, and model conversion workflows.
Strong experience in Docker, DevOps, and CI/CD pipeline integration.
Programming skills in Python, with solid experience in Linux and shell scripting .
Understanding of edge AI hardware (Jetson/NXP/Qualcomm) and embedded deployment.
Familiarity with Yocto OS and custom Linux builds.
Strong grasp of model optimization and compression techniques.
Experience with Langchain, AI agents, and RAG pipelines.
Good knowledge of inference acceleration using CUDA and GPU-specific kernels.
Excellent communication and collaboration skills.
RESPONSIBILITIES:
Understanding functional requirements thoroughly and analyzing the client s needs in the context of the project
Envisioning the overall solution for defined functional and non-functional requirements, and being able to define technologies, patterns and frameworks to realize it
Determining and implementing design methodologies and tool sets
Enabling application development by coordinating requirements, schedules, and activities.
Being able to lead/support UAT and production roll outs
Creating, understanding and validating WBS and estimated effort for given module/task, and being able to justify it
Addressing issues promptly, responding positively to setbacks and challenges with a mindset of continuous improvement
Giving constructive feedback to the team members and setting clear expectations.
Bachelor s or master s degree in computer science, Information Technology, or a related field.
Job Classification
Industry: IT Services & ConsultingFunctional Area / Department: Data Science & AnalyticsRole Category: Data Science & Machine LearningRole: Data Science & Machine Learning - OtherEmployement Type: Full time