Job Code - DEVMLL2
Mid-level AI/ML Engineer
We are seeking a Senior AI / LLM Developer with 6–8 years of experience in AI/ML and backend systems to design, build, fine-tune, and deploy production-grade LLM-powered applications. The ideal candidate will have deep expertise in Large Language Models, RAG systems, prompt engineering, fine-tuning, and cloud-scale deployment.
You will lead end-to-end development of AI assistants, copilots, and enterprise automation platforms while mentoring junior engineers.
Responsibilities
- Architect and develop LLM-powered applications using OpenAI, Claude, Gemini, and open-source LLMs (LLaMA, Mistral).
- Design and optimize enterprise-grade RAG pipelines using vector databases (Pinecone, FAISS, Weaviate, Chroma).
- Implement and evaluate advanced prompt engineering, tool calling, and agentic workflows.
- Perform model fine-tuning using LoRA, QLoRA, PEFT, and supervised instruction tuning.
- Lead LLM integration with backend systems using Python / Node.js.
- Optimize inference performance, token costs, caching, and latency.
- Build and maintain secure, scalable AI APIs using FastAPI / Flask / NestJS.
- Deploy AI services using Docker, Kubernetes, and Cloud (AWS/Azure/GCP).
- Implement AI observability, monitoring, and evaluation metrics.
- Ensure data security, governance, and compliance (HIPAA, GDPR, SOC2).
- Conduct code reviews, mentoring, and technical leadership.
- Collaborate with product, security, and UX teams to deliver production AI systems.
Required Experience & Qualifications
- Bachelor’s / Master’s in Computer Science, AI, ML, or related field
- 6–8 years of experience in AI/ML, NLP, or Backend Engineering
- 3+ years of hands-on experience with LLM-based systems
Nice to Have
- Experience with multi-agent systems (AutoGen, CrewAI, AutoGPT)
- Knowledge of MLOps tools (MLflow, Weights & Biases, Kubeflow)
- Hands-on with enterprise security, IAM, and data encryption
- Domain experience in Healthcare, BFSI, or SaaS