Lead Data Scientist - PythonModel & Pipeline Development Build and deploy multimodal ML models spanning NLP, CV, ASR, OCR, and video analytics. Develop robust pipelines for: o Video frame extraction, shot detection o Speech-to-text, speaker diarization o Image tagging, content moderation o Multimodal embeddings (CLIP, SigLIP, VideoCLIP) Implement RAG (Retrieval Augmented Generation) with multimodal indexing. Optimization & Performance Engineering Optimize model latency for real-time content tagging or streaming workflows. Implement batching, quantization, distillation, or GPU optimizations. MLOps and Deployment Build CI/CD pipelines for ML using GitHub Actions, Jenkins, or Azure DevOps. Deploy models as microservices using Docker, Kubernetes, KServe, or FastAPI. Integrate observability tools (Prometheus, Grafana) for model monitoring. Media Engineering Integration Integrate AI into CMS platforms, OTT apps, media asset management (MAM) systems.

Keyskills: model monitoring kubernetes continuous integration cd python github modeling natural language processing workflow ci/cd machine learning azure devops docker microservices analytics grafana optimization predictive modeling jenkins prometheus ml ocr deployment