Red Hat OpenShift AI is a flexible, scalable artificial intelligence (AI) and machine learning (ML) platform that enables enterprises to create and deliver AI-enabled applications at scale across hybrid cloud environments. Built using open-source technologies, OpenShift AI provides trusted, operationally consistent capabilities for teams to experiment, serve models, and deliver innovative apps.
The OpenShift AI team seeks a Software Engineer with Kubernetes and Model Inference Runtimes experience to join our rapidly growing engineering team. Our team focuses on making machine learning model deployment and monitoring seamless and scalable across the hybrid cloud and the edge. This is a fascinating opportunity to build and impact the next generation of hybrid cloud MLOps platforms.
What You Will Do
Develop and maintain a high-quality, high-performing ML inference runtime platform for multi-modal and distributed model serving.
Contribute directly to upstream inference runtime communities such as vLLM, TGI, PyTorch, OpenVINO, and others.
Maintain CI/CD build pipelines for container images that allow faster, more secure, reliable, and frequent releases
Coordination and communication with various stakeholders
Applying a growth mindset by staying up to date with AI and ML advancements
What You Will Bring
Highly experienced with programming in Python and PyTorch
Familiarity with model parallelization, quantization, and memory optimization using vLLM, TGI, and other inference libraries.
Experience with Python packaging, such as PyPI libraries
Solid understanding of the fundamentals of model inference architectures
Experience with Jenkins, Git, shell scripting, and related technologies
Experience with the development of containerized applications in Kubernetes
Experience with Agile development methodologies
Experience with Cloud Computing using at least one of the following Cloud infrastructures: AWS, GCP, Azure, or IBM Cloud
Ability to work across a large, distributed, hybrid engineering team
Following is considered a plus
Experience with open-source development is a plus
Development experience with C++, especially with the CUDA APIs, is a big plus
Job Classification
Industry: IT Services & ConsultingFunctional Area / Department: Engineering - Software & QARole Category: Software DevelopmentRole: Data Platform EngineerEmployement Type: Full time