Hirevector is seeking a highly skilled Machine Learning Engineer/SRE to join their team remotely. The role involves managing Azure infrastructure for AI model development and deployment, monitoring model performance, and responding to incidents. This is a 12-month contract position with a focus on MLOps and site reliability.
Responsibilities
Manage Azure infrastructure for AI model development and deployment, ensuring scalability and performance
Implement and maintain monitoring systems to track model performance, proactively identifying issues
Collaborate with SRE team to respond promptly to outages and incidents related to model operations
Requirements
Proficiency in managing Azure infrastructure components (VMs, storage, networking)
Experience with CI/CD pipelines and automating model deployment processes
Strong knowledge of containerization technologies like Docker and Kubernetes
Proficient in building and optimizing machine learning models
Programming skills in Python and libraries such as TensorFlow and PyTorch
Experience in data preprocessing, feature engineering, and data pipeline development
Nice to Have
Experience with cloud-based ML platforms (e.g., Azure Machine Learning)
Hirevector is a professional services and recruitment firm that provides an AI-powered technical interview intelligence platform. The company focuses on standardizing the hiring process through conversational, AI-driven assessments to ensure fairness, reduce bias, and improve candidate evaluation.