Senior Machine Learning Engineer for a global newswire and media organization. Build and optimize production inference systems for large-scale text, image, and video processing.
Responsibilities
Build and optimize inference systems at scale; profile transformers and improve serving latency; tune HNSW indices; manage AWS infrastructure for ML workloads; partner with MLOps and data science teams.
Requirements
5+ years building production ML inference systems; Python, PyTorch, TensorFlow; deep experience with Transformer models; inference optimization (quantization, distillation, etc.); AWS (EC2, Batch, Lambda, SageMaker); hybrid search (BM25 + vector search + cross-encoder); async processing systems; data pipeline orchestration (Airflow); video frameworks (FFmpeg); media industry experience; large-scale data with text, images, video.