Prolific is seeking AI and Machine Learning Engineers to join their Expert Network to train and evaluate LLMs using deep technical expertise. The role involves evaluating model architectures, auditing code, and providing human feedback for RLHF. Candidates should have experience with deep learning, LLMs, and relevant technologies like PyTorch and Hugging Face.
Responsibilities
Evaluate LLM Architecture Logic: review AI-generated explanations of model architectures, loss functions, and backpropagation for technical accuracy.
Audit Code & Notebooks: validate ML-specific code (e.g., training loops, data preprocessing scripts, or model evaluations) for efficiency and correctness.
Refine RLHF Frameworks: provide the high-quality human feedback necessary to align models with human intent, safety, and helpfulness.
Analyze Model Reasoning: critically assess how an AI model navigates complex chain-of-thought (CoT) prompts and identify where the reasoning breaks down.
Benchmark Performance: conduct comparative testing between different model outputs based on specific technical taxonomies and performance metrics.
Requirements
Education: a BS, MS, or PhD in Computer Science, Artificial Intelligence, Robotics, or a related quantitative field with a focus on Machine Learning.
Professional Experience: experience building, deploying, or fine-tuning ML models in a production environment.