AI Engineer/ML Engineer - Senior Developers - AI Training

PAProlific Academic Ltd

REMOTE · BOSTON · US

Up to $166K

Full-time

SENIOR

Apply Now

About the Role

Prolific is seeking AI and Machine Learning Engineers to join their Expert Network to train and evaluate LLMs using deep technical expertise. The role involves evaluating model architectures, auditing code, and providing human feedback for RLHF. Candidates should have experience with deep learning, LLMs, and relevant technologies like PyTorch and Hugging Face.

Responsibilities

Evaluate LLM Architecture Logic: review AI-generated explanations of model architectures, loss functions, and backpropagation for technical accuracy.
Audit Code & Notebooks: validate ML-specific code (e.g., training loops, data preprocessing scripts, or model evaluations) for efficiency and correctness.
Refine RLHF Frameworks: provide the high-quality human feedback necessary to align models with human intent, safety, and helpfulness.
Analyze Model Reasoning: critically assess how an AI model navigates complex chain-of-thought (CoT) prompts and identify where the reasoning breaks down.
Benchmark Performance: conduct comparative testing between different model outputs based on specific technical taxonomies and performance metrics.

Requirements

Education: a BS, MS, or PhD in Computer Science, Artificial Intelligence, Robotics, or a related quantitative field with a focus on Machine Learning.
Professional Experience: experience building, deploying, or fine-tuning ML models in a production environment.

PA

Prolific Academic Ltd

Austin · US

Related Jobs

AI Engineer/ML Engineer - Senior Developers - AI Training - Atlanta, US

Prolific

REMOTE · WORLDWIDE

AI/ML Engineer

Nexstar Broadcasting

Deep Learning Mastery: professional-level understanding of neural network architectures (Transformers, CNNs, RNNs) and optimization techniques.

LLM Specialization: hands-on experience with Prompt Engineering, RLHF, or RAG workflows.

Technical Rigor: ability to audit complex model logic, identify training data contamination, and evaluate mathematical proofs behind ML algorithms.

Analytical Critique: high attention to detail in spotting hallucinations, biased outputs, or logical failures in AI-generated technical content.

Tech Stack

PythonPyTorchTensorFlowWeights & BiasesPandasScikit-learnLangChainHugging Face TransformersNumPyKerasAWS SageMakerGoogle Cloud Vertex AIPineconeMilvusWeaviate