Astera Labs is hiring an AI/ML Engineer to build production AI systems for technical users. The role focuses on developing agentic workflows, LLM-based applications, and retrieval systems for engineering productivity. Responsibilities include designing systems that combine LLMs with tool use, retrieval, and evaluation loops. The team works on coding agents, diagnostic tools, and internal assistants.
Responsibilities
Build AI applications and agentic workflows for engineering productivity, diagnostics, search, documentation, and workflow automation.
Design systems that combine LLMs with retrieval, tool use, structured outputs, and evaluation loops.
Integrate models with internal tools, APIs, CLIs, MCP interfaces, and operational workflows.
Improve system quality through eval design, prompt and context iteration, model selection, failure analysis, and human feedback.
Build reusable skills, workflows, and abstractions for sharing across agents and teams.
Work closely with infrastructure and domain teams to deploy, monitor, and continuously improve AI systems in production.
Requirements
1–5 years of experience in software engineering, applied AI, ML engineering, or related backend/platform roles.
Experience with AWS or GCP and production AI services.
Strong Python skills and production engineering fundamentals.
Astera Labs is a global leader in purpose-built semiconductor connectivity solutions for rack-scale AI and cloud infrastructure. The company pioneered a software-defined architecture to solve data, memory, and networking bottlenecks in modern data centers.