Search
Go
This engagement is focused on building an internal AI platform that enables developers to ship AI-powered services efficiently. Scope includes model connectivity, prompt testing and evaluation, monitoring/observability, and the underlying AI infrastructure layer.The objective is to improve DevEx and reduce time-to-market for AI features.Location: Serbia (relocation support available), Croatia, Poland, PortugalTasksBuild and operate theAI platform infrastructureenabling developers to ship LLM-based services faster.Implement and maintainKubernetes-basedruntime environments (incl.AKS) for AI workloads.Manage infrastructure as code withTerraform(modules, environments, CI/CD automation).SupportLLM workflows: RAG, agents, prompt experimentation, evaluations, and deployment patterns.Integrate and operate tooling such asAzure AI Foundry,LiteLLM,Langfuse,MLflow.Orchestrate pipelines usingKubeflow Pipelinesand/orArgo Workflows(build, deploy, evaluate).Improveplatform reliability and observability(monitoring, logging, tracing, cost/perf signals).Collaborate closely with developers to streamlineDX(APIs, templates, docs, golden paths, automation).RequirementsStrong hands-on experience withKubernetesin production (preferably AKS).SolidTerraformexpertise (IaC best practices, multi-env setups).Practical experience supportingML/LLMworkloads in a platform or DevOps/MLOps context.Proficiency inPythonfor automation, scripting, and supporting APIs/evaluation tooling.Understanding ofCI/CD, release processes, and production-grade operations.Ability to work undertight timelinesand deliver pragmatically.Nice to HaveExperience buildinginternal developer platformsor “paved roads” for engineering teams.Familiarity withLLM evaluation frameworks, prompt testing workflows, and LLM observability.Exposure toRAG architectures, vector databases, and agentic patterns.Experience withKubeflow,Argo, and ML lifecycle tooling.Engagement TypeLong-term B2B contract.TeamYou will join a team of 5, with 3 AI Platform Engineers being added.Location / TimezoneRemote work from Croatia, Poland, Portugal, and Serbia.European working hours.Occasionally available for meetings up to 10:00 AM PST (US overlap).
San Francisco · US · 567+ employees
Iterable is an AI-powered customer engagement platform that enables enterprise-level brands to create personalized, cross-channel customer experiences. It transforms real-time user data into actionable marketing campaigns across email, SMS, push, and in-app channels.
Cozen Technology Solutions Inc
Alan
Scale.jobs
Accord Technologies Inc.
Jobflarely
Scale.jobs