Back to Jobs

Senior Platform Engineer

Alembic San Francisco Full-time
$150,000
per year

Job Description

About Alembic

Alembic is a fast-growing Series A software startup focused on building cutting-edge solutions that transform how businesses harness and leverage data. We are a team of innovators, engineers, and product leaders passionate about solving complex problems with scalable, data-driven technology.

We apply cutting-edge algorithms and composite AI solutions to provide a new approach for marketing data analytics. We’re backed by leading tech luminaries and innovators including WndrCo, founded by DreamWorks founder Jeffrey Katzenberg, Jensen Huang, Joe Montana, and many more.

About the Role

We’re looking for a Senior Platform Engineer to help evolve and scale the systems that power Alembic. This is a high-impact, foundational role where you’ll drive platform scalability from the ground up. This role is particularly well-suited for seasoned platform, cloud, or DevOps engineers who are ready to dive into AI infrastructure. You'll leverage your proven expertise in scalable systems while learning to deploy and manage cutting-edge ML workloads—making this an ideal transition role for infrastructure veterans looking to specialize in the AI space.

What You’ll Do

  • Design, build, integrate, and operate the foundational infrastructure that powers Alembic’s platform—including core services, data pipelines, and distributed AI/ML workloads—across both cloud (primarily AWS) and on-prem environments.

  • Leverage Infrastructure as Code (IaC) tools such as Terraform for cloud resource provisioning and Ansible for configuration management, enabling repeatable, auditable, and environment-agnostic infrastructure deployments.

  • Develop and maintain CI/CD pipelines that enable reliable, low-risk, and rapid deployments using modern tools like GitHub Actions, ArgoCD, Bazel, or equivalent, with automated testing, rollback, and deployment workflows.

  • Establish and operate robust observability systems, including metrics, logging, and distributed tracing, using tools like Prometheus, Grafana, Datadog, and OpenTelemetry to ensure proactive incident detection and diagnosis.

  • Collaborate closely with the AI Research team to deploy and manage novel ML algorithms and drive next generation work on GPU-based development efforts.

  • Serve as a technical mentor and thought leader, promoting best practices in system design, infrastructure reliability, and code quality across the engineering organization.

What Will Help You Succeed

  • 15–20 years of engineering experience, including significant time spent on platform, infrastructure, or DevOps/SRE teams.

  • Deep experience with AWS (or GCP/Azure), container orchestration with Kubernetes, and service discovery at scale.

  • Strong grasp of DevOps principles, infrastructure as code (Terraform, Ansible), and immutable infrastructure.

  • Experience deploying and operating production systems in fast-paced environments, ideally early- or growth-stage startups.

  • Proficiency in systems or scripting language (e.g., Python, Bash).

  • Experience with secure networking, secrets management, and managing systems in compliance-heavy environments.

  • A bias for simplicity, automation, and building tools that empower developers.

  • A hands-on, in-the-weeds approach and a collaborative mindset. You’re as comfortable fixing a broken pipeline as designing the future of our platform.

Why you might be excited about Alembic:

  • You're an experienced platform/DevOps engineer ready to apply your infrastructure expertise to the cutting edge of AI. This role offers the perfect bridge between traditional platform engineering and the emerging world of ML/AI systems at scale.

  • You want to build something that is both technologically challenging and solves a real customer need. You want a role with major upside that tackles a massive market opportunity.

Company Information

Location: Sydney, Australia

Type: Hybrid