Skip to Content

Platform / SRE / security engineer (Kubernetes & networking)

Stockholm, Sweden

Berget AI builds a sovereign, GPU-backed AI platform running on Kubernetes. We’re looking for a platform engineer who owns the runtime, reliability, and security of our Kubernetes-based systems and makes the infrastructure feel stable and simple for product and inference teams.

What you’ll work on

You will design, operate, and evolve our Kubernetes platform: cluster lifecycle, upgrades, capacity, and multi-tenant isolation. You’ll own core platform components such as GPU operators, storage integrations (CSI), networking, ingress, and observability.

You’ll build and maintain GitOps-based deployment workflows, CI/CD integrations, and internal tooling that supports product, inference, and ML workloads. You’ll participate in on-call and incident response, drive post-incident improvements, and continuously harden reliability and security across the platform.

You’ll work closely with backend, inference, and infrastructure engineers to translate workload needs into stable, scalable platform capabilities.

What you bring

You have hands-on experience running Kubernetes clusters in production and understand how systems fail in practice. You’re comfortable with Linux, containers, networking fundamentals, and debugging distributed systems.

You’ve worked with Kubernetes storage and networking in real environments and care about observability, automation, and security. Experience with GitOps, infrastructure automation, or GPU workloads is a strong plus.

You enjoy creating platforms that other engineers trust and like using.

Why Berget

You’ll own the backbone of a European AI platform, with real autonomy, short feedback loops, and direct influence on reliability, security, and developer experience. 

Interested?

Drop us a short note about yourself and links to recent projects or contributions:

📬 jobs@berget.ai

Let’s build the future of sovereign AI in Europe—together.

Note: Only EU Citizens can apply