Job Location: 100% remote in Romania
Recruitment process:
- HR call
- Technical screening
- Technical interview
Role description:
Join a small, fast-moving team that keeps a high-throughput Rust-based platform humming. You'll wear several hats, designing features in a compiled language one day, tuning Kubernetes the next, and hunting down performance glitches in production after that.
Demonstrated Experience
- Green-field Infrastructure Built complete AWS foundations (multi-account VPCs, EKS clusters, CI/CD pipelines) exclusively through Terraform
- Brown-field Maintenance & Enhancement Took ownership of long-lived, business-critical stacks, stabilized flaky deployments, added automated tests, and incrementally refactored infrastructure codeall without disrupting 24 × 7 production traffic.
- Python Automation Authored tested Go or Python tooling (CLI utilities, Lambda workers, custom controllers) that integrated cloud APIs, enforced policy-as-code, and removed repetitive ops tasks.
- Scale & Reliability Designed autoscaling, multi-AZ / multi-region fail-over patterns, and implemented SLO-based alerting to keep core services at 99.9 % availability during peak loads.
- Security & Governance Embedded least-privilege IAM, CICD, secrets management (AWS KMS/Vault), and automated compliance scans into every pipeline
- Knowledge Transfer Produced architectural diagrams, runbooks, and led workshops enabling staff teams to own and evolve delivered platforms post-engagement.
Must-Haves
- Terraform (HCL) Module design, remote state, policy-as-code, multi-account orchestration
- Kubernetes / EKS Controllers, CNI, upgrades, Helm/Kustomize, service mesh exposure
- AWS IAM, VPC, EC2, S3, EKS, RDS, Lambda, CloudWatch, Organizations
- CI/CD GitHub Actions / GitLab CI, zero-downtime rollouts, automated gatekeeping
- Observability Prometheus, OpenTelemetry, Grafana/Loki or analogous modern o11y systems
- Operating Systems Linux internals, container runtimes, networking, security hardening
Nice-to-Haves
- Hands-on Rust or Go in production
- Observability stacks (OpenTelemetry, Prometheus, Grafana)
- Multi-cloud or on-prem hybrid experience
- Massive scale, edge, or high-performance computing exposure
- Formal methods, distributed-systems research, or academic publications
- Security-first mindset (threat modeling, policy-as-code)
- Community contributions (open-source maintainer, conference speaker)
Consulting Attributes
- Works autonomously with minimal direction; drives clarity through written proposals and weekly status reports.
- Communicates complex infra topics to both engineers and non-technical stakeholders; documentation is artifact-quality.
- Bias toward automation: deletes runbooks that require human clicks.
- Pragmatic: optimizes for business risk, performance, and costnot novelty.
Why You'll Love It Here
Small team, huge impact; autonomy to choose the right tool; influence to improve the culture, continuous learning, and pragmatic engineering over dogma.
Ready to bring your battle scars and systems mindset to bear? Lets talk.