The Role
You’ll design, build, and operate the systems that power Gensyn’s decentralised compute network. This role ensures our mainnet and supporting infrastructure are reliable, observable, and ready to scale.
Responsibilities
- Own reliability, observability, and automation for Gensyn’s decentralised compute network
- Manage infrastructure provisioning and lifecycle automation using Terraform and configuration management tools (Ansible or Puppet)
- Operate and scale Kubernetes clusters across multiple regions and environments
- Maintain and improve CI/CD pipelines and deployment workflows for production services
- Participate in on-call rotations and lead incident response and postmortems
Competencies
Must have
- Strong background in Linux systems, networking, and distributed system operations
- Kubernetes production experience (multi-cluster or multi-region preferred)
- Expertise in Infrastructure as Code and GitOps (Terraform preferred)
- Experience monitoring and supporting live production systems, including on-call response
- Strong Python and Bash scripting skills for automation and tooling development
Preferred
- Experience managing or extending observability systems (Prometheus, OpenTelemetry, Grafana, ELK)
- Experience deploying large-scale or high-throughput systems across cloud or hybrid environments
- Familiarity with Ansible, Puppet, or similar configuration management frameworks
- Strong understanding of CI/CD pipelines, version control, and deployment safety patterns
- Exposure to fast paced, high-intensity environments
Nice to have
- Exposure to decentralised systems, blockchain, or machine learning infrastructure
- Experience provisioning and managing on-prem or bare-metal Linux servers
- Broader software engineering experience beyond traditional SRE or DevOps work