We are looking for a highly skilled Engineer to join our team as our first dedicated SRE/DevOps hire.
This role offers an exciting opportunity to design, implement, and manage our infrastructure, CI/CD pipelines, and production operations from the ground up.
Youll have autonomy in shaping our tech stack, defining best practices, and building scalable systems that will set the foundation for future engineering growth.
If you thrive in startup environments and enjoy the blend of software engineering, operations, and infrastructure, wed love to hear from you.
Youll get a chance to: ? ? Set Up and Manage Infrastructure: Design, build, and maintain a robust, cloud-based infrastructure on Azure Develop and maintain infrastructure as code (IaC) using tools like Terraform Have ownership of our systems reliability and scalability, laying a strong foundation for our engineering environment Deploy and Orchestrate Containers: Use k8s and Docker to manage containerised applications, ensuring high availability, scaling, and resource optimisation Set up and manage k8s clusters to support reliable and scalable infrastructure Develop CI/CD Pipelines: Design and implement CI/CD pipelines to automate our build, test, and deployment processes Collaborate with development teams to streamline code integration and ensure high-quality releases across the board Implement Monitoring and Incident Management: Set up proactive monitoring, logging, and alerting systems to detect and resolve issues before they impact users Develop and refine incident response protocols and conduct root cause analyses to continuously improve system reliability Foster Collaboration and Knowledge Sharing: Work closely with cross-functional teams, especially software engineering, to instil and grow a DevOps culture Document processes, systems, and configurations to ensure scalability and facilitate knowledge sharing as the team grows You should apply if: ? You have the experience and technical skills to build a solid foundation: 4 years in a DevOps, SRE, or related role with hands-on experience building and maintaining infrastructure Expertise with a major cloud provider (preferably Azure) Proficiency with Infrastructure as Code (IaC) tools and Kubernetes ecosystem tools, such as Terraform, Kubernetes, and FluxCD Solid experience with Docker and Kubernetes for container management Knowledge of CI/CD tools (e.g., Jenkins, GitLab CI, CircleCI, GitHub Actions) and experience setting up automated workflows Familiarity with monitoring and logging tools like Prometheus, Grafana, ELK stack, or DataDog Strong scripting skills (Python, Bash, or similar) for automation and tooling Youre motivated by ownership and impact: Youre ready to take ownership of critical systems and be the go-to person for system reliability You enjoy solving complex technical challenges and proactively seek solutions You have strong collaboration and communication skills: You can work effectively with stakeholders, document processes clearly, and explain technical concepts to non-technical team members