Skip to main content
Posted 03 July, 2026

Site Reliability Engineer

DOCOsoft
Dublin, Dublin D04V2N9, Ireland Full Time
Reference: 1348903905

As a Senior Azure Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our Vew SaaS platform hosted on Microsoft Azure.

You will work closely with Engineering, DevOps, and Infrastructure teams to design and operate scalable, resilient systems. This role is focused on automation, observability, incident response, and continuous improvement, ensuring our platform remains stable, secure, and operationally mature as it scales.

Responsibilities:
  • Design, implement, and operate highly available, scalable, and fault-tolerant systems on Microsoft Azure.
  • Define, track, and improve reliability metrics, including service health indicators and operational performance reporting.
  • Develop and maintain Infrastructure as Code using Bicep, ARM, Terraform, or similar tooling to ensure consistent and reproducible environments.
  • Build automation for provisioning, deployment, scaling, and operational workflows, reducing manual intervention and operational toil.
  • Enhance CI/CD pipelines in collaboration with DevOps to improve deployment safety, reliability, and efficiency.
  • Implement and maintain monitoring, logging, tracing, and alerting solutions to ensure real-time visibility and rapid issue detection.
  • Define meaningful alerting strategies that reduce noise and improve response effectiveness.
  • Lead incident response activities, including structured troubleshooting, stakeholder communication, root cause analysis, and post-incident reviews.
  • Strengthen incident and problem management processes to improve SLA adherence and customer impact mitigation.
  • Implement systemic improvements to prevent repeat incidents rather than applying short-term workarounds.
  • Embed security and compliance best practices across infrastructure, including access control, encryption, and policy enforcement.
  • Drive continuous service improvement initiatives to enhance performance, reliability, efficiency, and operational maturity.
  • Collaborate closely with Development and QA teams to improve application resilience and supportability.


Key Requirements:
  • Proven experience as a Site Reliability Engineer or in a similar reliability-focused role within a SaaS or cloud-native environment.
  • Strong hands-on experience operating production workloads on Microsoft Azure across compute, networking, storage, and monitoring services.
  • Infrastructure as Code expertise using Bicep, ARM, Terraform, or similar tools.
  • Strong automation and scripting capability (PowerShell essential; additional scripting languages advantageous).
  • Experience working with containerised environments (Docker) and orchestration concepts such as Kubernetes.
  • Practical experience with observability tooling such as Azure Monitor, Grafana, Prometheus, Datadog, or OpenTelemetry.
  • Strong understanding of structured incident response, root cause analysis, SLA/SLO concepts, and reliability engineering practices.
  • Knowledge of security best practices and compliance standards such as ISO27001, SOC 2, and GDPR.
  • Strong problem-solving capability with the ability to troubleshoot complex, distributed systems.
  • Effective communication skills and ability to collaborate across engineering, operations, and business stakeholders.
  • Azure certifications (e.g., Azure Administrator Associate, Azure Solutions Architect Expert) are desirable.


Who we are:

DOCOsoft is a leading software and services provider to Lloyd's of London and the broader London insurance market. Since our foundation, we have grown to become one of the leading insurance software specialists in the London Insurance Market. We are a growing team of over 105 colleagues based in Dublin, London, Tokyo, Portugal, Spain, India and Poland.

Here's what we have to offer:

DOCOsoft aspires to be a market leader in the technology sector and we are always looking for new ways to improve how we deliver value. We hire people who bring hard work, enthusiasm and their own ideas.

We offer:
  • 25 days Annual Leave
  • Private pension
  • Bonus scheme
  • Private health
  • Life assurance


Equal Opportunity Employer:

DOCOsoft is committed to building an inclusive and diverse team that represents a variety of backgrounds, experiences and perspectives. We welcome applications from all suitably qualified candidates and do not discriminate on any legally protected grounds. If you require reasonable accommodation during any stage of the recruitment process, please let us know.

Sign up for Job Alerts