Site Reliability Engineer

Digital Health Technology team powers digital experiences and engagement to enhance the lives of millions of people every day through connected care. We build, deliver and manage a portfolio of data management platforms and mobile offerings in support of our core businesses. We thrive on simple and elegant architecture and agility. You’ll be immersed in a dynamic high-growth environment and empowered to excel, take informed risks, and drive ingenuity across the enterprise.

Job Description

Digital Health Technology team powers digital experiences and engagement to enhance the lives of millions of people every day through connected care. We build, deliver and manage a portfolio of advanced analytics, web and mobile application products in support of our core businesses. We thrive on simple and elegant architecture and agility. You’ll be immersed in a dynamic high-growth environment and empowered to excel, take informed risks, and drive ingenuity across the enterprise.

Let’s talk about the team and you:

We are looking for a highly technical Site Reliability Engineer to join our growing team of Observability Engineering. You will be detail-oriented and possess expertise in both qualitative and quantitative analysis with a passion for automation and delivering actionable insights to key stakeholders.

The ideal candidate will configure, tune, and troubleshoot multi-tiered systems to achieve optimal application performance, stability and availability. You will be working alongside other engineering and support teams at ResMed, and your experience in logging, metrics and synthetic tracing will be critical in this role.

Let’s talk about Responsibilities:

As an SRE Engineer, you’ll work collaboratively with other engineering team members to deploy software and maintain and operate our systems; assist in automating and streamlining our operations and processes; maintain tools for operations and monitoring of critical systems; troubleshoot and resolve issues in our production environments; and maintain uptime for our sites, apps, and content. You will do this by:

  • Advocate automation and remove operation load with software
  • Apply in-depth troubleshooting and debugging skills and technical knowledge of systems, databases, and applications to get to the root cause of the customer’s issue. Apply testing methodology and debugging skills to narrow down the problem as needed.
  • Own the design, development, and maintenance of ongoing and ad-hoc metrics, reports, dashboards, and analyses, to drive key business decisions
  • Enable effective decision making by retrieving and aggregating data from multiple sources and compiling it into a tangible and actionable format
  • Design and influence operational best practices for reporting and analytics to enable the team to scale as we grow

Let’s talk about Qualifications and Experience:

  • Bachelor in engineering, computer science, or other technical disciplines
  • At least 3 years of experience running high availability systems and supporting distributed infrastructure
  • Proven track record to use languages like Go, Python, or similar to simplify complex problems
  • Expertise in managing distributed system observability, including health, telemetry, and logs
  • Advanced knowledge of SQL and NoSQL databases and analytical platforms (be able to efficiently extract data from multi-source datasets)
  • Experience operating solutions built on top of AWS
  • Experience in automation based on Terraform
  • Knowledge of Linux and containerization systems
  • Advanced knowledge of application, data, and infrastructure architecture disciplines
  • Strong sense of ownership, customer service, and integrity demonstrated through clear communication

Let’s talk about what you can expect:

  • A supportive environment that focuses on people development and best practices
  • Opportunity to design, influence and be innovative
  • Work with global teams and share new ideas
  • Be supported both inside and outside of the work environment
  • The opportunity to build something meaningful and see a direct positive impact on people’s lives

Joining us is more than saying “yes” to making the world a healthier place. It’s discovering a career that’s challenging, supportive and inspiring. Where a culture driven by excellence helps you not only meet your goals, but also create new ones. We focus on creating a diverse and inclusive culture, encouraging individual expression in the workplace and thrive on the innovative ideas this generates. If this sounds like the workplace for you, apply now!