Engineering Operations

Site Reliability Engineer (SRE)

About the Role

As a Site Reliability Engineer, you will be responsible for ensuring TurboVets’ platform remains available, reliable, and performs optimally. You will work closely with DevOps and engineering teams to monitor systems, manage incidents, and develop strategies for resilience and scaling. This role is ideal for someone passionate about uptime, reliability, and infrastructure management.

Responsibilities:

  • Monitor system performance, detect incidents, and ensure uptime.
  • Develop tools and practices to automate reliability and performance checks.
  • Collaborate with teams to identify and resolve potential issues proactively.
  • Manage incidents, troubleshoot issues, and implement preventive measures.

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or related field.
  • 3+ years of experience in site reliability or infrastructure engineering.
  • Proficient in monitoring tools and incident management practices.
  • Knowledge of scripting languages (Python, Bash) and cloud environments.

Who You Are:

  • An analytical thinker with a passion for maintaining high system reliability.
  • Organized, proactive, and quick to act in high-pressure situations.
  • Committed to building a stable, resilient platform that meets user expectations.

Equal Opportunity Statement

Apply

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.