Jobgether

Senior Site Reliability Engineer

Jobgether

Overview

Key role in scaling and securing cloud infrastructure while ensuring system reliability.

Ideal candidate should have 5+ years of experience in SRE or DevOps with strong problem-solving skills.

remoteseniorfull-timeKubernetesAWSPostgreSQLGitHub ActionsArgoCDTerraformDatadogPythonBash

Locations

  • United States

Requirements

  • Minimum 5 years experience in SRE or DevOps
  • Proficiency in Kubernetes and networking security
  • In-depth experience with AWS services
  • Expertise in PostgreSQL administration
  • Familiarity with CI/CD tools like GitHub Actions
  • Strong understanding of Infrastructure as Code using Terraform
  • Experience in observability with Datadog
  • Proficiency in Python and Bash scripting

Responsibilities

  • Ensure system reliability and scalability
  • Participate in on-call rotations
  • Design and manage Kubernetes clusters
  • Architect and maintain AWS infrastructure
  • Automate infrastructure provisioning
  • Enhance observability with monitoring systems
  • Conduct post-incident reviews
  • Document lessons learned

Benefits

  • Competitive salary and equity options
  • Comprehensive health, dental, and vision coverage
  • Life insurance and mental wellness coverage
  • Unlimited Flex Time Off
  • Paid family and medical leave
  • Retirement saving plans
  • Home office setup allowance
  • Annual professional development stipend