Senior Site Reliability Engineer
GitLabLocation🌍United States
Job Type💼Full‑time
Posted📅4 days ago
Engineeringsresite-reliabilitydevopscloud-engineeringpublic-sectorscalability
About the Role
Join our team as a Senior Site Reliability Engineer, where you will be instrumental in ensuring the reliability, scalability, and efficiency of critical user-facing services and production systems for our US Public Sector clients. This is an exciting opportunity to combine a pragmatic operations mindset with strong software engineering principles in a remote, US-based role.
Responsibilities
- Design, implement, and maintain highly available and scalable production systems.
- Develop and improve monitoring, alerting, and incident response systems to ensure optimal performance and uptime.
- Automate operational tasks and infrastructure provisioning using modern SRE practices.
- Collaborate with development teams to define and implement best practices for system architecture and operational readiness.
- Participate in on-call rotations to provide timely support for critical infrastructure and services.
- Troubleshoot complex production issues across various layers of the stack.
Requirements
- Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role, preferably in a senior capacity.
- Strong background in managing and operating large-scale distributed systems.
- Proficiency in at least one programming language (e.g., Python, Go, Ruby) for automation and tooling.
- Hands-on experience with cloud platforms (e.g., AWS, GCP, Azure) and infrastructure as code tools (e.g., Terraform).
- Expertise with containerization and orchestration technologies (e.g., Docker, Kubernetes).
- Deep understanding of Linux operating systems, networking, and security best practices.
- Excellent problem-solving skills and a proactive approach to system reliability.
Nice-to-Haves
- Experience working with US Public Sector clients or understanding of relevant compliance standards.
- Familiarity with DevSecOps principles and practices.
- Contributions to open-source projects or community involvement.
About GitLab
View companyGitLab is an open core software company that develops the most comprehensive DevSecOps Platform used by more than 100,000 organizations.
Apply now
Please let GitLab know you found this job on FullRemoteWork.
Apply NowGet Job Alerts
Receive notifications for similar jobs
Share this job
Similar Jobs
Intermediate Site Reliability Engineer at GitLab1d
🌍Europe💼Full‑time
sredevopsgitlab
Intermediate Site Reliability Engineer at GitLab
🌍Europe💼Full‑time
sredevopsgitlabrelease-management
1 day ago
Intermediate Site Reliability Engineer, Database Operations at GitLab1d
🌍EMEA💼Full‑time
sredatabase-operationsreliability
Intermediate Site Reliability Engineer, Database Operations at GitLab
🌍EMEA💼Full‑time
sredatabase-operationsreliabilitypostgresql
1 day ago
Site Reliability Engineer at Zapier1w
🌍Worldwide💼Full‑time
site-reliabilitydevopscloud-engineering
Site Reliability Engineer at Zapier
🌍Worldwide💼Full‑time
site-reliabilitydevopscloud-engineeringautomation
1 week ago
Principal Engineer, Production Engineering at GitLab1w
🌍Worldwide💼Full‑time
production-engineerisite-reliabilitydevsecops
Principal Engineer, Production Engineering at GitLab
🌍Worldwide💼Full‑time
production-engineerisite-reliabilitydevsecopsscalability
1 week ago
Senior DevOps Engineer at Udacity3w
🌍United States💵50k-100k💼Full‑time
devopsawskubernetes
Senior DevOps Engineer at Udacity
🌍United States💵50k-100k💼Full‑time
devopsawskubernetesterraform
3 weeks ago
Intermediate Site Reliability Engineer, Networking and Incident Management at GitLab4w
🌍Worldwide💵50k-100k💼Full‑time
srenetworkingincident-management
Intermediate Site Reliability Engineer, Networking and Incident Management at GitLab
🌍Worldwide💵50k-100k💼Full‑time
srenetworkingincident-managementdevops
4 weeks ago