Site Reliability Engineer III
Company
Vimeo
Date Posted
24-08-2025
Location
Bengaluru, Karnataka, India
As a Site Reliability Engineer, you’ll work closely with other SREs and developers to ensure Vimeo remains available, fast, and secure. We own the core infrastructure on which most Vimeo apps sit, including system configuration, basic network services, container orchestration, metrics collection, and load balancing. We're building tools that are used by all of engineering to manage production, with the goal of providing a consistent and powerful platform. You'll be instrumental in making Vimeo the best toolset for video creators.
What you'll do:
- With the rest of the SRE team, own the base production infrastructure: cluster provisioning, configuration management, load balancing, access control, container orchestration, and the tools we use to interact with all that stuff.
- Build tools to make our infrastructure more consistent, more reliable, more observable, and require less manual intervention.
- Consult with other engineering teams at Vimeo to improve their reliability and operational processes.
- Manage changes to our production environment.
- Take part in a 12x7 on-call rotation (split with our team in Bangalore), to triage and troubleshoot production issues.
Skills and knowledge you should possess:
- 5+ years of experience with Linux system internals, networking fundamentals (TCP/IP, HTTP, TLS, DNS), application optimization, and/or similar SRE-adjacent fields.
- Experience building applications, beyond basic scripting, using at least one language Vimeo uses (Python, Go, Ruby, or PHP) and a desire to learn more. The SRE team uses primarily Python and Go, with occasional shell scripts.
- An understanding of configuration management tools (we use Chef, Atlantis, Terraform), and containerization toolsets.
- Working knowledge of large scale system design, monitoring, and operational practices.
- A bachelor's degree in Computer Science or a related technical field, or equivalent practical system administration and programming experience.
Bonus points:
- Experience in high-throughput environments (>100k requests/second).
- Experience with at least one major cloud provider (Google Cloud, Amazon Web Services, or Microsoft Azure)
- A track record of automating away your job.
- An appreciation of and enthusiasm for software engineering best practices, such as code review, testing, and continuous delivery.
About Us: