Jr./Sr. Site Reliability Engineer Jr./Sr. 網站可靠性工程師
Company
17LIVE
Date Posted
16-07-2025
Location
Taipei, Taipei, Taiwan
17 LIVE 歡迎對以下工作內容有興趣的 網站可靠性工程師 加入我們的大家庭!
如果您具備以下工作技能及工作經驗,請不要猶豫立即手刀提出申請:
- 具備 Container 以及 Kubernetes 的基礎知識。
- 具備 CI/CD 流程的基礎概念,有撰寫或維護 CI/CD pipeline 的經驗者佳 (我們主要的使用到的有: CircleCI, Jenkins, ArgoCD, Helm)。
- 具備建置或維護監控系統的相關經驗 (例如:Prometheus, Thanos, Grafana, Elasticsearch, Fluentd, Kibana 等)。
- 了解 Linux 作業系統,並能在 Linux 環境下進行操作與問題排查。
- 熟悉使用 Infrastructure as Code (IaC) 工具管理雲端資源,特別是 Terraform。
- 具備基礎的 Shell Script 撰寫能力。
- 提高可用性:知道如何部署HA架構以及DR架構。
- 參與輪班值班,提供 24/7 支援
- 具備至少一種程式語言的開發經驗 (例如:Go, Python, etc.)
加分條件:
- 具備快速學習新技術與解決問題的能力。
- 曾對開源軟體專案做出貢獻。
- 具備後端服務開發經驗。
我們希望您具備的特質:
- 反應迅速,能快速理解問題並採取行動。
- 做事謹慎,習慣在進行變更或部署前,先進行小範圍的測試或驗證。
- 熱愛學習,對 SRE 領域有高度熱情。
- 良好的溝通能力與團隊合作精神。
If you have the following skills and experience, don’t hesitate to apply — we’d love to hear from you!Required Skills & Experience
- Basic knowledge of Containers and Kubernetes
- Familiar with CI/CD workflows; experience in writing or maintaining CI/CD pipelines is a plus (tools we use include CircleCI, Jenkins, ArgoCD, and Helm)
- Experience in building or maintaining monitoring systems such as Prometheus, Thanos, Grafana, Elasticsearch, Fluentd, Kibana, etc.
- Solid understanding of the Linux operating system, including the ability to operate and troubleshoot in a Linux environment
- Familiar with using Infrastructure as Code (IaC) tools to manage cloud resources, especially Terraform
- Basic scripting skills with Shell scripts
- Knowledge of how to design and deploy high availability (HA) and disaster recovery (DR) architectures to improve system reliability
- Willingness to participate in an on-call rotation and provide 24/7 support
- Experience with at least one programming language (e.g., Go, Python, etc.)
Bonus Points
- Ability to quickly learn new technologies and solve complex problems
- Contributions to open-source projects
- Experience in backend service development
What We Look for in You
- Fast response: Able to quickly understand and act on issues
- Cautious and detail-oriented: Prefer to test and validate changes in a limited scope before deploying widely
- Curious and passionate: Eager to learn and enthusiastic about the SRE field
- Great communicator and team player: Strong collaboration and communication skills