1. 系統維護與監控:負責設計、構建和維護高可靠性的生產系統。持續監控系統性能,並確保系統達到既定的服務水平目標(Service Level Objectives, SLOs)。
2. 故障分析與解決:快速響應系統中斷和性能問題,進行根本原因分析(Root Cause Analysis, RCA),並實施長期解決方案以防止問題再次發生。
3. 自動化與工具開發:開發和部署自動化工具來提高系統效率和減少人為錯誤。這包括自動化部署、故障恢復和其他常規維護任務。
4. 跨部門協作:與開發、運營和產品管理團隊緊密合作,以確保技術解決方案滿足功能和性能要求。積極參與產品的設計和改進過程,提供可靠性和可維護性的反饋。
5. 性能優化:分析現有系統的性能,識別瓶頸並實施優化策略,以提高效率和降低成本。
6. 持續學習與技術更新:保持對業界發展的敏感性,學習和實施新技術以不斷提升系統的可靠性和性能。
7. 文件編制與維護:編制詳細的系統架構、配置文檔和操作手冊,以支持團隊成員的瞭解和操作。
---
1. System Maintenance and Monitoring: Responsible for designing, building, and maintaining highly reliable production systems. Continuously monitor system performance to ensure compliance with established Service Level Objectives (SLOs).
2. Incident Analysis and Resolution: Respond quickly to system outages and performance issues, conduct Root Cause Analysis (RCA), and implement long-term solutions to prevent recurrence of problems.
3. Automation and Tool Development: Develop and deploy automation tools to improve system efficiency and reduce human errors. This includes automating deployment, failure recovery, and other routine maintenance tasks.
4. Cross-Departmental Collaboration: Work closely with development, operations, and product management teams to ensure technical solutions meet functional and performance requirements. Actively participate in the design and improvement process of products, providing feedback on reliability and maintainability.
5. Performance Optimization: Analyze the performance of existing systems, identify bottlenecks, and implement optimization strategies to enhance efficiency and reduce costs.
6. Continuous Learning and Technology Upkeep: Stay current with industry developments, learn and implement new technologies to continuously improve system reliability and performance.
7. Documentation and Maintenance: Prepare detailed system architecture, configuration documents, and operational manuals to support the understanding and operations of team members.
1. 精通 AWS 雲端服務,包括但不限於 EC2、Load Balancers、Cloud Watch、RDS 和 IAM。
2. 精通容器技術,尤其是 Docker。
3. 具備 Kubernetes 叢集管理技術,包括但不限於 AWS EKS。
4. 熟悉 Prometheus 和 Grafana 監控工具。
5. 具備至少一種 CI/CD 工具的實戰經驗,如 Ansible 或 Jenkins。
6. 具備至少一種日誌收集工具的實用經驗,如 Elasticsearch。
7. 精通至少一種程式語言及 Shell 腳本編寫。
8. 精通至少一種關聯式資料庫的使用與管理。
9. 精通至少一種基礎設施即代碼(IaC)版本控制工具,如 Terraform 或 Pulumi。
---
1. Expertise in AWS cloud services, including but not limited to EC2, Load Balancers, Cloud Watch, RDS, and IAM.
2. Proficient in container technologies, particularly Docker.
3. Skilled in Kubernetes cluster management, including but not limited to AWS EKS.
4. Familiar with monitoring tools such as Prometheus and Grafana.
5. Experienced with at least one CI/CD tool, such as Ansible or Jenkins.
6. Experienced in using at least one log collection tool, such as Elasticsearch.
7. Proficient in at least one programming language and Shell scripting.
8. Proficient in managing and using at least one relational database.
9. Proficient in at least one Infrastructure as Code (IaC) version control tool, such as Terraform or Pulumi.
出差外派
不需出差
上班時段
日班
遠端工作
現場
上班地點
台北市中山區建國北路三段92號2樓
休假制度
符合勞基法
可到職日
依面談為主
招募人數
1人
股票與獎金
績效獎金
工作經歷
不拘
學歷要求
專科或同等學歷以上
1. 負責業務的需求承接、設計、研發工作 2. 持續改進網站的業務邏輯、系統架構、核心技術等 3. 保證系統高性能、高可用性和高可擴展性 4. 負責相關模塊性能分析及改進,保證系統性能和穩定性...
1. 優化線上專案的效能 2. 完成日常開發工作(包含egret及cocos creator) 3. 參加產品評審,針對設計不合理之處能夠提出最佳化方案 4. 依照產品要求實現功能 5. 與設計溝通資源輸出規範 6. 與項管溝通任務排期...