Singapore, SG-Singapore
Posted 6 months ago
About The Company This company pioneers short-form video creation and social engagement, boasting a vast, engaged user base. Its platform empowers users with creative tools, filters, and effects. With a diverse content ecosystem, it’s a hub of creativity and expression. The proprietary algorithm ensures personalized content feeds, enhancing user engagement and satisfaction. This company wields significant influence on digital media, making it an invaluable partner for innovative collaborations and marketing endeavors. Responsibilities – Building and managing the Global SRE team, including team recruitment, new talent training, system operation/maintenance/coordination and team culture building. – Improve the cross-team/time zone/regional cooperation mechanism, and provide SRE solutions in line with actual business scenarios based on business orientation. – Responsible for SRE team arrangement and project management, guiding basic SRE work to be more effective, and improving the overall SRE efficiency. – Develop process specifications and plans for compliant access, configuration, disaster recovery and fault handling of critical paths of overseas SRE services. – Responsible for continuously improving the core SRE capabilities of Global E-commerce SRE in efficiency, cost, quality, security, etc. – Develop automation, data visualization and automated monitoring processes to facilitate the optimization of the TikTok e-Commerce platform infrastructure. – Drive the design and engineering of tools, as well as platform solutions, to optimize product engineering and operation efficiencies. – Manage oncall processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize downtime. – Oversee the acquisition and development of software systems in organisational units. – Monitor the results and quality of the different software solutions and projects implemented in the organisation. – Oversee the development of Proof-of-Concept/ solutions and provide technical expertise on the development of software and platform features, ensuring that appropriate security and risk factors are considered. Qualifications – Bachelor’s or higher degree in Computer Science, Information Technology, Programming & System Analysis, Science (Computer Studies) or related discipline and good English communication skills. – Familiar with SRE related processes, understand the development trend of SRE technology in the industry, and have a good ability to build an SRE system, 5 years+ experience in E-commerce industry. – Familiar with cloud computing technologies of Amazon Web Services, Google Cloud Platform and other suppliers. – Demonstrable experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python. – Expertise in operations, deployment, high availability and quality assurance of large-scale distributed systems, with a strong focus on stability and performance. – Possesses a strong sense of responsibility, a proactive team spirit, and a strong ability to comprehensively analyze and solve problems. Ideal Candidate – Agile, quick self-learner, highly self-motivated with strong sense of product ownership and creative problem solver. – Operational experience running a 24×7 production infrastructure at scale. – Ability to lead independent research to solve complex technical problems. – Empathetic and results-oriented leader and mentor. – Good collaborator and team player, comfortable working in a fast moving, culturally diverse and globally distributed team environment. |
Job Features
Job Category | DevOps & SRE |
Seniority | Manager / Senior Manager |
Recruiter | jack.cheng@ocbridge.ai |