Job Description

Site Reliability Engineer, Traffic Platform

About the Team Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the traffic services are reliable, fault‑tolerant, efficiently scalable and cost‑effective. You will have the opportunity to manage a variety of complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, global load balancer that handles Tbps of traffic.

Responsibilities

  • Build, expand and operate Bytedance’s global traffic platform, including large‑scale systems in public and private clouds, edge data centers.
  • Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global traffic platform.
  • Work in a fast‑paced environment. Participate in technical operations and rotations in response to performance and ...

Apply for this Position

Ready to join ByteDance? Click the button below to submit your application.

Submit Application