The Role Tesla's Traffic Engineering Team is seeking Site Reliability engineer (SRE) to help design, build, and operate systems for managing Global Application Traffic(GTM). Our Traffic Engineering team builds global and local load balancers, application acceleration proxies, content delivery networks, data ingestion pipelines, and DNS along with the automation that's required to operate a terabit scale edge network that handle all of Tesla's DNS and HTTP traffic across POP's, Data Centers, and multiple cloud providers. Responsibilities: · Work with the team to design, build, and maintain systems for caching, load balancing, and DNS. · Diagnose and troubleshoot complex distributed systems handling large volumes of data and develop solutions that have a significant impact at scale. · Participate in building advanced tooling for testing, monitoring, administration, and operations of multiple clusters across multiple geographically distributed data centers, primarily in Python. · Work and collaborate across teams such Application services, Linux kernel and Capacity Planning, Hardware, Network, and Datacenter Operations. · Troubleshoot issues across the entire stack - hardware, software, application and network. · Day-to-day administration of a multi-site distributed Big-IP LTM, Avi, HA-Proxy instances. · Authoring technical documentation for workflows/processes/best practices. · Take part in a 24x7 on-call rotation. Minimum Qualifications: · 3-5+ years of managing services in a distributed, internet-scale *nix environment. · Familiarity with systems management tools (Puppet, Chef, Ansible, etc.) · Ability to prioritize tasks and work independently. · Track record of practical problem solving under pressure · Excellent communication, and documentation skills. · BS or MS degree in Computer Science or Engineering, or equivalent experience. Preferred experience: · Demonstrable knowledge of TCP/IP, Linux operating system internals, filesystems, disk/storage technologies and storage protocols. · Strong fundamentals in HTTP including Cache-Control headers. · Hands-on operational experience on managing cache services (memcache, redis) · Familiar with data on-boarding procedures, CIM compliance and data normalization techniques. · Proficient in Git and repository management. · Expert-level Python and shell scripting experience.