Showing posts sorted by date for query linux. Sort by relevance Show all posts
Showing posts sorted by date for query linux. Sort by relevance Show all posts

Sunday, 11 January 2026

Site Reliability Engineer, Cloud Infrastructure - USDS | Tiktok | Seattle

 Site Reliability Engineer, Cloud Infrastructure - USDS | Tiktok | Seattle


Site Reliability Engineer, Cloud Infrastructure - USDS

Location:


Seattle


Employment Type:


Regular


Job Code:


A18401


Responsibilities


The Systems and Networking team is committed to ensuring the seamless operation of TikTok's US physical infrastructure. We handle the provisioning of physical servers and maintain the TikTok US physical network. Additionally, we engage in collaborative efforts with vendors such as OCI and Akamai to manage physical hardware, networks, and uphold assurance and compliance objectives.

We also work closely with our colleagues around the world to build and support various platforms within our US region, including internal platforms that support our daily operations. Our primary goal is to ensure the uninterrupted functionality of TikTok's US Physical Infrastructure, thus facilitating other internal middleware teams to deliver essential intermediary services to internal business units such as Product, e-Commerce, Ads/Monetization, etc., all while strictly adhering to compliance standards.


Drive infrastructure automation and tooling: Design, develop, and maintain solutions for efficient operation, optimization, and comprehensive monitoring of global infrastructure, minimizing manual intervention.

Collaborate on service lifecycle management: Partner with engineering teams to design, deploy, operate, and continuously improve robust and scalable systems and services, from inception to refinement.

Ensure service reliability and performance: Proactively monitor system health, conduct performance testing, and manage incidents to maximize uptime, availability, and adherence to defined SLAs/SLOs.

Execute core SRE practices: Perform on-call duties and production operations, including change management, capacity planning, and disaster recovery, while contributing to documentation and process improvements across teams.


Qualifications


Minimum Qualifications

-Proficient in one or more programming languages (e.g., Python, Go, Java, C++).

-Strong understanding of Linux operating systems and open-source technologies.

-Experience in network architecture and troubleshooting, database modeling, cloud systems, and large-scale distributed systems.

-Knowledge of monitoring tools and methodologies (such as Prometheus, Grafana), AIOPS, APM, Disaster Recovery.

-Experience in designing, analyzing, and building automation and tools for large-scale systems.

-Experience in building solutions with AWS, GCP, Azure, and other cloud services.


Preferred qualifications

-Expertise in any of these tech stacks: Kubernetes, ElasticSearch, ClickHouse, Message Queue, OpenTSDB, Service Mesh, MySQL, Redis, etc.

-Master's degree in Computer Science, Engineering, or a related field.


As a condition of employment, all successful candidates must be able to establish authorization to work in the United States. For this position, the Company does not provide sponsorship for any immigration-related benefits.


Job Information


【For Pay Transparency】Compensation Description (Annually)


The base salary range for this position in the selected city is $129960 - $246240 annually.​


Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.​


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).​


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.​


For Los Angeles County (unincorporated) Candidates:​


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:​


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;​


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and​


3. Exercising sound judgment.​


About USDS


TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (“USDS”) is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.​



On-site presence across teams allows the company to operate with greater speed, alignment, and agility — especially in areas like real-time decision-making, team development, and integrated execution. As such, the company is shifting from a hybrid work model to a fully in-person schedule up to 5 days a week.​


Why Join Us


Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.​


We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.​



Diversity & Inclusion​


TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.​


USDS Reasonable Accommodation


USDS is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/USDS-RA


https://lifeattiktok.com/search/7535215880826538247

Site Reliability Engineer, Recommendation Infrastructure - USDS | Tiktok | Seattle

 Site Reliability Engineer, Recommendation Infrastructure - USDS | Tiktok | Seattle


Site Reliability Engineer, Recommendation Infrastructure - USDS

Location:


Seattle


Employment Type:


Regular


Job Code:


LWR2


Responsibilities


About the team

The USDS TikTok Recommendations Infra SRE team works with engineering and product teams to build and run large-scale, globally distributed, observable, fault-tolerant systems. SREs on this team will deliver on production ownership and be responsible for observability and automation across complex, large-scale service mesh architectures.


Responsibilities:

• Engage in and improve the whole lifecycle of Recommendation systems — from system design consulting through to launch reviews, deployment, operation and refinement

• Deliver tools/software to improve the reliability and scalability of services, automate operations and improve R&D efficiency

• Build availability of large-scale services deployed across global data centers

• Plan, manage and optimize cloud resources utilization, ensuring SLA of large-scale clusters

• Measure and monitor availability, latency and overall service health

• Practice sustainable incident response and postmortems.


Qualifications


Minimum Qualifications

• Bachelor's degree or above majoring in Computer Science or related fields, with at least 2 years + of related work experience

• Experience in SRE of large-scale systems deployment with high reliability and scalability

• Familiar with system operation skills in Linux and network

• Experience programming in at least one of the following languages: Python, Perl, Go, or C/C++

• Experience in designing, analyzing and troubleshooting large-scale distributed systems

• Familiar with popular CI/CD procedures and environments

• Effective communication skills and a sense of ownership and drive


As a condition of employment, all successful candidates must be able to establish authorization to work in the United States. For this position, the Company does not provide sponsorship or any immigration-related benefits.


Job Information


【For Pay Transparency】Compensation Description (Annually)


The base salary range for this position in the selected city is $129960 - $246240 annually.​


Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.​


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).​


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.​


For Los Angeles County (unincorporated) Candidates:​


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:​


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;​


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and​


3. Exercising sound judgment.​


About USDS


TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (“USDS”) is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.​



On-site presence across teams allows the company to operate with greater speed, alignment, and agility — especially in areas like real-time decision-making, team development, and integrated execution. As such, the company is shifting from a hybrid work model to a fully in-person schedule up to 5 days a week.​


Why Join Us


Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.​


We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.​



Diversity & Inclusion​


TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.​


USDS Reasonable Accommodation


USDS is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/USDS-RA


https://lifeattiktok.com/search/7002060678023022885

Software Engineer Graduate (Site Reliability Engineering) - 2026 Start (BS/MS) | Tiktok | San Jose

 Software Engineer Graduate (Site Reliability Engineering) - 2026 Start (BS/MS) | Tiktok | San Jose


Software Engineer Graduate (Site Reliability Engineering) - 2026 Start (BS/MS)

Location:


San Jose


Employment Type:


Regular


Job Code:


A215974A


Responsibilities


TikTok’s Generalized Architecture US Tech and Operations team is dedicated to ensuring that TikTok’s core services run stable, efficient, and cost-effective at global scale. We focus on enhancing the observability and operability of our infrastructure and services, using data-driven insights to safeguard business stability 24/7.


We are looking for talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite at TikTok.


Successful candidates must be able to commit to an onboarding date by end of year 2026. Please state your availability and graduation date clearly in your resume.


Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to TikTok and its affiliates' jobs globally. Applications will be reviewed on a rolling basis. We encourage you to apply as early as possible.


Candidates who pass resume screening will be invited to participate in TikTok's technical online assessment.


Responsibilities:

- Ensure the stability and reliability of TikTok’s core services; respond quickly to production incidents and build mechanisms and platforms to continuously improve incident handling efficiency.

- Define and maintain system quality SLAs through continuous, comprehensive data operations; identify and manage system risks to improve reliability, scalability, and performance.

- Participate in TikTok’s disaster recovery initiatives, including risk assessment, disaster recovery design, capacity planning, and contingency plan development, to strengthen system resilience and fault tolerance.

- Develop and accumulate best practices, tools, and frameworks for operations and maintenance; provide guidance on system architecture design and component selection; produce high-quality technical and operational documentation.


Qualifications


Minimum Qualifications:

- BS/MS degree in Computer Science or equivalent majors/experience

- Foundation in computer science and software engineering, with understanding of operating systems (especially Linux), storage systems, and network I/O principles

- Proficiency in one or more programming languages, such as Python, Go, Java, PHP, C, or C++

- Strong problem-solving skills with a systematic approach, effective communication abilities, and a strong sense of ownership and responsibility


Preferred Qualifications:

- Experience building AI-powered tools to improve SRE/operations efficiency (e.g., intelligent runbooks, automated incident response, anomaly detection, or self-healing systems)


Job Information


【For Pay Transparency】Compensation Description (Annually)


The base salary range for this position in the selected city is $118657 - $187200 annually.​


Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.​


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).​


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.​


For Los Angeles County (unincorporated) Candidates:​


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:​


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;​


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and​


3. Exercising sound judgment.​


About TikTok


TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.​


Why Join Us


Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.​


We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.​



Diversity & Inclusion​


TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.​


TikTok Accommodation


TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/RA-request


https://lifeattiktok.com/search/7591947799615260933

Senior Machine Learning Ops Engineer, Global SRE | Tiktok | San Jose

 Senior Machine Learning Ops Engineer, Global SRE | Tiktok | San Jose


Senior Machine Learning Ops Engineer, Global SRE

Location:


San Jose


Employment Type:


Regular


Job Code:


A04380


Responsibilities


MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on.


Responsibilities

1) Responsible for setting SLOs of online machine learning serving systems, maintaining the stability of the online serving systems.

2) Responsible for maintaining stability of offline machine learning training tasks, improving the success rate of the training tasks.

3) Responsible for rolling out GPU model training in Non-China regions.

4) Responsible for stability of AIGC related machine learning tasks.

5) Responsible for resource management and planning of machine learning resources, including: cost and budget, resource efficiency enhancement, offline and online resources tides, etc.


Qualifications


Minimum Qualifications

1) Bachelor's degree in Computer Science or Software Engineering, similar technical field of study, or equivalent practical experience.

2) Expertise in Linux operating systems, networking, storage.

3) Experience programming in at least one of the following programming languages: Python, Go, C, C++, or Java.

4) Experience in troubleshooting application issues, or production operations.

5) Effective communication skills and a sense of ownership and drive.


Preferred qualifications:

1) Experience in SRE of machine learning systems.

2) Experience in SRE of ads/recommendation/search systems.


Job Information


【For Pay Transparency】Compensation Description (Annually)


The base salary range for this position in the selected city is $187040 - $359720 annually.​


Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.​


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).​


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.​


For Los Angeles County (unincorporated) Candidates:​


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:​


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;​


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and​


3. Exercising sound judgment.​


About TikTok


TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.​


Why Join Us


Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.​


We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.​



Diversity & Inclusion​


TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.​


TikTok Accommodation


TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/RA-request


https://lifeattiktok.com/search/7320146691230845211

Tech Lead Site Reliability Engineer, TikTok Generalized Arch USTO | Tiktok | San Jose

 Tech Lead Site Reliability Engineer, TikTok Generalized Arch USTO | Tiktok | San Jose


Tech Lead Site Reliability Engineer, TikTok Generalized Arch USTO

Location:


San Jose


Employment Type:


Regular


Job Code:


A156234


Responsibilities


TikTok’s Generalized Architecture US Tech and Operations team is dedicated to ensuring that TikTok’s core services run stable, efficient, and cost-effective at global scale. We focus on enhancing the observability and operability of our infrastructure and services, using data-driven insights to safeguard business stability 24/7.


- Ensure the stability and reliability of TikTok’s core services; respond quickly to production incidents and build mechanisms and platforms to continuously improve incident handling efficiency.

- Define and maintain system quality SLAs through continuous, comprehensive data operations; identify and manage system risks to improve reliability, scalability, and performance.

- Participate in TikTok’s disaster recovery initiatives, including risk assessment, disaster recovery design, capacity planning, and contingency plan development, to strengthen system resilience and fault tolerance.

- Develop and accumulate best practices, tools, and frameworks for operations and maintenance; provide guidance on system architecture design and component selection; produce high-quality technical and operational documentation.


Qualifications


Minimum Qualifications

- Bachelor’s degree or above in Computer Science or a related field.

- Solid foundation in computer science and software engineering, with understanding of operating systems (especially Linux), storage systems, and network I/O principles.

- Proficiency in one or more programming languages, such as Python, Go, Java, PHP, C, or C++.

- Strong problem-solving skills with a systematic approach, effective communication abilities, and a strong sense of ownership and responsibility.


Preferred Qualifications

- 8+ years of relevant experience in a large-scale internet or cloud-based business environment.

- Hands-on experience building AI-powered tools to improve SRE/operations efficiency (e.g., intelligent runbooks, automated incident response, anomaly detection, or self-healing systems).


Job Information


【For Pay Transparency】Compensation Description (Annually)


The base salary range for this position in the selected city is $208800 - $438000 annually.​


Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.​


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).​


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.​


For Los Angeles County (unincorporated) Candidates:​


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:​


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;​


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and​


3. Exercising sound judgment.​


About TikTok


TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.​


Why Join Us


Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.​


We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.​



Diversity & Inclusion​


TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.​


TikTok Accommodation


TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/RA-request


https://lifeattiktok.com/search/7537751924243892498

Infrastructure Site Reliability Engineer (Entry Level) - USDS | Tiktok | Seattle

 Infrastructure Site Reliability Engineer (Entry Level) - USDS | Tiktok | Seattle


Infrastructure Site Reliability Engineer (Entry Level) - USDS

Location:


Seattle


Employment Type:


Regular


Job Code:


PNH2


Responsibilities


Site Reliability Engineering (SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you’ll have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of diversity, intellectual curiosity, openness, and problem-solving. We encourage close collaboration while promoting self-direction.


Responsibilities

- Engage in and improve the whole lifecycle of services from inception and design, throughout development, capacity planning, and launch reviews, to deployment, operation, and automate

- Design and implement various dashboards and monitoring frameworks for efficient, automated, and intelligent service-oriented architecture (SOA) governance

- Scale systems elastically through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for changes

- Practice efficient customer support, incident response, and blameless postmortems.


Qualifications


Minimum Qualifications:

- Bachelor's degree in Computer Science or a related technical field

- Industrial or internship experience in accredited internet or cloud companies

- Proficient in one of the following programming languages: Python, GoLang, Java, Shell

- Familiar with Linux system internals, networking, and distributed systems

- Strong interpersonal and communication skills


Preferred Qualifications:

- Experience in MySQL, Redis, Kubernetes, Docker, Hadoop, Spark, Flink, HDFS, etc.

- Experience in designing and analyzing large-scale distributed systems


As a condition of employment, all successful candidates must be able to establish authorization to work in the United States. For this position, the Company does not provide sponsorship or any immigration-related benefits.


Job Information


【For Pay Transparency】Compensation Description (Annually)


The base salary range for this position in the selected city is $112725 - $177840 annually.​


Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.​


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).​


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.​


For Los Angeles County (unincorporated) Candidates:​


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:​


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;​


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and​


3. Exercising sound judgment.​


About USDS


TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (“USDS”) is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.​



On-site presence across teams allows the company to operate with greater speed, alignment, and agility — especially in areas like real-time decision-making, team development, and integrated execution. As such, the company is shifting from a hybrid work model to a fully in-person schedule up to 5 days a week.​


Why Join Us


Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.​


We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.​



Diversity & Inclusion​


TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.​


USDS Reasonable Accommodation


USDS is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/USDS-RA


https://lifeattiktok.com/search/7003009067103734030

Production System Engineer - San Jose | Tiktok | San Jose

 Production System Engineer - San Jose | Tiktok | San Jose


Production System Engineer - San Jose

Location:


San Jose


Employment Type:


Regular


Job Code:


R7TF1


Responsibilities


The Data Systems Infrastructure (DSI) team stands as the unseen architects behind the scenes. In a thrilling dance of technology and innovation, we propel the company's meteoric rise by constructing and orchestrating colossal data fortresses, taming the life cycle of server fleets, conjuring Cloud solutions, and crafting a symphony of infrastructure services. Our mission is to ensure scalability and unwavering reliability, making sure ByteDance's digital footprint leaves an indelible mark on the world.


Embark on an exciting expedition to explore the rapidly expanding ByteDance domain in the United States, Europe, and Asia. Here, the Data Systems Infrastructure (DSI) team is crafting monumental data citadels that encircle the planet, sheltering legions of hundreds of thousands of servers. As the maestro of our production systems, you will embark on a captivating odyssey, taming the life cycles of these servers. Your adventure will begin with the orchestration of their initial deployment, navigating the intricate terrain of OS installation, summoning services like a digital magician, and maintaining vigilant watch over our inventory. But, like any epic tale, there will be times of challenge when you become a troubleshooter extraordinaire, mending and restoring with unwavering dedication. Eventually, you'll guide them into the sunset, orchestrating their decommissioning and ensuring their rebirth through recycling, all while contributing to the pulsating rhythm of ByteDance's technological evolution.


Responsibilities:

- Operation: As a Production Systems Engineer, your mission is to contribute to enhancing the stability, efficiency, effectiveness, and scalability of our data center and server operations, platform, and service on a worldwide scale.

- Lifecycle Enhancement: Participate in and enhance the entire lifecycle of the server fleet - from system design/introduction consultation to launch reviews, deployment, operation, and retirement.

- Automation: Develop and deploy tools and solutions to enhance the automation, reliability, scalability, and operability of servers in the datacenter.

- Monitoring: Develop and deploy tools and solutions for improving the availability, latency, and overall service of the datacenter infrastructure, server, and network health.

- Disaster Recovery: Troubleshoot and resolve complex technical issues in a high-pressure, fast-paced environment. Conduct high-level root-cause analysis for service interruption and establish preventive measures. Practice sustainable incident response and postmortem.

- Cross-team Collaboration: Collaborate with stakeholders such as infrastructure architects, project managers, data center operations engineers, platform developers, supply chain teams, and our internal customers to comprehend overarching business objectives. Additionally, you will have the chance to design and implement innovative solutions for our Core IDCs and CDN/Edge.

- On-call: Engage in our on-call support spanning across regions and incident response teams to address critical issues in the production environment.


Qualifications


Minimum Qualifications:

- Education: Bachelor's degree in Computer Science, Electronic Engineering, relevant technical field, or equivalent practical experience.

- Experience in at least one of the areas below:

- Server Operations: Demonstrated proficiency in Linux system administration tasks. Possessed an in-depth comprehension of Linux kernels, drivers, and modules. Capable of scripting in Bash and Python to automate routine system operations, encompassing skills such as system configuration, performance tuning, and security management within the Linux environment. Had an in-depth understanding of server hardware, and was able to conduct troubleshooting or diagnostics. Experience participating in the planning, delivery, and operation of large-scale data centers in different countries.

- Tooling Adaptation, Deployment, and Maintenance: Proficient in customizing operation and maintenance tools to satisfy specific demands for new server hardware. Competent in managing the entire software tool lifecycle, ranging from deployment to continuous maintenance. This encompasses tasks associated with facilitating the monitoring of server performance, effectively provisioning resources, timely handling of fault management, and conducting repairs to guarantee the smooth operation of new server hardware. Experience in developing and maintaining hardware, network, or service monitoring software for more than 10,000 servers.

- Communication: Experience in managing and coordinating teams in the global context.


Preferred Qualifications:

- 3 years of work experience in related filed.

- Data Center: An intermediate level of expertise is preferred. We are looking for individuals who are proficient in areas ranging from OS installations and break-fix operations to significant projects such as planning and operations (encompassing the entire infrastructure lifecycle), as well as new design-build or retrofit activities for existing systems.

- Proficiency in the operation and maintenance of GPU server is strongly preferred.

- Full Stack Software Development: Actively, we are in search of individuals proficient in full stack software development. The ideal candidates are expected to possess the following preferred skills:

- Be capable of creating and integrating RESTful APIs. This encompasses expertise in using Flask for Python-based back-end development to establish robust API endpoints.

- Have a profound understanding of JavaScript and be capable of leveraging it, along with Node.js, for both front-end and back-end development tasks.

- Demonstrate proficiency in SQL for efficient database management, including designing database schemas, composing queries, and ensuring data integrity; be familiar with Redis.

- Possess experience in Ansible Configuration Management, Application Deployment, and Task Execution.


Job Information


【For Pay Transparency】Compensation Description (Annually)


The base salary range for this position in the selected city is $87480 - $228000 annually.​


Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.​


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).​


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.​


For Los Angeles County (unincorporated) Candidates:​


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:​


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;​


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and​


3. Exercising sound judgment.​


About TikTok


TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.​


Why Join Us


Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.​


We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.​



Diversity & Inclusion​


TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.​


TikTok Accommodation


TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/RA-request


https://lifeattiktok.com/search/6776750658629404942

Site Reliability Engineer (Global) - TikTok Server Arch | Tiktok | Singapore

 Site Reliability Engineer (Global) - TikTok Server Arch | Tiktok | Singapore


Site Reliability Engineer (Global) - TikTok Server Arch

Location:


Singapore


Employment Type:


Regular


Job Code:


A197417

Responsibilities


This position is with TikTok's Stability Assurance Team. The team is responsible for ensuring that the services provided by TikTok are highly reliable with low-latency. Reliability assurance is complex and systematic for any massive application system and the team focuses on optimizing the application architecture from end to end; driven by data analysis, with automatic and intelligent failure recovery.


Job Responsibilities:

1.Ensure the online stability of TikTok and improve product SLA through systematic disaster recovery abilities, standardized emergency mechanisms, and intelligent analysis.

2.Identify system risks and promote governance through comprehensive and multi-perspective quality data.

3.Establish TikTok's unified standards and specifications, design and develop a one-stop operation platform, and enhance efficiency across multiple fields.

4.Collaborate closely with developers to implement best practices in SRE.


Qualifications


Minimum Qualifications:

1. Bachelor's degree or above in a computer-related field

2.Solid foundational knowledge of computer software; understanding of Linux operating systems, storage, network IO, and related principles.

3.Ability to solve problems systematically, strong communication skills, and a sense of ownership.


Preferred Qualification

1. Minimum 3-5 years relevant work experience from a large-scale internet business


Job Information


About TikTok


TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.​


Why Join Us


Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.​


We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.​



Diversity & Inclusion​


TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.


https://lifeattiktok.com/search/7548400703959173383

Friday, 9 January 2026

Staff Software Engineer I, Experimentation Orchestration | Etsy | Brooklyn, New York, United States

 Staff Software Engineer I, Experimentation Orchestration | Etsy | Brooklyn, New York, United States

           


Staff Software Engineer I, Experimentation Orchestration

 Hybrid 

 Brooklyn 

 Full-time 


Company Description

Etsy is the global marketplace for unique and creative goods. We build, power, and evolve the tools and technologies that connect millions of entrepreneurs with millions of buyers around the world. As an Etsy Inc. employee, whether a team member of Etsy or Depop, you will tackle unique, meaningful, and large-scale problems alongside passionate coworkers, all the while making a rewarding impact and Keeping Commerce Human.


Salary Range:


$204,000.00 - $240,000.00

What’s the role?


Etsy is looking for a passionate Staff Software Engineer to help us enable data-driven learning and decision-making for our Experimentation initiative. Our initiative empowers teams across Etsy to run fast, trustworthy, and scalable experiments — by laying the foundation for how we test ideas, measure impact, and continuously improve our customer experience. 


This is a high-impact opportunity to influence the technical direction of experimentation at Etsy, which affects feature development across the company. We’re looking for someone who thrives in collaborative environments, challenges the status quo, and is a creative problem solver. Additionally, we are looking for someone who embodies Etsy’s blameless engineering culture, as we value clear communication, honest feedback, and empathy as the foundation for how we work together and get things done.


This is a full-time position reporting to the Engineering Manager, Experimentation. In addition to salary, you will also be eligible for an equity package, an annual performance bonus, and our competitive benefits that support you and your family as part of your total rewards package at Etsy.


For this role, we are considering candidates based in the United States. Candidates living within commutable distance of Etsy’s Brooklyn Office Hub or in the San Francisco Bay Area may be the first to be considered. For candidates within commutable distance, Etsy requires in-office attendance once or twice per week depending on your proximity to the office. Etsy offers different work modes to meet the variety of needs and preferences of our team. Learn more details about our  work modes and workplace safety policies here.


What’s this team like at Etsy

You would join the Experimentation Orchestration team, which focuses on making experiment setup simple, intuitive, and reliable.


We build the core services, APIs, and tools that help teams configure and launch experiments with confidence. Whether it’s feature flagging, observability, traffic allocation, or ensuring a consistent user experience, our work sits at the heart of a seamless experimentation workflow.


Our mission is to make testing new ideas effortless — so teams can move faster, learn more, and build better products for our buyers and sellers.


What does the day-to-day look like?


Partner with engineers, analysts, and product managers — both within and beyond the Experimentation initiative — to improve Etsy’s experimentation platform. Your work will focus on experiment configuration, orchestration, and event logging.


Take initiative to lead, design, and implement solutions that keep our platform at the forefront of industry standards — and ultimately a reference point for experimentation excellence.


Mentor and uplevel others within the team and across Etsy to help them grow as software engineers.


Advocate for experimentation best practices across engineering teams and contribute to a strong culture of learning and continuous improvement.


Play a key role in driving technical and product direction through thoughtful decision-making and collaboration.


Of course, this is just a sample of the kinds of work this role will require! You should assume that your role will encompass other tasks, too, and that your job duties and responsibilities may change from time to time at Etsy's discretion, or otherwise applicable with local law.


Qualities that will help you thrive in this role are

You have hands-on experience with A/B testing frameworks and experimentation workflows.


You’re excited about full-stack development, particularly in the context of building experimentation platforms.


You have experience providing technical leadership and mentoring others to grow in their craft.


You’ve built reliable, scalable systems and know how to balance long-term vision with practical delivery.


You write clean, testable code — and help others do the same — in one or more of the following languages: PHP, Python, or Java.


You’re proficient with MySQL (or similar relational databases) and experienced in designing data models that support robust, scalable systems.


You have experience with cloud platforms such as Google Cloud Platform, AWS, or Azure.


You’re familiar with CI/CD tools and practices that support high-quality, rapid software delivery.


You’re familiar with containerization and orchestration tools like Docker and Kubernetes.


You’re familiar with monitoring and alerting systems, such as Prometheus and Grafana.


You’re comfortable using Git and understand different branching strategies.


You’re comfortable working in Linux-based environments and know your way around the command line.


Helpful, but Not Required

Familiarity with front-end technologies, such as React.


Experience with data warehousing (e.g. BigQuery) and ETL processes, using big data frameworks such as Kafka and Spark and orchestration tools such as Airflow.


Additional Information

 


What's Next

If you're interested in joining the team at Etsy, please share your resume with us and feel free to include a cover letter if you'd like. As we hope you've seen already, Etsy is a place that values individuality and variety. We don't want you to be like everyone else -- we want you to be like you! So tell us what you're all about.

 

Our Promise

At Etsy, we believe that a diverse, equitable and inclusive workplace furthers relevance, resilience, and longevity. We encourage people from all backgrounds, ages, abilities, and experiences to apply. Etsy is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or veteran status, or any other characteristic protected by applicable law. If, due to a disability, you need an accommodation during any part of the application or interview process, please let your recruiter know. While Etsy supports visa sponsorship, sponsorship opportunities may be limited to certain roles and skills.


https://careers.etsy.com/jobs/staff-software-engineer-i-experimentation-orchestration-brooklyn-new-york-united-states

Software Engineer II, SRE | Etsy | Brooklyn, New York, United States

 Software Engineer II, SRE | Etsy | Brooklyn, New York, United States

           


Software Engineer II, SRE

 Hybrid 

 Brooklyn 

 Full-time 

Company Description

Etsy is the global marketplace for unique and creative goods. We build, power, and evolve the tools and technologies that connect millions of entrepreneurs with millions of buyers around the world. As an Etsy Inc. employee, whether a team member of Etsy or Depop, you will tackle unique, meaningful, and large-scale problems alongside passionate coworkers, all the while making a rewarding impact and Keeping Commerce Human.


Salary Range:


$135,000.00 - $175,000.00

What’s the role?


Etsy’s Services Infrastructure group is looking for a Site Reliability Engineer II to join us in our mission of building and supporting reliable large scale Kubernetes infrastructure. The SRE team owns several aspects of business critical services(search retrieval and ranking) & Machine Learning Models infrastructure(Kubernetes hosted on Google Cloud) that enable engineers to efficiently build and release, as well as support the uptime of critical systems behind etsy.com. You will be playing an instrumental role in crafting the future architecture of how we run our systems in the cloud while being part of a dynamic international team.


You’ll get exposure to a variety of technologies ranging from Kubernetes, Golang, LLMs, Model Serving, Search Retrieval & Ranking and more as you build systems to support the services that support our 86M active buyers and 5.5M sellers! As the Software Engineer II, SRE you will drive the adoption of containers and Kubernetes, improve reliability, automating the operations and providing a self-service runtime platform to accelerate Etsy’s product & ML engineering, and contribute to the design and implementation of Observability & CI/CD on top of Kubernetes.


Do you find joy in improving developer velocity and have the itch to work on complex large-scale distributed systems? If so, this could be the perfect match.


This is a full-time position reporting to the Senior Engineering Manager. In addition to salary, you will also be eligible for an equity package, an annual performance bonus, and our competitive benefits that support you and your family as part of your total rewards package at Etsy. 


For this role, we are considering candidates based in the United States. Candidates living within commutable distance of Etsy’s Brooklyn Office Hub or in the San Francisco Bay Area may be the first to be considered. For candidates within commutable distance, Etsy requires in-office attendance once or twice per week depending on your proximity to the office. Etsy offers different work modes to meet the variety of needs and preferences of our team. Learn more details about our  work modes and workplace safety policies here.


What’s this team like at Etsy?


This team improves the Developer experience around build, deploy, release and observing services and ML Models transparently on Google Kubernetes Engine. They work on 20+ Kubernetes clusters with hundreds of nodes running services with low latency requirements. This team also standardizes cluster and application security with common admission policies and container vulnerability, as well as establishing standard SLI/O for all services running on Kubernetes.


This team works closely with many product and enablement teams across Etsy. This team handles


20+ Kubernetes clusters with hundreds of nodes running services with low latency requirements.


Build and support the CI/CD platform (Buildkite) used by more than a few hundred engineers to deploy their workloads to GKE.


Maintain and upgrade GKE addons(CertManager, Gatekeeper), ingress controllers (Contour, Envoy), and various telemetry components (kube-prometheus, AlertManager, Karma) and Container Security. 


Here’s a sneak peek into our Roadmap for the next year


Support multiple Search, ML & Gen AI teams to efficiently utilise GPUs across different zones and regions. Evaluate Build vs Buy decisions within LLM space.


Enable service mesh across GKE and enable a native way of accessing services across the stack.


Standardizing cluster and application security and container vulnerability scanning (both during build and run time)


What does the day-to-day look like?


Administer GKE clusters and automate operations like provisioning and service observability. Support the partner teams running their workloads on the Kubernetes Platform. . 


Provide guidance and collaborate with multi-functional engineering teams to streamline and improve the adoption of Kubernetes


Build paved paths for wider product engineering with codelabs, documentation, automation and self-service portals to develop, deploy and operate services on GKE.


Participate in an on-call rotation and seek opportunities for reducing toil and avoiding technical debt to reduce support and operations load on the team.


Of course, this is just a sample of the kinds of work this role will require! You should assume that your role will encompass other tasks, too, and that your job duties and responsibilities may change from time to time at Etsy's discretion, or otherwise applicable with local law


Qualities that will help you thrive in this role are:


You have strong software engineering and coding skills and ability to write high performance production quality code. You have 2+ years of experience in systems/infrastructure engineering or SRE or DevOps roles, preferably in a cloud environment.


Exposure to container orchestration systems like Kubernetes (traffic ingresses, cluster networking/administration, pod security policies).


Experience iterating on multiple projects on a collaborative team, each of which may have taken months or longer to complete.


Proficiency in one programming language like PHP, Python, or Go.


Hands-on experience with Infrastructure As Code tooling like Terraform and configuration management tooling like Chef/Ansible.


Hands on debugging experience with Linux based operating systems.


Working knowledge of ML Operations(MLOps)  is nice to have.


Willing to work with and improve on code you did not originally write.


You understand that being an effective software engineer is as much about communicating with people as it is about writing code.


Additional Information

 


What's Next

If you're interested in joining the team at Etsy, please share your resume with us and feel free to include a cover letter if you'd like. As we hope you've seen already, Etsy is a place that values individuality and variety. We don't want you to be like everyone else -- we want you to be like you! So tell us what you're all about.

 

Our Promise

At Etsy, we believe that a diverse, equitable and inclusive workplace furthers relevance, resilience, and longevity. We encourage people from all backgrounds, ages, abilities, and experiences to apply. Etsy is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or veteran status, or any other characteristic protected by applicable law. If, due to a disability, you need an accommodation during any part of the application or interview process, please let your recruiter know. While Etsy supports visa sponsorship, sponsorship opportunities may be limited to certain roles and skills.


https://careers.etsy.com/jobs/software-engineer-ii-sre-brooklyn-new-york-united-states

Staff Software Engineer I, Service Platform Team | Etsy | Brooklyn, New York, United States

 Staff Software Engineer I, Service Platform Team | Etsy | Brooklyn, New York, United States

        


Staff Software Engineer I, Service Platform Team

 Hybrid 

 Brooklyn 

 Full-time 

Company Description

Etsy is the global marketplace for unique and creative goods. We build, power, and evolve the tools and technologies that connect millions of entrepreneurs with millions of buyers around the world. As an Etsy Inc. employee, whether a team member of Etsy or Depop, you will tackle unique, meaningful, and large-scale problems alongside passionate coworkers, all the while making a rewarding impact and Keeping Commerce Human.


Salary Range:


$204,000.00 - $240,000.00

What’s the role?


Our Foundation Engineering teams are responsible for innovating and scaling our systems and infrastructure to align with Etsy's rapid growth and future plans. More specifically our teams build, maintain, and support many of the platforms and services that power Etsy’s technical stack to connect tens of millions of Etsy sellers and customers. We also partner with Etsy engineers to help them achieve paved path development and deployment experiences. 


This is a full-time position reporting to the Senior Engineering Manager, Service Platform. In addition to salary, you will also be eligible for an equity package, an annual performance bonus, and our competitive benefits that support you and your family as part of your total rewards package at Etsy.


For this role, we are considering candidates based in the United States. Candidates living within commutable distance of Etsy’s Brooklyn Office Hub or in the San Francisco Bay Area may be the first to be considered. For candidates within commutable distance, Etsy requires in-office attendance once or twice per week depending on your proximity to the office. Etsy offers different work modes to meet the variety of needs and preferences of our team. Learn more details about our  work modes and workplace safety policies here.


What’s this team like at Etsy?


You will be joining a mission critical team responsible for the architecture, implementation, and support of Etsy’s primary containerized Services Platform built on Cloud Run. This new platform supports the service based architecture initiatives of many Etsy teams including both product and infrastructure engineering.


We're looking for people who are excellent at working with others, are creative problem-solvers, and are passionate about software as craft. We value clear communication, honest feedback, and empathy for others.  This is a truly exciting opportunity to join Etsy as we move into the next stage of our service platform’s evolution, building on its rapid adoption and scaling it to a higher level of maturity for the growing needs of our engineering teams.


What does the day-to-day look like?


Etsy is seeking a Staff Engineer with SRE/Service//DevOps experience to join its growing Foundation Platform and SRE organization. 


As a Staff Engineer you will be the technical and career role model on the direct team, in the Platform Initiative, and within the entire infrastructure organization.


You will be responsible for architecting our systems in GCP as well as leading and delivering on multiple critical and complex technical initiatives in GCP.


You will work with stakeholders and internal customers to align their requirements with project roadmaps and technical design. 


At important junctures, you will evaluate pros and cons and provide guidance and recommendations on which design decisions or technical direction to take.


You will help determine project  priorities and quarterly deliverables and help drive the team and initiative roadmaps.


You will produce stellar technical specs and documentation that can be read and understood by team members and other technical leads and engineering leaders.


You will mentor and coach junior engineers on the team and in the engineering organization.


You will participate in hiring efforts.


Of course, this is just a sample of the kinds of work this role will require! You should assume that your role will encompass other tasks, too, and that your job duties and responsibilities may change from time to time at Etsy's discretion, or otherwise applicable with local law.


Qualities that will help you thrive in this role are:


You have 7+ years of experience in an SRE or DevOps role in a cloud environment coupled with strong software and/or platform engineering experience.


You have experience as a technical lead or architect for large scale, distributed, consumer facing applications.


Bash/Python/Go proficiency essential. PHP/Node proficiency is more of a bonus.


You have in-depth knowledge of Linux operating systems and have experience with hypervisors, Linux containers and orchestration managers.


You have design experience with Service Oriented Architecture and hands on experience with container management tooling (like Cloud Run or Kubernetes)


You are comfortable providing estimates or project ideas that will influence your team’s roadmap.


You are a strong collaborator and communicator and can mentor other engineers.


Your written communication is concise and clear.


You are able to step up to lead the team when necessary, or dive deep to help with most challenging technical details.


Additional Information

 


What's Next

If you're interested in joining the team at Etsy, please share your resume with us and feel free to include a cover letter if you'd like. As we hope you've seen already, Etsy is a place that values individuality and variety. We don't want you to be like everyone else -- we want you to be like you! So tell us what you're all about.

 

Our Promise

At Etsy, we believe that a diverse, equitable and inclusive workplace furthers relevance, resilience, and longevity. We encourage people from all backgrounds, ages, abilities, and experiences to apply. Etsy is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or veteran status, or any other characteristic protected by applicable law. If, due to a disability, you need an accommodation during any part of the application or interview process, please let your recruiter know. While Etsy supports visa sponsorship, sponsorship opportunities may be limited to certain roles and skills.


https://careers.etsy.com/jobs/staff-software-engineer-i-service-platform-team-brooklyn-new-york-united-states

Senior Software Engineer I, SRE | Etsy | Dublin, Ireland

 Senior Software Engineer I, SRE | Etsy | Dublin, Ireland

           


Senior Software Engineer I, SRE

 Hybrid 

 Dublin 

 Full-time 

Company Description

Etsy is the global marketplace for unique and creative goods. We build, power, and evolve the tools and technologies that connect millions of entrepreneurs with millions of buyers around the world. As an Etsy Inc. employee, whether a team member of Etsy or Depop, you will tackle unique, meaningful, and large-scale problems alongside passionate coworkers, all the while making a rewarding impact and Keeping Commerce Human.


What’s the role?


Etsy’s Services Infrastructure group is looking for a Senior Site Reliability Engineer I, to join us in our mission of building and supporting reliable large scale Kubernetes infrastructure. The SRE team owns several aspects of business critical services(search retrieval and ranking) & Machine Learning Models infrastructure(Kubernetes hosted on Google Cloud) that enable engineers to efficiently build and release, as well as support the uptime of critical systems behind etsy.com. You will be playing an instrumental role in crafting the future architecture of how we run our systems in the cloud while being part of a dynamic international team.


You’ll get exposure to a variety of technologies ranging from Kubernetes, Golang, LLMs, Model Serving, Search Retrieval & Ranking and more as you build systems to support the services that support our 86M active buyers and 5.5M sellers! As the Senior Software Engineer I, SRE you will drive the adoption of containers and Kubernetes, improve reliability, automating the operations and providing a self-service runtime platform to accelerate Etsy’s product & ML engineering, and contribute to the design and implementation of Observability & CI/CD on top of Kubernetes.


Do you find joy in improving developer velocity and have the itch to work on complex large-scale distributed systems? If so, this could be the perfect match.


This is a full-time position reporting to the Senior Engineering Manager. In addition to salary, you will also be eligible for an equity package, an annual performance bonus, and our competitive benefits that support you and your family as part of your total rewards package at Etsy. 


This role requires your presence in Etsy’s Dublin office once or twice per week depending on your proximity to the office. Candidates living within commutable distance of our Dublin office, may be the first to be considered. Learn more details about our  work modes and workplace safety policies here.


What’s this team like at Etsy? 


This team improves the Developer experience around build, deploy, release and observing services and ML Models transparently on Google Kubernetes Engine. They work on 20+ Kubernetes clusters with hundreds of nodes running services with low latency requirements. This team also standardizes cluster and application security with common admission policies and container vulnerability, as well as establishing standard SLI/O for all services running on Kubernetes.


This team works closely with many product and enablement teams across Etsy. This team handles


20+ Kubernetes clusters with hundreds of nodes running services with low latency requirements.


Build and support the CI/CD platform (Buildkite) used by more than a few hundred engineers to deploy their workloads to GKE.


Maintain and upgrade GKE addons(CertManager, Gatekeeper), ingress controllers (Contour, Envoy), and various telemetry components (kube-prometheus, AlertManager, Karma) and Container Security. 


Here’s a sneak peek into our Roadmap for the next year


Support multiple Search, ML & Gen AI teams to efficiently utilise GPUs across different zones and regions. Evaluate Build vs Buy decisions within LLM space.


Enable service mesh across GKE and enable a native way of accessing services across the stack.


Standardizing cluster and application security and container vulnerability scanning (both during build and run time) 


Standardize SLI/O creation for all services across Kubernetes Platform


What does the day-to-day look like?


Administer GKE clusters and automate operations like provisioning and service observability. Support the partner teams running their workloads on the Kubernetes Platform. . 


Provide guidance and collaborate with multi-functional engineering teams to streamline and improve the adoption of Kubernetes


Build paved paths for wider product engineering with codelabs, documentation, automation and self-service portals to develop, deploy and operate services on GKE.


Participate in an on-call rotation and seek opportunities for reducing toil and avoiding technical debt to reduce support and operations load on the team.


Of course, this is just a sample of the kinds of work this role will require! You should assume that your role will encompass other tasks, too, and that your job duties and responsibilities may change from time to time at Etsy's discretion, or otherwise applicable with local law


Qualities that will help you thrive in this role are:


You have strong software engineering and coding skills and ability to write high performance production quality code. You have 6+ years of experience in software engineering, where the last 2 years are in systems/infrastructure engineering or SRE or DevOps roles, preferably in a cloud environment.


Experience with orchestration systems like Kubernetes (traffic ingresses, cluster networking/administration, pod security policies) is essential.


Experience iterating on multiple projects on a collaborative team, each of which may have taken months or longer to complete.


Hands-on experience with Infrastructure As Code tooling like Terraform and configuration management tooling like Chef/Ansible.


Hands on debugging experience with Linux based operating systems.


Willing to work with and improve on code you did not originally write.


Proficiency in one programming language (like Go, Python, PHP) or working knowledge of ML Operations(MLOps)  is nice to have.


You understand that being an effective software engineer is as much about communicating with people as it is about writing code.


Additional Information

 


What's Next

If you're interested in joining the team at Etsy, please share your resume with us and feel free to include a cover letter if you'd like. As we hope you've seen already, Etsy is a place that values individuality and variety. We don't want you to be like everyone else -- we want you to be like you! So tell us what you're all about.

 

Our Promise

At Etsy, we believe that a diverse, equitable and inclusive workplace furthers relevance, resilience, and longevity. We encourage people from all backgrounds, ages, abilities, and experiences to apply. Etsy is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or veteran status, or any other characteristic protected by applicable law. If, due to a disability, you need an accommodation during any part of the application or interview process, please let your recruiter know. While Etsy supports visa sponsorship, sponsorship opportunities may be limited to certain roles and skills.


https://careers.etsy.com/jobs/senior-software-engineer-i-sre-dublin-ireland

Tuesday, 6 January 2026

Software Engineer, Developer Relations (EAC) (R26414) | Epic Games | Cary, United States

 Software Engineer, Developer Relations (EAC) (R26414) | Epic Games | Cary, United States


Department

Engineering


Location

Cary, United States


Product

Unreal Engine


Company

Epic Games


Requisition ID

R26414


WHAT MAKES US EPIC?

At the core of Epic’s success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it’s building award-winning games or crafting engine technology that enables others to make visually stunning interactive experiences, we’re always innovating.


Being Epic means being a part of a team that continually strives to do right by our community and users. We’re constantly innovating to raise the bar of engine and game development.


ENGINEERING - UNREAL ENGINE

What We Do

Unreal-powered projects have been on the bleeding edge of real-time entertainment for over 20 years. Our team of engineering experts are always innovating to improve the tools and technology that empower content developers worldwide.


What You'll Do

We are looking for an experienced Developer Relations Engineer to join our team and support EOS Anti-Cheat (also known as "Easy Anti-Cheat"). You will serve as a crucial technical liaison between our internal engineering teams and external partners, assisting them in integrating, debugging, and optimizing Anti-Cheat in their projects. Your role involves deep technical troubleshooting of issues, analyzing crash dumps, debugging low-level C/C++ code, and providing effective solutions and technical insights. You will help guide design decisions for Anti-Cheat, contributing to technical documentation and maintaining active communication internally and externally. The ideal candidate is proactive, detail-oriented, tactful, and empathetic, with strong problem-solving skills and the ability to communicate complex technical concepts clearly to stakeholders of varying expertise. You should be comfortable working independently and collaboratively, with excellent time management and multitasking capabilities.


In this role, you will

Troubleshoot complex integration and operational issues involving Anti-Cheat, analyzing crash dumps, logs, and call stacks to identify root causes

Collaborate directly with external game developers and internal teams to resolve technical issues promptly and effectively

Debug and reproduce customer issues, clearly documenting and communicating findings internally and externally

Represent Epic Games through asynchronous and live support, presence at trade shows such as UEFest, and customer visits

Develop and maintain clear, comprehensive technical documentation, tutorials, and guides to support partner integration

Advocate for partners' successful integration and continued use of Anti-Cheat and related Epic technologies, and influence product improvements through customer insights

Research and identify opportunities to enhance Anti-Cheat technologies and developer experience

What we're looking for

Highly proficient in C and C++, particularly low-level or kernel-level debugging and development

Strong ability to analyze crash dumps and debug complex, obfuscated code at the assembly level

Familiarity with cross-platform development (Windows, Linux, macOS), understanding differences and limitations across these platforms

Exceptional problem-solving abilities, proactively tackling issues independently

Excellent verbal and written communication skills to effectively collaborate with internal teams and external partners

Ability to manage multiple tasks simultaneously, work well under pressure, and prioritize to meet SLA targets

Prior experience with SDK/API integration and understanding of software engineering principles, including legacy support

Understanding of online multiplayer video game architectures and associated security concerns

EPIC JOB + EPIC BENEFITS = EPIC LIFE

Our intent is to cover all things that are medically necessary and improve the quality of life. We pay 100% of the premiums for both you and your dependents. Our coverage includes Medical, Dental, a Vision HRA, Long Term Disability, Life Insurance & a 401k with competitive match. We also offer a robust mental well-being program through Modern Health, which provides free therapy and coaching for employees & dependents. Throughout the year we celebrate our employees with events and company-wide paid breaks. We offer unlimited PTO and sick time and recognize individuals for 7 years of employment with a paid sabbatical.


ABOUT US

Epic Games spans across 25 countries with 46 studios and 4,500+ employees globally. For over 25 years, we've been making award-winning games and engine technology that empowers others to make visually stunning games and 3D content that bring environments to life like never before. Epic's award-winning Unreal Engine technology not only provides game developers the ability to build high-fidelity, interactive experiences for PC, console, mobile, and VR, it is also a tool being embraced by content creators across a variety of industries such as media and entertainment, automotive, and architectural design. As we continue to build our Engine technology and develop remarkable games, we strive to build teams of world-class talent.


Like what you hear? Come be a part of something Epic!

Epic Games deeply values diverse teams and an inclusive work culture, and we are proud to be an Equal Opportunity employer. Learn more about our Equal Employment Opportunity (EEO) Policy here.


Note to Recruitment Agencies: Epic does not accept any unsolicited resumes or approaches from any unauthorized third party (including recruitment or placement agencies) (i.e., a third party with whom we do not have a negotiated and validly executed agreement). We will not pay any fees to any unauthorized third party. Further details on these matters can be found here.


https://www.epicgames.com/site/en-US/careers/jobs/5472586004

Senior Software Engineer - Devop & Perforce (Game Creation) (R27171) | Epic Games | Austin, United States

 Senior Software Engineer - Devop & Perforce (Game Creation) (R27171) | Epic Games | Austin, United States


Department

Engineering


Location

Austin, United States


Product

Corporate


Company

Epic Games


Requisition ID

R27171


WHAT MAKES US EPIC?

At the core of Epic’s success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it’s building award-winning games or crafting engine technology that enables others to make visually stunning interactive experiences, we’re always innovating.


Being Epic means being a part of a team that continually strives to do right by our community and users. We’re constantly innovating to raise the bar of engine and game development.


ONLINE INFRASTRUCTURE

What We Do

We enable Epic’s online services teams to build, deploy, and manage services that are used by more than half a billion players around the world. Our mission is to provide world class tools and platforms to improve the experience of our developers and make it easier, faster, and safer to build, operate, and scale their applications. We operate at massive scale as one of the largest cloud computing users in the world.


What You'll Do

You’ll work on one of the largest and most demanding Perforce Helix Core deployments in the industry, supporting thousands of users and millions of files at scale. You’ll tackle complex engineering challenges that come with operating source control at this level, while partnering closely with teams across Epic Games and with external studios and partners around the world. As one of Perforce’s largest customers, Epic collaborates directly with the Perforce engineering team, giving you a unique opportunity to influence their product roadmap and help shape the future of source control tooling used across the industry.


In this role, you will

Architect and administer large, high performance Perforce Helix, and other SCM tools at a global scale with thousands of consumers worldwide

Perform deep troubleshooting of Perforce Helix server performance and database contention issues

Implement and maintain automation tools, user facing tools, documentation, and workflows to assist with system management

Manage and support permissions and identity integrations

Assist in disaster recovery and business continuity initiatives to ensure a protected and highly available implementation

Devise, test, and deploy integrations between Perforce products and other internal systems

Promoting adoption and best practices among the system user community and conducting user-training and help-sessions for new features or implementations

What we're looking for

Experience deploying and maintaining environments with infrastructure as code approaches and tools (SCM/Git, Packer, Terraform, Ansible, Chef, or Salt, and leveraging CI/CD systems to get work done)

Experience with cloud providers like AWS, Azure, GCP (Google Cloud Platform)

Understanding of Linux operating systems, OS performance tuning, troubleshooting, patching and patch management best practices

Expertise writing tools, scripting and automation in Bash/UNIX shell, Python

Excellent communicator with the ability to convey complex ideas clearly and collaborate effectively across teams

Strong learning skills, able to quickly absorb new concepts, tools, and processes in a fast-paced environment

Experience with Perforce Helix, Git and their related server and client components in a large scale, and understanding of version control workflows is a big plus

Experience with other version control systems such as Git, and related code collaboration tools such as GitHub, and GitLab would be a plus

Highly organized, with the ability to manage priorities, track details, and deliver work reliably on time would be a plus

EPIC JOB + EPIC BENEFITS = EPIC LIFE

Our intent is to cover all things that are medically necessary and improve the quality of life. We pay 100% of the premiums for both you and your dependents. Our coverage includes Medical, Dental, a Vision HRA, Long Term Disability, Life Insurance & a 401k with competitive match. We also offer a robust mental well-being program through Modern Health, which provides free therapy and coaching for employees & dependents. Throughout the year we celebrate our employees with events and company-wide paid breaks. We offer unlimited PTO and sick time and recognize individuals for 7 years of employment with a paid sabbatical.


ABOUT US

Epic Games spans across 25 countries with 46 studios and 4,500+ employees globally. For over 25 years, we've been making award-winning games and engine technology that empowers others to make visually stunning games and 3D content that bring environments to life like never before. Epic's award-winning Unreal Engine technology not only provides game developers the ability to build high-fidelity, interactive experiences for PC, console, mobile, and VR, it is also a tool being embraced by content creators across a variety of industries such as media and entertainment, automotive, and architectural design. As we continue to build our Engine technology and develop remarkable games, we strive to build teams of world-class talent.


Like what you hear? Come be a part of something Epic!

Epic Games deeply values diverse teams and an inclusive work culture, and we are proud to be an Equal Opportunity employer. Learn more about our Equal Employment Opportunity (EEO) Policy here.


Note to Recruitment Agencies: Epic does not accept any unsolicited resumes or approaches from any unauthorized third party (including recruitment or placement agencies) (i.e., a third party with whom we do not have a negotiated and validly executed agreement). We will not pay any fees to any unauthorized third party. Further details on these matters can be found here.



https://www.epicgames.com/site/en-US/careers/jobs/5739578004