Site Reliability Engineer, Systems
By Anthropic At , San Francisco, Ca
Automate operations and infrastructure management
Have significant experience with Kubernetes and cloud-native infrastructure
Have strong communication skills to work with a range of technical and non-technical colleagues
Python and Linux SysAdmin skills
Significant experience with Kubernetes architecture and administration
Strong Linux skills and cloud infrastructure expertise
Site Reliability Engineer Jobs
By Lawrence Berkeley National Laboratory At , San Francisco Bay Area, Ca $9,739 - $11,905 a month
Minimum of three years of experience in UNIX or Linux, Networking, IT infrastructure environment and management experience in a distributed-computing environment.
Knowledge of the processes for standard operating procedures, and best practices for implementation and change management.
Past experience with Incident Management and a good understanding of IT service management.
Experience with network security: configuring/maintaining ACLs, knowledge of firewalls
Bachelor’s Degree in a Computer Science or similar discipline or equivalent years of experience.
Strong hands-on knowledge of the Linux shell and working in a command-line (e.g. SSH) environment.
Site Reliability Engineer, Product - Usds
By TikTok At , Los Angeles $119,000 - $289,000 a year
Gain a solid understanding of the various components and services that power the TikTok experience
Maintain services to meet service-level-agreements (SLAs) and service-level-objectives (SLOs) by measuring and monitoring availability, performance, and overall system health
Scale systems sustainability through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for changes
Provide user support, incident responses and postmortems
In this role, you will:
Our time off and leave plans are:
Site Reliability Engineer (L4/5) - Core
By Netflix At , Los Gatos, Ca
Experience in risk management and/or analysis
Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks
Read signals and metrics to develop deeper insights into our customers’ quality of experience to help inform business decisions
Strong writing and presentation skills
Development experience with Java, JavaScript/Node.js, Python, Go
Knowledge of cloud platforms (i.e. AWS, GCP, etc.) and microservices architecture
Site Reliability Engineer Jobs
By Sohum Inc At San Francisco Bay Area, United States
Full time opportunity that offers excellent benefits.
• Configuration Management and IAC - Salt, Pulumi (Terraform will work)
• Bachelor’s degree in CS / other highly technical discipline, or equivalent experience
• 5+ years of experience and 3+ years experience as Site Reliability Engg
• Strong networking and firewall knowledge
• Exceptional problem solving and troubleshooting skills
Site Reliability Engineer Jobs
By WalkWater Technologies At Cupertino, CA, United States
Experience with SSL/mTLS and certificate management
Hands-on experience with cloud orchestration platforms such as Kubernetes or Nomad
Setting up CD/CD pipelines using GitHub hooks, TeamCity, Docker, and Artifactory
Familiarity with load balancers, traffic-envoys, and proxies
Familiarity with Java runtime / JVM
Familiarity with observability systems such as Prometheus or Open Metrics
Staff Site Reliability Engineer
By Netskope At , Santa Clara, Ca
You will be part of a high caliber engineering team in the exciting space of cloud tools and infrastructure management.
Drive efficiencies in systems and processes: capacity planning, configuration management, performance tuning, monitoring and root cause analysis.
You will solve complex, exciting challenges and improve the depth and breadth of your technical and analytical skills
Partner closely with our development teams and product managers to architect and build features that are highly available, performant and secure
Gain deep knowledge of our application stack
Experience improving the performance of micro-services and solve scaling/performance issues
Principal Site Reliability Engineer
By Oracle At , Redwood City, 94065, Ca
Develop and implement various database life-cycle management flows.
Certification of Database products for cloud integration
Participate in Product Feature Review, Certification experiments and User Document reviews.
Research and acquire skills on new technologies as needed from time to time
6-14 years of Oracle database administration experience on large production environments
Database hands on skills especially around database and system troubleshooting and administration
Sr. Site Reliability Engineer
By rockset At , San Mateo, Ca $140,000 - $185,000 a year
Experience with Terraform, Salt, Chef, Packer, or similar configuration management tools
Willing to learn new skills and technologies
Bachelor's or Master's degree in Computer Science or a related field, or relevant work experience
Experience as an SRE for 3+ years
Experience building and operating public-facing 24x7 web applications at scale
Experience working with cloud infrastructure and patterns (AWS preferred)
Staff Site Reliability Engineer
By Collective Health At , San Mateo, 94401, Ca $140,000 - $210,000 a year
Expertise in management and use of relational databases including.
10+ years of work experience in DevOps, Site Reliability Engineering, or Software Engineering.
Experience creating and monitoring SLIs and SLOs in order to set and remain within error budgets.
Experience in supporting customer-facing production systems and responding to incidents as part of an oncall rotation.
Knowledge of data structures, algorithms, distributed systems, and information retrieval.
Experience in solving diagnosing and resolving incidents that involve application, OS, network, infrastructure, partners, people, and process.
Aws Site Reliability Engineer
By Zeektek At United States
Help set up and manage our AWS EKS environment.
Help set up and manage our GitLab CI/CD pipeline.
Can engage and manage the heterogenous CI/CD and deployment environments of the teams we collaborate with
Site Reliability Engineer, DevOps manager
1.5+ years experience in SRE/DevOps or equivalent role
Work with other teams to assist in deploying our microservices and code into their environments (on prem and AWS)
Staff Site Reliability Engineer, Multi-Cloud
By Okta At ,
Extensive experience with configuration management tools like Chef, Ansible, or Puppet and infrastructure-as-code tools such as Terraform
Experience with multi-cloud infrastructure is desired
Proficiency in distributed systems design, with a comprehensive understanding of failure modes, benefits, and potential drawbacks
In-depth knowledge of various types of data stores, including both SQL and NoSQL
Core contributor driving Okta’s multi-cloud initiatives
Design, build, and operate Okta's global production infrastructure
Site Reliability Engineer Jobs
By Adobe At , Lehi, 84043 $92,100 - $161,000 a year

What you need to succeed:

An understanding of SRE standard methodologies:

Site Reliability Engineer (Sre) - Evening Shift
By Brightspot At , Chicago $100,000 - $115,000 a year
Automate manual tasks and build tools for system monitoring, deployment, and configuration management.
2+ years of relevant experience in Cloud Operations
Proven troubleshooting and problem-solving skills in a cloud-based application environment
Outstanding communication skills with the ability to work in a client-facing role
Monitor the availability, performance, and reliability of our systems and applications during the evening shift.
Investigate and resolve incidents, troubleshooting any issues that arise and ensuring prompt resolution to minimize downtime.
Site Reliability Engineer Jobs
By Fisker Inc At , Manhattan Beach $60,900 - $169,650 a year
Experience with artifact management (Artifactory, Nexus)
Experience with strict security requirements and implementation
Design, provision, deploy, and manage Kubernetes clusters and resources
Bachelor’s degree in computer science or related technical field or equivalent experience
5+ years of SRE / DevOps Engineer experience
Experience with cloud infrastructure (AWS, GCP, Azure)
Site Reliability Engineer Jobs
By Zscaler At , San Jose
Strong Centos/UNIX skills, FreeBSD specific experience is a plus.
5 -7 years experience in a SaaS/ Cloud/Distributed environment growing at a rapid scale.
Minimum 3+ years of scripting experience in Python is required.
Hands-on experience with infrastructure as code and automation tools (Ansible, Chef, Puppet, Terraform).
Basic Networking skills (TCP/IP, DNS, LACP, CARP) for testing and troubleshooting are required.
Competitive salary and benefits, including equity
Site Reliability Engineer (Sre)
By Agama Solutions At , San Jose
5+ years of US experience as in a SRE role
Good communication (and listening) skills.
Some experience administering Linux “web” servers, at scale.
Working knowledge of DNS, HTTP, TLS, web security.
Experience with networking troubleshooting using tools such as TCP Dump.
Well versed in *nix Operating Systems (we use CentOS and Ubuntu LTS).
Site Reliability Engineer Jobs
By Ascendion At , Alpharetta
Knowledge of the cloud and managed services such as MS Flex Server or AWS RDS.
Strong experience as a database administrator.
Strong experience in PostgreSQL and/or MySQL.
Automation skill in Bash, Golang, Python a plus.
Knowledge of IaC and CI/CD tools such as Terraform and GitHub Actions a plus.
Experience in query optimization and performance improvement.
Cdn Site Reliability Engineer (L5) - Open Connect
By Netflix At , Remote
Knowledge of and proven experience with CDNs and HTTP cache/proxy technologies
Service Reliability/Operational experience running large scale, high performance systems & internet services with focus on security and reliability
Expert-level knowledge of Unix or Linux system administration at scale. We happen to use FreeBSD
Knowledge of networking concepts and application protocols, especially TCP/IP, BGP, HTTP/S and DNS
Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
Some experience with container and container orchestration technologies (Docker, Kubernetes)
Sr. Site Reliability Engineer
By eHealth At , Remote $113,500 - $141,900 a year
A security certification and/or knowledge of DevSecOps would be a plus
5+ years of experience as System engineer or SRE engineer (DevOps culture)
Strong Linux skills and excellent skills in one major programming language (Python, Java would be great.)
Hands-on experience implementing and maintaining Container stack with all the security and compliance consideration.
Experience managing Hybrid infrastructure and configuration using tools like Terraform, Ansible and Puppet.
Understanding of CI/CD and experience with Jenkins, Pipeline as code
Site Reliability Engineer Jobs
By eBay At , San Jose, 95125, Ca $168,400 - $262,900 a year
Develop automation systems for implementing eBay Traffic management
Manage eBay’s traffic infrastructure including SLB, CDN, etc.
Solid programming experience in languages like Golang, Java, C/C++
Experience with Kubernetes, docker is a must
Experience working with public cloud is a plus
Experience in software load balancer(IPVS, Envoy, Istio, Cilium etc) is a plus
Site Reliability Engineer, Netflix Technology
By Netflix At , Remote
Experience with incident management and response
Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks
Reads signals in aggregate to develop deeper insights into the quality of experience for our users to help inform business decisions
Experience with complex sociotechnical systems and their successful operations at scale
Experience conducting blame-aware incident reviews
Strong analytical and problem-solving skills
Sr. Site Reliability Engineer
By CCC At , Chicago, Il
Experience preparing and presenting operational artifacts to senior management
Gain and disseminate knowledge of our complex applications
2+ years experience working with the Azure tech stack in a production capacity
5+ years operational experience working with Microsoft technologies
Comfort and experience with Ops environment growing at a rapid scale.
Knowledge of Virtualization, Cloud Infrastructure and APIs
Site Reliability Engineer Jobs
By Nike At Beaverton, OR, United States

This overview explains our hiring process for corporate roles. Note there may be different hiring steps involved for non-corporate roles

Site Reliability Engineer (Sre) - $700,000
By Thurn Partners At New York, NY, United States
3+ years' experience in a similar software engineering or site reliability engineering position
Experience with SQL database operations
Experience with Kafka, CICD pipelines and virtualisation a bonus
Beautiful office space with generous overall benefits package
Proficiency with either Python or Golang
Extremely competitive compensation including performance bonuses
Site Reliability Engineer Jobs
By Spotify At Greater Chicago Area, United States
• 4+ years of IT experience needed
• Experience working in a Linux environment
• Good knowledge of Unix
• Basic experience in writing SQL queries
• Good verbal communicative skills
• Ability to manage priorities and deadlines
Site Reliability Engineer (.Net Engineer)
By Suzy At United States
Exposure to a Configuration Management System (Puppet, Chef, Salt, etc)
Optimize: Observe and improve performance, reduce cost, and improve the experience for millions of users
3+ years of experience in Software Engineering, Site Reliability Engineering, or a Development focused DevOps role.
Experience with Kubernetes and Cloud systems
Experience with the development and operation of high-traffic backend systems
Troubleshooting skills that span applications, networking (TCP/IP), and systems
Site Reliability Engineer - Usds
By TikTok At Seattle, WA, United States

Responsibilities TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, ...