Sr. Site Reliability Engineer - Remote Us
By SitusAMC At , Remote $100,000 - $125,000 a year

SitusAMC is where the best and most passionate people come to transform our client’s businesses and their own careers. Whether you’re a real estate veteran, a passionate technologist, or looking to ...

Site Reliability Engineer Ii - Remote
By Akamai At , Remote $93,656 - $140,803 a year
Defining requirements as part of the product lifecycle to influence the new designs and standards
Have 2 years of relevant experience and a Bachelors degree or its equivalent
Have proven experience as a systems performance/site reliability or DevOps engineer
Have experience of working with NoSQL databases, such as Cassandra or Redis
Have experience with orchestration tools e.g. Chef and/or Ansible
Join our highly skilled Security team
Lead Sre (Site Reliability Engineer)
By Concentrix At , Remote
Team lead experience with offshore resources
Expected experience even if not deep in these areas:
Nice to have experience (not required):
Ability to create structure and process for a greenfield dev team
React.js & responsive web app dev
- DevOps & CI/CD - specific tooling is related to a Full stack Java and automation
Cdn Site Reliability Engineer (L5) - Open Connect
By Netflix At , Remote
Knowledge of and proven experience with CDNs and HTTP cache/proxy technologies
Service Reliability/Operational experience running large scale, high performance systems & internet services with focus on security and reliability
Expert-level knowledge of Unix or Linux system administration at scale. We happen to use FreeBSD
Knowledge of networking concepts and application protocols, especially TCP/IP, BGP, HTTP/S and DNS
Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
Some experience with container and container orchestration technologies (Docker, Kubernetes)
Sr. Site Reliability Engineer
By eHealth At , Remote $113,500 - $141,900 a year
A security certification and/or knowledge of DevSecOps would be a plus
5+ years of experience as System engineer or SRE engineer (DevOps culture)
Strong Linux skills and excellent skills in one major programming language (Python, Java would be great.)
Hands-on experience implementing and maintaining Container stack with all the security and compliance consideration.
Experience managing Hybrid infrastructure and configuration using tools like Terraform, Ansible and Puppet.
Understanding of CI/CD and experience with Jenkins, Pipeline as code
Site Reliability Engineer (Sre) - Mid/Senior
By Vanilla Technologies Inc. At , Remote
Project management tools such as Jira, Git, and Confluence
Accounting for and addressing software vulnerabilities
Securing infrastructure, applications, and code
Ensuring high SLA for uptime & security
Quick, continuous automation and deployment of updates
Preserving infrastructure and stability of code
Site Reliability Engineer, Netflix Technology
By Netflix At , Remote
Experience with incident management and response
Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks
Reads signals in aggregate to develop deeper insights into the quality of experience for our users to help inform business decisions
Experience with complex sociotechnical systems and their successful operations at scale
Experience conducting blame-aware incident reviews
Strong analytical and problem-solving skills
Lead Site Reliability Engineer (Remote)
By IQVIA At , Remote
Bachelor’s Degree in Computer Science, Software Engineering, or equivalent professional experience
Significant (7+ years) experience building, managing, and supporting cloud-based IT infrastructure (IaC)
Thorough knowledge of Unix and/or Linux fundamentals and system administration
Experience with infrastructure-as-code (IaC) tools or technologies (notably Terraform)
Solid foundational knowledge of TCP/IP networking
Knowledge of source control systems and workflow (notably git)
Site Reliability Engineer (Sre)
By Luxoft At , Remote
5+ years of experience with administrating Linux and at least 2 years in supporting production environments;
Fluent developer skills in any popular programming language (C++ / Python / Java / Go. Java is preferred);
Experience with designing large-scale distributed solutions accompanied with it's capacity planning;
Experience with monitoring and alerting tools like Grafana, Datadog, Prometheus etc;
Strong knowledge of virtualization and containerization principles including orchestration tools;
Experience with relational and NoSQL DBMS
Backend Engineer (Site-Reliability) Jobs
By Terraform Labs At , Remote
In-depth knowledge of database management systems, including relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., Cassandra, MongoDB, Redis).
Collaborate with cross-functional teams to understand requirements and translate them into technical designs and implementation plans.
An interest in DeFi, or background in finance / Fintech
3+ years of professional work experience
Proven experience as a Backend Software Engineer, with a focus on site reliability and DevOps.
Experience with containerization and orchestration technologies such as Docker and Kubernetes.
Site Reliability Engineer Ii
By Exact Sciences Corporation At , Remote $82,000 - $130,000 a year
Support and comply with the company’s Quality Management System policies and procedures.
3+ years of experience in systems engineering
3+ years of work and/or formal classroom experience with modern application design and cloud environments
3+ years of work and/or formal classroom experience working with software development and operations teams
1+ years of experience developing highly available systems architecture using modern technologies.
AWS Solutions Architect, AWS SysOps Administrator, or AWS Developer certification.
Site Reliability Engineer - Kubernetes
By Avantage Entertainment At , Remote $115,000 - $130,000 a year
Strong detail orientation, time management skills, dependability, and flexibility required (our team spans at least 12 time zones).
Support our DevOps team with management of application deployments using GitOps tooling in the Kubernetes environment.
Proactively researches new capabilities and trends and reports findings to senior leadership.
Bachelor's degree in computer science or equivalent occupational experience.
Experience in an AWS or other cloud environment.
In-depth experience in Kubernetes (Red Hat OpenShift preferred).
Site Reliability Engineer (Sre) - Evening Shift
By Brightspot At , Chicago $100,000 - $115,000 a year
Automate manual tasks and build tools for system monitoring, deployment, and configuration management.
2+ years of relevant experience in Cloud Operations
Proven troubleshooting and problem-solving skills in a cloud-based application environment
Outstanding communication skills with the ability to work in a client-facing role
Monitor the availability, performance, and reliability of our systems and applications during the evening shift.
Investigate and resolve incidents, troubleshooting any issues that arise and ensuring prompt resolution to minimize downtime.
Site Operations Manager Jobs
By Hewlett Packard Enterprise At , Dallas, 75202 $95,100 - $218,700 a year
Implement and maintain all security and data management protocols as defined by the security architects and system admins
Applies advanced subject matter knowledge to manage staff activities in solving common and complex business/technical issues within established policies
5+ years of team supervision or management
BS in Computer Science, IT Management, or equivalent
Primary regional point of contact and manager for our Supercomputing-as-a-Service system
Manage system startup and shutdown, including hardware replacement as necessary
Senior Site Reliability Engineer
By Adyen At , Chicago
Have a good understanding of Infrastructure as Code and experience with configuration management and automation tools such as Puppet and Ansible;
Strong familiarity with SRE practices and methodologies such as defining SLOs, change management processes and incident response;
Together with the team lead the way in continuously improving our incident management and on-call processes
Have experience with building, operating and troubleshooting large-scale distributed systems spanning multiple data centers across the globe;
Skilled in one or more programming or scripting languages such as Python, Java or bash;
We use SLOs to drive platform stability and innovation
System Reliability Operations Engineer
By Disney At , Lake Buena Vista
2+ years incident recovery with demonstrated experience with Service and Event Management tools
Experience in enterprise IT operations including system administration, application platforms, infrastructure, networking fundamentals, and IT service management
2+ years experience supporting converged infrastructure stacks including application, compute, storage, and networking
Experience within network technologies (WAN/LAN, wireless infrastructure, DNS/DHCP, Load-Balancers, Accelerators)
Demonstrated experience in systems integration, application infrastructure support, and middleware operations.
Experience with hands-on support of cloud operations (AWS, Google Cloud, Azure)
Site Reliability Engineer - Entry Level (Technology Rotational Development Program)
By Equifax At Alpharetta, GA, United States
Bachelor’s Degree in Computer Science, Information Technology, Project Management, or equivalent field; Completion of coursework by May 2024.
Ability to gain experience by cross-training in the various areas within the Technology organization and other key related functions.
Excellent leadership, teamwork and service skills.
Excellent oral and written communication skills.
Experienced working with and developing with Java
Exposure/knowledge of cloud technologies (Google Cloud Platform (GCP), Amazon Web Services (AWS), or Azure)
Aws Site Reliability Engineer
By Zeektek At United States
Help set up and manage our AWS EKS environment.
Help set up and manage our GitLab CI/CD pipeline.
Can engage and manage the heterogenous CI/CD and deployment environments of the teams we collaborate with
Site Reliability Engineer, DevOps manager
1.5+ years experience in SRE/DevOps or equivalent role
Work with other teams to assist in deploying our microservices and code into their environments (on prem and AWS)
Saas Site Reliability Engineer And Automation Developer
By Siemens Digital Industries Software At , Costa Mesa, 92627 $116,900 - $210,400 a year
Develop and maintain automation tools, scripts, and frameworks to streamline deployment, configuration management, and monitoring processes.
Design and implement infrastructure solutions using configuration management tools, such as Ansible, Puppet, or Chef.
Proficiency in automation and configuration management tools (e.g., Ansible, Puppet, Chef).
In-depth knowledge and hands-on experience with cloud platforms such as AWS, Azure, or Google Cloud and their scalability features.
Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent work experience).
Strong programming skills in languages such as Python, Go, or Ruby.
Staff Site Reliability Engineer, Multi-Cloud
By Okta At ,
Extensive experience with configuration management tools like Chef, Ansible, or Puppet and infrastructure-as-code tools such as Terraform
Experience with multi-cloud infrastructure is desired
Proficiency in distributed systems design, with a comprehensive understanding of failure modes, benefits, and potential drawbacks
In-depth knowledge of various types of data stores, including both SQL and NoSQL
Core contributor driving Okta’s multi-cloud initiatives
Design, build, and operate Okta's global production infrastructure
Senior Engineer Ii - Digital Site Reliability
By Lululemon At , Seattle $132,300 - $173,500 a year
Contribute to engineering automation, management or development of pre-prod and production systems
Mentor and guide junior team members, sharing knowledge and expertise to foster a culture of learning and continuous improvement.
Eight+ years of engineering experience
Five+ years experience with CI/CD tools, GitLab preferred
Proficiency in at least one programming language (e.g., Python, Go, Java) and experience with scripting and automation.
Acknowledge the presence of choice in every moment and take personal responsibility for your life.
Senior Site Reliability Engineer, Trello
By Atlassian At , San Francisco
3+ years of hands-on experience with public cloud offerings such as AWS,GCP or Azure
Familiarity with Incident management, post-incident analysis and participation in on-call rotation
3+ years experience operating high-availability, fault-tolerant, scalable, distributed software in production: building monitoring, tweaking dashboards, defining alerts, writing runbooks, etc.
Engineering microservices and tools across one or more programming languages (e.g. Go, Python,Bash)
Automation and Infrastructure-as-Code projects and tooling (e.g. Ansible, Puppet, Terraform)
Build and maintain a continuous integration and delivery pipeline (e.g. Bamboo, Bitbucket Pipelines, Github Actions)
Site Reliability Engineer Jobs
By Adobe At , Lehi, 84043 $92,100 - $161,000 a year

What you need to succeed:

An understanding of SRE standard methodologies:

Infrastructure Site Reliability Engineer
By CVS Health At , Hartford $75,400 - $162,700 a year
A year or more experience with incident management, performance monitoring, and capacity planning tools.
Multiple years’ demonstrated proficiency in at least one configuration management tool such as Ansible, Puppet, or Chef.
Minimum of 5 years of experience in Infrastructure Engineering, System Administration, or related roles.
Multiple years’ experience with cloud platforms (e.g., Amazon Web Services, Microsoft Azure) and infrastructure-as-code tools (e.g., Terraform, CloudFormation).
Multiple years’ experience with containerization technologies such as Docker and container orchestration platforms like Kubernetes.
Multiple years’ demonstrated knowledge of networking principles and protocols, including TCP/IP, DNS, load balancing, and firewalls.