Site Reliability Engineer Jobs
By Ascendion At , Alpharetta
Knowledge of the cloud and managed services such as MS Flex Server or AWS RDS.
Strong experience as a database administrator.
Strong experience in PostgreSQL and/or MySQL.
Automation skill in Bash, Golang, Python a plus.
Knowledge of IaC and CI/CD tools such as Terraform and GitHub Actions a plus.
Experience in query optimization and performance improvement.
Sr. Software Engineer- Site Reliability (Remote)
By Home Depot / THD At , Atlanta, 30301 $160,000 a year
Knowledge of configuration management tools (e.g., Ansible, Puppet, or Chef)
This position typically reports to Software Engineer Manager or Sr. Manager
2-4 years of relevant work experience
Experience with cloud platforms (e.g., AWS, Azure, or GCP)
Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)
Knowledge of version control systems (e.g., Git)
Site Reliability Engineer - Remote
By Sheetz At , Claysburg, 16625
(Equivalent combinations of education, licenses, certifications and/or experience may be considered)
Responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for their assigned system(s).
A four year degree in Computer Science, Management Information Systems, Computer Engineering is preferred.
6 years of applicable experience in a technology environment, preferably with time spent in an engineering capacity, is required.
Coding experience beyond simple scripts is required.
A four year degree which includes courses or training in computer programming, systems analysis, system development, or systems engineering, is required.
Site Reliability Engineer Ii - Remote
By Akamai At , Remote $93,656 - $140,803 a year
Defining requirements as part of the product lifecycle to influence the new designs and standards
Have 2 years of relevant experience and a Bachelors degree or its equivalent
Have proven experience as a systems performance/site reliability or DevOps engineer
Have experience of working with NoSQL databases, such as Cassandra or Redis
Have experience with orchestration tools e.g. Chef and/or Ansible
Join our highly skilled Security team
Lead Sre (Site Reliability Engineer)
By Concentrix At , Remote
Team lead experience with offshore resources
Expected experience even if not deep in these areas:
Nice to have experience (not required):
Ability to create structure and process for a greenfield dev team
React.js & responsive web app dev
- DevOps & CI/CD - specific tooling is related to a Full stack Java and automation
Cdn Site Reliability Engineer (L5) - Open Connect
By Netflix At , Remote
Knowledge of and proven experience with CDNs and HTTP cache/proxy technologies
Service Reliability/Operational experience running large scale, high performance systems & internet services with focus on security and reliability
Expert-level knowledge of Unix or Linux system administration at scale. We happen to use FreeBSD
Knowledge of networking concepts and application protocols, especially TCP/IP, BGP, HTTP/S and DNS
Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
Some experience with container and container orchestration technologies (Docker, Kubernetes)
Sr. Site Reliability Engineer
By eHealth At , Remote $113,500 - $141,900 a year
A security certification and/or knowledge of DevSecOps would be a plus
5+ years of experience as System engineer or SRE engineer (DevOps culture)
Strong Linux skills and excellent skills in one major programming language (Python, Java would be great.)
Hands-on experience implementing and maintaining Container stack with all the security and compliance consideration.
Experience managing Hybrid infrastructure and configuration using tools like Terraform, Ansible and Puppet.
Understanding of CI/CD and experience with Jenkins, Pipeline as code
Site Reliability Engineer Jobs
By eBay At , San Jose, 95125, Ca $168,400 - $262,900 a year
Develop automation systems for implementing eBay Traffic management
Manage eBay’s traffic infrastructure including SLB, CDN, etc.
Solid programming experience in languages like Golang, Java, C/C++
Experience with Kubernetes, docker is a must
Experience working with public cloud is a plus
Experience in software load balancer(IPVS, Envoy, Istio, Cilium etc) is a plus
Site Reliability Engineer - Remote
By Regal Rexnord At , Morehead, 40351, Ky
Experience and understanding in DevOps, Cloud Resiliency, Performance Engineering, Release Engineering, Application Performance Management and Capacity Planning, Caching, JavaScripts and .Net
Responsible for Application Performance Monitoring tool administration and management by monitoring availability and taking a holistic view of system health
Reduce organizational ‘toil’ via automation, scripting, and implementation and management of toolsets
Understand business / technical requirements and the overall business objectives of applications
2+ years of experience in software application development or test automation
5+ years of Performance Engineer or related experience with high-traffic, large-scale distributed systems, client-server architectures both on-prem and cloud (Primarily Azure)
Site Reliability Engineer (Sre) - Mid/Senior
By Vanilla Technologies Inc. At , Remote
Project management tools such as Jira, Git, and Confluence
Accounting for and addressing software vulnerabilities
Securing infrastructure, applications, and code
Ensuring high SLA for uptime & security
Quick, continuous automation and deployment of updates
Preserving infrastructure and stability of code
Site Reliability Engineer, Netflix Technology
By Netflix At , Remote
Experience with incident management and response
Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks
Reads signals in aggregate to develop deeper insights into the quality of experience for our users to help inform business decisions
Experience with complex sociotechnical systems and their successful operations at scale
Experience conducting blame-aware incident reviews
Strong analytical and problem-solving skills
Site Reliability Engineer, Systems
By Anthropic At , San Francisco, Ca
Automate operations and infrastructure management
Have significant experience with Kubernetes and cloud-native infrastructure
Have strong communication skills to work with a range of technical and non-technical colleagues
Python and Linux SysAdmin skills
Significant experience with Kubernetes architecture and administration
Strong Linux skills and cloud infrastructure expertise
Site Reliability Engineer (L4/5) - Core
By Netflix At , Los Gatos, Ca
Experience in risk management and/or analysis
Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks
Read signals and metrics to develop deeper insights into our customers’ quality of experience to help inform business decisions
Strong writing and presentation skills
Development experience with Java, JavaScript/Node.js, Python, Go
Knowledge of cloud platforms (i.e. AWS, GCP, etc.) and microservices architecture
Sr. Site Reliability Engineer
By CCC At , Chicago, Il
Experience preparing and presenting operational artifacts to senior management
Gain and disseminate knowledge of our complex applications
2+ years experience working with the Azure tech stack in a production capacity
5+ years operational experience working with Microsoft technologies
Comfort and experience with Ops environment growing at a rapid scale.
Knowledge of Virtualization, Cloud Infrastructure and APIs
Site Reliability Engineer Jobs
By Nike At Beaverton, OR, United States

This overview explains our hiring process for corporate roles. Note there may be different hiring steps involved for non-corporate roles

Site Reliability Engineer (Sre) - $700,000
By Thurn Partners At New York, NY, United States
3+ years' experience in a similar software engineering or site reliability engineering position
Experience with SQL database operations
Experience with Kafka, CICD pipelines and virtualisation a bonus
Beautiful office space with generous overall benefits package
Proficiency with either Python or Golang
Extremely competitive compensation including performance bonuses
Senior Site Reliability Engineer
By NVIDIA At California, United States
BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics), or equivalent experience
Technical leadership beyond development that includes scoping, requirements capturing, leading and influencing multiple teams of engineers on broad development initiatives.
Experience with the ELK and Prometheus stacks as a power user and administrator.
Prior experience driving production issues and helping with on-call support.
Experience with Cuda, PyTorch, TensorRT, TensorFlow, and/or Triton.
Experience with StackStorm and similar automation platforms is a bonus.
Application Support – Site Reliability Engineer
By Morgan Stanley At New York, NY, United States
Good working knowledge of trading and risk management business concepts
Ensure efficient incident management, ensuring accurate communication to impacted groups and timely resolution.
Familiarity with SDLC processes and management tools (Jira/GIT/Stashblue)
Network diagnostic skills and experience with networks and realtime messaging technologies (multicast, TCP/IP, UDP, SNMP)
Facilitate root cause investigations and manage the implementation of corrective and preventative measures.
Manage coverage during Asian and European market hours, including weekend pre-open ready-for-business checks.
Site Operations Manager Jobs
By KNAPP North America At Joliet, IL, United States
Resolve any resourcing issues beyond the Resident Site Manager’s control or responsibilities
Provide management of the supply chain and, in particular, ensure the cultural alignment of sub-suppliers
Provided leadership in the management of maintenance interfacing with KNAPP’s supplier(s) and sub-supplier(s)
Manage site budgets and associated commercial activities
Oversee tasks by developing team skill sets to ensure delivery of defined Service Level Agreements (SLA)
Oversee all training requirements, both technical and regulatory
Site Reliability Engineer Jobs
By Spotify At Greater Chicago Area, United States
• 4+ years of IT experience needed
• Experience working in a Linux environment
• Good knowledge of Unix
• Basic experience in writing SQL queries
• Good verbal communicative skills
• Ability to manage priorities and deadlines