Site Reliability Engineer (Remote)
By Liberty IT Solutions At Atlanta, GA, United States

Job Description Summary: Manages, supports and maintains a reliable environment for the site in order to ensure the stability and security of multiple systems/platforms that are run or operated in ...

Site Reliability Engineer - Entry Level (Technology Rotational Development Program)
By Equifax At Alpharetta, GA, United States
Bachelor’s Degree in Computer Science, Information Technology, Project Management, or equivalent field; Completion of coursework by May 2024.
Ability to gain experience by cross-training in the various areas within the Technology organization and other key related functions.
Excellent leadership, teamwork and service skills.
Excellent oral and written communication skills.
Experienced working with and developing with Java
Exposure/knowledge of cloud technologies (Google Cloud Platform (GCP), Amazon Web Services (AWS), or Azure)
Site Reliability Engineer Jobs
By Ascendion At , Alpharetta
Knowledge of the cloud and managed services such as MS Flex Server or AWS RDS.
Strong experience as a database administrator.
Strong experience in PostgreSQL and/or MySQL.
Automation skill in Bash, Golang, Python a plus.
Knowledge of IaC and CI/CD tools such as Terraform and GitHub Actions a plus.
Experience in query optimization and performance improvement.
Senior Site Reliability Engineer
By UKG (Ultimate Kronos Group) At , Alpharetta, Ga
Actively participate in incident response, including on-call responsibilities
Engineering degree, or a related technical discipline, or equivalent work experience
Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java)
Working experience with industry standards like Terraform, Ansible
3+ years of hands-on experience working in Engineering or Cloud
3+ years of experience with public cloud platforms (e.g. GCP, AWS, Azure)
Site Reliability Engineer Jobs
By Global Payments (Beamery) At , Alpharetta, Ga
Experience working on containerization technologies.
Hands-on experience on setting up, managing, maintaining, upgrading vanilla Kubernetes or using Openshift.
Create Platform automation to create/manage clusters and Deployment automation for all CI/CD pipelines.
Experience writing programs in Python or Go.
Provide deep and detailed levels of monitoring/alerting capabilities across all levels of the application.
Typically minimum of 8 years - Professional Experience in IT infrastructure operations.
Aws Site Reliability Engineer
By Zeektek At United States
Help set up and manage our AWS EKS environment.
Help set up and manage our GitLab CI/CD pipeline.
Can engage and manage the heterogenous CI/CD and deployment environments of the teams we collaborate with
Site Reliability Engineer, DevOps manager
1.5+ years experience in SRE/DevOps or equivalent role
Work with other teams to assist in deploying our microservices and code into their environments (on prem and AWS)
Sr. Site Reliability Engineer - Remote Us
By SitusAMC At , Remote $100,000 - $125,000 a year

SitusAMC is where the best and most passionate people come to transform our client’s businesses and their own careers. Whether you’re a real estate veteran, a passionate technologist, or looking to ...

Saas Site Reliability Engineer And Automation Developer
By Siemens Digital Industries Software At , Costa Mesa, 92627 $116,900 - $210,400 a year
Develop and maintain automation tools, scripts, and frameworks to streamline deployment, configuration management, and monitoring processes.
Design and implement infrastructure solutions using configuration management tools, such as Ansible, Puppet, or Chef.
Proficiency in automation and configuration management tools (e.g., Ansible, Puppet, Chef).
In-depth knowledge and hands-on experience with cloud platforms such as AWS, Azure, or Google Cloud and their scalability features.
Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent work experience).
Strong programming skills in languages such as Python, Go, or Ruby.
Staff Site Reliability Engineer, Multi-Cloud
By Okta At ,
Extensive experience with configuration management tools like Chef, Ansible, or Puppet and infrastructure-as-code tools such as Terraform
Experience with multi-cloud infrastructure is desired
Proficiency in distributed systems design, with a comprehensive understanding of failure modes, benefits, and potential drawbacks
In-depth knowledge of various types of data stores, including both SQL and NoSQL
Core contributor driving Okta’s multi-cloud initiatives
Design, build, and operate Okta's global production infrastructure
Senior Engineer Ii - Digital Site Reliability
By Lululemon At , Seattle $132,300 - $173,500 a year
Contribute to engineering automation, management or development of pre-prod and production systems
Mentor and guide junior team members, sharing knowledge and expertise to foster a culture of learning and continuous improvement.
Eight+ years of engineering experience
Five+ years experience with CI/CD tools, GitLab preferred
Proficiency in at least one programming language (e.g., Python, Go, Java) and experience with scripting and automation.
Acknowledge the presence of choice in every moment and take personal responsibility for your life.
Senior Site Reliability Engineer, Trello
By Atlassian At , San Francisco
3+ years of hands-on experience with public cloud offerings such as AWS,GCP or Azure
Familiarity with Incident management, post-incident analysis and participation in on-call rotation
3+ years experience operating high-availability, fault-tolerant, scalable, distributed software in production: building monitoring, tweaking dashboards, defining alerts, writing runbooks, etc.
Engineering microservices and tools across one or more programming languages (e.g. Go, Python,Bash)
Automation and Infrastructure-as-Code projects and tooling (e.g. Ansible, Puppet, Terraform)
Build and maintain a continuous integration and delivery pipeline (e.g. Bamboo, Bitbucket Pipelines, Github Actions)
Site Reliability Engineer Jobs
By Adobe At , Lehi, 84043 $92,100 - $161,000 a year

What you need to succeed:

An understanding of SRE standard methodologies:

Infrastructure Site Reliability Engineer
By CVS Health At , Hartford $75,400 - $162,700 a year
A year or more experience with incident management, performance monitoring, and capacity planning tools.
Multiple years’ demonstrated proficiency in at least one configuration management tool such as Ansible, Puppet, or Chef.
Minimum of 5 years of experience in Infrastructure Engineering, System Administration, or related roles.
Multiple years’ experience with cloud platforms (e.g., Amazon Web Services, Microsoft Azure) and infrastructure-as-code tools (e.g., Terraform, CloudFormation).
Multiple years’ experience with containerization technologies such as Docker and container orchestration platforms like Kubernetes.
Multiple years’ demonstrated knowledge of networking principles and protocols, including TCP/IP, DNS, load balancing, and firewalls.
Site Reliability Engineer (Sre) - Evening Shift
By Brightspot At , Chicago $100,000 - $115,000 a year
Automate manual tasks and build tools for system monitoring, deployment, and configuration management.
2+ years of relevant experience in Cloud Operations
Proven troubleshooting and problem-solving skills in a cloud-based application environment
Outstanding communication skills with the ability to work in a client-facing role
Monitor the availability, performance, and reliability of our systems and applications during the evening shift.
Investigate and resolve incidents, troubleshooting any issues that arise and ensuring prompt resolution to minimize downtime.
Associate Site Reliability Engineer (Remote)
By Patterson Technology Center At , Minneapolis-Saint Paul
Bachelor's degree in Computer Science, Management Information Sciences or area of functional responsibility preferred, or equivalent years of industry work experience
Knowledge of aspects of application development and project life cycles design and development experience with engineering software design tools
Office environment – either in Patterson facility or at home/remote location
Plan, design, deploy, and operate Site Reliability Engineering capabilities for cloud products & services.
Continuously build, automate, and improve upon capabilities that are secure, scalable, performant, and resilient
Demonstrated knowledge and understanding of database and operating systems
Senior Site Reliability Engineer/Devops Engineer
By Zillow At , Remote
Knowledge and experience working with microservices
Leverage your knowledge to build technical consensus around architecture and technology choices
Build and manage StreetEasy's cloud infrastructure, contributing to our commitment to reliability and efficiency
A Bachelor's degree in Computer Science or a related technical field, or equivalent practical experience
1-3 years of experience in site reliability engineering, DevOps, or a related field
Experience with cloud service providers, preferably AWS
Site Reliability Engineer, Recommendation Infrastructure (San Jose)
By TikTok At , San Jose $112,200 - $205,000 a year
Plan, manage and optimize cloud resources utilization, ensuring SLA of large-scale clusters
Bachelor's degree or above majoring in Computer Science or related fields, with at least 1 years of related work experience
Experience in SRE of large-scale systems deployment with high reliability and scalability
Familiar with system operation skills in Linux and network
Experience programming in at least one of the following languages: Python, Perl, Go, or C/C++
Experience in designing, analyzing and troubleshooting large-scale distributed systems
(Remote) - Sr Site Reliability Engineer
By First American Financial Corporation At , Santa Ana $87,945 - $182,655 a year
Bachelor's degree in Computer Science, Information Technology, or equivalent education and experience.
Strong understanding of SRE practices: incident response, change/release management, capacity planning, infrastructure automation, elastic environments, chaos engineering and blameless postmortems.
Skilled in defining service level objectives, measuring service level indicators, and setting up error budgets.
Experienced in creating SRE adoption framework and onboarding procedure.
What You’ll Bring (At least 5-7 years' experience)
Maintain and improve reliability of core software systems.
Site Reliability Engineer, Product - Usds
By TikTok At , Los Angeles $119,000 - $289,000 a year
Gain a solid understanding of the various components and services that power the TikTok experience
Maintain services to meet service-level-agreements (SLAs) and service-level-objectives (SLOs) by measuring and monitoring availability, performance, and overall system health
Scale systems sustainability through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for changes
Provide user support, incident responses and postmortems
In this role, you will:
Our time off and leave plans are:
Site Reliability Engineer Jobs
By Fisker Inc At , Manhattan Beach $60,900 - $169,650 a year
Experience with artifact management (Artifactory, Nexus)
Experience with strict security requirements and implementation
Design, provision, deploy, and manage Kubernetes clusters and resources
Bachelor’s degree in computer science or related technical field or equivalent experience
5+ years of SRE / DevOps Engineer experience
Experience with cloud infrastructure (AWS, GCP, Azure)
Site Reliability Engineer Jobs
By Zscaler At , San Jose
Strong Centos/UNIX skills, FreeBSD specific experience is a plus.
5 -7 years experience in a SaaS/ Cloud/Distributed environment growing at a rapid scale.
Minimum 3+ years of scripting experience in Python is required.
Hands-on experience with infrastructure as code and automation tools (Ansible, Chef, Puppet, Terraform).
Basic Networking skills (TCP/IP, DNS, LACP, CARP) for testing and troubleshooting are required.
Competitive salary and benefits, including equity
Senior Site Reliability Engineer
By Adyen At , Chicago
Have a good understanding of Infrastructure as Code and experience with configuration management and automation tools such as Puppet and Ansible;
Strong familiarity with SRE practices and methodologies such as defining SLOs, change management processes and incident response;
Together with the team lead the way in continuously improving our incident management and on-call processes
Have experience with building, operating and troubleshooting large-scale distributed systems spanning multiple data centers across the globe;
Skilled in one or more programming or scripting languages such as Python, Java or bash;
We use SLOs to drive platform stability and innovation
Site Reliability Engineer (Sre)
By Agama Solutions At , San Jose
5+ years of US experience as in a SRE role
Good communication (and listening) skills.
Some experience administering Linux “web” servers, at scale.
Working knowledge of DNS, HTTP, TLS, web security.
Experience with networking troubleshooting using tools such as TCP Dump.
Well versed in *nix Operating Systems (we use CentOS and Ubuntu LTS).
Sr. Software Engineer- Site Reliability (Remote)
By Home Depot / THD At , Atlanta, 30301 $160,000 a year
Knowledge of configuration management tools (e.g., Ansible, Puppet, or Chef)
This position typically reports to Software Engineer Manager or Sr. Manager
2-4 years of relevant work experience
Experience with cloud platforms (e.g., AWS, Azure, or GCP)
Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)
Knowledge of version control systems (e.g., Git)

Are you looking for an opportunity to join a fast-paced and innovative team? We are looking for a Site Reliability Engineer to join our team and help us ensure our systems are running smoothly and efficiently. You will be responsible for monitoring, troubleshooting, and resolving any issues that arise with our systems. You will also be responsible for developing and implementing strategies to improve system reliability and performance. If you are a self-starter with a passion for problem-solving and a knack for automation, then this is the job for you!

A Site Reliability Engineer (SRE) is responsible for ensuring the reliability, performance, and availability of a company’s websites, applications, and services. They are responsible for developing and maintaining automation tools, monitoring systems, and other processes to ensure the reliability of the company’s services.

What is Site Reliability Engineer Skills Required?

• Knowledge of Linux/Unix systems
• Knowledge of scripting languages such as Bash, Python, and Ruby
• Knowledge of distributed systems and cloud computing
• Knowledge of monitoring and logging tools such as Nagios, Splunk, and ELK
• Knowledge of configuration management tools such as Chef, Puppet, and Ansible
• Knowledge of container technologies such as Docker and Kubernetes
• Knowledge of version control systems such as Git
• Ability to troubleshoot and debug complex systems
• Ability to work in a fast-paced environment

What is Site Reliability Engineer Qualifications?

• Bachelor’s degree in Computer Science, Information Technology, or related field
• 5+ years of experience in a DevOps or SRE role
• Experience with automation and configuration management tools
• Experience with monitoring and logging tools
• Experience with container technologies
• Experience with version control systems

What is Site Reliability Engineer Knowledge?

• Knowledge