Senior Engineer Ii - Digital Site Reliability
By Lululemon At , Seattle $132,300 - $173,500 a year
Contribute to engineering automation, management or development of pre-prod and production systems
Mentor and guide junior team members, sharing knowledge and expertise to foster a culture of learning and continuous improvement.
Eight+ years of engineering experience
Five+ years experience with CI/CD tools, GitLab preferred
Proficiency in at least one programming language (e.g., Python, Go, Java) and experience with scripting and automation.
Acknowledge the presence of choice in every moment and take personal responsibility for your life.
Senior Site Reliability Engineer
By Dremio At , Seattle, Wa $166,304 - $225,000 a year
Have moderate-advanced experience in Python/Go, and at least reading knowledge of Java.
10+ years of relevant experience in the following areas: SRE, DevOps, Distributed Systems, Cloud Operations, Software Engineering.
Have a systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
Hands-on experience with large-scale production Kubernetes clusters (<=1000 nodes).
Hands-on experience using Honeycomb for OpenTelemetry trace analysis.
Drive continuous improvements to our usage of Kubernetes, our Operators, and the GitOps deployment paradigm.
Senior Site Reliability Engineer
By Abbott Laboratories At , Abbott Park, Il
Experience with Microsoft Azure DevOps, Release Management Tools
Experience with Windows Server Configuration Management
Experience in IIS Configuration Management
EDUCATION AND EXPERIENCE YOU’LL BRING
Produces and Manages Infrastructure as code
Manages Development, QA, and Production environment configuration
Site Reliability Engineer Iii
By JPMorgan Chase Bank, N.A. At , Plano, Tx
Required qualifications, capabilities, and skills
Formal training or certification on site reliability engineering concepts and 3+ years applied experience
Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications
Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
Site Reliability Engineer Jobs
By Motorola Solutions At , Allen, 75002, Tx

Company Overview At Motorola Solutions, we believe that everything starts with safety. It’s the constant that empowers people to confidently move forward. It can fill a flight or sell out a stadium. ...

Sr Site Reliability Engineer
By Tesla At , Austin, Tx
Deploy, configure, manage, and automate CI/CD pipelines using Jenkins, Github Actions and Git for version control.
5+ years’ experience working in a manufacturing or material flow setting.
5+ years’ experience integrating manufacturing or material flow systems.
5+ years’ experience in a high-level language such as Go, Python and/or Java.
5+ years’ experience with SQL (MySQL, Postgres, MSSQL)
5+ years’ experience with Docker and Kubernetes.
Devops Site Reliability Engineer
By Reynolds and Reynolds At , Houston, 77001, Tx
Bachelor degree in MIS, CIS, Computer Science, Engineering, or equivalent work experience
Experience with virtualization, scripting and automation, server hardware, and/or network communications desired
Experience with both Windows and Linux server operating systems preferred
Preferred industry standard certifications include: A+, Server+, Security+, Linux+, Network+, CCNA, MCSA
Desire and ability to quickly learn and apply new skills
Strong verbal and written communication skills
Site Reliability Engineer Jobs
By Visionary Recruiting Solutions At Corpus Christi, TX, United States
Completes Management of Change where appropriate
Provides input to a Risk Management Plan to anticipate reliability-related and non-reliability-related risks that could adversely impact plant operation.
Provides technical support to production, maintenance management, and technical personnel.
Identify training needs to maintain the required skills and knowledge to perform the job to the necessary standard.
Three years of experience as Reliability Engineer required; in the chemical industry preferred.
Certifications in Six Sigma (Green Belt, Lean) preferred.
Site Reliability Operations Engineer
By Fox Corporation At , Tempe, 85283, Az
Experience with operation and management of cloud-based services, including operational processes
Familiarity with modern operations concepts such as Agile and Incident Management
Experience / knowledge of the Broadcast industry
Operate and support live events, delivering smooth video experience to the audience
Hands on experience in a production / operational role
Experience using enterprise monitoring tools of any kind
Site Reliability Engineer-Remote Jobs
By Dynata At , Plano, 75024, Tx $90,000 - $112,000 a year
Experience with configuratioon management tools like Chef, Puppet, or Ansible
Learning Management System available through the Intranet providing free access to nearly 500 online training modules and personal development programs
Previous experience in an SRE or related role: DevOps, platform engineering, software engineering
Experience with distributed / highly available systems architecture, theory and practice.
Experience with an infrastructure-as-code tool (terraform, cloudformation, etc) [tf preferred]
Previous experience building and maintaining production systems in the cloud (AWS preferred)
Site Reliability Engineer Jobs
By Blue Yonder At , Dallas, Tx $88,525 - $125,575 a year
Solid understanding of large-scale applications, Cloud Observability, monitoring and fault management, and understanding of Network Architectures
Respond to technical business requirements around availability, performance, and planned maintenance activities to ensure a well-operating solution and SLA compliance.
Strong experience of min 5 years’ experience developing, managing, or supporting distributed systems in a Cloud/IaaS environment, Azure preferred
Experience working with monitoring and visualization tools such as Splunk and AppDynamics
Experience coordinating between support and development teams to ensure effective delivery of monitoring services to the end-user.
Experience implementing best practices and industry standards for operational monitoring aligned to ITIL.
Site Reliability Engineer Jobs
By Autodesk At , Atlanta, Ga $109,400 - $188,760 a year
Use modern administration tools like Docker, Terraform, AWS CloudFormation/CDK to manage and deploy containers and virtual machines
Collaborate with stakeholders to understand requirements, understand use cases and build towards a cohesive technical strategy
Experience in large-scale cloud-based production infrastructure (AWS preferred)
Expert experience with Docker and other container technology
Strong experience with Log Analysis and Monitoring tools such as CloudWatch, Splunk, New Relic, Grafana
Experience with any relevant language (Python, JavaScript, Ruby, Rust, Bash, etc.)
Site Reliability Engineer (Remote)
By Home Depot / THD At , Atlanta, 30301, Ga $130,000 a year
This position typically reports to Software Engineer Manager or Sr. Manager
Demonstrable knowledge of Linux systems, TCP/IP, HTTP, and multi-tier web application architectures
Excellent written and interpersonal communication and documentation skills
Practical knowledge of various aspects of service design, including application protocols, caching strategies, and software design principles
Practical, solid knowledge of shell scripting, Java and at least one systems programming language (Go preferred)
BS in Computer Science or equivalent experience
Senior Software Engineer - Site Reliability (Remote)
By Home Depot / THD At , Atlanta, 30301, Ga $180,000 a year
This position typically reports to Software Engineer Manager or Sr. Manager
2-4 years of relevant work experience
Experience with security frameworks for user and services authorization and authentication
Experience with creating and executing unit, functional, destructive and performance tests
Experience with modern debugging and root cause analysis techniques
Experience with version control system
It Ops Senior Reliability Engineer
By Shell At , Houston, Tx
Strong stakeholder management skills, and development, operations, and security experience
Proven coordination and consulting management skills
Must have a Minimum of eight (8) years in applications support and IT services and operations management.
Sound knowledge of Downstream and Retail business work processes, business requirements and their impact on operations and customers
Proven ability to influence without line management control
E2E responsible for the IT Products run and maintain operational processes in line with service requirements
Staff Site Reliability Engineer
By Procore Technologies At , Austin, Tx $136,000 - $187,000 a year
Bachelor’s Degree in Computer Science or a related field is preferred, or comparable work experience
8+ years of industry experience as an SRE or Software Engineer
Experience supporting and working with cross-functional teams in a dynamic environment
Strong oral and written communication skills
2+ years of experience working with Ruby on Rails
Provide technical efforts around building a robust and scalable observability pipeline to support billions of events
Senior Site Reliability Engineer, Trello
By Atlassian At , San Francisco
3+ years of hands-on experience with public cloud offerings such as AWS,GCP or Azure
Familiarity with Incident management, post-incident analysis and participation in on-call rotation
3+ years experience operating high-availability, fault-tolerant, scalable, distributed software in production: building monitoring, tweaking dashboards, defining alerts, writing runbooks, etc.
Engineering microservices and tools across one or more programming languages (e.g. Go, Python,Bash)
Automation and Infrastructure-as-Code projects and tooling (e.g. Ansible, Puppet, Terraform)
Build and maintain a continuous integration and delivery pipeline (e.g. Bamboo, Bitbucket Pipelines, Github Actions)
Reliability Cae Senior Engineer I
By Honda Dev. and Mfg of Am.,LLC At , Raymond
Experience in data analysis and communication of complex information to engineering management is desired.
Experience with following software or similar is desired
Ability to communicate concerns and ideas through remote work environment
Education reimbursement for continued learning
BS in Mechanical / Automotive Engineering
Proficient in Microsoft Excel, Word, and PowerPoint
Senior Reliability Engineer Jobs
By Digital Diagnostics, Inc. At , Remote
Location – Chicago, IL | Coralville, IA | or Remote-US
What We Have to Offer
Lead or participate in deploying updates or improvements as needed.
Lead or participate in support activities.
Identify performance and scalability bottlenecks in Digital Diagnostics’ global technical infrastructure.
Identify and work to eliminate waste in cloud infrastructure costs.
Senior Site Reliability Engineer/Devops Engineer
By Zillow At , Remote
Knowledge and experience working with microservices
Leverage your knowledge to build technical consensus around architecture and technology choices
Build and manage StreetEasy's cloud infrastructure, contributing to our commitment to reliability and efficiency
A Bachelor's degree in Computer Science or a related technical field, or equivalent practical experience
1-3 years of experience in site reliability engineering, DevOps, or a related field
Experience with cloud service providers, preferably AWS
Senior Site Reliability Engineer
By Adyen At , Chicago
Have a good understanding of Infrastructure as Code and experience with configuration management and automation tools such as Puppet and Ansible;
Strong familiarity with SRE practices and methodologies such as defining SLOs, change management processes and incident response;
Together with the team lead the way in continuously improving our incident management and on-call processes
Have experience with building, operating and troubleshooting large-scale distributed systems spanning multiple data centers across the globe;
Skilled in one or more programming or scripting languages such as Python, Java or bash;
We use SLOs to drive platform stability and innovation
Cloud Senior Site Reliability Engineer
By Bank of America At , New York, Ny
Perform deep dives into systemic and latent reliability issues, incident management, problem management
Understanding of cost management, inventory management, FinOps model
Identifying, analyzing, and resolving infrastructure vulnerabilities and application deployment issues.
Evaluating and automating the scaling and capacity requirements within Azure environments
BS /MS degree in Computer Science or related technical field involving systems or equivalent practical experience.
Minimum 8+ years of hands-on experience maintaining cloud platforms on a major cloud service provider.
Site Reliability Engineer (Sre) - Mid/Senior
By Vanilla Technologies Inc. At , Remote
Project management tools such as Jira, Git, and Confluence
Accounting for and addressing software vulnerabilities
Securing infrastructure, applications, and code
Ensuring high SLA for uptime & security
Quick, continuous automation and deployment of updates
Preserving infrastructure and stability of code
Senior Site Reliability Engineer
By NVIDIA At California, United States
BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics), or equivalent experience
Technical leadership beyond development that includes scoping, requirements capturing, leading and influencing multiple teams of engineers on broad development initiatives.
Experience with the ELK and Prometheus stacks as a power user and administrator.
Prior experience driving production issues and helping with on-call support.
Experience with Cuda, PyTorch, TensorRT, TensorFlow, and/or Triton.
Experience with StackStorm and similar automation platforms is a bonus.
Senior Site Reliability Engineer
By Business Wire At United States
Strong experience with AWS cloud infrastructure and container orchestration (Kubernetes, Docker)
Strong experience with monitoring and alerting systems such as Prometheus, Grafana, Nagios, etc.
Strong experience with at least one programming language. Java is highly preferred but other languages such as Python will be considered
Advanced experience with Linux system administration, Java based applications, and network architecture
Ability to work remotely 100%
Excellent health benefits that begin on your first day of employment
Senior Site Reliability Engineer (Remote)
By The Hartford At , Hartford, Ct
Progressively implement preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines
Hands on experience with Performance and Observability tools such as DynaTrace, Splunk, TrueSight, CloudWatch, CloudTrail, and related tools.
Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Jenkins, Nexus, Rally, SonarQube etc.
Knowledge of complex traditional and modern enterprise architectures and systems (understand more than the component itself).
Strong hybrid cloud experience (private and public) across various service delivery models – IaaS, PaaS, SaaS.
Strong communication (verbally and written) / collaboration / negotiation skill, working in a diverse team cross business units
Senior Site Reliability Engineer
By Humana At , Phoenix, 85050, Az
Detail oriented with excellent organizational and project management skills
Bachelor's degree or equivalent experience
Experienced in Java, Python, or similar coding experience
3+ years of experience working with voice technologies / IVR
2+ years of project leadership experience
Project-based experience driving changes and improvements to IVR solutions.
Senior Site Manager Jobs
By RSPB At Boardman, OH, United States
Head up & develop the site's management team to effectively deliver agreed objectives.
Be responsible for ensuring the safe management of all site operations.
The ability & experience to be a natural leader with excellent communication skills, who can successfully engage & influence.
Have applied or can apply a long-term vision approach to ensure best use of resources, balancing conservation management with visitor operations.
Able to understand and manipulate data to inform operational decision-making and ensure sustainable business management.
Have led and monitored compliance in land management obligations and health and safety.
Senior Site Reliability Engineer
By Akamai At , $113,430 - $170,043 a year
Acting as an escalation point for operational, network and managerial teams to ensure network/customer issues are resolved
Have 4 years of relevant experience and a Bachelor's degree in Engineering, Computer Science, or related discipline
Experience in SQL, working in UNIX/Linux environments along with managing and running RDBMS (MySQL, PostgreSQL, etc.) clusters
Possess good experience with Internet protocols (DNS, HTTP, TLS, TCP/IP, SSH, etc.)
Experience with Web Services & Cloud API, cloud computing, hosting, and networking
Have experience in Python and/or Bash scripting
Senior Site Reliability Engineer - Apple Maps
By Apple At , Cupertino, Ca
Incident management experience is a plus.
Cloud Native SRE experience ( Ideally 5-10 years).
Experience setting up and managing services running on Kubernetes.
Multi Cloud environment experience such as AWS and Google Cloud is preferred but not required.
Ability to learn and adapt. Experience matters but curiosity and adaptability are even more important.
Linux System and Network Administration.
Senior Service Reliability Engineer
By Amadeus At , Portsmouth, 03801, Nh
Change and Release Management: Manage and execute change, release and test processes and drive automation of these processes
Develop standardized automation to control, build artifact and deploy managed services
Leverage, improve, design, and implement services that automate application provisioning and manage the underlying infrastructure as a service
Manage application operations for Amadeus Core Services end to end.
Manage the full application stack (OS, Data Bases and Data Stores,
Play a key role in accelerating the organization's ability to deliver changes reliably and consistently to Amadeus Hospitality customers
Senior Site Reliability Engineer
By Microsoft At , Redmond, 98052, Wa $112,000 - $218,400 a year
6+ years of experience in Site reliability engineering role experience with large-scale, distributed infrastructures
5+ years’ experience with scripting languages such as PowerShell, Python etc.
6+ years’ experience troubleshooting, investigating, and fixing production issues in large scale cloud and/or hosted environments
4+ years experience with building infrastructure using Microsoft Azure technology
Technical Knowledge and Domain-Specific Expertise
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Senior Site Reliability Engineer
By UKG (Ultimate Kronos Group) At , Alpharetta, Ga
Actively participate in incident response, including on-call responsibilities
Engineering degree, or a related technical discipline, or equivalent work experience
Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java)
Working experience with industry standards like Terraform, Ansible
3+ years of hands-on experience working in Engineering or Cloud
3+ years of experience with public cloud platforms (e.g. GCP, AWS, Azure)
Staff Site Reliability Engineer
By Netskope At , Santa Clara, Ca
You will be part of a high caliber engineering team in the exciting space of cloud tools and infrastructure management.
Drive efficiencies in systems and processes: capacity planning, configuration management, performance tuning, monitoring and root cause analysis.
You will solve complex, exciting challenges and improve the depth and breadth of your technical and analytical skills
Partner closely with our development teams and product managers to architect and build features that are highly available, performant and secure
Gain deep knowledge of our application stack
Experience improving the performance of micro-services and solve scaling/performance issues

Are you an experienced Senior Site Reliability Engineer looking for a new challenge? We are looking for a motivated individual to join our team and help us ensure our systems are running smoothly and efficiently. You will be responsible for developing and maintaining our infrastructure, monitoring system performance, and troubleshooting any issues that arise. If you are passionate about technology and have a keen eye for detail, this could be the perfect opportunity for you!

What is Senior Site Reliability Engineer Skills Required?

•Strong knowledge of Linux/Unix administration
•Experience with scripting languages such as Bash, Python, Ruby, etc.
•Experience with automation/configuration management using tools such as Chef, Puppet, Ansible, etc.
•Experience with cloud technologies such as AWS, Azure, Google Cloud Platform, etc.
•Experience with container technologies such as Docker, Kubernetes, etc.
•Experience with monitoring tools such as Nagios, Zabbix, etc.
•Experience with version control systems such as Git, SVN, etc.
•Strong troubleshooting and problem-solving skills
•Excellent written and verbal communication skills

What is Senior Site Reliability Engineer Qualifications?

•Bachelor’s degree in Computer Science, Information Technology, or related field
•5+ years of experience in a Site Reliability Engineer role
•Experience with DevOps practices and tools
•Experience with database technologies such as MySQL, PostgreSQL, etc.

What is Senior Site Reliability Engineer Knowledge?

•Knowledge of ITIL best practices
•Knowledge of network protocols and technologies
•Knowledge of security best practices
•Knowledge of software development lifecycle

What is Senior Site Reliability Engineer Experience?

•Experience with large-scale distributed systems