Senior Engineer Ii - Digital Site Reliability
By Lululemon At , Seattle $132,300 - $173,500 a year
Contribute to engineering automation, management or development of pre-prod and production systems
Mentor and guide junior team members, sharing knowledge and expertise to foster a culture of learning and continuous improvement.
Eight+ years of engineering experience
Five+ years experience with CI/CD tools, GitLab preferred
Proficiency in at least one programming language (e.g., Python, Go, Java) and experience with scripting and automation.
Acknowledge the presence of choice in every moment and take personal responsibility for your life.
Senior Site Reliability Engineer
By Dremio At , Seattle, Wa $166,304 - $225,000 a year
Have moderate-advanced experience in Python/Go, and at least reading knowledge of Java.
10+ years of relevant experience in the following areas: SRE, DevOps, Distributed Systems, Cloud Operations, Software Engineering.
Have a systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
Hands-on experience with large-scale production Kubernetes clusters (<=1000 nodes).
Hands-on experience using Honeycomb for OpenTelemetry trace analysis.
Drive continuous improvements to our usage of Kubernetes, our Operators, and the GitOps deployment paradigm.
Sr Reliability Engineer - Remote
By Dart Container At , Houston
Experience with change management and effectively present to an audience or one-on-one of all levels within the organization
Experience using Computer Maintenance Management Systems (CMMS), preferred
Develop, implement, and maintain a life cycle asset management process for components, equipment, and processes at all facilities
Bachelor’s degree with an emphasis in engineering or related field of study and six (6) years of related engineering experience
Associate’s degree and ten (10) years of related engineering experience
Experience in maintenance leadership positions or background in reliability fundamentals and maintenance workflow processes
Sr. Software Engineer- Site Reliability (Remote)
By Home Depot / THD At , Atlanta, 30301 $160,000 a year
Knowledge of configuration management tools (e.g., Ansible, Puppet, or Chef)
This position typically reports to Software Engineer Manager or Sr. Manager
2-4 years of relevant work experience
Experience with cloud platforms (e.g., AWS, Azure, or GCP)
Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)
Knowledge of version control systems (e.g., Git)
Site Reliability Engineer Jobs
By Blue Yonder At Dallas, TX, United States
Solid understanding of large-scale applications, Cloud Observability, monitoring and fault management, and understanding of Network Architectures
Respond to technical business requirements around availability, performance, and planned maintenance activities to ensure a well-operating solution and SLA compliance.
Strong experience of min 5 years’ experience developing, managing, or supporting distributed systems in a
Experience working with monitoring and visualization tools such as Splunk and AppDynamics
Experience coordinating between support and development teams to ensure effective delivery of monitoring services to the end-user.
Experience implementing best practices and industry standards for operational monitoring aligned to ITIL.
Senior Site Reliability Engineer
By Abbott Laboratories At , Abbott Park, Il
Experience with Microsoft Azure DevOps, Release Management Tools
Experience with Windows Server Configuration Management
Experience in IIS Configuration Management
EDUCATION AND EXPERIENCE YOU’LL BRING
Produces and Manages Infrastructure as code
Manages Development, QA, and Production environment configuration
Site Reliability Engineer, Infrastructure
By FullStory At , Atlanta, Ga
Be a go-to person for one or more areas such as infrastructure, configuration management, or automation.
Grow your family. We offer a global fertility and family building benefit that encompasses all journeys to growing your family.
Design, create, and manage high-performance infrastructure and tooling for our external and internal services.
Experience with one or more of the following programming languages: Golang, Python, C++, etc.
Experience architecting and maintaining systems in a public cloud environment. (e.g., GCP, AWS, Azure or similar)
Experience with modern metrics, monitoring, and logging frameworks and services. (e.g., Prometheus, Grafana, Stackdriver)
Site Reliability Engineer Iii
By JPMorgan Chase Bank, N.A. At , Plano, Tx
Required qualifications, capabilities, and skills
Formal training or certification on site reliability engineering concepts and 3+ years applied experience
Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications
Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
Site Reliability Engineer Jobs
By Motorola Solutions At , Allen, 75002, Tx

Company Overview At Motorola Solutions, we believe that everything starts with safety. It’s the constant that empowers people to confidently move forward. It can fill a flight or sell out a stadium. ...

Sr Site Reliability Engineer
By Tesla At , Austin, Tx
Deploy, configure, manage, and automate CI/CD pipelines using Jenkins, Github Actions and Git for version control.
5+ years’ experience working in a manufacturing or material flow setting.
5+ years’ experience integrating manufacturing or material flow systems.
5+ years’ experience in a high-level language such as Go, Python and/or Java.
5+ years’ experience with SQL (MySQL, Postgres, MSSQL)
5+ years’ experience with Docker and Kubernetes.
Devops Site Reliability Engineer
By Reynolds and Reynolds At , Houston, 77001, Tx
Bachelor degree in MIS, CIS, Computer Science, Engineering, or equivalent work experience
Experience with virtualization, scripting and automation, server hardware, and/or network communications desired
Experience with both Windows and Linux server operating systems preferred
Preferred industry standard certifications include: A+, Server+, Security+, Linux+, Network+, CCNA, MCSA
Desire and ability to quickly learn and apply new skills
Strong verbal and written communication skills
Site Reliability Engineer Jobs
By Visionary Recruiting Solutions At Corpus Christi, TX, United States
Completes Management of Change where appropriate
Provides input to a Risk Management Plan to anticipate reliability-related and non-reliability-related risks that could adversely impact plant operation.
Provides technical support to production, maintenance management, and technical personnel.
Identify training needs to maintain the required skills and knowledge to perform the job to the necessary standard.
Three years of experience as Reliability Engineer required; in the chemical industry preferred.
Certifications in Six Sigma (Green Belt, Lean) preferred.
Site Reliability Operations Engineer
By Fox Corporation At , Tempe, 85283, Az
Experience with operation and management of cloud-based services, including operational processes
Familiarity with modern operations concepts such as Agile and Incident Management
Experience / knowledge of the Broadcast industry
Operate and support live events, delivering smooth video experience to the audience
Hands on experience in a production / operational role
Experience using enterprise monitoring tools of any kind
Clinical Site Payment, Manager (Remote)
By Vertex Pharmaceuticals At , Boston, 02110, Ma $120,080 - $180,120 a year
Functional management and oversight of resources and assignments within the Site Payment group
Providing knowledge related to global requirements for payments
Collaborate with Site Contracting and Data Management counterparts to ensure study budgets are setup and configured appropriately for payments
Typically requires 4 years of experience or the equivalent combination of education and experience
Previous experience with site payments and clinical site budgets
Ability to effectively manage multiple priorities
Site Reliability Engineer-Remote Jobs
By Dynata At , Plano, 75024, Tx $90,000 - $112,000 a year
Experience with configuratioon management tools like Chef, Puppet, or Ansible
Learning Management System available through the Intranet providing free access to nearly 500 online training modules and personal development programs
Previous experience in an SRE or related role: DevOps, platform engineering, software engineering
Experience with distributed / highly available systems architecture, theory and practice.
Experience with an infrastructure-as-code tool (terraform, cloudformation, etc) [tf preferred]
Previous experience building and maintaining production systems in the cloud (AWS preferred)
Site Reliability Engineer Jobs
By Blue Yonder At , Dallas, Tx $88,525 - $125,575 a year
Solid understanding of large-scale applications, Cloud Observability, monitoring and fault management, and understanding of Network Architectures
Respond to technical business requirements around availability, performance, and planned maintenance activities to ensure a well-operating solution and SLA compliance.
Strong experience of min 5 years’ experience developing, managing, or supporting distributed systems in a Cloud/IaaS environment, Azure preferred
Experience working with monitoring and visualization tools such as Splunk and AppDynamics
Experience coordinating between support and development teams to ensure effective delivery of monitoring services to the end-user.
Experience implementing best practices and industry standards for operational monitoring aligned to ITIL.
Site Reliability Engineer Jobs
By Autodesk At , Atlanta, Ga $109,400 - $188,760 a year
Use modern administration tools like Docker, Terraform, AWS CloudFormation/CDK to manage and deploy containers and virtual machines
Collaborate with stakeholders to understand requirements, understand use cases and build towards a cohesive technical strategy
Experience in large-scale cloud-based production infrastructure (AWS preferred)
Expert experience with Docker and other container technology
Strong experience with Log Analysis and Monitoring tools such as CloudWatch, Splunk, New Relic, Grafana
Experience with any relevant language (Python, JavaScript, Ruby, Rust, Bash, etc.)
Site Reliability Engineer (Remote)
By Home Depot / THD At , Atlanta, 30301, Ga $130,000 a year
This position typically reports to Software Engineer Manager or Sr. Manager
Demonstrable knowledge of Linux systems, TCP/IP, HTTP, and multi-tier web application architectures
Excellent written and interpersonal communication and documentation skills
Practical knowledge of various aspects of service design, including application protocols, caching strategies, and software design principles
Practical, solid knowledge of shell scripting, Java and at least one systems programming language (Go preferred)
BS in Computer Science or equivalent experience
Senior Software Engineer - Site Reliability (Remote)
By Home Depot / THD At , Atlanta, 30301, Ga $180,000 a year
This position typically reports to Software Engineer Manager or Sr. Manager
2-4 years of relevant work experience
Experience with security frameworks for user and services authorization and authentication
Experience with creating and executing unit, functional, destructive and performance tests
Experience with modern debugging and root cause analysis techniques
Experience with version control system
Staff Site Reliability Engineer
By Procore Technologies At , Austin, Tx $136,000 - $187,000 a year
Bachelor’s Degree in Computer Science or a related field is preferred, or comparable work experience
8+ years of industry experience as an SRE or Software Engineer
Experience supporting and working with cross-functional teams in a dynamic environment
Strong oral and written communication skills
2+ years of experience working with Ruby on Rails
Provide technical efforts around building a robust and scalable observability pipeline to support billions of events

Are you looking for a challenging and rewarding role as a Remote Site Reliability Engineer? We are looking for a talented individual to join our team and help us ensure our systems are reliable and secure. You will be responsible for monitoring, troubleshooting, and resolving issues with our systems, as well as developing and implementing strategies to improve system performance. If you have a passion for technology and a desire to make a difference, this is the job for you!

Overview:

A Remote Site Reliability Engineer is responsible for ensuring the reliability, availability, and scalability of a company’s remote systems and services. This role requires a combination of technical and operational skills to ensure that the remote systems are running optimally and securely. The Remote Site Reliability Engineer will work with the development, operations, and security teams to ensure that the remote systems are reliable, secure, and available.

Detailed Job Description:

The Remote Site Reliability Engineer will be responsible for the following tasks:

• Design, implement, and maintain remote systems and services.
• Monitor and troubleshoot remote systems and services.
• Develop and maintain automation and configuration management systems.
• Develop and maintain security policies and procedures.
• Develop and maintain system and service performance metrics.
• Develop and maintain system and service availability metrics.
• Develop and maintain system and service scalability metrics.
• Develop and maintain system and service reliability metrics.
• Develop and maintain system and service security metrics.
• Develop and maintain system and service documentation.
• Develop and maintain system and service monitoring and alerting systems.
• Develop and maintain system and service backup and recovery systems.
• Develop and maintain system and service disaster recovery plans.
• Develop and maintain system and service capacity planning.
• Develop and maintain system and service performance tuning.
• Develop and maintain system and service patching and upgrades.
• Develop and maintain system and service security hardening.
• Develop and maintain system and service change management processes.
• Develop and maintain system and service incident response plans.
• Develop and maintain system and service root cause analysis processes.

What is Remote Site Reliability Engineer Job Skills Required?

• Expertise in remote systems and services.
• Expertise in automation and configuration management systems.
• Expertise in security policies and procedures.
• Expertise in system and service performance metrics.
• Expertise in system and service availability metrics.
• Expertise in system and service scalability metrics.
• Expertise in system and service reliability metrics.
• Expertise in system and service security metrics.
• Expertise in system and service documentation.
• Expertise in system and service monitoring and alerting systems.
• Expertise in system and service backup and recovery systems.
• Expertise in system and service disaster recovery plans.
• Expertise in system and service capacity planning.
• Expert