Remote Site Reliability Engineer Jobs in United States , Employment

Site Reliability Engineer Jobs

By Ascendion At , Alpharetta

Knowledge of the cloud and managed services such as MS Flex Server or AWS RDS.

Strong experience as a database administrator.

Strong experience in PostgreSQL and/or MySQL.

Automation skill in Bash, Golang, Python a plus.

Knowledge of IaC and CI/CD tools such as Terraform and GitHub Actions a plus.

Experience in query optimization and performance improvement.

Sr. Software Engineer- Site Reliability (Remote)

By Home Depot / THD At , Atlanta, 30301 $160,000 a year

Knowledge of configuration management tools (e.g., Ansible, Puppet, or Chef)

This position typically reports to Software Engineer Manager or Sr. Manager

2-4 years of relevant work experience

Experience with cloud platforms (e.g., AWS, Azure, or GCP)

Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)

Knowledge of version control systems (e.g., Git)

Site Reliability Engineer - Remote

By Sheetz At , Claysburg, 16625

(Equivalent combinations of education, licenses, certifications and/or experience may be considered)

Responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for their assigned system(s).

A four year degree in Computer Science, Management Information Systems, Computer Engineering is preferred.

6 years of applicable experience in a technology environment, preferably with time spent in an engineering capacity, is required.

Coding experience beyond simple scripts is required.

A four year degree which includes courses or training in computer programming, systems analysis, system development, or systems engineering, is required.

Site Reliability Engineer Ii - Remote

By Akamai At , Remote $93,656 - $140,803 a year

Defining requirements as part of the product lifecycle to influence the new designs and standards

Have 2 years of relevant experience and a Bachelors degree or its equivalent

Have proven experience as a systems performance/site reliability or DevOps engineer

Have experience of working with NoSQL databases, such as Cassandra or Redis

Have experience with orchestration tools e.g. Chef and/or Ansible

Join our highly skilled Security team

Lead Sre (Site Reliability Engineer)

By Concentrix At , Remote

Team lead experience with offshore resources

Expected experience even if not deep in these areas:

Nice to have experience (not required):

Ability to create structure and process for a greenfield dev team

React.js & responsive web app dev

- DevOps & CI/CD - specific tooling is related to a Full stack Java and automation

Cdn Site Reliability Engineer (L5) - Open Connect

By Netflix At , Remote

Knowledge of and proven experience with CDNs and HTTP cache/proxy technologies

Service Reliability/Operational experience running large scale, high performance systems & internet services with focus on security and reliability

Expert-level knowledge of Unix or Linux system administration at scale. We happen to use FreeBSD

Knowledge of networking concepts and application protocols, especially TCP/IP, BGP, HTTP/S and DNS

Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)

Some experience with container and container orchestration technologies (Docker, Kubernetes)

Sr. Site Reliability Engineer

By eHealth At , Remote $113,500 - $141,900 a year

A security certification and/or knowledge of DevSecOps would be a plus

5+ years of experience as System engineer or SRE engineer (DevOps culture)

Strong Linux skills and excellent skills in one major programming language (Python, Java would be great.)

Hands-on experience implementing and maintaining Container stack with all the security and compliance consideration.

Experience managing Hybrid infrastructure and configuration using tools like Terraform, Ansible and Puppet.

Understanding of CI/CD and experience with Jenkins, Pipeline as code

Site Reliability Engineer Jobs

By eBay At , San Jose, 95125, Ca $168,400 - $262,900 a year

Develop automation systems for implementing eBay Traffic management

Manage eBay’s traffic infrastructure including SLB, CDN, etc.

Solid programming experience in languages like Golang, Java, C/C++

Experience with Kubernetes, docker is a must

Experience working with public cloud is a plus

Experience in software load balancer(IPVS, Envoy, Istio, Cilium etc) is a plus

Cloud Senior Site Reliability Engineer

By Bank of America At , New York, Ny

Perform deep dives into systemic and latent reliability issues, incident management, problem management

Understanding of cost management, inventory management, FinOps model

Identifying, analyzing, and resolving infrastructure vulnerabilities and application deployment issues.

Evaluating and automating the scaling and capacity requirements within Azure environments

BS /MS degree in Computer Science or related technical field involving systems or equivalent practical experience.

Minimum 8+ years of hands-on experience maintaining cloud platforms on a major cloud service provider.

Reliability & Maintainability Engineer (R&M) (Remote) - Huntsvlle, Al

By Davidson Technologies, Inc. At , Remote

Ensure the associated requirements and tasks are properly flowed through specifications and SOWs

Primary focus will be developing Failure Modes and Effects Analysis (FMEA)/Critical Items List (CIL) as defined in SLS SOW paragraph 5.8.2.

Perform all tasks in accordance with SLS-RQMT-014 and SLS-RQMT-016

Participate in the R&M working group, R&M team meetings, and customer meetings as required

All hardware failure modes will be considered in the analysis

For each postulated failure mode, potential failure causes will be identified and documented

Site Reliability Engineer - Remote

By Regal Rexnord At , Morehead, 40351, Ky

Experience and understanding in DevOps, Cloud Resiliency, Performance Engineering, Release Engineering, Application Performance Management and Capacity Planning, Caching, JavaScripts and .Net

Responsible for Application Performance Monitoring tool administration and management by monitoring availability and taking a holistic view of system health

Reduce organizational ‘toil’ via automation, scripting, and implementation and management of toolsets

Understand business / technical requirements and the overall business objectives of applications

2+ years of experience in software application development or test automation

5+ years of Performance Engineer or related experience with high-traffic, large-scale distributed systems, client-server architectures both on-prem and cloud (Primarily Azure)

Site Reliability Engineer (Sre) - Mid/Senior

By Vanilla Technologies Inc. At , Remote

Project management tools such as Jira, Git, and Confluence

Accounting for and addressing software vulnerabilities

Securing infrastructure, applications, and code

Ensuring high SLA for uptime & security

Quick, continuous automation and deployment of updates

Preserving infrastructure and stability of code

Site Reliability Engineer, Netflix Technology

By Netflix At , Remote

Experience with incident management and response

Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks

Reads signals in aggregate to develop deeper insights into the quality of experience for our users to help inform business decisions

Experience with complex sociotechnical systems and their successful operations at scale

Experience conducting blame-aware incident reviews

Strong analytical and problem-solving skills

Site Reliability Engineer, Systems

By Anthropic At , San Francisco, Ca

Automate operations and infrastructure management

Have significant experience with Kubernetes and cloud-native infrastructure

Have strong communication skills to work with a range of technical and non-technical colleagues

Python and Linux SysAdmin skills

Significant experience with Kubernetes architecture and administration

Strong Linux skills and cloud infrastructure expertise

Site Reliability Engineer (L4/5) - Core

By Netflix At , Los Gatos, Ca

Experience in risk management and/or analysis

Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks

Read signals and metrics to develop deeper insights into our customers’ quality of experience to help inform business decisions

Strong writing and presentation skills

Development experience with Java, JavaScript/Node.js, Python, Go

Knowledge of cloud platforms (i.e. AWS, GCP, etc.) and microservices architecture

Director Of Engineering, Site Reliability

By OneStudyTeam At , Remote

Experience implementing security controls for AWS environments, including setup and management of authentication controls, VPN’s, KMS, etc

Be the product manager for your vertical, defining the roadmap, requirements, goals and acceptance criteria

Learn more about our global benefits offerings on our careers site: https://careers.onestudyteam.com/us-benefits

Manage vendors, contracts and spend associated to operational infrastructure

Experience managing a team of 5+ SREs

Experience managing a global AWS footprint

Sr. Site Reliability Engineer

By CCC At , Chicago, Il

Experience preparing and presenting operational artifacts to senior management

Gain and disseminate knowledge of our complex applications

2+ years experience working with the Azure tech stack in a production capacity

5+ years operational experience working with Microsoft technologies

Comfort and experience with Ops environment growing at a rapid scale.

Knowledge of Virtualization, Cloud Infrastructure and APIs

Site Reliability Engineer Jobs

By Nike At Beaverton, OR, United States

This overview explains our hiring process for corporate roles. Note there may be different hiring steps involved for non-corporate roles

Site Reliability Engineer (Sre) - $700,000

By Thurn Partners At New York, NY, United States

3+ years' experience in a similar software engineering or site reliability engineering position

Experience with SQL database operations

Experience with Kafka, CICD pipelines and virtualisation a bonus

Beautiful office space with generous overall benefits package

Proficiency with either Python or Golang

Extremely competitive compensation including performance bonuses

Senior Site Reliability Engineer

By NVIDIA At California, United States

BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics), or equivalent experience

Technical leadership beyond development that includes scoping, requirements capturing, leading and influencing multiple teams of engineers on broad development initiatives.

Experience with the ELK and Prometheus stacks as a power user and administrator.

Prior experience driving production issues and helping with on-call support.

Experience with Cuda, PyTorch, TensorRT, TensorFlow, and/or Triton.

Experience with StackStorm and similar automation platforms is a bonus.

Are you looking for a challenging and rewarding role as a Remote Site Reliability Engineer? We are looking for a talented individual to join our team and help us ensure our systems are reliable and secure. You will be responsible for monitoring, troubleshooting, and resolving issues with our systems, as well as developing and implementing strategies to improve system performance. If you have a passion for technology and a desire to make a difference, this is the job for you!

Overview:

A Remote Site Reliability Engineer is responsible for ensuring the reliability, availability, and scalability of a company’s remote systems and services. This role requires a combination of technical and operational skills to ensure that the remote systems are running optimally and securely. The Remote Site Reliability Engineer will work with the development, operations, and security teams to ensure that the remote systems are reliable, secure, and available.

Detailed Job Description:

The Remote Site Reliability Engineer will be responsible for the following tasks:

• Design, implement, and maintain remote systems and services.

• Monitor and troubleshoot remote systems and services.

• Develop and maintain automation and configuration management systems.

• Develop and maintain security policies and procedures.

• Develop and maintain system and service performance metrics.

• Develop and maintain system and service availability metrics.

• Develop and maintain system and service scalability metrics.

• Develop and maintain system and service reliability metrics.

• Develop and maintain system and service security metrics.

• Develop and maintain system and service documentation.

• Develop and maintain system and service monitoring and alerting systems.

• Develop and maintain system and service backup and recovery systems.

• Develop and maintain system and service disaster recovery plans.

• Develop and maintain system and service capacity planning.

• Develop and maintain system and service performance tuning.

• Develop and maintain system and service patching and upgrades.

• Develop and maintain system and service security hardening.

• Develop and maintain system and service change management processes.

• Develop and maintain system and service incident response plans.

• Develop and maintain system and service root cause analysis processes.

What is Remote Site Reliability Engineer Job Skills Required?

• Expertise in remote systems and services.

• Expertise in automation and configuration management systems.

• Expertise in security policies and procedures.

• Expertise in system and service performance metrics.

• Expertise in system and service availability metrics.

• Expertise in system and service scalability metrics.

• Expertise in system and service reliability metrics.

• Expertise in system and service security metrics.

• Expertise in system and service documentation.

• Expertise in system and service monitoring and alerting systems.

• Expertise in system and service backup and recovery systems.

• Expertise in system and service disaster recovery plans.

• Expertise in system and service capacity planning.

• Expert

Latest vacancies

Systems Analyst - Excel, Xml, Sql, Scripting
By CyberCoders At Salt Lake City, UT, United States 7 months ago
(Senior) Finance & Shared Services Manager
By Catholics For Choice At Washington, DC, United States 7 months ago
Paralegal - Probate Administration
By CyberCoders At Miami, FL, United States 7 months ago
Account Executive - Automotive Software
By ECW Search At United States 7 months ago
Construction Project Coordinator Jobs
By CyberCoders At River Falls, WI, United States 7 months ago

Remote Site Reliability Engineer at