Sr. Site Reliability Engineer - Remote Us
By SitusAMC At , Remote $100,000 - $125,000 a year

SitusAMC is where the best and most passionate people come to transform our client’s businesses and their own careers. Whether you’re a real estate veteran, a passionate technologist, or looking to ...

Site Reliability Engineer Ii - Remote
By Akamai At , Remote $93,656 - $140,803 a year
Defining requirements as part of the product lifecycle to influence the new designs and standards
Have 2 years of relevant experience and a Bachelors degree or its equivalent
Have proven experience as a systems performance/site reliability or DevOps engineer
Have experience of working with NoSQL databases, such as Cassandra or Redis
Have experience with orchestration tools e.g. Chef and/or Ansible
Join our highly skilled Security team
Lead Sre (Site Reliability Engineer)
By Concentrix At , Remote
Team lead experience with offshore resources
Expected experience even if not deep in these areas:
Nice to have experience (not required):
Ability to create structure and process for a greenfield dev team
React.js & responsive web app dev
- DevOps & CI/CD - specific tooling is related to a Full stack Java and automation
Cdn Site Reliability Engineer (L5) - Open Connect
By Netflix At , Remote
Knowledge of and proven experience with CDNs and HTTP cache/proxy technologies
Service Reliability/Operational experience running large scale, high performance systems & internet services with focus on security and reliability
Expert-level knowledge of Unix or Linux system administration at scale. We happen to use FreeBSD
Knowledge of networking concepts and application protocols, especially TCP/IP, BGP, HTTP/S and DNS
Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
Some experience with container and container orchestration technologies (Docker, Kubernetes)
Sr. Site Reliability Engineer
By eHealth At , Remote $113,500 - $141,900 a year
A security certification and/or knowledge of DevSecOps would be a plus
5+ years of experience as System engineer or SRE engineer (DevOps culture)
Strong Linux skills and excellent skills in one major programming language (Python, Java would be great.)
Hands-on experience implementing and maintaining Container stack with all the security and compliance consideration.
Experience managing Hybrid infrastructure and configuration using tools like Terraform, Ansible and Puppet.
Understanding of CI/CD and experience with Jenkins, Pipeline as code
Site Reliability Engineer, Netflix Technology
By Netflix At , Remote
Experience with incident management and response
Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks
Reads signals in aggregate to develop deeper insights into the quality of experience for our users to help inform business decisions
Experience with complex sociotechnical systems and their successful operations at scale
Experience conducting blame-aware incident reviews
Strong analytical and problem-solving skills
Site Reliability Engineer (Sre)
By Luxoft At , Remote
5+ years of experience with administrating Linux and at least 2 years in supporting production environments;
Fluent developer skills in any popular programming language (C++ / Python / Java / Go. Java is preferred);
Experience with designing large-scale distributed solutions accompanied with it's capacity planning;
Experience with monitoring and alerting tools like Grafana, Datadog, Prometheus etc;
Strong knowledge of virtualization and containerization principles including orchestration tools;
Experience with relational and NoSQL DBMS
Backend Engineer (Site-Reliability) Jobs
By Terraform Labs At , Remote
In-depth knowledge of database management systems, including relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., Cassandra, MongoDB, Redis).
Collaborate with cross-functional teams to understand requirements and translate them into technical designs and implementation plans.
An interest in DeFi, or background in finance / Fintech
3+ years of professional work experience
Proven experience as a Backend Software Engineer, with a focus on site reliability and DevOps.
Experience with containerization and orchestration technologies such as Docker and Kubernetes.
Site Reliability Engineer Ii
By Exact Sciences Corporation At , Remote $82,000 - $130,000 a year
Support and comply with the company’s Quality Management System policies and procedures.
3+ years of experience in systems engineering
3+ years of work and/or formal classroom experience with modern application design and cloud environments
3+ years of work and/or formal classroom experience working with software development and operations teams
1+ years of experience developing highly available systems architecture using modern technologies.
AWS Solutions Architect, AWS SysOps Administrator, or AWS Developer certification.
Site Reliability Engineer - Kubernetes
By Avantage Entertainment At , Remote $115,000 - $130,000 a year
Strong detail orientation, time management skills, dependability, and flexibility required (our team spans at least 12 time zones).
Support our DevOps team with management of application deployments using GitOps tooling in the Kubernetes environment.
Proactively researches new capabilities and trends and reports findings to senior leadership.
Bachelor's degree in computer science or equivalent occupational experience.
Experience in an AWS or other cloud environment.
In-depth experience in Kubernetes (Red Hat OpenShift preferred).
Principal Site Reliability Engineer
By GoDaddy At , Remote $168,000 - $252,000 a year
Process improvement, management, and development experience.
Translate core architecture and business requirements into technical cloud infrastructure solutions that consist of platform, network, software, cloud automation, security, etc.
3+ years of experience in complex distributed networking, system performance tuning, and monitoring.
Experience with CI/CD development using Kubernetes, Docker, etc.
Experience in virtualization technologies such as KVM, and OpenStack.
Experience with back-end services, highly distributed and scalable services, and deployment automation.
Site Reliability Engineer *Sre*
By Synchronoss Technologies At , Remote
Proven ability to deliver a superior operations support experience working directly with corporate clients’ technology teams and associated change management.
Experience in monitoring tools such as Prometheous, Thanos and Grafana
Experience with Terraform and Ansible.
Experience with Cloud platforms such as AWS
Excellent verbal, written and analytical skills, with the ability to tailor communication to the intended audience.
Experience working with ticketing systems.
Site Reliability Engineer * Sre*
By Synchronoss Technologies At , Remote
Experience with Configuration Management Automation tools (chef or puppet).
Deploy and manage Kubernetes (EKS) based docker applications in AWS/OCI.
Solid experience in building a solution on AWS or Oracle Cloud or other public cloud services using Terraform.
Knowledge in Infrastructure monitoring tools (ELK stack, Prometheus, Grafana, or similar)
Knowledge of AWS/OCI best practices. Very keen to learn new technologies, Flexible to work on new platforms/environments and models like Agile/Scrum.
Excellent written and verbal skills.
Site Reliability Engineer - Cloud Infrastructure
By Lambda At , Remote $147,000 - $229,000 a year
Have experience with configuration management and infrastructure-as-code tooling
Experience building and maintaining internal tools and infrastructure (secrets management, artifact storage, CI/CD platforms, identity management)
Build abstractions that simplify and unify the management of development and staging environments
Work with multiple engineering teams to gather requirements and translate them into tooling and infrastructure projects
Have experience building and maintaining CI/CD pipelines
Have experience deploying and monitoring infrastructure in public cloud environments
Sre - Site Reliability Engineer (Ambra Team)
By Intelerad At , Remote
Experience with Systems Lifecycle Management Products (Foreman, Katello, RedHat Satellite)
Demonstrated knowledge of configuration management tools like Puppet, Chef and Ansible
Own system designs, documentation, platform management, and capacity planning for Enterprise Imaging Systems in your area of responsibility
University or college education in science, technology, engineering, or equivalent industry experience
Build software and systems to manage platform infrastructure and applications
Excellent verbal and written communication skills and ability to communicate technical subjects to a broad range of stakeholders
Site Reliability/Devops Engineer
By Axoni At , Remote
Experience with automation and configuration management tools (Terraform, Ansible, Salt, Chef, Puppet)
Experience troubleshooting issues on a remote distributed system
Manage and configure all pre-production, production, and client facing infrastructure
Coordinate with the Applications team to satisfy all non-functional project requirements (security, performance, scalability, and resiliency)
Experience with at least one of the following scripting languages: Bash and/or Python
Experience with Docker (Docker compose, yamls, etc)
Aws Site Reliability Engineer
By Derivative Path At , Remote
Excellent communication, organizational and time-management skills
Work closely with architects, software engineers, quality engineers, product owners, and management to design scalable, robust systems using cloud architecture
Participate in system design consulting, platform management, and capacity planning
Proficient with AWS certification preferred
Prior experience within the Capital Markets, Financial Services, and IT & Services
Design and implement fully automated CI/CD Pipelines using industry tools
Software Engineer, Site Reliability
By Packback Inc At , Remote $108,000 - $140,000 a year
2+ years of devops experience using Docker and Kubernetes
Experience with CI/CD pipelines, containerization, and orchestration
Experience reviewing code to both give and receive constructive feedback.
Experience with helm and terraform
Experience working on highly scalable cloud infrastructures
Startup or small company experience
Senior Site Reliability Engineer
By Lumin Digital At , Remote $170,000 - $200,000 a year
Expert-level knowledge of at least one configuration management system (Chef, Ansible, Puppet, etc.).
Exceptional full stack and environment troubleshooting skills.
Exceptional written and verbal communication skills.
Experience with a microservice architecture running in containers (Docker or other containerization technology).
Experience with Terraform and Kubernetes
2+ years of experience as a software engineer. C#, Angular, JavaScript preferred.
Site Reliability Engineer (Sre) - Evening Shift
By Brightspot At , Chicago $100,000 - $115,000 a year
Automate manual tasks and build tools for system monitoring, deployment, and configuration management.
2+ years of relevant experience in Cloud Operations
Proven troubleshooting and problem-solving skills in a cloud-based application environment
Outstanding communication skills with the ability to work in a client-facing role
Monitor the availability, performance, and reliability of our systems and applications during the evening shift.
Investigate and resolve incidents, troubleshooting any issues that arise and ensuring prompt resolution to minimize downtime.

Are you looking for an opportunity to join a fast-paced and innovative team as a Principal Site Reliability Engineer? We are looking for a highly motivated individual to join our team and help us build and maintain reliable, secure, and scalable systems. You will be responsible for developing and implementing strategies to ensure the availability, performance, and security of our systems. If you have a passion for technology and a drive to make a difference, this is the job for you!

Overview:

A Principal Site Reliability Engineer is responsible for ensuring the reliability, availability, and scalability of a company’s IT infrastructure. They are responsible for developing, implementing, and maintaining systems and processes that ensure the highest levels of performance and reliability. They must be able to troubleshoot and resolve complex technical issues quickly and efficiently.

Detailed Job Description:

The Principal Site Reliability Engineer is responsible for designing, developing, and maintaining systems and processes that ensure the highest levels of performance and reliability. They must be able to troubleshoot and resolve complex technical issues quickly and efficiently. They must be able to identify potential problems and develop solutions to prevent them from occurring. They must be able to work with other teams to ensure that the systems and processes are properly implemented and maintained. They must be able to provide technical guidance and support to other teams.

What is Principal Site Reliability Engineer Job Skills Required?

• Strong technical knowledge of IT infrastructure, including hardware, software, and networking
• Knowledge of system and process design
• Knowledge of system and process automation
• Knowledge of system and process monitoring
• Knowledge of system and process optimization
• Knowledge of system and process security
• Knowledge of system and process scalability
• Knowledge of system and process troubleshooting
• Ability to work independently and as part of a team
• Ability to work under pressure and meet deadlines
• Excellent problem-solving and analytical skills
• Excellent communication and interpersonal skills

What is Principal Site Reliability Engineer Job Qualifications?

• Bachelor’s degree in Computer Science, Information Technology, or related field
• 5+ years of experience in IT infrastructure, system and process design, system and process automation, system and process monitoring, system and process optimization, system and process security, system and process scalability, and system and process troubleshooting
• Experience with cloud technologies such as AWS, Azure, or GCP
• Experience with scripting languages such as Python, Bash, or PowerShell
• Experience with configuration management tools such as Chef, Puppet, or Ansible
• Experience with monitoring tools such as Nagios, Zabbix, or Splunk
• Experience with container technologies such as Docker or Kubernetes

What is Principal Site Reliability Engineer Job Knowledge?

• Knowledge of IT infrastructure, including hardware, software, and networking
• Knowledge of system and process design
• Knowledge of system and process automation
• Knowledge of system and process monitoring
• Knowledge of system and process optimization
• Knowledge of system