Principal Site Reliability Jobs in United States , Employment

Director Of Engineering, Site Reliability

By OneStudyTeam At , Remote

Experience implementing security controls for AWS environments, including setup and management of authentication controls, VPN’s, KMS, etc

Be the product manager for your vertical, defining the roadmap, requirements, goals and acceptance criteria

Learn more about our global benefits offerings on our careers site: https://careers.onestudyteam.com/us-benefits

Manage vendors, contracts and spend associated to operational infrastructure

Experience managing a team of 5+ SREs

Experience managing a global AWS footprint

Principal Reliability Engineer Jobs

By Novartis At Cambridge, MA, United States

3+ years of people leadership, project management, and in collaborating across boundaries experience

Experience in Data Management & Systems, preferably in data security

Broadly experienced specialists managing a small unit OR project. May be responsible for managing others -Leads/co‐leads novel projects within the team

Experience in implementing DevOps tools and practices for product and services teams

Experience handling a large volume of data

Experience with AWS and containers

Sr Site Reliability Eng.

By ENGIE Impact At United States

Strong communication and interpersonal skills with all levels of management.

Design and implement build, deployment, and configuration management.

Deploy and manage both Iaas and Pass services in development and production.

Manage CI and CD tools with team.

BS/MS Computer Science degree preferred, or equivalent experience.

At least four years of software engineering or site reliability engineering experience.

Principal, Site Reliability Engineer

By BNY Mellon At , Lake Mary, Fl

In this role, you’ll make an impact in the following ways:

To be successful in this role, we’re seeking the following:

Best Places to Work for Disability Inclusion , Disability:

Principal Site Reliability Engineer

By GoDaddy At , Remote $168,000 - $252,000 a year

Process improvement, management, and development experience.

Translate core architecture and business requirements into technical cloud infrastructure solutions that consist of platform, network, software, cloud automation, security, etc.

3+ years of experience in complex distributed networking, system performance tuning, and monitoring.

Experience with CI/CD development using Kubernetes, Docker, etc.

Experience in virtualization technologies such as KVM, and OpenStack.

Experience with back-end services, highly distributed and scalable services, and deployment automation.

Site Reliability Operations Engineer

By Fox Corporation At , Tempe, 85283, Az

Experience with operation and management of cloud-based services, including operational processes

Familiarity with modern operations concepts such as Agile and Incident Management

Experience / knowledge of the Broadcast industry

Operate and support live events, delivering smooth video experience to the audience

Hands on experience in a production / operational role

Experience using enterprise monitoring tools of any kind

Site Reliability Engineer Jobs

By University of Washington At , Seattle, 98195, Wa $7,554 - $11,667 a month

To request disability accommodation in the application process, contact the Disability Services Office at 206-543-6450 or [email protected].

Site Reliability Engineer Jobs

By Lawrence Berkeley National Laboratory At , San Francisco Bay Area, Ca $9,739 - $11,905 a month

Minimum of three years of experience in UNIX or Linux, Networking, IT infrastructure environment and management experience in a distributed-computing environment.

Knowledge of the processes for standard operating procedures, and best practices for implementation and change management.

Past experience with Incident Management and a good understanding of IT service management.

Experience with network security: configuring/maintaining ACLs, knowledge of firewalls

Bachelor’s Degree in a Computer Science or similar discipline or equivalent years of experience.

Strong hands-on knowledge of the Linux shell and working in a command-line (e.g. SSH) environment.

Principal Site Reliability Engineer

By Oracle At , Redwood City, 94065, Ca

Develop and implement various database life-cycle management flows.

Certification of Database products for cloud integration

Participate in Product Feature Review, Certification experiments and User Document reviews.

Research and acquire skills on new technologies as needed from time to time

6-14 years of Oracle database administration experience on large production environments

Database hands on skills especially around database and system troubleshooting and administration

Site Reliability Engineer Jobs

By DNV At , Corvallis, Or

Support web operations workflow automation using configuration management and continuous deployment frameworks

Systems engineering or DevOps experience

Good knowledge of a scripting language like Powershell, Bash, Python

Experience working on cloud-based infrastructure (e.g. Azure)

Strong written and verbal English communication skills

Experience securing cloud/web applications strongly desired

Site Reliability Engineer Jobs

By Blue Yonder At , Dallas, Tx $88,525 - $125,575 a year

Solid understanding of large-scale applications, Cloud Observability, monitoring and fault management, and understanding of Network Architectures

Respond to technical business requirements around availability, performance, and planned maintenance activities to ensure a well-operating solution and SLA compliance.

Strong experience of min 5 years’ experience developing, managing, or supporting distributed systems in a Cloud/IaaS environment, Azure preferred

Experience working with monitoring and visualization tools such as Splunk and AppDynamics

Experience coordinating between support and development teams to ensure effective delivery of monitoring services to the end-user.

Experience implementing best practices and industry standards for operational monitoring aligned to ITIL.

Site Reliability Engineer Jobs

By Motion Industries At , Irondale, 35210, Al

Participates in system design, platform management and capacity planning.

Understands debugging and applying troubleshooting skills.

Cloud Services experience with Google Cloud Platform (GCP).

Experience with API, service-based or microservice-based architecture.

Architecture-level knowledge of Windows and Linux and Infrastructure systems.

Experience with production deployment, monitoring and operational support for enterprise-class applications (Dynatrace a plus).

Site Reliability Engineer Jobs

By JPMorgan Chase Bank, N.A. At , Jersey City, 07310, Nj Up to $200,000 a year

Minimum education and experience required:

Site Reliability Engineer Jobs

By Sezzle At , Minneapolis, Mn $75,000 - $90,000 a year

Maintain and develop monitoring and alerting solutions to improve the on-call experience

Bachelor's in computer science (preferred) or equivalent related experience

Basic knowledge of a Microservice Architecture

Basic knowledge of AWS, Kubernetes, Docker

Knowledge of Relational Databases, SQL and ORM technologies

Collaborative workspace, commuter benefits, full-stocked kitchen, weekly lunches and much more!

Sr. Site Reliability Engineer

By rockset At , San Mateo, Ca $140,000 - $185,000 a year

Experience with Terraform, Salt, Chef, Packer, or similar configuration management tools

Willing to learn new skills and technologies

Bachelor's or Master's degree in Computer Science or a related field, or relevant work experience

Experience as an SRE for 3+ years

Experience building and operating public-facing 24x7 web applications at scale

Experience working with cloud infrastructure and patterns (AWS preferred)

Site Reliability Engineer Jobs

By Cogito Corporation At , $94,000 - $110,000 a year

Experience with configuration management tools, such as Ansible, Chef, Puppet, etc.

Experience with Kubernetes networking and knowledge of how traffic flows within pods, load balancers, and the internet

Proven experience with designing, building, securing, and managing Kubernetes at scale

Experience with Continuous Integration / Delivery tooling, such as Jenkins, Travis CI, CircleCI, etc.

Cross-functional collaboration skills and relationship-building skills are a must

Design and reimplement Cogito's core infrastructure in a containerized, cloud-native manner, making use of Docker, Kubernetes, Istio, and AWS infrastructure

Site Reliability Engineer Jobs

By Autodesk At , Atlanta, Ga $109,400 - $188,760 a year

Use modern administration tools like Docker, Terraform, AWS CloudFormation/CDK to manage and deploy containers and virtual machines

Collaborate with stakeholders to understand requirements, understand use cases and build towards a cohesive technical strategy

Experience in large-scale cloud-based production infrastructure (AWS preferred)

Expert experience with Docker and other container technology

Strong experience with Log Analysis and Monitoring tools such as CloudWatch, Splunk, New Relic, Grafana

Experience with any relevant language (Python, JavaScript, Ruby, Rust, Bash, etc.)

Staff Site Reliability Engineer

By Collective Health At , San Mateo, 94401, Ca $140,000 - $210,000 a year

Expertise in management and use of relational databases including.

10+ years of work experience in DevOps, Site Reliability Engineering, or Software Engineering.

Experience creating and monitoring SLIs and SLOs in order to set and remain within error budgets.

Experience in supporting customer-facing production systems and responding to incidents as part of an oncall rotation.

Knowledge of data structures, algorithms, distributed systems, and information retrieval.

Experience in solving diagnosing and resolving incidents that involve application, OS, network, infrastructure, partners, people, and process.

Site Reliability Engineer Jobs

By ASAPP At , New York, 10013, Ny $140,000 - $194,000 a year

Knowledge of AWS services, containers and container management frameworks

+4 years of relevant experience bringing software to production at high scale

BS or MS degree in the Computer Science field, or equivalent hands-on experience.

Experience in product oriented environments

Work with product engineering teams on service architecture and implementation

Deliver configuration as code and automate everything

Aws Site Reliability Engineer

By Derivative Path At , Remote

Excellent communication, organizational and time-management skills

Work closely with architects, software engineers, quality engineers, product owners, and management to design scalable, robust systems using cloud architecture

Participate in system design consulting, platform management, and capacity planning

Proficient with AWS certification preferred

Prior experience within the Capital Markets, Financial Services, and IT & Services

Design and implement fully automated CI/CD Pipelines using industry tools

Are you looking for an opportunity to make a real impact on the reliability and scalability of a product? We are looking for a Principal Site Reliability Engineer to join our team and help us build and maintain a reliable and scalable platform. You will be responsible for ensuring the availability and performance of our services, as well as developing and implementing strategies to improve system reliability. If you are passionate about technology and have a strong background in system engineering, we want to hear from you!

What is Principal Site Reliability job Skills Required?

• Expertise in system architecture, system design, and system engineering

• Knowledge of cloud computing, distributed systems, and DevOps

• Ability to troubleshoot complex systems and identify root causes

• Experience with automation and scripting languages such as Python, Bash, and PowerShell

• Knowledge of monitoring and logging tools such as Splunk, ELK, and Prometheus

• Understanding of networking protocols and technologies

• Ability to work in a fast-paced environment

What is Principal Site Reliability job Qualifications?

• Bachelor’s degree in Computer Science, Information Technology, or related field

• 5+ years of experience in system engineering, system architecture, or related field

• Experience with cloud computing platforms such as AWS, Azure, or GCP

• Experience with container technologies such as Docker and Kubernetes

• Experience with configuration management tools such as Chef, Puppet, or Ansible

• Experience with automation and scripting languages such as Python, Bash, and PowerShell

• Knowledge of monitoring and logging tools such as Splunk, ELK, and Prometheus

What is Principal Site Reliability job Knowledge?

• Knowledge of system architecture, system design, and system engineering

• Knowledge of

Latest vacancies

Systems Analyst - Excel, Xml, Sql, Scripting
By CyberCoders At Salt Lake City, UT, United States 7 months ago
(Senior) Finance & Shared Services Manager
By Catholics For Choice At Washington, DC, United States 7 months ago
Paralegal - Probate Administration
By CyberCoders At Miami, FL, United States 7 months ago
Account Executive - Automotive Software
By ECW Search At United States 7 months ago
Construction Project Coordinator Jobs
By CyberCoders At River Falls, WI, United States 7 months ago

Principal Site Reliability at