Principal Site Reliability Engineer Jobs in Cass, Texas , Employment

Sr. Software Engineer- Site Reliability (Remote)

By Home Depot / THD At , Atlanta, 30301 $160,000 a year

Knowledge of configuration management tools (e.g., Ansible, Puppet, or Chef)

This position typically reports to Software Engineer Manager or Sr. Manager

2-4 years of relevant work experience

Experience with cloud platforms (e.g., AWS, Azure, or GCP)

Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)

Knowledge of version control systems (e.g., Git)

Site Reliability Engineer, Infrastructure

By FullStory At , Atlanta, Ga

Be a go-to person for one or more areas such as infrastructure, configuration management, or automation.

Grow your family. We offer a global fertility and family building benefit that encompasses all journeys to growing your family.

Design, create, and manage high-performance infrastructure and tooling for our external and internal services.

Experience with one or more of the following programming languages: Golang, Python, C++, etc.

Experience architecting and maintaining systems in a public cloud environment. (e.g., GCP, AWS, Azure or similar)

Experience with modern metrics, monitoring, and logging frameworks and services. (e.g., Prometheus, Grafana, Stackdriver)

Site Reliability Engineer Jobs

By Autodesk At , Atlanta, Ga $109,400 - $188,760 a year

Use modern administration tools like Docker, Terraform, AWS CloudFormation/CDK to manage and deploy containers and virtual machines

Collaborate with stakeholders to understand requirements, understand use cases and build towards a cohesive technical strategy

Experience in large-scale cloud-based production infrastructure (AWS preferred)

Expert experience with Docker and other container technology

Strong experience with Log Analysis and Monitoring tools such as CloudWatch, Splunk, New Relic, Grafana

Experience with any relevant language (Python, JavaScript, Ruby, Rust, Bash, etc.)

Site Reliability Engineer (Remote)

By Home Depot / THD At , Atlanta, 30301, Ga $130,000 a year

This position typically reports to Software Engineer Manager or Sr. Manager

Demonstrable knowledge of Linux systems, TCP/IP, HTTP, and multi-tier web application architectures

Excellent written and interpersonal communication and documentation skills

Practical knowledge of various aspects of service design, including application protocols, caching strategies, and software design principles

Practical, solid knowledge of shell scripting, Java and at least one systems programming language (Go preferred)

BS in Computer Science or equivalent experience

Senior Software Engineer - Site Reliability (Remote)

By Home Depot / THD At , Atlanta, 30301, Ga $180,000 a year

This position typically reports to Software Engineer Manager or Sr. Manager

2-4 years of relevant work experience

Experience with security frameworks for user and services authorization and authentication

Experience with creating and executing unit, functional, destructive and performance tests

Experience with modern debugging and root cause analysis techniques

Experience with version control system

Site Reliability Engineer Jobs

By Blue Yonder At Dallas, TX, United States

Solid understanding of large-scale applications, Cloud Observability, monitoring and fault management, and understanding of Network Architectures

Respond to technical business requirements around availability, performance, and planned maintenance activities to ensure a well-operating solution and SLA compliance.

Strong experience of min 5 years’ experience developing, managing, or supporting distributed systems in a

Experience working with monitoring and visualization tools such as Splunk and AppDynamics

Experience coordinating between support and development teams to ensure effective delivery of monitoring services to the end-user.

Experience implementing best practices and industry standards for operational monitoring aligned to ITIL.

Senior Site Reliability Engineer

By Dremio At , Seattle, Wa $166,304 - $225,000 a year

Have moderate-advanced experience in Python/Go, and at least reading knowledge of Java.

10+ years of relevant experience in the following areas: SRE, DevOps, Distributed Systems, Cloud Operations, Software Engineering.

Have a systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.

Hands-on experience with large-scale production Kubernetes clusters (<=1000 nodes).

Hands-on experience using Honeycomb for OpenTelemetry trace analysis.

Drive continuous improvements to our usage of Kubernetes, our Operators, and the GitOps deployment paradigm.

Senior Site Reliability Engineer

By Abbott Laboratories At , Abbott Park, Il

Experience with Microsoft Azure DevOps, Release Management Tools

Experience with Windows Server Configuration Management

Experience in IIS Configuration Management

EDUCATION AND EXPERIENCE YOU’LL BRING

Produces and Manages Infrastructure as code

Manages Development, QA, and Production environment configuration

Site Reliability Engineer Iii

By JPMorgan Chase Bank, N.A. At , Plano, Tx

Required qualifications, capabilities, and skills

Formal training or certification on site reliability engineering concepts and 3+ years applied experience

Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate

Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines

Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications

Implements infrastructure, configuration, and network as code for the applications and platforms in your remit

Site Reliability Engineer Jobs

By Motorola Solutions At , Allen, 75002, Tx

Company Overview At Motorola Solutions, we believe that everything starts with safety. It’s the constant that empowers people to confidently move forward. It can fill a flight or sell out a stadium. ...

Sr Site Reliability Engineer

By Tesla At , Austin, Tx

Deploy, configure, manage, and automate CI/CD pipelines using Jenkins, Github Actions and Git for version control.

5+ years’ experience working in a manufacturing or material flow setting.

5+ years’ experience integrating manufacturing or material flow systems.

5+ years’ experience in a high-level language such as Go, Python and/or Java.

5+ years’ experience with SQL (MySQL, Postgres, MSSQL)

5+ years’ experience with Docker and Kubernetes.

Site Reliability Engineer Jobs

By Visionary Recruiting Solutions At Corpus Christi, TX, United States

Completes Management of Change where appropriate

Provides input to a Risk Management Plan to anticipate reliability-related and non-reliability-related risks that could adversely impact plant operation.

Provides technical support to production, maintenance management, and technical personnel.

Identify training needs to maintain the required skills and knowledge to perform the job to the necessary standard.

Three years of experience as Reliability Engineer required; in the chemical industry preferred.

Certifications in Six Sigma (Green Belt, Lean) preferred.

Site Reliability Operations Engineer

By Fox Corporation At , Tempe, 85283, Az

Experience with operation and management of cloud-based services, including operational processes

Familiarity with modern operations concepts such as Agile and Incident Management

Experience / knowledge of the Broadcast industry

Operate and support live events, delivering smooth video experience to the audience

Hands on experience in a production / operational role

Experience using enterprise monitoring tools of any kind

Site Reliability Engineer-Remote Jobs

By Dynata At , Plano, 75024, Tx $90,000 - $112,000 a year

Experience with configuratioon management tools like Chef, Puppet, or Ansible

Learning Management System available through the Intranet providing free access to nearly 500 online training modules and personal development programs

Previous experience in an SRE or related role: DevOps, platform engineering, software engineering

Experience with distributed / highly available systems architecture, theory and practice.

Experience with an infrastructure-as-code tool (terraform, cloudformation, etc) [tf preferred]

Previous experience building and maintaining production systems in the cloud (AWS preferred)

Site Reliability Engineer Jobs

By Blue Yonder At , Dallas, Tx $88,525 - $125,575 a year

Solid understanding of large-scale applications, Cloud Observability, monitoring and fault management, and understanding of Network Architectures

Respond to technical business requirements around availability, performance, and planned maintenance activities to ensure a well-operating solution and SLA compliance.

Strong experience of min 5 years’ experience developing, managing, or supporting distributed systems in a Cloud/IaaS environment, Azure preferred

Experience working with monitoring and visualization tools such as Splunk and AppDynamics

Experience coordinating between support and development teams to ensure effective delivery of monitoring services to the end-user.

Experience implementing best practices and industry standards for operational monitoring aligned to ITIL.

Staff Site Reliability Engineer

By Procore Technologies At , Austin, Tx $136,000 - $187,000 a year

Bachelor’s Degree in Computer Science or a related field is preferred, or comparable work experience

8+ years of industry experience as an SRE or Software Engineer

Experience supporting and working with cross-functional teams in a dynamic environment

Strong oral and written communication skills

2+ years of experience working with Ruby on Rails

Provide technical efforts around building a robust and scalable observability pipeline to support billions of events

Site Reliability Engineer - Entry Level (Technology Rotational Development Program)

By Equifax At Alpharetta, GA, United States

Bachelor’s Degree in Computer Science, Information Technology, Project Management, or equivalent field; Completion of coursework by May 2024.

Ability to gain experience by cross-training in the various areas within the Technology organization and other key related functions.

Excellent leadership, teamwork and service skills.

Excellent oral and written communication skills.

Experienced working with and developing with Java

Exposure/knowledge of cloud technologies (Google Cloud Platform (GCP), Amazon Web Services (AWS), or Azure)

Aws Site Reliability Engineer

By Zeektek At United States

Help set up and manage our AWS EKS environment.

Help set up and manage our GitLab CI/CD pipeline.

Can engage and manage the heterogenous CI/CD and deployment environments of the teams we collaborate with

Site Reliability Engineer, DevOps manager

1.5+ years experience in SRE/DevOps or equivalent role

Work with other teams to assist in deploying our microservices and code into their environments (on prem and AWS)

Sr. Site Reliability Engineer - Remote Us

By SitusAMC At , Remote $100,000 - $125,000 a year

SitusAMC is where the best and most passionate people come to transform our client’s businesses and their own careers. Whether you’re a real estate veteran, a passionate technologist, or looking to ...

Saas Site Reliability Engineer And Automation Developer

By Siemens Digital Industries Software At , Costa Mesa, 92627 $116,900 - $210,400 a year

Develop and maintain automation tools, scripts, and frameworks to streamline deployment, configuration management, and monitoring processes.

Design and implement infrastructure solutions using configuration management tools, such as Ansible, Puppet, or Chef.

Proficiency in automation and configuration management tools (e.g., Ansible, Puppet, Chef).

In-depth knowledge and hands-on experience with cloud platforms such as AWS, Azure, or Google Cloud and their scalability features.

Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent work experience).

Strong programming skills in languages such as Python, Go, or Ruby.

Site Reliability Engineer Jobs

By Adobe At , Lehi, 84043 $92,100 - $161,000 a year

What you need to succeed:

An understanding of SRE standard methodologies:

Infrastructure Site Reliability Engineer

By CVS Health At , Hartford $75,400 - $162,700 a year

A year or more experience with incident management, performance monitoring, and capacity planning tools.

Multiple years’ demonstrated proficiency in at least one configuration management tool such as Ansible, Puppet, or Chef.

Minimum of 5 years of experience in Infrastructure Engineering, System Administration, or related roles.

Multiple years’ experience with cloud platforms (e.g., Amazon Web Services, Microsoft Azure) and infrastructure-as-code tools (e.g., Terraform, CloudFormation).

Multiple years’ experience with containerization technologies such as Docker and container orchestration platforms like Kubernetes.

Multiple years’ demonstrated knowledge of networking principles and protocols, including TCP/IP, DNS, load balancing, and firewalls.

Site Reliability Engineer (Sre) - Evening Shift

By Brightspot At , Chicago $100,000 - $115,000 a year

Automate manual tasks and build tools for system monitoring, deployment, and configuration management.

2+ years of relevant experience in Cloud Operations

Proven troubleshooting and problem-solving skills in a cloud-based application environment

Outstanding communication skills with the ability to work in a client-facing role

Monitor the availability, performance, and reliability of our systems and applications during the evening shift.

Investigate and resolve incidents, troubleshooting any issues that arise and ensuring prompt resolution to minimize downtime.

(Remote) - Sr Site Reliability Engineer

By First American Financial Corporation At , Santa Ana $87,945 - $182,655 a year

Bachelor's degree in Computer Science, Information Technology, or equivalent education and experience.

Strong understanding of SRE practices: incident response, change/release management, capacity planning, infrastructure automation, elastic environments, chaos engineering and blameless postmortems.

Skilled in defining service level objectives, measuring service level indicators, and setting up error budgets.

Experienced in creating SRE adoption framework and onboarding procedure.

What You’ll Bring (At least 5-7 years' experience)

Maintain and improve reliability of core software systems.

Site Reliability Engineer, Product - Usds

By TikTok At , Los Angeles $119,000 - $289,000 a year

Gain a solid understanding of the various components and services that power the TikTok experience

Maintain services to meet service-level-agreements (SLAs) and service-level-objectives (SLOs) by measuring and monitoring availability, performance, and overall system health

Scale systems sustainability through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for changes

Provide user support, incident responses and postmortems

In this role, you will:

Our time off and leave plans are:

Site Reliability Engineer Jobs

By Fisker Inc At , Manhattan Beach $60,900 - $169,650 a year

Experience with artifact management (Artifactory, Nexus)

Experience with strict security requirements and implementation

Design, provision, deploy, and manage Kubernetes clusters and resources

Bachelor’s degree in computer science or related technical field or equivalent experience

5+ years of SRE / DevOps Engineer experience

Experience with cloud infrastructure (AWS, GCP, Azure)

Site Reliability Engineer Jobs

By Zscaler At , San Jose

Strong Centos/UNIX skills, FreeBSD specific experience is a plus.

5 -7 years experience in a SaaS/ Cloud/Distributed environment growing at a rapid scale.

Minimum 3+ years of scripting experience in Python is required.

Hands-on experience with infrastructure as code and automation tools (Ansible, Chef, Puppet, Terraform).

Basic Networking skills (TCP/IP, DNS, LACP, CARP) for testing and troubleshooting are required.

Competitive salary and benefits, including equity

Site Reliability Engineer (Sre)

By Agama Solutions At , San Jose

5+ years of US experience as in a SRE role

Good communication (and listening) skills.

Some experience administering Linux “web” servers, at scale.

Working knowledge of DNS, HTTP, TLS, web security.

Experience with networking troubleshooting using tools such as TCP Dump.

Well versed in *nix Operating Systems (we use CentOS and Ubuntu LTS).

Site Reliability Engineer Jobs

By Ascendion At , Alpharetta

Knowledge of the cloud and managed services such as MS Flex Server or AWS RDS.

Strong experience as a database administrator.

Strong experience in PostgreSQL and/or MySQL.

Automation skill in Bash, Golang, Python a plus.

Knowledge of IaC and CI/CD tools such as Terraform and GitHub Actions a plus.

Experience in query optimization and performance improvement.

Site Reliability Engineer - Remote

By Sheetz At , Claysburg, 16625

(Equivalent combinations of education, licenses, certifications and/or experience may be considered)

Responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for their assigned system(s).

A four year degree in Computer Science, Management Information Systems, Computer Engineering is preferred.

6 years of applicable experience in a technology environment, preferably with time spent in an engineering capacity, is required.

Coding experience beyond simple scripts is required.

A four year degree which includes courses or training in computer programming, systems analysis, system development, or systems engineering, is required.

Site Reliability Engineer Ii - Remote

By Akamai At , Remote $93,656 - $140,803 a year

Defining requirements as part of the product lifecycle to influence the new designs and standards

Have 2 years of relevant experience and a Bachelors degree or its equivalent

Have proven experience as a systems performance/site reliability or DevOps engineer

Have experience of working with NoSQL databases, such as Cassandra or Redis

Have experience with orchestration tools e.g. Chef and/or Ansible

Join our highly skilled Security team

Are you looking for an opportunity to join a fast-paced and innovative team as a Principal Site Reliability Engineer? We are looking for a highly motivated individual to join our team and help us build and maintain reliable, secure, and scalable systems. You will be responsible for developing and implementing strategies to ensure the availability, performance, and security of our systems. If you have a passion for technology and a drive to make a difference, this is the job for you!

Overview:

A Principal Site Reliability Engineer is responsible for ensuring the reliability, availability, and scalability of a company’s IT infrastructure. They are responsible for developing, implementing, and maintaining systems and processes that ensure the highest levels of performance and reliability. They must be able to troubleshoot and resolve complex technical issues quickly and efficiently.

Detailed Job Description:

The Principal Site Reliability Engineer is responsible for designing, developing, and maintaining systems and processes that ensure the highest levels of performance and reliability. They must be able to troubleshoot and resolve complex technical issues quickly and efficiently. They must be able to identify potential problems and develop solutions to prevent them from occurring. They must be able to work with other teams to ensure that the systems and processes are properly implemented and maintained. They must be able to provide technical guidance and support to other teams.

What is Principal Site Reliability Engineer Job Skills Required?

• Strong technical knowledge of IT infrastructure, including hardware, software, and networking

• Knowledge of system and process design

• Knowledge of system and process automation

• Knowledge of system and process monitoring

• Knowledge of system and process optimization

• Knowledge of system and process security

• Knowledge of system and process scalability

• Knowledge of system and process troubleshooting

• Ability to work independently and as part of a team

• Ability to work under pressure and meet deadlines

• Excellent problem-solving and analytical skills

• Excellent communication and interpersonal skills

What is Principal Site Reliability Engineer Job Qualifications?

• Bachelor’s degree in Computer Science, Information Technology, or related field

• 5+ years of experience in IT infrastructure, system and process design, system and process automation, system and process monitoring, system and process optimization, system and process security, system and process scalability, and system and process troubleshooting

• Experience with cloud technologies such as AWS, Azure, or GCP

• Experience with scripting languages such as Python, Bash, or PowerShell

• Experience with configuration management tools such as Chef, Puppet, or Ansible

• Experience with monitoring tools such as Nagios, Zabbix, or Splunk

• Experience with container technologies such as Docker or Kubernetes

What is Principal Site Reliability Engineer Job Knowledge?

• Knowledge of IT infrastructure, including hardware, software, and networking

• Knowledge of system and process design

• Knowledge of system and process automation

• Knowledge of system and process monitoring

• Knowledge of system and process optimization

• Knowledge of system

Latest vacancies

Systems Analyst - Excel, Xml, Sql, Scripting
By CyberCoders At Salt Lake City, UT, United States 8 months ago
(Senior) Finance & Shared Services Manager
By Catholics For Choice At Washington, DC, United States 8 months ago
Paralegal - Probate Administration
By CyberCoders At Miami, FL, United States 8 months ago
Account Executive - Automotive Software
By ECW Search At United States 8 months ago
Construction Project Coordinator Jobs
By CyberCoders At River Falls, WI, United States 8 months ago

Principal Site Reliability Engineer at