Aws Site Reliability Engineer
By Zeektek At United States
Help set up and manage our AWS EKS environment.
Help set up and manage our GitLab CI/CD pipeline.
Can engage and manage the heterogenous CI/CD and deployment environments of the teams we collaborate with
Site Reliability Engineer, DevOps manager
1.5+ years experience in SRE/DevOps or equivalent role
Work with other teams to assist in deploying our microservices and code into their environments (on prem and AWS)
Site Reliability Engineer (.Net Engineer)
By Suzy At United States
Exposure to a Configuration Management System (Puppet, Chef, Salt, etc)
Optimize: Observe and improve performance, reduce cost, and improve the experience for millions of users
3+ years of experience in Software Engineering, Site Reliability Engineering, or a Development focused DevOps role.
Experience with Kubernetes and Cloud systems
Experience with the development and operation of high-traffic backend systems
Troubleshooting skills that span applications, networking (TCP/IP), and systems
Site Reliability Engineer - All Levels
By FedEx Dataworks At United States
Experience in FinOps - Cloud cost management
Experience/knowledge in capacity planning, demand forecast based on production KPIs and provisioning.
Two (2) years equivalent work experience in information technology or engineering environment. A related advanced degree may offset the experience requirements.
Bachelor's Degree in Computer Science, Engineering, Information Systems and/or related field or equivalent formal training or work experience.
Strong SRE background, with experience in Cloud platforms, Software Development, DevOps, and Data Engineering
Strong skills in Python, SQL, Azure or other Cloud technologies
Senior Site Reliability Engineer
By Business Wire At United States
Strong experience with AWS cloud infrastructure and container orchestration (Kubernetes, Docker)
Strong experience with monitoring and alerting systems such as Prometheus, Grafana, Nagios, etc.
Strong experience with at least one programming language. Java is highly preferred but other languages such as Python will be considered
Advanced experience with Linux system administration, Java based applications, and network architecture
Ability to work remotely 100%
Excellent health benefits that begin on your first day of employment
Site Reliability Engineer Jobs
By Xforia Global Talent Solutions At United States
Support system design consulting, platform management, and capacity planning
Excellent communication skills and a high degree of technical leadership skills.
As Site Reliability Engineer you will:
Support the production environment by monitoring availability and the system health.
Improve reliability, quality, and time-to-release of the changes.
Provide primary operational support and engineering for multiple large-scale distributed software applications.
Senior Software Engineer - Site Reliability Engineer
By Gopuff At , Independence, 67301, Ks
Participate in system design consulting, platform management, and capacity planning
Build software and systems to manage platform infrastructure and applications
Experience building and operating services in a distributed environment
Production experience with managing public cloud infrastructure (Azure Preferred)
Strong knowledge of UNIX and TCP/IP and HTTP fundamentals.
Experience with monitoring, metrics, and visualization tools (Application Insights, Icinga, Graphite, Prometheus, ELK, etc.)

Are you looking for an opportunity to join a fast-paced and innovative team as a Principal Site Reliability Engineer? We are looking for a highly motivated individual to join our team and help us build and maintain reliable, secure, and scalable systems. You will be responsible for developing and implementing strategies to ensure the availability, performance, and security of our systems. If you have a passion for technology and a drive to make a difference, this is the job for you!

Overview:

A Principal Site Reliability Engineer is responsible for ensuring the reliability, availability, and scalability of a company’s IT infrastructure. They are responsible for developing, implementing, and maintaining systems and processes that ensure the highest levels of performance and reliability. They must be able to troubleshoot and resolve complex technical issues quickly and efficiently.

Detailed Job Description:

The Principal Site Reliability Engineer is responsible for designing, developing, and maintaining systems and processes that ensure the highest levels of performance and reliability. They must be able to troubleshoot and resolve complex technical issues quickly and efficiently. They must be able to identify potential problems and develop solutions to prevent them from occurring. They must be able to work with other teams to ensure that the systems and processes are properly implemented and maintained. They must be able to provide technical guidance and support to other teams.

What is Principal Site Reliability Engineer Job Skills Required?

• Strong technical knowledge of IT infrastructure, including hardware, software, and networking
• Knowledge of system and process design
• Knowledge of system and process automation
• Knowledge of system and process monitoring
• Knowledge of system and process optimization
• Knowledge of system and process security
• Knowledge of system and process scalability
• Knowledge of system and process troubleshooting
• Ability to work independently and as part of a team
• Ability to work under pressure and meet deadlines
• Excellent problem-solving and analytical skills
• Excellent communication and interpersonal skills

What is Principal Site Reliability Engineer Job Qualifications?

• Bachelor’s degree in Computer Science, Information Technology, or related field
• 5+ years of experience in IT infrastructure, system and process design, system and process automation, system and process monitoring, system and process optimization, system and process security, system and process scalability, and system and process troubleshooting
• Experience with cloud technologies such as AWS, Azure, or GCP
• Experience with scripting languages such as Python, Bash, or PowerShell
• Experience with configuration management tools such as Chef, Puppet, or Ansible
• Experience with monitoring tools such as Nagios, Zabbix, or Splunk
• Experience with container technologies such as Docker or Kubernetes

What is Principal Site Reliability Engineer Job Knowledge?

• Knowledge of IT infrastructure, including hardware, software, and networking
• Knowledge of system and process design
• Knowledge of system and process automation
• Knowledge of system and process monitoring
• Knowledge of system and process optimization
• Knowledge of system