Senior Engineer Ii - Digital Site Reliability
By Lululemon At , Seattle $132,300 - $173,500 a year
Contribute to engineering automation, management or development of pre-prod and production systems
Mentor and guide junior team members, sharing knowledge and expertise to foster a culture of learning and continuous improvement.
Eight+ years of engineering experience
Five+ years experience with CI/CD tools, GitLab preferred
Proficiency in at least one programming language (e.g., Python, Go, Java) and experience with scripting and automation.
Acknowledge the presence of choice in every moment and take personal responsibility for your life.
Senior Site Reliability Engineer, Trello
By Atlassian At , San Francisco
3+ years of hands-on experience with public cloud offerings such as AWS,GCP or Azure
Familiarity with Incident management, post-incident analysis and participation in on-call rotation
3+ years experience operating high-availability, fault-tolerant, scalable, distributed software in production: building monitoring, tweaking dashboards, defining alerts, writing runbooks, etc.
Engineering microservices and tools across one or more programming languages (e.g. Go, Python,Bash)
Automation and Infrastructure-as-Code projects and tooling (e.g. Ansible, Puppet, Terraform)
Build and maintain a continuous integration and delivery pipeline (e.g. Bamboo, Bitbucket Pipelines, Github Actions)
Senior Reliability Engineer Jobs
By Digital Diagnostics, Inc. At , Remote
Location – Chicago, IL | Coralville, IA | or Remote-US
What We Have to Offer
Lead or participate in deploying updates or improvements as needed.
Lead or participate in support activities.
Identify performance and scalability bottlenecks in Digital Diagnostics’ global technical infrastructure.
Identify and work to eliminate waste in cloud infrastructure costs.
Associate Site Reliability Engineer (Remote)
By Patterson Technology Center At , Minneapolis-Saint Paul
Bachelor's degree in Computer Science, Management Information Sciences or area of functional responsibility preferred, or equivalent years of industry work experience
Knowledge of aspects of application development and project life cycles design and development experience with engineering software design tools
Office environment – either in Patterson facility or at home/remote location
Plan, design, deploy, and operate Site Reliability Engineering capabilities for cloud products & services.
Continuously build, automate, and improve upon capabilities that are secure, scalable, performant, and resilient
Demonstrated knowledge and understanding of database and operating systems
Senior Site Reliability Engineer/Devops Engineer
By Zillow At , Remote
Knowledge and experience working with microservices
Leverage your knowledge to build technical consensus around architecture and technology choices
Build and manage StreetEasy's cloud infrastructure, contributing to our commitment to reliability and efficiency
A Bachelor's degree in Computer Science or a related technical field, or equivalent practical experience
1-3 years of experience in site reliability engineering, DevOps, or a related field
Experience with cloud service providers, preferably AWS
Senior Site Reliability Engineer
By Adyen At , Chicago
Have a good understanding of Infrastructure as Code and experience with configuration management and automation tools such as Puppet and Ansible;
Strong familiarity with SRE practices and methodologies such as defining SLOs, change management processes and incident response;
Together with the team lead the way in continuously improving our incident management and on-call processes
Have experience with building, operating and troubleshooting large-scale distributed systems spanning multiple data centers across the globe;
Skilled in one or more programming or scripting languages such as Python, Java or bash;
We use SLOs to drive platform stability and innovation
Cloud Senior Site Reliability Engineer
By Bank of America At , New York, Ny
Perform deep dives into systemic and latent reliability issues, incident management, problem management
Understanding of cost management, inventory management, FinOps model
Identifying, analyzing, and resolving infrastructure vulnerabilities and application deployment issues.
Evaluating and automating the scaling and capacity requirements within Azure environments
BS /MS degree in Computer Science or related technical field involving systems or equivalent practical experience.
Minimum 8+ years of hands-on experience maintaining cloud platforms on a major cloud service provider.
Site Reliability Engineer (Sre) - Mid/Senior
By Vanilla Technologies Inc. At , Remote
Project management tools such as Jira, Git, and Confluence
Accounting for and addressing software vulnerabilities
Securing infrastructure, applications, and code
Ensuring high SLA for uptime & security
Quick, continuous automation and deployment of updates
Preserving infrastructure and stability of code
Senior Site Reliability Engineer
By NVIDIA At California, United States
BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics), or equivalent experience
Technical leadership beyond development that includes scoping, requirements capturing, leading and influencing multiple teams of engineers on broad development initiatives.
Experience with the ELK and Prometheus stacks as a power user and administrator.
Prior experience driving production issues and helping with on-call support.
Experience with Cuda, PyTorch, TensorRT, TensorFlow, and/or Triton.
Experience with StackStorm and similar automation platforms is a bonus.
Engineer Associate - Reliability Engineering
By Best Buy At Richfield, MN, United States
Bachelor’s degree in Computer Information Systems, Engineering, Management Information Systems, Computer Science, or another related technical, highly quantitative discipline.
Experience using scripting languages (i.e., JavaScripting, Python, Perl, PowerShell, Ruby)
Customize, enhance, and support internally developed self-service monitoring tools.
Extend and develop through automation custom monitoring scripts and integrations.
Bachelor's degree, Associate's degree, or Coding Bootcamp/Code School
Ability to write code and solve problems in cloud (e.g., AWS, Azure, etc.) environments.
Senior Site Reliability Engineer
By Business Wire At United States
Strong experience with AWS cloud infrastructure and container orchestration (Kubernetes, Docker)
Strong experience with monitoring and alerting systems such as Prometheus, Grafana, Nagios, etc.
Strong experience with at least one programming language. Java is highly preferred but other languages such as Python will be considered
Advanced experience with Linux system administration, Java based applications, and network architecture
Ability to work remotely 100%
Excellent health benefits that begin on your first day of employment
Senior Site Reliability Engineer (Remote)
By The Hartford At , Hartford, Ct
Progressively implement preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines
Hands on experience with Performance and Observability tools such as DynaTrace, Splunk, TrueSight, CloudWatch, CloudTrail, and related tools.
Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Jenkins, Nexus, Rally, SonarQube etc.
Knowledge of complex traditional and modern enterprise architectures and systems (understand more than the component itself).
Strong hybrid cloud experience (private and public) across various service delivery models – IaaS, PaaS, SaaS.
Strong communication (verbally and written) / collaboration / negotiation skill, working in a diverse team cross business units
Senior Site Reliability Engineer
By Humana At , Phoenix, 85050, Az
Detail oriented with excellent organizational and project management skills
Bachelor's degree or equivalent experience
Experienced in Java, Python, or similar coding experience
3+ years of experience working with voice technologies / IVR
2+ years of project leadership experience
Project-based experience driving changes and improvements to IVR solutions.
Associate Site Reliability Engineer
By ConstructConnect, Inc At , Cincinnati, 45209, Oh

To apply for the Associate Site Reliability Engineer role, please click on the iCIMS link below. The link will guide you to ConstructConnect's new Careers page portal. Associate Site Reliability ...

Senior Site Reliability Engineer
By Dremio At , Seattle, Wa $166,304 - $225,000 a year
Have moderate-advanced experience in Python/Go, and at least reading knowledge of Java.
10+ years of relevant experience in the following areas: SRE, DevOps, Distributed Systems, Cloud Operations, Software Engineering.
Have a systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
Hands-on experience with large-scale production Kubernetes clusters (<=1000 nodes).
Hands-on experience using Honeycomb for OpenTelemetry trace analysis.
Drive continuous improvements to our usage of Kubernetes, our Operators, and the GitOps deployment paradigm.
Senior Site Reliability Engineer
By Akamai At , $113,430 - $170,043 a year
Acting as an escalation point for operational, network and managerial teams to ensure network/customer issues are resolved
Have 4 years of relevant experience and a Bachelor's degree in Engineering, Computer Science, or related discipline
Experience in SQL, working in UNIX/Linux environments along with managing and running RDBMS (MySQL, PostgreSQL, etc.) clusters
Possess good experience with Internet protocols (DNS, HTTP, TLS, TCP/IP, SSH, etc.)
Experience with Web Services & Cloud API, cloud computing, hosting, and networking
Have experience in Python and/or Bash scripting
Senior Site Reliability Engineer
By Abbott Laboratories At , Abbott Park, Il
Experience with Microsoft Azure DevOps, Release Management Tools
Experience with Windows Server Configuration Management
Experience in IIS Configuration Management
EDUCATION AND EXPERIENCE YOU’LL BRING
Produces and Manages Infrastructure as code
Manages Development, QA, and Production environment configuration
Senior Site Reliability Engineer - Apple Maps
By Apple At , Cupertino, Ca
Incident management experience is a plus.
Cloud Native SRE experience ( Ideally 5-10 years).
Experience setting up and managing services running on Kubernetes.
Multi Cloud environment experience such as AWS and Google Cloud is preferred but not required.
Ability to learn and adapt. Experience matters but curiosity and adaptability are even more important.
Linux System and Network Administration.
Senior Service Reliability Engineer
By Amadeus At , Portsmouth, 03801, Nh
Change and Release Management: Manage and execute change, release and test processes and drive automation of these processes
Develop standardized automation to control, build artifact and deploy managed services
Leverage, improve, design, and implement services that automate application provisioning and manage the underlying infrastructure as a service
Manage application operations for Amadeus Core Services end to end.
Manage the full application stack (OS, Data Bases and Data Stores,
Play a key role in accelerating the organization's ability to deliver changes reliably and consistently to Amadeus Hospitality customers
Associate Service Reliability Engineer
By Papa John's At , Louisville, 40299, Ky
Good working knowledge in SRE concepts like Availability, Observability/Monitoring, Scalability, SLA, SLO, SLA, MTTR, MTTF
Experience in Java performance, Linux system monitoring, Database SQL tuning and basic networking services
Strong knowledge in resiliency frameworks and patterns for applications (Hystrix, resilience4j, circuit breaker, bulk-head), as well as infrastructure
Hands-on experience with tools like AppDynamics, Splunk (or Kibana/ELK), Solarwinds and Cloud monitoring tools (Stackdriver, Google monitoring, Cloudwatch)
Cloud environments (GCP, AWS, Azure) and Automation experience
Knowledge around containerization (docker) and container orchestration(Kubernetes, GKE, docker swarm)

Are you looking for an exciting opportunity to join a team of experienced Site Reliability Engineers? We are looking for a Senior Associate Site Reliability Engineer to join our team and help us ensure our systems are reliable and secure. You will be responsible for developing and maintaining our systems, monitoring performance, and troubleshooting any issues that arise. If you have a passion for technology and a desire to work in a fast-paced environment, this could be the perfect job for you!

Overview:

Senior Associate Site Reliability Engineers are responsible for ensuring the reliability, scalability, and performance of a company’s web applications and services. They are responsible for developing and maintaining automation and monitoring systems, as well as troubleshooting and resolving issues that arise. Senior Associate Site Reliability Engineers must have a strong understanding of web technologies, networking, and system administration.

Detailed Job Description:

Senior Associate Site Reliability Engineers are responsible for developing and maintaining automation and monitoring systems to ensure the reliability, scalability, and performance of a company’s web applications and services. They are also responsible for troubleshooting and resolving any issues that arise. Senior Associate Site Reliability Engineers must have a strong understanding of web technologies, networking, and system administration. They must be able to develop and maintain automation and monitoring systems, as well as troubleshoot and resolve any issues that arise. They must also be able to work with other teams to ensure that the systems are running optimally.

What is Senior Associate Site Reliability Engineer Job Skills Required?

• Strong understanding of web technologies, networking, and system administration
• Ability to develop and maintain automation and monitoring systems
• Ability to troubleshoot and resolve issues
• Ability to work with other teams to ensure optimal system performance
• Knowledge of scripting languages such as Python, Bash, and PowerShell
• Knowledge of cloud technologies such as AWS, Azure, and GCP
• Knowledge of container technologies such as Docker and Kubernetes
• Knowledge of monitoring tools such as Prometheus, Grafana, and ELK

What is Senior Associate Site Reliability Engineer Job Qualifications?

• Bachelor’s degree in Computer Science, Information Technology, or related field
• 5+ years of experience in web technologies, networking, and system administration
• Experience with scripting languages such as Python, Bash, and PowerShell
• Experience with cloud technologies such as AWS, Azure, and GCP
• Experience with container technologies such as Docker and Kubernetes
• Experience with monitoring tools such as Prometheus, Grafana, and ELK

What is Senior Associate Site Reliability Engineer Job Knowledge?

• Knowledge of web technologies, networking, and system administration
• Knowledge of scripting languages such as Python, Bash, and PowerShell
• Knowledge of cloud technologies such as AWS, Azure, and GCP
• Knowledge of container technologies such as Docker and Kubernetes
• Knowledge of monitoring tools such as Prometheus, Grafana, and ELK
Job