Senior Site Reliability Engineer, Trello
By Atlassian At , San Francisco
3+ years of hands-on experience with public cloud offerings such as AWS,GCP or Azure
Familiarity with Incident management, post-incident analysis and participation in on-call rotation
3+ years experience operating high-availability, fault-tolerant, scalable, distributed software in production: building monitoring, tweaking dashboards, defining alerts, writing runbooks, etc.
Engineering microservices and tools across one or more programming languages (e.g. Go, Python,Bash)
Automation and Infrastructure-as-Code projects and tooling (e.g. Ansible, Puppet, Terraform)
Build and maintain a continuous integration and delivery pipeline (e.g. Bamboo, Bitbucket Pipelines, Github Actions)
Staff Site Reliability Engineer
By Netskope At , Santa Clara, Ca
You will be part of a high caliber engineering team in the exciting space of cloud tools and infrastructure management.
Drive efficiencies in systems and processes: capacity planning, configuration management, performance tuning, monitoring and root cause analysis.
You will solve complex, exciting challenges and improve the depth and breadth of your technical and analytical skills
Partner closely with our development teams and product managers to architect and build features that are highly available, performant and secure
Gain deep knowledge of our application stack
Experience improving the performance of micro-services and solve scaling/performance issues
Senior System Reliability Engineer
By NVIDIA At , Santa Clara, Ca $132,000 - $212,750 a year
Good project management skills and ability to balance multiple simultaneous projects during development and production stages.
BS (or equivalent experience) in Engineering, Material Science, Physics, or a related field. MS or PHD preferred.
Deep understanding and hands-on experience in theoretical and practical Reliability concepts as it relates to high-tech electronic enterprise and consumer products.
Hands-on experience with Reliability demonstration & testing along with accelerated life methods for components, subassemblies, and complete products.
Good verbal and writing skills as well as the ability to communicate at a high level.
ASQ certification is desired but not a must.
Site Reliability Engineer Jobs
By Lawrence Berkeley National Laboratory At , San Francisco Bay Area, Ca $9,739 - $11,905 a month
Minimum of three years of experience in UNIX or Linux, Networking, IT infrastructure environment and management experience in a distributed-computing environment.
Knowledge of the processes for standard operating procedures, and best practices for implementation and change management.
Past experience with Incident Management and a good understanding of IT service management.
Experience with network security: configuring/maintaining ACLs, knowledge of firewalls
Bachelor’s Degree in a Computer Science or similar discipline or equivalent years of experience.
Strong hands-on knowledge of the Linux shell and working in a command-line (e.g. SSH) environment.
Principal Site Reliability Engineer
By Oracle At , Redwood City, 94065, Ca
Develop and implement various database life-cycle management flows.
Certification of Database products for cloud integration
Participate in Product Feature Review, Certification experiments and User Document reviews.
Research and acquire skills on new technologies as needed from time to time
6-14 years of Oracle database administration experience on large production environments
Database hands on skills especially around database and system troubleshooting and administration
Senior Site Reliability Engineer (Sre)
By Apple At , San Diego, Ca
Experience in a DevOPS or SRE role
Experience with modern web-scale services including servers, VIPs, load balancers, proxies
Highly experienced with one of these: Puppet, Chef, Saltstack, Ansible
Bonus: Native Kubernetes implementation including CNI, Kafka, etcd experience
Bonus: Experience with Cisco, Juniper, or Arista routing and switching hardware (+OS), including wireless
Able to write software needed to build and operate a large scale platform 24x7 including the development and staging platforms.
Sr. Site Reliability Engineer
By rockset At , San Mateo, Ca $140,000 - $185,000 a year
Experience with Terraform, Salt, Chef, Packer, or similar configuration management tools
Willing to learn new skills and technologies
Bachelor's or Master's degree in Computer Science or a related field, or relevant work experience
Experience as an SRE for 3+ years
Experience building and operating public-facing 24x7 web applications at scale
Experience working with cloud infrastructure and patterns (AWS preferred)
Staff Site Reliability Engineer
By Collective Health At , San Mateo, 94401, Ca $140,000 - $210,000 a year
Expertise in management and use of relational databases including.
10+ years of work experience in DevOps, Site Reliability Engineering, or Software Engineering.
Experience creating and monitoring SLIs and SLOs in order to set and remain within error budgets.
Experience in supporting customer-facing production systems and responding to incidents as part of an oncall rotation.
Knowledge of data structures, algorithms, distributed systems, and information retrieval.
Experience in solving diagnosing and resolving incidents that involve application, OS, network, infrastructure, partners, people, and process.
Manager, Site Reliability Engineer - Remote
By KPMG-UnitedStates At , San Diego, Ca
Manager, Site Reliability Engineer - Remote
Experience in supporting various enterprise class solutions and services including Windows server administration and security issue remediation
Be the Technical Lead representing the SRE\Tier 3 team for operational initiatives or project support
Improve reliability, quality, and time-to-market of our suite of software solutions
Create sustainable systems and services through automation and uplifts
Bachelor's degree from an accredited college or university is preferred

Are you an experienced Senior Site Reliability Engineer looking for a new challenge? We are looking for a motivated individual to join our team and help us ensure our systems are running smoothly and efficiently. You will be responsible for developing and maintaining our infrastructure, monitoring system performance, and troubleshooting any issues that arise. If you are passionate about technology and have a keen eye for detail, this could be the perfect opportunity for you!

What is Senior Site Reliability Engineer Skills Required?

•Strong knowledge of Linux/Unix administration
•Experience with scripting languages such as Bash, Python, Ruby, etc.
•Experience with automation/configuration management using tools such as Chef, Puppet, Ansible, etc.
•Experience with cloud technologies such as AWS, Azure, Google Cloud Platform, etc.
•Experience with container technologies such as Docker, Kubernetes, etc.
•Experience with monitoring tools such as Nagios, Zabbix, etc.
•Experience with version control systems such as Git, SVN, etc.
•Strong troubleshooting and problem-solving skills
•Excellent written and verbal communication skills

What is Senior Site Reliability Engineer Qualifications?

•Bachelor’s degree in Computer Science, Information Technology, or related field
•5+ years of experience in a Site Reliability Engineer role
•Experience with DevOps practices and tools
•Experience with database technologies such as MySQL, PostgreSQL, etc.

What is Senior Site Reliability Engineer Knowledge?

•Knowledge of ITIL best practices
•Knowledge of network protocols and technologies
•Knowledge of security best practices
•Knowledge of software development lifecycle

What is Senior Site Reliability Engineer Experience?

•Experience with large-scale distributed systems