Director Of Engineering, Site Reliability
By OneStudyTeam At , Remote
Experience implementing security controls for AWS environments, including setup and management of authentication controls, VPN’s, KMS, etc
Be the product manager for your vertical, defining the roadmap, requirements, goals and acceptance criteria
Learn more about our global benefits offerings on our careers site: https://careers.onestudyteam.com/us-benefits
Manage vendors, contracts and spend associated to operational infrastructure
Experience managing a team of 5+ SREs
Experience managing a global AWS footprint
Principal Reliability Engineer Jobs
By Novartis At Cambridge, MA, United States
3+ years of people leadership, project management, and in collaborating across boundaries experience
Experience in Data Management & Systems, preferably in data security
Broadly experienced specialists managing a small unit OR project. May be responsible for managing others -Leads/co‐leads novel projects within the team
Experience in implementing DevOps tools and practices for product and services teams
Experience handling a large volume of data
Experience with AWS and containers
Sr Site Reliability Eng.
By ENGIE Impact At United States
Strong communication and interpersonal skills with all levels of management.
Design and implement build, deployment, and configuration management.
Deploy and manage both Iaas and Pass services in development and production.
Manage CI and CD tools with team.
BS/MS Computer Science degree preferred, or equivalent experience.
At least four years of software engineering or site reliability engineering experience.
Principal, Site Reliability Engineer
By BNY Mellon At , Lake Mary, Fl
In this role, you’ll make an impact in the following ways:
To be successful in this role, we’re seeking the following:
Best Places to Work for Disability Inclusion , Disability:
Principal Site Reliability Engineer
By GoDaddy At , Remote $168,000 - $252,000 a year
Process improvement, management, and development experience.
Translate core architecture and business requirements into technical cloud infrastructure solutions that consist of platform, network, software, cloud automation, security, etc.
3+ years of experience in complex distributed networking, system performance tuning, and monitoring.
Experience with CI/CD development using Kubernetes, Docker, etc.
Experience in virtualization technologies such as KVM, and OpenStack.
Experience with back-end services, highly distributed and scalable services, and deployment automation.
Site Reliability Operations Engineer
By Fox Corporation At , Tempe, 85283, Az
Experience with operation and management of cloud-based services, including operational processes
Familiarity with modern operations concepts such as Agile and Incident Management
Experience / knowledge of the Broadcast industry
Operate and support live events, delivering smooth video experience to the audience
Hands on experience in a production / operational role
Experience using enterprise monitoring tools of any kind
Site Reliability Engineer Jobs
By University of Washington At , Seattle, 98195, Wa $7,554 - $11,667 a month

To request disability accommodation in the application process, contact the Disability Services Office at 206-543-6450 or [email protected].

Site Reliability Engineer Jobs
By Lawrence Berkeley National Laboratory At , San Francisco Bay Area, Ca $9,739 - $11,905 a month
Minimum of three years of experience in UNIX or Linux, Networking, IT infrastructure environment and management experience in a distributed-computing environment.
Knowledge of the processes for standard operating procedures, and best practices for implementation and change management.
Past experience with Incident Management and a good understanding of IT service management.
Experience with network security: configuring/maintaining ACLs, knowledge of firewalls
Bachelor’s Degree in a Computer Science or similar discipline or equivalent years of experience.
Strong hands-on knowledge of the Linux shell and working in a command-line (e.g. SSH) environment.
Principal Site Reliability Engineer
By Oracle At , Redwood City, 94065, Ca
Develop and implement various database life-cycle management flows.
Certification of Database products for cloud integration
Participate in Product Feature Review, Certification experiments and User Document reviews.
Research and acquire skills on new technologies as needed from time to time
6-14 years of Oracle database administration experience on large production environments
Database hands on skills especially around database and system troubleshooting and administration
Site Reliability Engineer Jobs
By DNV At , Corvallis, Or
Support web operations workflow automation using configuration management and continuous deployment frameworks
Systems engineering or DevOps experience
Good knowledge of a scripting language like Powershell, Bash, Python
Experience working on cloud-based infrastructure (e.g. Azure)
Strong written and verbal English communication skills
Experience securing cloud/web applications strongly desired
Site Reliability Engineer Jobs
By Blue Yonder At , Dallas, Tx $88,525 - $125,575 a year
Solid understanding of large-scale applications, Cloud Observability, monitoring and fault management, and understanding of Network Architectures
Respond to technical business requirements around availability, performance, and planned maintenance activities to ensure a well-operating solution and SLA compliance.
Strong experience of min 5 years’ experience developing, managing, or supporting distributed systems in a Cloud/IaaS environment, Azure preferred
Experience working with monitoring and visualization tools such as Splunk and AppDynamics
Experience coordinating between support and development teams to ensure effective delivery of monitoring services to the end-user.
Experience implementing best practices and industry standards for operational monitoring aligned to ITIL.
Site Reliability Engineer Jobs
By Motion Industries At , Irondale, 35210, Al
Participates in system design, platform management and capacity planning.
Understands debugging and applying troubleshooting skills.
Cloud Services experience with Google Cloud Platform (GCP).
Experience with API, service-based or microservice-based architecture.
Architecture-level knowledge of Windows and Linux and Infrastructure systems.
Experience with production deployment, monitoring and operational support for enterprise-class applications (Dynatrace a plus).
Site Reliability Engineer Jobs
By JPMorgan Chase Bank, N.A. At , Jersey City, 07310, Nj Up to $200,000 a year

Minimum education and experience required:

Site Reliability Engineer Jobs
By Sezzle At , Minneapolis, Mn $75,000 - $90,000 a year
Maintain and develop monitoring and alerting solutions to improve the on-call experience
Bachelor's in computer science (preferred) or equivalent related experience
Basic knowledge of a Microservice Architecture
Basic knowledge of AWS, Kubernetes, Docker
Knowledge of Relational Databases, SQL and ORM technologies
Collaborative workspace, commuter benefits, full-stocked kitchen, weekly lunches and much more!
Sr. Site Reliability Engineer
By rockset At , San Mateo, Ca $140,000 - $185,000 a year
Experience with Terraform, Salt, Chef, Packer, or similar configuration management tools
Willing to learn new skills and technologies
Bachelor's or Master's degree in Computer Science or a related field, or relevant work experience
Experience as an SRE for 3+ years
Experience building and operating public-facing 24x7 web applications at scale
Experience working with cloud infrastructure and patterns (AWS preferred)
Site Reliability Engineer Jobs
By Cogito Corporation At , $94,000 - $110,000 a year
Experience with configuration management tools, such as Ansible, Chef, Puppet, etc.
Experience with Kubernetes networking and knowledge of how traffic flows within pods, load balancers, and the internet
Proven experience with designing, building, securing, and managing Kubernetes at scale
Experience with Continuous Integration / Delivery tooling, such as Jenkins, Travis CI, CircleCI, etc.
Cross-functional collaboration skills and relationship-building skills are a must
Design and reimplement Cogito's core infrastructure in a containerized, cloud-native manner, making use of Docker, Kubernetes, Istio, and AWS infrastructure
Site Reliability Engineer Jobs
By Autodesk At , Atlanta, Ga $109,400 - $188,760 a year
Use modern administration tools like Docker, Terraform, AWS CloudFormation/CDK to manage and deploy containers and virtual machines
Collaborate with stakeholders to understand requirements, understand use cases and build towards a cohesive technical strategy
Experience in large-scale cloud-based production infrastructure (AWS preferred)
Expert experience with Docker and other container technology
Strong experience with Log Analysis and Monitoring tools such as CloudWatch, Splunk, New Relic, Grafana
Experience with any relevant language (Python, JavaScript, Ruby, Rust, Bash, etc.)
Staff Site Reliability Engineer
By Collective Health At , San Mateo, 94401, Ca $140,000 - $210,000 a year
Expertise in management and use of relational databases including.
10+ years of work experience in DevOps, Site Reliability Engineering, or Software Engineering.
Experience creating and monitoring SLIs and SLOs in order to set and remain within error budgets.
Experience in supporting customer-facing production systems and responding to incidents as part of an oncall rotation.
Knowledge of data structures, algorithms, distributed systems, and information retrieval.
Experience in solving diagnosing and resolving incidents that involve application, OS, network, infrastructure, partners, people, and process.
Site Reliability Engineer Jobs
By ASAPP At , New York, 10013, Ny $140,000 - $194,000 a year
Knowledge of AWS services, containers and container management frameworks
+4 years of relevant experience bringing software to production at high scale
BS or MS degree in the Computer Science field, or equivalent hands-on experience.
Experience in product oriented environments
Work with product engineering teams on service architecture and implementation
Deliver configuration as code and automate everything
Aws Site Reliability Engineer
By Derivative Path At , Remote
Excellent communication, organizational and time-management skills
Work closely with architects, software engineers, quality engineers, product owners, and management to design scalable, robust systems using cloud architecture
Participate in system design consulting, platform management, and capacity planning
Proficient with AWS certification preferred
Prior experience within the Capital Markets, Financial Services, and IT & Services
Design and implement fully automated CI/CD Pipelines using industry tools

Are you looking for an opportunity to make a real impact on the reliability and scalability of a product? We are looking for a Principal Site Reliability Engineer to join our team and help us build and maintain a reliable and scalable platform. You will be responsible for ensuring the availability and performance of our services, as well as developing and implementing strategies to improve system reliability. If you are passionate about technology and have a strong background in system engineering, we want to hear from you!

What is Principal Site Reliability job Skills Required?

• Expertise in system architecture, system design, and system engineering
• Knowledge of cloud computing, distributed systems, and DevOps
• Ability to troubleshoot complex systems and identify root causes
• Experience with automation and scripting languages such as Python, Bash, and PowerShell
• Knowledge of monitoring and logging tools such as Splunk, ELK, and Prometheus
• Understanding of networking protocols and technologies
• Ability to work in a fast-paced environment

What is Principal Site Reliability job Qualifications?

• Bachelor’s degree in Computer Science, Information Technology, or related field
• 5+ years of experience in system engineering, system architecture, or related field
• Experience with cloud computing platforms such as AWS, Azure, or GCP
• Experience with container technologies such as Docker and Kubernetes
• Experience with configuration management tools such as Chef, Puppet, or Ansible
• Experience with automation and scripting languages such as Python, Bash, and PowerShell
• Knowledge of monitoring and logging tools such as Splunk, ELK, and Prometheus

What is Principal Site Reliability job Knowledge?

• Knowledge of system architecture, system design, and system engineering
• Knowledge of