Unfortunately, this job posting is expired.
Don't worry, we can still help! Below, please find related information to help you with your job search.
Some similar recruitments
Site Reliability Engineer - Remote
Recruited by Sheetz 8 months ago Address , Claysburg, 16625
Site Service Engineer Jobs
Recruited by Winbro 9 months ago Address Springfield, OH, United States
Associate Site Reliability Engineer
Recruited by ConstructConnect, Inc 11 months ago Address , Cincinnati, 45209, Oh
Senior Site Manager Jobs
Recruited by RSPB 11 months ago Address Boardman, OH, United States
Senior Service Reliability Engineer
Recruited by Amadeus 11 months ago Address , Portsmouth, 03801, Nh
Site Reliability Engineer Jobs
Recruited by Kohler 11 months ago Address , Somerville, Ma
Site Reliability Engineer Jobs
Recruited by Leidos 11 months ago Address , Springfield, 22151, Va $118,300 - $245,700 a year
Site Reliability Engineer- Remote
Recruited by Cognizant 11 months ago Address , Allentown, Pa
Entry Level Site Reliability Engineer (Devops)
Recruited by Reynolds and Reynolds 1 year ago Address , Dayton, 45430, Oh

Site Reliability Engineer Jobs

Company

Motion Industries

Address , Irondale, 35210, Al
Employment type FULL_TIME
Salary
Expires 2023-06-26
Posted at 1 year ago
Job Description
SUMMARY:
The Site Reliability Engineer (SRE) is responsible for improving system reliability and
resilience. This role focuses on building automation to reduce manual effort and prevent
service-impacting incidents. The SRE combines software and systems engineering to
build and support large-scale, distributed, fault-tolerant systems. This role ensures that
critical platforms are available, reliable and able to support a fast rate of improvement.
This role relies on monitoring platforms and is continually taking a holistic view of system
health and performance. The SRE will enhance and support cloud-based
transformations, and is focused on pushing capabilities forward, staying ahead of
customer needs and innovating for continuous improvement. The SRE provides
operational support and engineering for multiple large-scale distributed software
applications
JOB DUTIES
  • Gathers and analyzes metrics from monitoring platforms to assist in performance tuning
and fault tolerance.
  • Partners with development teams to improve services through testing and release
procedures.
  • Balances feature development speed and reliability with service-level objectives.
  • Works closely with the incident response team and restoring service to normal operation.
  • Utilizes monitoring systems and dashboards for proactive changes and alerting.
  • Participates in system design, platform management and capacity planning.
  • Understands debugging and applying troubleshooting skills.
  • Investigates, blocks and rate-limits unwanted traffic.
  • Establishes continuous process improvement cycles where the process, performance,
and supporting technologies are reviewed and enhanced where applicable.
  • Performs other duties as assigned.

EDUCATION & EXPERIENCE
Typically requires a bachelor's degree and five (5) to seven (7) years of experience in a
technology and/or software engineering role or an equivalent combination.
KNOWLEDGE, SKILLS, ABILITIES
  • Understanding of Kubernetes, containers, clusters and elastic scalability.
  • Mindset of continually finding ways to drive scalability, stability and performance.
  • Expertise in SRE principles.
  • Experience with API, service-based or microservice-based architecture.
  • Cloud Services experience with Google Cloud Platform (GCP).
  • Proficiency in infrastructure, network, database, operating systems or security
troubleshooting and remediation.
  • Architecture-level knowledge of Windows and Linux and Infrastructure systems.
  • Experience with production deployment, monitoring and operational support for enterprise-class applications (Dynatrace a plus).
  • Experience working with Continuous Integration/ Continuous Deployment tools.
  • Experience in performance diagnostics, capacity planning, performance architecture
design, performance tuning and performance monitoring.
  • Experience with Azure DevOps (ADO), Dynatrace, Prometheus, Terraform and Grafana.
  • A strong mix of software engineering and operational support skills.
  • Knowledge of web technologies – HTTP, proxy, java, etc.