Application Support – Site Reliability Engineer
By Morgan Stanley At New York, NY, United States
Good working knowledge of trading and risk management business concepts
Ensure efficient incident management, ensuring accurate communication to impacted groups and timely resolution.
Familiarity with SDLC processes and management tools (Jira/GIT/Stashblue)
Network diagnostic skills and experience with networks and realtime messaging technologies (multicast, TCP/IP, UDP, SNMP)
Facilitate root cause investigations and manage the implementation of corrective and preventative measures.
Manage coverage during Asian and European market hours, including weekend pre-open ready-for-business checks.
Site Reliability Engineer Jobs
By Spotify At Greater Chicago Area, United States
• 4+ years of IT experience needed
• Experience working in a Linux environment
• Good knowledge of Unix
• Basic experience in writing SQL queries
• Good verbal communicative skills
• Ability to manage priorities and deadlines
Site Reliability Engineer (.Net Engineer)
By Suzy At United States
Exposure to a Configuration Management System (Puppet, Chef, Salt, etc)
Optimize: Observe and improve performance, reduce cost, and improve the experience for millions of users
3+ years of experience in Software Engineering, Site Reliability Engineering, or a Development focused DevOps role.
Experience with Kubernetes and Cloud systems
Experience with the development and operation of high-traffic backend systems
Troubleshooting skills that span applications, networking (TCP/IP), and systems
Site Acquisition Specialist - Remote
By AFL At Sacramento, CA, United States

Let us connect you to your next career opportunity!

Site Reliability Engineer - Usds
By TikTok At Seattle, WA, United States

Responsibilities TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, ...

Site Reliability Engineer - All Levels
By FedEx Dataworks At United States
Experience in FinOps - Cloud cost management
Experience/knowledge in capacity planning, demand forecast based on production KPIs and provisioning.
Two (2) years equivalent work experience in information technology or engineering environment. A related advanced degree may offset the experience requirements.
Bachelor's Degree in Computer Science, Engineering, Information Systems and/or related field or equivalent formal training or work experience.
Strong SRE background, with experience in Cloud platforms, Software Development, DevOps, and Data Engineering
Strong skills in Python, SQL, Azure or other Cloud technologies
Sr Site Reliability Eng.
By ENGIE Impact At United States
Strong communication and interpersonal skills with all levels of management.
Design and implement build, deployment, and configuration management.
Deploy and manage both Iaas and Pass services in development and production.
Manage CI and CD tools with team.
BS/MS Computer Science degree preferred, or equivalent experience.
At least four years of software engineering or site reliability engineering experience.
Senior Site Reliability Engineer
By Business Wire At United States
Strong experience with AWS cloud infrastructure and container orchestration (Kubernetes, Docker)
Strong experience with monitoring and alerting systems such as Prometheus, Grafana, Nagios, etc.
Strong experience with at least one programming language. Java is highly preferred but other languages such as Python will be considered
Advanced experience with Linux system administration, Java based applications, and network architecture
Ability to work remotely 100%
Excellent health benefits that begin on your first day of employment
Site Reliability Engineer Jobs
By Therapy Brands At Birmingham, AL, United States
2+ years of experience programming or scripting. C# or Python is preferred.
1+ years of experience with cloud environments: AWS and Azure
1+ years of experience with SQL: writing basic select and update statements
Primary Responsibilities Of This Position
Familiarity with networking fundamentals: TCP/IP, DNS resolution
Familiarity with tools including or similar to: Grafana, InfluxDB, OpenTelemetry
Site Reliability Engineer Jobs
By Xforia Global Talent Solutions At United States
Support system design consulting, platform management, and capacity planning
Excellent communication skills and a high degree of technical leadership skills.
As Site Reliability Engineer you will:
Support the production environment by monitoring availability and the system health.
Improve reliability, quality, and time-to-release of the changes.
Provide primary operational support and engineering for multiple large-scale distributed software applications.
Site Reliability Engineer Jobs
By Sohum Inc At San Francisco Bay Area, United States
Full time opportunity that offers excellent benefits.
• Configuration Management and IAC - Salt, Pulumi (Terraform will work)
• Bachelor’s degree in CS / other highly technical discipline, or equivalent experience
• 5+ years of experience and 3+ years experience as Site Reliability Engg
• Strong networking and firewall knowledge
• Exceptional problem solving and troubleshooting skills
Site Reliability Engineer Jobs
By Insight GlobalProject Manager At Hampton, VA, United States
● (5+) years of experience working in Software Engineering, or Site Reliability Engineering
● Experience building and maintaining Container Orchestration across hybrid-cloud infrastructure
● Experience deploying and configuring modern observability tooling for monitoring and alerting.
● Experience programming in Java, JavaScript, and SQL dialects with the Spring framework and React library
● Experience writing or troubleshooting software delivery pipelines, eg: GitLab CI and Concourse
Active DOD security clearance or the ability to obtain an interim secret within 60 days of hire
Site Reliability Engineer Jobs
By Blue Yonder At Dallas, TX, United States
Solid understanding of large-scale applications, Cloud Observability, monitoring and fault management, and understanding of Network Architectures
Respond to technical business requirements around availability, performance, and planned maintenance activities to ensure a well-operating solution and SLA compliance.
Strong experience of min 5 years’ experience developing, managing, or supporting distributed systems in a
Experience working with monitoring and visualization tools such as Splunk and AppDynamics
Experience coordinating between support and development teams to ensure effective delivery of monitoring services to the end-user.
Experience implementing best practices and industry standards for operational monitoring aligned to ITIL.
Site Reliability Engineer (Remote)
By Liberty IT Solutions At Atlanta, GA, United States

Job Description Summary: Manages, supports and maintains a reliable environment for the site in order to ensure the stability and security of multiple systems/platforms that are run or operated in ...

Site Reliability Engineer Jobs
By Techfellow Limited At Chicago, IL, United States
Develop cutting-edge solutions following robust engineering principles alongside an experienced team
Proficient scripting skills in PowerShell, Python, or comparable languages
Harness Python and PowerShell scripting to streamline build, configuration, deployment, and admin tasks
Bolster communication and collaboration, serving as a nexus between business users and tech teams
Deploy, oversee, and refine Windows infrastructure
Identify and actualise system enhancements for optimal performance
Site Reliability Engineer Jobs
By WalkWater Technologies At Cupertino, CA, United States
Experience with SSL/mTLS and certificate management
Hands-on experience with cloud orchestration platforms such as Kubernetes or Nomad
Setting up CD/CD pipelines using GitHub hooks, TeamCity, Docker, and Artifactory
Familiarity with load balancers, traffic-envoys, and proxies
Familiarity with Java runtime / JVM
Familiarity with observability systems such as Prometheus or Open Metrics
Senior Site Reliability Engineer (Remote)
By The Hartford At , Hartford, Ct
Progressively implement preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines
Hands on experience with Performance and Observability tools such as DynaTrace, Splunk, TrueSight, CloudWatch, CloudTrail, and related tools.
Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Jenkins, Nexus, Rally, SonarQube etc.
Knowledge of complex traditional and modern enterprise architectures and systems (understand more than the component itself).
Strong hybrid cloud experience (private and public) across various service delivery models – IaaS, PaaS, SaaS.
Strong communication (verbally and written) / collaboration / negotiation skill, working in a diverse team cross business units
Senior Site Reliability Engineer
By Humana At , Phoenix, 85050, Az
Detail oriented with excellent organizational and project management skills
Bachelor's degree or equivalent experience
Experienced in Java, Python, or similar coding experience
3+ years of experience working with voice technologies / IVR
2+ years of project leadership experience
Project-based experience driving changes and improvements to IVR solutions.
Lead Site Reliability Engineer (Remote)
By IQVIA At , Remote
Bachelor’s Degree in Computer Science, Software Engineering, or equivalent professional experience
Significant (7+ years) experience building, managing, and supporting cloud-based IT infrastructure (IaC)
Thorough knowledge of Unix and/or Linux fundamentals and system administration
Experience with infrastructure-as-code (IaC) tools or technologies (notably Terraform)
Solid foundational knowledge of TCP/IP networking
Knowledge of source control systems and workflow (notably git)
Associate Site Reliability Engineer
By ConstructConnect, Inc At , Cincinnati, 45209, Oh

To apply for the Associate Site Reliability Engineer role, please click on the iCIMS link below. The link will guide you to ConstructConnect's new Careers page portal. Associate Site Reliability ...

Are you looking for a challenging and rewarding role as a Remote Site Reliability Engineer? We are looking for a talented individual to join our team and help us ensure our systems are reliable and secure. You will be responsible for monitoring, troubleshooting, and resolving issues with our systems, as well as developing and implementing strategies to improve system performance. If you have a passion for technology and a desire to make a difference, this is the job for you!

Overview:

A Remote Site Reliability Engineer is responsible for ensuring the reliability, availability, and scalability of a company’s remote systems and services. This role requires a combination of technical and operational skills to ensure that the remote systems are running optimally and securely. The Remote Site Reliability Engineer will work with the development, operations, and security teams to ensure that the remote systems are reliable, secure, and available.

Detailed Job Description:

The Remote Site Reliability Engineer will be responsible for the following tasks:

• Design, implement, and maintain remote systems and services.
• Monitor and troubleshoot remote systems and services.
• Develop and maintain automation and configuration management systems.
• Develop and maintain security policies and procedures.
• Develop and maintain system and service performance metrics.
• Develop and maintain system and service availability metrics.
• Develop and maintain system and service scalability metrics.
• Develop and maintain system and service reliability metrics.
• Develop and maintain system and service security metrics.
• Develop and maintain system and service documentation.
• Develop and maintain system and service monitoring and alerting systems.
• Develop and maintain system and service backup and recovery systems.
• Develop and maintain system and service disaster recovery plans.
• Develop and maintain system and service capacity planning.
• Develop and maintain system and service performance tuning.
• Develop and maintain system and service patching and upgrades.
• Develop and maintain system and service security hardening.
• Develop and maintain system and service change management processes.
• Develop and maintain system and service incident response plans.
• Develop and maintain system and service root cause analysis processes.

What is Remote Site Reliability Engineer Job Skills Required?

• Expertise in remote systems and services.
• Expertise in automation and configuration management systems.
• Expertise in security policies and procedures.
• Expertise in system and service performance metrics.
• Expertise in system and service availability metrics.
• Expertise in system and service scalability metrics.
• Expertise in system and service reliability metrics.
• Expertise in system and service security metrics.
• Expertise in system and service documentation.
• Expertise in system and service monitoring and alerting systems.
• Expertise in system and service backup and recovery systems.
• Expertise in system and service disaster recovery plans.
• Expertise in system and service capacity planning.
• Expert