Unfortunately, this job posting is expired.
Don't worry, we can still help! Below, please find related information to help you with your job search.
Some similar recruitments
Software Engineer (Hybrid) Jobs
Recruited by Boomerang Healthcare 8 months ago Address Portland, OR, United States
Transition Of Care Nurse (Remote/Hybrid)
Recruited by Blue Cross Blue Shield of Arizona 8 months ago Address , Phoenix, 85021
Senior Software Engineer Jobs
Recruited by Nike 10 months ago Address Beaverton, OR, United States
Senior Software Engineer Jobs
Recruited by Warner Bros. Discovery 1 year ago Address , , Or $95,550 - $177,450 a year

Senior Devops Engineer (Hybrid Eligibile)

Company

Oak Ridge National Laboratory

Address , Oak Ridge, 37830
Employment type
Salary
Expires 2023-09-18
Posted at 8 months ago
Job Description

Requisition Id 9682

Overview:

The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts several of the world’s most powerful computer systems, is seeking highly qualified individuals to play a key role in improving the security, performance, and reliability of the NCCS computing environments.


In this role, you will work within the HPC Clusters Group inside of the NCCS Systems Section to support numerous activities of the center.


The HPC Clusters Group administers and supports the division’s HPC computing infrastructure, which includes system installation, deployment, acceptance, performance testing, upgrades, problem diagnosis, and troubleshooting.


The Systems Section within National Center for Computational Sciences Division (NCCS). The HPC Systems Section administers and supports the division’s computing, networking, and storage systems.


The NCCS provides state-of-the-art computational and data science infrastructure, coupled with dedicated technical and scientific professionals, to accelerate scientific discovery and engineering advances across a broad range of disciplines. NCCS hosts the Oak Ridge Leadership Computing Facility, one of DOE’s National User Facilities.


Major Duties/Responsibilities:

  • Work with team to adopt a software-defined infrastructure and infrastructure as code paradigm
  • Identify automation opportunities to improve DevOps operations, make recommendations to management, and lead the implementation of improvements.
  • Install and configure software, both commercial packages, and various open-source packages.
  • Evaluate new technology options and vendor products; rely on expertise to recommend new technology and products to management.
  • Answer escalated helpline calls in addition to primary project work.
  • Review architecture and offer recommendations for improvements; lead implementation efforts.
  • Embrace continuous integration and continuous delivery (CI/CD) processes. Train and mentor junior-level staff on these processes.
  • Maintain documentation/notes on software builds and installs.
  • System troubleshooting and problem-solving across multiple platforms (dev/test/prod)
  • Work with other systems engineers and vendors to resolve hardware and software issues.
  • Automate systems administration tasks utilizing open-source configuration management tools
  • Work with the team to define and implement best practices and standards within the organization
  • Ensure the secure and effective operation of computing systems through compliance with ORNL procedures and IT Internal Operating Procedures.
  • Identify and document IT best practices that will improve the systems deployment function
  • Configuration Management - i.e. Puppet, Ansible, etc.
  • CI/CD technologies – Gitlab runners, etc.
  • Monitor systems performance.


Basic Qualifications:

  • A minimum of 7 years of experience managing UNIX/Linux Systems.
  • A minimum of 2 years utilizing configuration management and automation tools such as Git, Ansible, Puppet or other CI/CD pipeline tools.
  • Bachelor's degree in Computer Science or related technical subjects or equivalent combination of education and experience.
  • Fluency in at least one scripting language such as Bash, Python, Go or equivalent
  • A minimum of 2 years of experience managing container infrastructure using docker.


Preferred Qualifications:

  • Demonstrated ability to balance complex research and security requirements.
  • Excellent interpersonal skills suitable for user support and ability to work well with peer system administrators.
  • Ability to work independently and demonstrated analytical and problem-solving skills.
  • Experience with performance and diagnostic tools for benchmarking, analysis, and tuning of systems, networking, and storage.
  • Working knowledge of multiple operating systems.
  • Experience with RHEL7/8
  • Previous experience working in a government, scientific, or other highly technical environment.
  • Knowledge of networking fundamentals including TCP/IP, traffic analysis, common protocols, and network diagnostics.
  • Excellent written and verbal communication skills.
  • Experience with Nagios, Zabbix, Ganglia, and other network and device monitoring systems.
  • Technical documentation skills, including the ability to prepare simple documentation web pages.
  • Background of contributing to open-source projects or avocational endeavors such as hacker/maker spaces is desirable.


#LI-DC1


This position will remain open for a minimum of 5 days after which it will close when a qualified candidate is identified and/or hired.

We accept Word (.doc, .docx), Adobe (unsecured .pdf), Rich Text Format (.rtf), and HTML (.htm, .html) up to 5MB in size. Resumes from third party vendors will not be accepted; these resumes will be deleted and the candidates submitted will not be considered for employment.


If you have trouble applying for a position, please email [email protected].


ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. UT-Battelle is an E-Verify employer.