Unfortunately, this job posting is expired.
Don't worry, we can still help! Below, please find related information to help you with your job search.
Some similar recruitments
Technical Support (Devops Team)
Recruited by Tech Talent Link, Inc 9 months ago Address Wilsonville, OR, United States
Software Engineer Iii Jobs
Recruited by Nike 10 months ago Address Beaverton, OR, United States
Java Engineer Jobs
Recruited by Mainz Brady Group 10 months ago Address Portland, OR, United States
Devops Engineer - 14597 Jobs
Recruited by Enlighten 11 months ago Address , Columbia, Md
Software Engineer Iii Jobs
Recruited by Thermo Fisher Scientific 11 months ago Address , Eugene, 97402, Or
Qa Engineer Jobs
Recruited by TransUnion 1 year ago Address Portland, OR, United States

Devops Engineer Jobs

Company

Oak Ridge National Laboratory

Address , Oak Ridge, 37830, Tn
Employment type
Salary
Expires 2023-06-27
Posted at 1 year ago
Job Description

Requisition Id 9683

Overview:

The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts several of the world’s most powerful computer systems, is seeking highly qualified individuals to play a key role in improving the security, performance, and reliability of the NCCS computing environments.


In this role, you will work within the HPC Clusters Group inside of the NCCS Systems Section to support numerous activities of the center.


The HPC Clusters Group administers and supports the division’s HPC computing infrastructure, which includes system installation, deployment, acceptance, performance testing, upgrades, problem diagnosis, and troubleshooting.


The Systems Section within National Center for Computational Sciences Division (NCCS). The HPC Systems Section administers and supports the division’s computing, networking, and storage systems.


The NCCS provides state-of-the-art computational and data science infrastructure, coupled with dedicated technical and scientific professionals, to accelerate scientific discovery and engineering advances across a broad range of disciplines. NCCS hosts the Oak Ridge Leadership Computing Facility, one of DOE’s National User Facilities.


Major Duties/Responsibilities:

  • Answer escalated helpline calls in addition to primary project work.
  • Install and configure software, both commercial packages, and various open-source packages.
  • Identify automation opportunities to improve DevOps operations
  • Ensure the secure and effective operation of computing systems through compliance with ORNL procedures and IT Internal Operating Procedures.
  • Evaluate new technology options and vendor products
  • Work with the team to define and implement best practices and standards within the organization
  • Maintain documentation/notes on software builds and installs.
  • Automate systems administration tasks utilizing open-source configuration management tools
  • Embrace continuous integration and continuous delivery (CI/CD) processes
  • CI/CD technologies – Gitlab runners, etc.
  • Work with other systems engineers and vendors to resolve hardware and software issues.
  • Monitor systems performance.
  • Identify and document IT best practices that will improve the systems deployment function
  • System troubleshooting and problem-solving across multiple platforms (dev/test/prod)
  • Work with team to adopt a software defined infrastructure and infrastructure as code paradigm
  • Review architecture and offer recommendations for improvements
  • Configuration Management - i.e. Puppet, Ansible, etc.


Basic Qualifications:

  • A minimum of 2 years utilizing configuration management and automation tools such as Git, Ansible, Puppet or other CI/CD pipeline tools.
  • Fluency in at least one scripting language such as Bash, Python, Go or equivalent
  • Bachelor's degree in Computer Science or related technical subjects and 5 years of relevant experience. An equivalent combination of education and experience may be considered.
  • A minimum of 2 years of experience managing container infrastructure using docker.
  • A minimum of 5 years of experience managing UNIX/Linux Systems.


Preferred Qualifications:

  • Excellent written and verbal communication skills.
  • Experience with RHEL7/8
  • Experience with Nagios, Zabbix, Ganglia, and other network and device monitoring systems.
  • Demonstrated ability to balance complex research and security requirements.
  • Previous experience working in a government, scientific, or other highly technical environment.
  • Working knowledge of multiple operating systems.
  • Knowledge of networking fundamentals including TCP/IP, traffic analysis, common protocols and network diagnostics.
  • Excellent interpersonal skills suitable for user support and ability to work well with peer system administrators.
  • Experience with performance and diagnostic tools for benchmarking, analysis and tuning of systems, networking and storage.
  • Technical documentation skills, including ability to prepare simple documentation web pages.
  • Background of contributing to open-source projects or avocational endeavors such as hacker/maker spaces is desirable.
  • Ability to work independently and demonstrated analytical and problem-solving skills.


#LI-DC1


This position will remain open for a minimum of 5 days after which it will close when a qualified candidate is identified and/or hired.

We accept Word (.doc, .docx), Adobe (unsecured .pdf), Rich Text Format (.rtf), and HTML (.htm, .html) up to 5MB in size. Resumes from third party vendors will not be accepted; these resumes will be deleted and the candidates submitted will not be considered for employment.


If you have trouble applying for a position, please email [email protected].


ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. UT-Battelle is an E-Verify employer.