Principal Reliability Engineer Jobs
By Novartis At Cambridge, MA, United States
3+ years of people leadership, project management, and in collaborating across boundaries experience
Experience in Data Management & Systems, preferably in data security
Broadly experienced specialists managing a small unit OR project. May be responsible for managing others -Leads/co‐leads novel projects within the team
Experience in implementing DevOps tools and practices for product and services teams
Experience handling a large volume of data
Experience with AWS and containers

Are you looking for an opportunity to make a real impact on the reliability and scalability of a product? We are looking for a Principal Site Reliability Engineer to join our team and help us build and maintain a reliable and scalable platform. You will be responsible for ensuring the availability and performance of our services, as well as developing and implementing strategies to improve system reliability. If you are passionate about technology and have a strong background in system engineering, we want to hear from you!

What is Principal Site Reliability job Skills Required?

• Expertise in system architecture, system design, and system engineering
• Knowledge of cloud computing, distributed systems, and DevOps
• Ability to troubleshoot complex systems and identify root causes
• Experience with automation and scripting languages such as Python, Bash, and PowerShell
• Knowledge of monitoring and logging tools such as Splunk, ELK, and Prometheus
• Understanding of networking protocols and technologies
• Ability to work in a fast-paced environment

What is Principal Site Reliability job Qualifications?

• Bachelor’s degree in Computer Science, Information Technology, or related field
• 5+ years of experience in system engineering, system architecture, or related field
• Experience with cloud computing platforms such as AWS, Azure, or GCP
• Experience with container technologies such as Docker and Kubernetes
• Experience with configuration management tools such as Chef, Puppet, or Ansible
• Experience with automation and scripting languages such as Python, Bash, and PowerShell
• Knowledge of monitoring and logging tools such as Splunk, ELK, and Prometheus

What is Principal Site Reliability job Knowledge?

• Knowledge of system architecture, system design, and system engineering
• Knowledge of