Unfortunately, this job posting is expired.
Don't worry, we can still help! Below, please find related information to help you with your job search.
Some similar recruitments
Saas Site Reliability Engineer And Automation Developer
Recruited by Siemens Digital Industries Software 8 months ago Address , Costa Mesa, 92627 $116,900 - $210,400 a year
Senior Site Reliability Engineer, Trello
Recruited by Atlassian 8 months ago Address , San Francisco
Site Reliability Engineer, Product - Usds
Recruited by TikTok 8 months ago Address , Los Angeles $119,000 - $289,000 a year
Site Reliability Engineer, Systems
Recruited by Anthropic 8 months ago Address , San Francisco, Ca
Site Reliability Engineer (L4/5) - Core
Recruited by Netflix 8 months ago Address , Los Gatos, Ca
Software Engineer Iii, Site Reliability Engineering, Google Cloud
Recruited by Google 9 months ago Address Sunnyvale, CA, United States
Site Acquisition Specialist - Remote
Recruited by AFL 9 months ago Address Sacramento, CA, United States
Site Reliability Engineer Jobs
Recruited by Sohum Inc 10 months ago Address San Francisco Bay Area, United States
Site Reliability Engineer Jobs
Recruited by WalkWater Technologies 10 months ago Address Cupertino, CA, United States

Principal Site Reliability Engineer

Company

Oracle

Address , Redwood City, 94065, Ca
Employment type
Salary
Expires 2023-06-26
Posted at 1 year ago
Job Description

Job description

We are looking for dynamic, energetic and forward-looking engineers to join our database cloud engineering team. Candidate must have Oracle Database Administration experience as a Site Reliability Engineer or DBA on large production environments. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of large-scale implementation of Oracle database.

Responsibilities includes development and implementation of critical test cases, certification of the mission critical technology stacks for cloud integration with focus on security, resiliency, scale, and performance. Partner with development teams and work towards addressing and fixing production issues on cloud, defining and implementing product enhancements. Collaborate with various cloud operations teams to understand the production issues and work towards to create a reproducible test cases in the lab environment to present to the development teams.

Detailed Responsibilities

  • Certification of Database products for cloud integration

o Design & Develop Highly Automated Multi-Tier/Multi-Stack Stress Test Suites/Workloads for Testing and Certifications

  • Independently setup and configure large database environments with RAC, data-guard, and Oracle Goldengate.
  • Conduct extensive testing of Oracle database HA, data protection and disaster recovery.
  • Independently install, configure, patch and upgrade large database clusters on Linux
  • Develop and implement various database life-cycle management flows.
  • Work on large engineered cloud environments on OCI and EXaCS
  • Implement and validate all Maximum Availability Architecture best practices and asses how these helps to prevent, detect, tolerate and repair from various outages.

o Develop test cases and configurations to simulate application/business critical workloads and usage scenarios.

o Develop and Maintain Test Specs/Plans/Methodologies and then Design, and Implement End-to-End Test Suites/Frameworks simulating Real-world production systems.

  • Extensive upgrade and Patching testing while simulating production like load scenarios.

  • Research and Development

o Reviewing production issues and identifying gaps in the test suite and implementing these as test cases. This may include DB Schema Design/Normalization, Data generation, Load generation and Application/Business Logic programming in Oracle SQL, PL-SQL, Perl/Shell/Python and JDBC/JMS or Python.

o Carry-out independent research and review new functionality/features in nextgen Oracle Database releases and other products.

o Performance Tuning and pro-active measurements of future planning.

o Backup & Recovery Strategy.

  • Develop Automation tools, Simulation Apps and Re-usable Framework for efficient System/Stress Testing.
  • Research and acquire skills on new technologies as needed from time to time
  • Participate in Product Feature Review, Certification experiments and User Document reviews.
  • Log and track product defects (bugs), Collaborating closely with Development teams to resolve problems encountered in these Multi-tier test simulations.

Technical qualifications

Minimum Requirements for this job role

  • 6-14 years of Oracle database administration experience on large production environments
  • Database hands on skills especially around database and system troubleshooting and administration
  • GoldenGate setup, administration and tuning
  • RAC setup and administration
  • Strong Linux/UNIX OS understanding including OS Architecture & Internals (Networking, File Systems, Process/Memory Monitoring/Tuning/Linux Virtualization etc).
  • Programming/Scripting skills in one or more of below languages is required.

o Scripting - Perl / Shell / Python, Microservices/REST APIs

o Programming - Oracle SQL, PL/SQL, Java/JDBC, Python

  • B.E, M.E./MS in CS/ECE/EE, MCA from Reputed Engineering Colleges preferred.

Preferred Requirements for this job role

  • Architect and administrating large mission critical Oracle database systems
  • Data Guard administration and tuning
  • Experience on any ERP applications like Oracle E-biz suite, Oracle Fusion Apps or Netsuite
  • Good written communication skills to create best practice and solutions
  • Experience in developing test-cases, automation and orchestration

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.