Software DevOps Engineer- Logging developer - 20000Q8D Applicants are required to read, write, and speak the following languages English Preferred Qualifications Cloud Reliability Services (CRS) is a strategic component that will transform Oracles Global Business Units (GBU) cloud operations.
As cloud service operations evolve from a predominately re-active model (i.e., responding to emergencies with high levels of human involvement) to a pro-active model (i.
e., preventing emergencies and outages with intelligent tools, services and automation), the mission of CRS will ensure that all GBUs can efficiently operate ultra-scalable and highly-reliable SaaS / Foundation services, across multiple operating models as they iterate to become Cloud Native.
Team description The Cloud Reliability Services (CRS) team will build, deliver and operate standardized services to enable Oracles GBUs to more effectively operate their SaaS applications.
The CRS team is a fast paced, highly motivated team that embodies We rather than I. CRS will be a geo-diverse team that will allow it to quickly respond to customer engagements and challenges, and adapt quickly to incoming incidents while still delivering on committed features and enhancements.
Roles & Responsibilities The candidate will work with highly skilled, highly motivated engineers using Agile methodologies based on Scrum or Kanban, and incorporating enterprise agile practices from Scaled Agile Framework (SAFe).
The team embraces a DevOps environment the Developers are the Operators. The work environment is to treat everything as code (code, configuration, infrastructure, pipelines, everything) to achieve the highest quality product in the most efficient amount of time.
You will work alongside a software development team within the greater Oracle Cloud Reliability Engineering team where you will develop new features as well as expand and support existing features.
One week you may be writing automated tests for an existing feature. The next week you may be developing a new feature (design, code, test, and deploy) for a customer in our environment.
The next week you may be providing support to a customer on your new feature. You will learn new technologies based on what we already deploy and use.
You will also learn about and research new technologies that you bring to the team to better our offerings. You will play a key role in building more intelligence, into CRS services that we deliver, so that SaaS services function more and more autonomously over time.
Per team roles and responsibilities Work with the Product Owner and team members to build new features and enhancements, while supporting existing capabilities being heavily utilized across all Oracle Global Business Units Planning, designing, coding, documenting and testing of new Oracle Cloud Reliability services used by multiple Oracle Software as a Service products Develop software using Agile methodologies and participate as a member of scrum development teams Use Everything-As-Code methodologies to ensure traceability, configurability, immutability, repeatability, and governability Participate in a follow-the-sun model for 24x7 support of CRS services for designated engineers on a rotating basis Manage and continuously improve existing CRS capabilities Review and approve the work products of other team members Supporting the operation of services using DevOps methodologies for the rapid introduction to production of new services and operational enhancements Technical thought leadership and mentoring of junior colleagues Attend training as required to meet Oracle and CRS compliance and regulatory standards.
Perform daily tasks in accordance with compliance and regulatory standards Other duties as assigned General Qualifications The candidate will Explore and learn multiple, cutting edge technologies in the cloud industry Solve complex technical problems and communicate effectively in a team environment Understand CI / CD best practices Be self-motivated Advance automation of standard / recurring tasks Assimilate and apply new technologies Work with remote, global teams as well as individually Produce documentation for application engineers in support of developed work Experience with development / test in an open source environment including operation of SSH and shell functions Strong networking knowledge Experience with SCM tools and software engineering best practices Agile methodology knowledge Preferred Qualifications An ideal candidate will have expertise with Experience with Elasticsearch, Logstash, and Kibana (ELK Stack) is expected Experience with ELK alternatives such as Splunk, Loggly, and other log management tools is an advantage Linux / Unix development (Oracle Linux preferred) CI / CD (Jenkins and GitLab CI) Containers and orchestration (Docker, Kubernetes, and docker-compose) Cloud computing platform (Oracle Cloud Infrastructure Services) Programming and scripting languages (Python, bash, Java Script - additional experience with Groovy, Java, Ansible, and / or Go is a plus) Git version-control and collaboration (GitLab) Issue tracking and collaboration (Jira and Confluence) Product / Service ownership or Project Management experience a plus Experienced with ITIL V3 or V4, Foundation Level certified preferred Support Oracle GBUs provide services to many critical systems globally requiring 24x7 support.
DevOps engineers will rotate with other team members in a designated, on-call status following in-country requirements. CRSs primary support model is follow the sun, utilizing geographically diverse team members during normal working hours to provide support.
CRS will strive to have subject matter experts distributed globally. With geographic diversity, countries and regions have a broader array of holidays requiring a flexible support schedule across multiple geographies.
Additionally, coverage is required through the weekend. To provide the required support to Oracle customers, CRS will use additional compensation to cover extended business hours and / or on-call pay based on in-country laws and Oracle policy.
In general, there will be a Primary and Secondary engineer designated in advance to provide coverage for select services.
If numerous services are supported in a specific geographic region, there may be more than one set of Primary / Secondary engineers selected.
Software engineers and database administrators will engage in activities to restore services that are down or degraded. This may be as simple as running an existing script to restart a service or executing a standard operating procedure, or may require code changes with review steps, integration, testing, and software deployments to restore a service to normal operation.
In addition, engagement with other Oracle development and support teams may be needed.