Job Description
FRE provides the complete centrally managed Continuous Integration & Delivery (CI/CD) solution for Oracle Fusionapps Cloud Application Portfolio. It caters to a large number of developers and release management team members (several thousands) and a business critical platform solution. As a member of FRE DevOps Team, candidate will be responsible for CI/CD Platform Activities like Observability / Monitoring / Triaging / Supporting in the various phases of CI/CD.
As a member of FRE DevOps Team, candidate will be responsible for CI/CD Platform Activities like Observability / Monitoring / Triaging / Supporting in the various phases of CI/CD.
1. Reliability and Observability Engineering (using Cloud native stands using OCI)
- Design and implement end-to-end observability (metrics, logs, traces) across applications and infrastructure
- Implement alerting strategies to reduce noise and improve incident response
- Extend Clout native monitoring solutions using tools like Prometheus, Grafana, , ELK and others
- Create dashboards for system health, performance, and business KPIs
- Integrate observability tools into CI/CD pipelines
- Implement monitoring for microservices, APIs, databases, and message queues
Incident Management:
Coordinate and collaborate with multiple teams like Fusionapps Development, QA, Corporate IT or Development DevOps etc to identify, triage and resolve the blocker issues in the scope of CI/CD Operations and implement preventive actions to minimize the Mean Time to Resolution (MTTR).
2. Hybrid CI/CD and Process Automation
Deployment Automation: Design build and maintain CI/CD pipelines
Toil Reduction: Identify, design, and implement automation solutions to eliminate repetitive manual tasks and operational toil.
Configuration/Management of Jenkins/oke clusters and other application servers/services required for apps
3.Build and Release management
- Manage and optimize builds using Maven and Gradle
- Handle dependency management and multi module projects
- Trouble shoot build, dependency , and pipeline failures
4. Operational Data and AI Knowledge Base
Knowledge Structuring: Support the effort to organize operational knowledge (runbooks, post-mortems, environment details) into a standardized, machine-readable format to enable AI and automation tools.
Context Engineering Support: Focus on linking monitoring data (metrics, logs, traces) with relevant contextual metadata (deployment ID, application version, host details) to improve the efficacy of troubleshooting and future AIOps modeling.
Technical Skills & Qualifications
Experience: Minimum 7+ years of experience in a hands-on SRE, DevOps, or highly technical Operations role. Cloud Platform: Proven professional experience working with Oracle Cloud Infrastructure (OCI) services or any cloud provider (AWS, Azure or Google).
Observability Tools: Deep practical experience with OCI Logging Analytics, OCI Monitoring, and OCI APM (or equivalent modern tools like Prometheus, Grafana, Jaeger).
Containerization: Proficiency with Docker and orchestrators like Kubernetes .
CI/CD: Hands-on experience building, maintaining, and troubleshooting CI/CD pipelines. Solid experience with Maven/Gradle
Soft skills : Strong communication and analytical skills
Systems: Expert-level knowledge of Linux operating system internals, networking (TCP/IP, DNS, Load Balancers), and distributed systems concepts.
Bonus: Exposure to AI/ML concepts, knowledge management systems, or data modelling for operational data.Exposure to OpenTelemtry.
Job Classification
Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Platform Engineer
Employement Type: Full time
Contact Details:
Company: Oracle
Location(s): Hyderabad
Keyskills:
Maven
Linux
Networking
Configuration management
DNS
Incident management
Oracle
Release management
Analytics
Monitoring