Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Site Reliability Engineer @ Kripya Engineering

Home > IT Infrastructure Services

 Site Reliability Engineer

Job Description

Role & responsibilities

  • Monitor Cloud Infrastructure: Continuously monitor cloud environments (AWS and Azure & GCP) using DataDog, AppTio, and Nagios to ensure optimal performance and availability.
  • Incident Management: Detect, analyse, and resolve Cloud/Infrastructure issues in a timely manner to minimize downtime and impact on services.
  • Performance Tuning: Identify and implement optimizations to improve cloud infrastructure performance.
  • Reporting: Generate and analyse monitoring reports to provide insights and recommendations for infrastructure improvements.
  • Collaboration: Work closely with development, DevOps, and Cloud/Infrastructure teams to ensure seamless integration and performance of cloud services.
  • Documentation: Maintain comprehensive documentation of monitoring setups, procedures, and best practices.
  • Compliance: Ensure all monitoring practices adhere to industry standards and compliance requirements.

Required Qualifications

  • Experience: Minimum of 1 to 2 years of experience in cloud monitoring or a related field.
  • Tools and Technologies: Proficient in using DataDog, Grafana, and Nagios for monitoring and analysis.
  • Cloud Platforms: Strong knowledge of AWS and Azure services and architecture, GCP.
  • Scripting and Automation: Experience with scripting languages (e.g., Python, Shell) and automation tools.
  • Incident Response: Proven experience in incident management and resolution.
  • Analytical Skills: Strong analytical and problem-solving skills.
  • Communication: Excellent verbal and written communication skills, with the ability to convey complex technical concepts to non-technical stakeholders.
  • OS Competency - Linux & Windows, Micro Services - Docker

Preferred Qualifications

  • Experience with Other Monitoring Tools: Familiarity with other monitoring tools and platforms is a plus.
  • DevOps Practices: Understanding of DevOps principles and practices.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: IT & Information Security
Role Category: IT Infrastructure Services
Role: Incident Management
Employement Type: Full time

Contact Details:

Company: Kripya Engineering
Location(s): Chennai

+ View Contactajax loader


Keyskills:   NAGIOS Cloud Monitoring Datadog Sre New Relic Solarwinds Prometheus Dynatrace Splunk Grafana Monitoring Tools Appdynamics

 Fraud Alert to job seekers!

₹ -5 Lacs P.A

Similar positions

Azure Infrastructure Engineer

  • Capgemini
  • 4 - 6 years
  • Mumbai
  • 3 days ago
₹ Not Disclosed

Aws Devops Engineer

  • Cognizant
  • 6 - 11 years
  • Bengaluru
  • 3 days ago
₹ Not Disclosed

Senior Devops Engineer

  • Cognizant
  • 8 - 12 years
  • Chennai
  • 3 days ago
₹ Not Disclosed

Urgent Opening For Tech Support Role With Gcp Cloud Engineer

  • Cognizant
  • 2 - 6 years
  • Coimbatore
  • 10 days ago
₹ Not Disclosed

Kripya Engineering

Kripya Engineering Private Limited Kripya Group of Companies is founded with the mission of Creating Value. Kripya LLC based in Seattle (USA) together with Kripya Engineering Private Limited and Kripya Technologies (India) Private Limited based in Chennai (India) create value to customers by off...