Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Sr. Site Reliability Engineer, DocCloud @ Adobe

Home > Software Engineer

 Sr. Site Reliability Engineer, DocCloud

Job Description


Our Company
Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.
We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!
Position summary
The AdobeDocument CloudSite Reliabilityteam is responsible for delivering a scalable, reliable and secure computing environment to support the millions of transactions that happen every day. We are looking to expand ourSite Reliability Engineeringteam as we embark on a new phase of growth for our product. We are a metrics-driven organization that strives to deliver world-class service both externally and internally. The team strongly believes in the DevOps/SRE methodology and works very closely with our peers on the development team.
Responsibilities
  • Implement and support AdobeDocument Cloudhosted web applications, virtual machines, databases, storage systems, and service buses in cloud deployments by working with engineering organizations in support of development and test functions
  • Engage with product and engineering todrive and improve the whole lifecycle of operational readiness - from inception and design, through deployment, operation and refinement proactively.
  • Troubleshoot issues across multiple systems or domains of varied complexities
  • Contributes to architecture, design, and code while factoring in business priorities
  • Support various UNIX-based services to ensure maximum uptime, performance and security and participate in RCAs
  • Document every action so that your learning turn into repeatable actions and then into automation
  • Collaborate effectively and mentor other engineers in the team and in the larger reliability engineering org
  • Analyze performance trends across a variety of systems for capacity planning
  • Work closely with engineering and QA teams to roll out new products and services
  • Handle day-to-day system administration tasks such as account management, patching, application deployment, system installations, and other routine maintenance
  • Own and enforce security compliance processes and controls
  • Programmatically automate routine cloud deployment, administration and monitoring tasks
  • Seeks out and learns new technologies & techniques and advocates for their use
  • Participate in 24x7 on-call pager rotation

Requirements
  • 8+ years of experience in a production (Web Facing) Linux, Solaris or *BSD environments at medium to large scale
  • Deep experience with AWS,Azureincluding migrating services to AWS, Azure
  • Ability and determination to solve complex system/application problems
  • Relentless approach to getting to the bottom of any problem
  • Experience with MySQL, Java, Apache, & Tomcat
  • Experience with configuration management tools like Chef, Puppet or CFengine
  • Experience with containerization with Docker, Kubernetes/EKS/AKS
  • Experience with CI/CD with Jenkins, Groovy DSL
  • Familiarity with Prometheus, Cortex, Grafana, NewRelic, DataDog, and Splunk
  • Knowledge of key protocols including TCP/IP, SSH, DNS, SMTP, SNMP, SSL, HTTP and LDAP
  • Experience with different caching architectures
  • Knowledge of security compliance frameworks, such as SOC II, PCI, HIPPA, ISO27001 and FedRAMP
  • Strong programming skills, particularly with anyone of Go(preferred), Python and Java
  • Knowledge of well-known open source tools for monitoring, trending and configuration management
  • A desire to provide a reliable, secure and scalable environment that supports millions of users
  • Ability to architect and help create a highly automated environment
  • Participate in the incident management process
  • Assist in the creation and refinement of operational documentation
  • Manage our uptime and performance using service level indicators and objectives
  • Excellent verbal and written communication skills
  • Self-driven, eager to gets things done

Employement Category:

Employement Type: Full time
Industry: IT
Functional Area: IT
Role Category: Software Engineer
Role/Responsibilies: Sr. Site Reliability Engineer, DocCloud

Contact Details:

Company: Adobe
Location(s): Noida, Gurugram

+ View Contactajax loader


 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ Not Specified

Adobe

www.adobe.comwww.adobeindia.com