Job Description
Our CompanyChanging the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.
We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!
Our companyAt Adobe, we are changing the world. How We give people the tools to bring their ideas to life and create content that makes life more fun and work more meaningful. We give businesses and organizations the power to truly engage their customers. We're the ones behind the gorgeously designed content that streams across your laptop, TV, phone, and tablet every day-and we're the ones who harness the massive power of big data to help companies move from data to insight and insight to action by delivering content that people crave most.
We're a company that understands that product innovation comes from people innovation, and that's why we invest in cultivating leaders throughout the organization. If you are passionate about leading from where you sit, join us.
Roles and Responsibilities:. Improving & Optimizing Availability, Performance, Capacity of infrastructure, and applications.
. Building and implementing observability for applications health/performance/capacity.
. Optimizing On-call rotations and processes.
. Documenting 'tribal' knowledge and spread the knowledge across the team
. Building Auto-remediation/Event Driven Automation solutions on platforms
- Cloud Platforms (AWS/Azure)
- Mesos/Kubernetes
- CICD
- Observability (Prometheus/New Relic/ELK)Cloud Platforms (AWS/ Azure)
- Databases (MySQL/Mongo/Cassandra)
- Data Platforms Infrastructure (Hadoop)
- Stackstorm (Preferred Event Driven Auto-remediation platform)
. Providing help in onboarding new auto-remediations solutions
. Providing reports/dashboards on services SLO/Error Budgets/Alerts and Operational Overhead.
. Working with various teams to define SLO/Error Budgets/Alerts/identifying and reducing toil via auto-remediation.
. Working with the Dev team to have an in-depth understanding of the application architecture
and its bottlenecks.
. Identifying observability gaps in product services, infrastructure and working with stakeholders to fix it.
. Managing Outages and doing detailed RCA with developers and identifying ways to avoid that situation.
. Managing/Automating upgrades of the infrastructure services.
. Automate toil work
Experience & Skills:. 7+ years of experienceas an SRE/DevOps/Infrastructure Engineer on large scale microservices and infrastructure.
. A collaborative spirit with the ability to work across disciplines to influence, learn and deliver
. Self-Learner and Passion towards automation, innovation, avoid repetitive manual tasks
. Contributor or Follower in Open Source technologies, active in developer forums, solving problems, providing recommendations for code fixes
. Accountability and Ownership - Self-drive towards delivery
. Thirst for Innovation & Challenges
. A deep understanding of computer science, software development, and networking principles.
. Demonstrated experience with languages, such as Python, Java, Golang, etc.
. Strong programming on Python (Core - Strong, Advanced - Foundational)
. Basic Linux Programming (shell script, awk script, regular expressions, grep, sed)
. Extensive experience with Linux administration and a good understanding of the various Linux kernel subsystems (memory, storage, network, etc).
. Extensive experience in DNS, TCP/IP, UDP, GRPC, Routing, and Load Balancing.
. Expertise in GitOps, Infrastructure as Code tools such as Terraform, etc.. and Configuration Management Tools such as Chef, Puppet, Saltstack, Ansible.
. Expertise of Amazon Web Services (AWS) and/or other relevant Cloud Infrastructure solutions like Microsoft Azure or Google Cloud.
. Experience in building CI/CD solutions with tools such as Jenkins, GitLab, Spinnaker, Argo etc.
. Experience in managing and deploying containerized environments using Docker, Mesos/Kubernetes is a plus.
. Experience with multiple datastores is a plus (MySQL, PostgreSQL, Aerospike, Couchbase, Scylla, Cassandra, Elasticsearch).
. Experience with data platforms tech stacks like Hadoop, Hive, Presto, etc is a plus.
. Understanding of ssh, sftp, Message queues protocols (AMQP, SQS, MQTT, STOMP, Kafka, RabbitMQ, ActiveMQ)
. Data Structures & Algorithms, Solid OOPS Understanding
. Working knowledge of MEAN stack & Front end UI/UX technologies (MongoDB, angularJS, node.js)
. Building & Managing Webservices in Python/Java (REST/SOAP)
. Working Understanding on JIRA, Agile scrum processes (JIRA)
Employement Category:
Employement Type: Full time
Industry: IT
Functional Area: IT
Role Category: Software Engineer
Role/Responsibilies: Computer Scientist
Contact Details:
Company: Adobe
Location(s): Bengaluru