Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.
We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!
You will be a member of the Site Reliability Engineering team of the Digital Experience. We are looking for a Senior Site Reliability Engineer with a software engineering background who is passionate about developing software to help scale monitoring, alerting, provisioning and configuration management. We are a multi-cloud environment (Azure/AWS), are security-focused and are helping customers succeed. This individual should be self-motivated and have a drive for quality.
What you'll do Engage with product and engineering to drive and improve the whole lifecycle of operational readiness - from inception and design, through deployment, operation and refinement proactively Write software layers, scripts, deployment frameworks, tracers, monitors, self-healing/auto remediation tools and automate the processes Build and maintain software modules for use and re-use in cloud and on-premise systems automation Maintain business continuity by identifying and driving opportunities to make systems highly resilient and human-free Assist our software engineering team to ensure accurate monitoring and metrics are being built into applications before going to production Maintain up-to-date documentation on deployments, processes, and standard operating procedures/run-books
What you need to succeed Bachelor’s degree in Computer Science or equivalent, and 5 years of relevant work experience Advanced experience with Linux, Internet Protocols, and Large-Scale Operations Programming (Python and Bash are our preferred scripting/shell languages) and automation skills Troubleshooting and system engineering exposure in Linux production environments; experience with Linux, Internet Protocols, and Large-Scale Operations Experience with AWS and/or Azure stack – particularly in the areas of networking (VPCs, security groups), VMs (EC2), databases (RDS), load balancing (ELB, ALB) Excellent information management practices, such as thorough documentation, usage of wikis, and other collaboration tools Ability to scope project work, estimate effort and then break down work into sub-tasks Strong intuition about system design, robustness, and scalability Excellent written and verbal communication skills, demonstrating the ability to effectively convey technical information to both technical and non-technical audiences Experience with relational databases such as MySQL, Postgres, and document stores such as MongoDB is preferred Experience deploying applications in containers using Docker and Kubernetes is preferred