Our Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!
The Cloud Technology Organization builds platform and client services that are foundational building blocks for many other Adobe products and services. Areas of focus include: identity, security, cloud storage, e-commerce, workflow management synchronization, customer facing web apps, scalability, infrastructure management and search, just to name a few. Our mission is to build highly scalable, available and resilient services that fulfill the business objectives of Adobe.
The SRE team within this organization has an exciting and challenging mission: Build, deploy, operate, scale and maintain cloud platforms for customer facing Adobe SaaS solutions used by billions of users worldwide. Adobe is looking for a Senior Manager (SRE CoE) with multiple years of experience on cloud deployments to drive many critical initiatives across a number of teams.
Build a SRE Center of Excellence (CoE) to coordinate and drive initiatives to improve reliability of services across multiple Engineering teams. Be the CoE for technologies related to Cloud, Observability, Database, FedRAMP etc.
Ensure the highest level of uptime and Quality of Service (QoS) to Adobe’s customers through operational excellence
Drive initiatives across multiple services such as automation, Chaos Engineering, defining critical metrics (SLIs/SLOs/SLAs), reducing MTTR and MTTD, EDA (Event Driven Automation), RPA (Robotic Process Automation), AI/ML techniques, improving end to end observability in microservices environments, defining service launch readiness criteria etc.
Partner with Engineering teams on architectural discussions to increase adoption and migration of cloud services to AWS and Azure
Hire, mentor and build a strong team to meet our ongoing customer needs of the highest reliability of our services
Consistent track record building very highly available and distributed systems. At least, 5 years of production level experience with reliable and secure cloud scale infrastructure (AWS and/or Azure)
5+ years of personnel management experience, preferably with employees in multiple locations
Experience driving critical initiatives related to reliability and efficiency across Cloud Engineering and Operations teams
Strong programming experience in one or more of the following languages: Java, Python, Go
Production level expertise with Kubernetes and common Observability tools (New Relic, Splunk, Datadog, Prometheus, OpenTelemetry etc)
Experience with Agile methodologies and CI/CD tools
Familiarity with security frameworks such as FedRAMP, ISO27001, SOC2, PCI-DSS, and/or HIPAA
Strong written and oral communication skills with a high degree of comfort speaking with developers and leadership
B.S. degree in Computer Science or related technical field