Leader Site Reliability Engineering

  • Location:
    Bangalore, India
  • Area of Interest
    Information Technology
  • Job Type
  • Technology Interest
  • Job Id


The Role: 

A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about system reliability to influence and drive the strategic SRE mission.  

As a Systems Reliability Engineer working on critical services your mission will be to ensure our services are fast, highly available, scalable, and able to withstand unprecedented increases in load. The Systems Reliability Engineer will be at the heart of solving production problems. Your scope is from the kernel to the application. The position requires the flexibility to take a holistic approach to troubleshooting and the ability to delve deeply into technical details. The Systems Reliability Engineer is co-located with the various application development teams. This ensures the Systems Reliability Engineer will acquire the necessary domain knowledge to effectively troubleshoot and repair an outage. The Systems Reliability Engineer will build automation tools for system health and production acceptance tests to validate production changes. The Systems Reliability Engineer will ensure the system is well instrumented and highly fault tolerant. 

Key Leadership Responsibilities: 

  • Engage, influence, and evangelize SRE practices with development, operational and product groups to align technology service/solution delivery. 

  • Drive quality accountability within the organization with well-defined processes, metrics, and goals for process quality. This includes leading effective postmortems and ensuring actions are followed-up. 

  • Manage availability, latency, scalability and efficiency of hybrid cloud offerings by instilling engineering reliability into our development life cycle with a focus on fault tolerant approaches. 

  • Drive capacity planning, performance analysis, instrumentation and other non-functional systems requirements. 

  • Must be able to define and report "progress" on strategic initiates and project level tasks to all stakeholders including senior executives, clients and use effective communication approaches with each constituency. 

  • Implement metrics driven processes to ensure service quality targets are met. 


Who you will work with: 

Cisco is transforming its platforms to run the next generation of cloud-native and multi-cloud services. This role offers a superb opportunity to transform how Hybrid Cloud platforms are managed with full stack automation. This team is responsible to manage hybrid cloud platforms that provides self-service provisioning and management capabilities to the application developers, and at the same time is highly available with self-healing, full lifecycle monitoring and management capabilities. 

You will be part of the Cloud Infrastructure and platform services team leading a team of infrastructu software development engineers and site reliability engineers in evaluating and developing capabilities needed to fully manage hybrid cloud platforms. You will closely collaborate with application teams, security team, and networking team partners in understanding their policies and requirements. You will also engage with the CI/CD, security and code quality metrics teams in keeping our code and third-party software components in compliance 


Key skills: 

You have Expert knowledge in all aspects of designing, developing, managing hybrid cloud platforms. 

You have Project and process management. 

Prior successful experience as a systems performance or site/systems reliability engineer 

You have experience with Linux, OpenStack, VMware, Containers, and Kubernetes. 

You have experience with AWS, GCP or Azure 

You have 10+ years of prior hands-on software development experience in Python, Java, C, or Golang. 

Exposure with CI/CD tools and continuous integration – Jenkins, SonarQube, Git. 

You enjoy doing code reviews and providing technical mentorship. 

Demonstrated experience working in large, complex systems environments. 

Deep understanding of internet and networking protocols. 

A passion for performance excellence, robustness and engineering mindset 

You have 3+ years of demonstrated ability working in a technical management position. 

MS in Computer Science or related technical field. 

You have good interpersonal skills (written and verbal) 



Why Cisco 

The Internet of Everything is a phenomenon driving new opportunities for Cisco and it's redefining our customers' businesses worldwide. We are pioneers and have been since the early days of connectivity. Today, we are building teams that are rapidly growing our technology solutions in the mobile, cloud, security, IT, and big data spaces, including software and consulting services. As Cisco delivers the network that powers the Internet, we are connecting the unconnected. Imagine crafting unprecedented disruption. Your groundbreaking ideas will impact everything from retail, healthcare, and entertainment, to public and private sectors, and far beyond. Collaborate with like-minded innovators in a lively and flexible culture that has earned Cisco global recognition as a phenomenal Place To Work. With roughly 10 billion connected things in the world now and over 50 billion estimated in the future, your career has exponential possibilities at Cisco. 


At Cisco, each person brings their rare talents to work as a team and make a difference. 

Yes, our technology changes the way the world works, lives, plays and learns, but our edge comes from our people. 

  • We connect everything – people, process, data and things taking results-oriented risks to craft the technologies that give us smart cities, connected cars, and handheld hospitals. And we do it in style with rare personalities who aren't afraid to change. 

  • We innovate everywhere - From launching a new era of networking that adapts, learns and protects, to building Cisco Services that accelerate businesses and business results. Our technology powers entertainment, retail, healthcare, education and more – from Cities to your everyday devices. 

  • We benefit everyone - We do all of this while striving for a culture that empowers every person to be the difference, at work and in our communities. 

  • Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified individuals will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis. Cisco will consider for employment, on a case by case basis, applicants with arrest and conviction records. 

  • We are leaders with vision, tech geeks, pop culture aficionados, and we even have a few purple haired rock stars. We celebrate the creativity and diversity that fuels our innovation. We are dreamers and we are doers. 

Colorful hair? Don’t care. Tattoos? Show off your ink. Like polka dots? That’s cool. Pop culture geek? Many of us are. Be you, with us! 



Cisco Covid-19 Vaccination Requirements
The health and safety of Cisco's employees, customers, and partners is a top priority. Our goal is to protect and mitigate the spread of COVID-19 infection for strong business resiliency during the pandemic. Therefore, Cisco may require new hires to be fully vaccinated against COVID-19 if the role requires business-related travel, meeting with customers/partners (including visiting third-party sites on behalf of Cisco), attending trade events, and Cisco office entry, unless otherwise prohibited by applicable law, and in countries where COVID-19 vaccination is legally required. The company will consider legally required accommodations/exceptions for medical, religious, and other reasons as per the requirements of the role and in accordance with applicable law. Additional information will be provided to candidates about the requirements and accommodation process at the offer time based on region.