Lead SRE Engineer IT

  • Location:
    RTP, North Carolina, US
  • Additional Location(s)
  • Area of Interest
    Information Technology
  • Job Type
  • Technology Interest
    Cloud and Data Center, Networking, Security
  • Job Id

Location is open!

It’s an exciting time to be a part of Cisco IT’s Infrastructure and Container Services. Our team is responsible to build and operate on-premise clouds on Cisco ACI Software Defined Network in a DevSecOps model. We are enabling Cloud Native Infrastructure within Cisco.

Who You Are:

You are a success driven Lead Site Reliability Engineer with shown leadership skills and who has a passion for enterprise cloud infrastructure automation and DevOps frameworks. You have a consistent record of designing, developing and leading cloud infrastructure Ops code using open source technologies.

What You’ll Do:

Site Reliability Engineers are responsible for reliability, scalability, automation, and other issues related to uptime and availability of our on-premise cloud. You will need to have solid skills in following areas:

  • Design, write and build tools to improve the reliability, availability and scalability of our Openstack/VMWare/Openshift clouds.
  • Augment existing instrumentation to build a cohesive picture of the characteristics of our systems with special attention to points of failure.
  • Design and develop improvements, focused on resilience, to our production systems to achieve and surpass SLOs
  • Help improve our operational practices to minimize service disruptions
  • Work with our Service Assurance team to modernize and improve our monitoring and alerting stack.
  • Design new tools to monitor and smart alerts that help discover failures or issues before our customers.
  • Work with engineers to identify root cause and fix issues
  • Influence, design and create new architectures, standards and methods for large-scale enterprise systems.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.

Who You'll Work With:

We are a DevOps team inside Cisco IT building the next generation cloud platform that will be used by all of Cisco as we move to cloud native applications. This is a small team of highly motivated individuals using Agile scrum. We move at a fast pace and are passionate about cloud and automation. Giving back and contributing to the Opensource projects we leverage is encouraged. Where there is not a tool or project to deliver what we need we develop it. We have a history of building clouds at a large scale and are looking for someone with just as much passion about cloud as we have.

Technical Expertise:

Experience with tools like Elasticsearch, Logstash, Nagios, Grafana, Graphite, InfluxDB, StatsD, and CollectD

Experience with building and maintaining Redhat or Centos Linux

Experience with configuration automation using Ansible

Experience with public cloud like AWS, GCP, or Azure

Experience with on-premise cloud technologies using VMware or Openstack

Experience with container technologies like Openshift, Kubernetes, and Docker

Software development lifecycle including design, development, testing, packaging, deployment, upgrade and support.

Experience with software development tools like Git, Gerrit, Spinnaker, and Jenkins

Python, Go, or similar programming experience.

QA and testing experience of your code and the entire platform.

Understanding of security including OS hardening, firewalls, iptables, and working with Infosec

Understanding of network basics like routers and switches

Non-Technical Requirements:

Leadership in crafting and maintaining SRE technologies

Leadership in Agile software development practices

Leading geographically distributed teams

Understand IT processes, including: architecture, design, implementation, and operations

Opensource development experience

Self-motivated, able and willing to help where help is needed

Able to build relationships, be culturally sensitive, have goal alignment, have learning agility

Typically requires BS/BA and 10+ yrs of relevant experience.

Why Cisco:

#WeAreCisco, where each person is unique, but we bring our talents to work as a team and make a difference. Here’s how we do it.

We embrace digital, and help our customers implement change in their digital businesses. Some may think we’re “old” (30 years strong!) and only about hardware, but we’re also a software company. And a security company. A blockchain company. An AI/Machine Learning company. We even invented an intuitive network that adapts, predicts, learns and protects. No other company can do what we do – you can’t put us in a box!

But “Digital Transformation” is an empty buzz phrase without a culture that allows for innovation, creativity, and yes, even failure (if you learn from it.)

Day to day, we focus on the give and take. We give our best, we give our egos a break and we give of ourselves (because giving back is built into our DNA.) We take accountability, we take bold steps, and we take difference to heart. Because without diversity of thought and a commitment to equality for all, there is no moving forward.

So, you have colorful hair. Don’t care. Tattoos. Show off your ink. Like polka dots. That’s cool. Pop culture geek. Many of us are. Passion for technology and world changing? Be you, with us.

We Are Cisco