Lead SRE Engineer IT
Location:RTP, North Carolina, US
Area of InterestInformation Technology
Technology InterestCloud and Data Center, Networking, Security
Location is open!
It’s an exciting time to be a part of Cisco IT’s Infrastructure and Container Services. Our team is responsible to build and operate on-premise clouds on Cisco ACI Software Defined Network in a DevSecOps model. We are enabling Cloud Native Infrastructure within Cisco.
Who You Are:
You are a success driven Lead Site Reliability Engineer with shown leadership skills and who has a passion for enterprise cloud infrastructure automation and DevOps frameworks. You have a consistent record of designing, developing and leading cloud infrastructure Ops code using open source technologies.
What You’ll Do:
Site Reliability Engineers are responsible for reliability, scalability, automation, and other issues related to uptime and availability of our on-premise cloud. You will need to have solid skills in following areas:
- Design, write and build tools to improve the reliability, availability and scalability of our Openstack/VMWare/Openshift clouds.
- Augment existing instrumentation to build a cohesive picture of the characteristics of our systems with special attention to points of failure.
- Design and develop improvements, focused on resilience, to our production systems to achieve and surpass SLOs
- Help improve our operational practices to minimize service disruptions
- Work with our Service Assurance team to modernize and improve our monitoring and alerting stack.
- Design new tools to monitor and smart alerts that help discover failures or issues before our customers.
- Work with engineers to identify root cause and fix issues
- Influence, design and create new architectures, standards and methods for large-scale enterprise systems.
- Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Who You'll Work With:
We are a DevOps team inside Cisco IT building the next generation cloud platform that will be used by all of Cisco as we move to cloud native applications. This is a small team of highly motivated individuals using Agile scrum. We move at a fast pace and are passionate about cloud and automation. Giving back and contributing to the Opensource projects we leverage is encouraged. Where there is not a tool or project to deliver what we need we develop it. We have a history of building clouds at a large scale and are looking for someone with just as much passion about cloud as we have.
Experience with tools like Elasticsearch, Logstash, Nagios, Grafana, Graphite, InfluxDB, StatsD, and CollectD
Experience with building and maintaining Redhat or Centos Linux
Experience with configuration automation using Ansible
Experience with public cloud like AWS, GCP, or Azure
Experience with on-premise cloud technologies using VMware or Openstack
Experience with container technologies like Openshift, Kubernetes, and Docker
Software development lifecycle including design, development, testing, packaging, deployment, upgrade and support.
Experience with software development tools like Git, Gerrit, Spinnaker, and Jenkins
Python, Go, or similar programming experience.
QA and testing experience of your code and the entire platform.
Understanding of security including OS hardening, firewalls, iptables, and working with Infosec
Understanding of network basics like routers and switches
Leadership in crafting and maintaining SRE technologies
Leadership in Agile software development practices
Leading geographically distributed teams
Understand IT processes, including: architecture, design, implementation, and operations
Opensource development experience
Self-motivated, able and willing to help where help is needed
Able to build relationships, be culturally sensitive, have goal alignment, have learning agility
Typically requires BS/BA and 10+ yrs of relevant experience.
#WeAreCisco, where each person is unique, but we bring our talents to work as a team and make a difference. Here’s how we do it.
We embrace digital, and help our customers implement change in their digital businesses. Some may think we’re “old” (30 years strong!) and only about hardware, but we’re also a software company. And a security company. A blockchain company. An AI/Machine Learning company. We even invented an intuitive network that adapts, predicts, learns and protects. No other company can do what we do – you can’t put us in a box!
But “Digital Transformation” is an empty buzz phrase without a culture that allows for innovation, creativity, and yes, even failure (if you learn from it.)
Day to day, we focus on the give and take. We give our best, we give our egos a break and we give of ourselves (because giving back is built into our DNA.) We take accountability, we take bold steps, and we take difference to heart. Because without diversity of thought and a commitment to equality for all, there is no moving forward.
So, you have colorful hair. Don’t care. Tattoos. Show off your ink. Like polka dots. That’s cool. Pop culture geek. Many of us are. Passion for technology and world changing? Be you, with us.
We Are Cisco