AppD Site Reliability Engineer - EUM

  • Location:
    Offsite, RTP, North Carolina, US
  • Alternate Location
    Anywhere in the US remote
  • Area of Interest
    Engineer - Software
  • Job Type
  • Technology Interest
    Cloud and Data Center
  • Job Id

About Us

Cisco AppDynamics is an application performance monitoring solution that uses machine learning and artificial intelligence (AI) to provide real-time visibility and insight into IT environments. With our outstanding AIOps solution, you can take the right action at exactly the right time with automated anomaly detection, rapid root-cause analysis, and a unified view of your entire application ecosystem, including private and public clouds. Using Cisco AppDynamics, you’ll finally align IT, DevOps, and the business around the information that helps you protect your bottom line and deliver detailed customer experiences at scale.

About You

First and foremost, you have strong troubleshooting and problem resolution skills. You stay cool under pressure and have strong written and verbal communications skills. You pride yourself in being a self-starter who has experience working in a constantly evolving environment! You also have:

  • Bachelors degree in CSE, EE, CSM, or related technical field; MS degree desired
  • Minimum of a combined 5+ years of Site Reliability, DevOps, and/or Software Development experience, ideally in a growth-stage environment
  • Experience operating and supporting sophisticated SaaS production or revenue-critical 24/7 web services environments
  • Must have experience developing and operationalizing system installations and upgrades
  • Strong Experience with Unix/Linux system administration especially in RedHat Linux (CentOS)
  • Solid understanding of microservice architectures, principles, and patterns
  • Strong understanding of SQL and NoSQL, including experience working with one or more databases (e.g., MySQL, Druid, DynamoDB, or ElasticSearch)
  • Experience running services in AWS or other cloud platforms (Azure, GCP)
  • Significant experience with scripting/coding languages, ideally with Terraform or Python
  • Experience with big data platform engineering
  • Experience with scaling and operationalizing distributed data stores, file systems, and services (Kafka, Elasticsearch, etc)
  • Experience with virtualization and containerization platforms (Docker), container orchestration tools (Kubernetes) and aspects of Kubernetes to facilitate ease of delivery (Istio/Helm/Kube2Iam)
  • Availability for occasional on-call after-hours support

About the Role

End-User Monitoring (EUM) also known as Real User Monitoring (RUM) measures user experience from real users by collecting performance data on end-user devices like Browsers, Mobile Applications, IoT Devices et al. Another aspect of EUM is Synthetic User Monitoring (SUM) where we allow customers to test scripted flows against their websites from browsers deployed around the world. All the performance data is processed in an infinitely scalable infrastructure built on Amazon Cloud.

We are looking for a key member of the team to support our EUM Platform in a DevOPS/SRE role. Some of the primary responsibilities would include:

  • Helping to build an infrastructure to facilitate rapid service deployments
  • Documenting findings and recommendations for improvement
  • Responsible for helping lead full-stack platform infrastructure projects
  • Experience using config mgmt tools/cookbooks  (Ansible/Chef)
  • Maintaining and enhancing deployment tools and methodologies; play a lead role in advancing our 'Infrastructure as code' architecture
  • Scale products with AWS infrastructure components, namely; EKS, S3, Load Balancing, Dynamo, EMR, Kinesis, SQS, APIG etc.
  • Zero downtime, zero data drop, continuous deployment, multi-tenant, multi-version etc
  • Making recommendations to, and collaborating with engineering to ensure 100% application uptime
  • Monitoring the SaaS environment and working with Development, QA and Performance teams to identify and solve problems
  • Ensuring that failover mechanisms are in place and are working accurately
  • Responding to and resolving technical emergencies


We know that the award-winning culture at AppDynamics is something to brag about, but here are more reasons that make you excited to get out of bed to come in the morning, like:

  • Medical, dental, vision coverage
  • 401k match (4.5%)
  • Wellness perks (gym, hobbies, education, store discounts, personal finance)
  • 4 weeks PTO, 5 days VTO, 14 holidays (including 1 birthday PTO and 1 floating holiday)
  • Company - wide shut down between Christmas and New Years!

Just a Note

Note to Recruiters and Placement Agencies: AppDynamics does not accept unsolicited agency resumes. Please do not forward unsolicited agency resumes to our website or to any AppDynamics employee. AppDynamics will not pay fees to any third party agency or firm and will not be responsible for any agency fees associated with unsolicited resumes. Unsolicited resumes received will be considered property of AppDynamics.

AppDynamics is an equal opportunity employer and considers all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, protected veteran status, or any other unlawful factor. Appdynamic follows all applicable laws, including those regarding consideration of qualified applicants with criminal histories (such as the San Francisco Fair Chance Ordinance). If your disability makes it difficult for you to use this site, please contact AppDynamics participates in E-Verify.


Cisco Covid-19 Vaccination Requirements
The health and safety of Cisco's employees, customers, and partners is a top priority. Our goal is to protect and mitigate the spread of COVID-19 infection for strong business resiliency during the pandemic. Therefore, Cisco may require new hires to be fully vaccinated against COVID-19 if the role requires business-related travel, meeting with customers/partners (including visiting third-party sites on behalf of Cisco), attending trade events, and Cisco office entry, unless otherwise prohibited by applicable law, and in countries where COVID-19 vaccination is legally required. The company will consider legally required accommodations/exceptions for medical, religious, and other reasons as per the requirements of the role and in accordance with applicable law. Additional information will be provided to candidates about the requirements and accommodation process at the offer time based on region.