Our Ability Jobs

Job Information

CVS Health Cloud Engineer - Observability Platforms in Saint Paul, Minnesota

Bring your heart to CVS Health. Every one of us at CVS Health shares a single, clear purpose: Bringing our heart to every moment of your health. This purpose guides our commitment to deliver enhanced human-centric health care for a rapidly changing world. Anchored in our brand — with heart at its center — our purpose sends a personal message that how we deliver our services is just as important as what we deliver. Our Heart At Work Behaviors™ support this purpose. We want everyone who works at CVS Health to feel empowered by the role they play in transforming our culture and accelerating our ability to innovate and deliver solutions to make health care more personal, convenient and affordable.

Job Description

A Brief Overview

Location: Hybrid or Remote potential

  • Woonsocket, RI

  • Florham Park, NJ

  • Chantilly, VA

  • Alpharetta, GA

  • Soho, NY

  • Hartford, CT

CVS Health Enterprise Engineering plays a critical part in shaping the future of CVS Health. If you’re looking for the chance to leverage advanced technology to redefine the CVS Application Platform landscape, enhance the customer experience and improve engineer’s lives on day to day basis, this is the opportunity for you. Join us and challenge your IT expertise and analytical skills to help create a better engineering experience to our customers within CVS Health.

The platform is focused on providing a seamless developer experience, identifying, and analyzing system design weaknesses, along with troubleshooting complex technical issues. In addition, this role will primarily assist the team technically around Observability Platform capabilities, Alerting and Monitoring Solutions. You should have the ability to learn quickly, shadow senior engineers on the team, and communicate clearly. You should also have excellent Software Engineering and collaboration skills. A successful candidate will be a highly motivated, collaborative individual; motivated to achieve results in a fast-paced environment.


  • Implement the technical capabilities and collaborate with other Observability Engineers on team in terms of technical solutions.

  • Collaborate with Team, Principal Engineer, Architect, and Engineering Leaders to understand the Roadmap and help them team in delivering those capabilities on time.

  • Build and maintain monitoring and alerting systems that provide timely feedback on the performance and health of our systems/applications. Continuously improve infrastructure and applications to ensure 99.99% uptime while removing architectural complexity.

  • Adopt and implement best coding practices and documentation standards.

  • Improve the Application Observability product. You will build dashboards and provide guidance on how to monitor various technologies covered by OpenTelemetry components.

  • Develop and maintain non functional requirements for Observability Metrics such as SLAs and SLOs.

  • Help monitor and maintain Production environments using experience with Loki, Grafana, Prometheus and alertManager.

  • Hands-on experience creating Grafana operational dashboards, data visualization using Grafana.

  • Achieve material improvements in system performance based on insights from observability metrics.

  • Provide 24/7 operations support for production, other critical environments to ensure 99.99% availability of our systems.

  • Work on OpenTelemetry-based solutions. We have plans to ship Grafana OpenTelemetry distributions for Java, React, Angular, Node JS etc.

  • Clear understanding of SRE best practices, performance management, capacity analysis and creating fault tolerant deployment patterns.

  • Write documentation. As an instrumentation expert, you will write documentation that makes it easy for Grafana Cloud users to instrument their applications with OpenTelemetry and get started with Grafana Cloud.

  • Teach others. Share the knowledge with OpenTelemetry, semantic conventions, and various technologies and frameworks to both Grafana squads and customers.

  • Help Customers on priority basis as and when customers reach out on support channels

  • A passion for staying up to date with the latest trends and technologies in public cloud environments.

  • Exceptional analytical skills, able to apply knowledge and experience in decision-making to arrive at creative and commercial solutions

  • Excellent verbal and written communication skills

What You’ll Do Day To Day

  • Design and implement observability platform strategies.

  • Improve reliability, stability, and performance of production systems.

  • Implement automation of engineering and operations processes for observability platform

  • Optimize Observability Practices

  • 24/7 On-call rotation support as needed. On-call support will be rotated among team members

  • Maintenance and administration of source control systems for observability platform

  • Help Customers on priority basis as and when customers reach out on support channels

For this role you will need Minimum Requirements:

  • 3 + years of experience in Cloud Engineering, Site Reliability Engineering

  • 2 + years of experience with Observability Platform tooling such as Grafana, Loki, Prometheus

  • 2 + years of experience with Docker, Kubernetes, and Helm

  • 2 + years of experience on Cloud platform – Google Cloud or AWS. GCP is preferred

Preferred Skills:

  • Exposure to MySQL, PostgreSQL or any other RDS databases

  • 2 + years of experience with CD tools such as Argo CD, Harness etc. Argo CD is preferred.

  • 2 + years of experience with Open Telemetry tools such as OTEL

  • 2 + years of experience in open-source frameworks and 1 + years of experience with Tempo

  • 2 + years of Strong Linux OS-level, command-line and scripting knowledge (e.g., Go, Python, Bash etc.), and configuration management principles

  • Experience with SaaS Architecture and with the development and operation of high-traffic backend systems

  • Experience with infrastructure-as-code with Ansible, Terraform etc.

  • Strong Exposure to Microservices and web app architecture

  • Experience in architecting, implementing, and managing monitoring tools such as Prometheus/Grafana, CloudWatch, NewRelic, and ELK in the cloud

Educational Qualification:

  • A Bachelor's or Master's degree in Computer Science, Electrical Engineering, a related field, or equivalent industry experience.

Pay Range

The typical pay range for this role is:

$64,890.00 - $154,500.00

This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. In addition to your compensation, enjoy the rewards of an organization that puts our heart into caring for our colleagues and our communities. The Company offers a full range of medical, dental, and vision benefits. Eligible employees may enroll in the Company’s 401(k) retirement savings plan, and an Employee Stock Purchase Plan is also available for eligible employees. The Company provides a fully-paid term life insurance plan to eligible employees, and short-term and long term disability benefits. CVS Health also offers numerous well-being programs, education assistance, free development courses, a CVS store discount, and discount programs with participating partners. As for time off, Company employees enjoy Paid Time Off (“PTO”) or vacation pay, as well as paid holidays throughout the calendar year. Number of paid holidays, sick time and other time off are provided consistent with relevant state law and Company policies. For more detailed information on available benefits, please visit jobs.CVSHealth.com/benefits

We anticipate the application window for this opening will close on: 07/14/2024