Cloud Monitoring Engineer · Bengaluru, India

Naveen Kumar
Where Clouds Meet
Command.

Cloud Infrastructure · NOC Operations · DevOps in Progress

3+ years keeping enterprise cloud platforms healthy at 3AM. I monitor production SaaS environments at scale, investigate infrastructure incidents across AWS and Azure, and build automation tools that make the whole team faster.

☁️ AWS 📊 Grafana 🔭 Sumo Logic ⚡ PagerDuty 🐙 Kubernetes 🌊 Spot by NetApp 🔷 Azure
Naveen Kumar
Open to Opportunities

The Engineer Behind the Dashboard

I'm Naveen Kumar — a Cloud Monitoring Engineer based in Bengaluru, India, with 3+ years of experience in 24x7 production operations, enterprise cloud monitoring, and incident management for global SaaS platforms.

Currently at Flexera Software, I monitor production environments using Grafana, AWS CloudWatch, Sumo Logic, PagerDuty, and Spot by NetApp — ensuring platform availability for cloud-native infrastructure running on AWS and Azure.

Before that, I spent time at Amazon in Seller Partner Support, where I developed deep instincts for diagnosing production issues under pressure, communicating across engineering teams, and owning the full incident lifecycle.

I'm actively building toward a DevOps Engineering role — adding Terraform, Kubernetes administration, and CI/CD pipelines to my toolkit. My longer-term goal is to work with international cloud teams, with Europe as my target destination.

3+ Years in Cloud Ops
24x7 Production Monitoring
P1→P3 Full Incident Coverage
99.9% Uptime Mindset

What I Work With

A toolkit built for production-grade cloud observability — with DevOps skills in active development.

☁️
Cloud Platforms
Amazon Web Services Microsoft Azure EC2 Ocean Clusters Elastigroup ALB Auto Scaling SaaS Infrastructure
🔭
Monitoring & Observability
Grafana AWS CloudWatch Sumo Logic PagerDuty Spot by NetApp Alert Validation Event Correlation
🚨
Incident Management
Root Cause Analysis P1/P2/P3 Triage SLA Management ITIL Incident Mgmt Shift Handover Runbooks Post-mortems
🏗️
Infrastructure & DevOps
Kubernetes Linux Docker Terraform Git CI/CD Bash Scripting
🛠️
Tools & Platforms
Salesforce Jira Zendesk SQL (Basics) Microsoft Excel Chrome Extensions
🤝
Soft Skills
Cross-team Collaboration Stakeholder Communication Analytical Thinking Documentation Time Management
🔷
Azure Fundamentals (AZ-900)
Microsoft Certified
Earned
☁️
Cloud Computing with AWS
Internshala
Earned
⚙️
DevOps Beginners to Advanced
Udemy
Earned
🏆
AWS DevOps Engineer Professional
Amazon Web Services
In Progress
CKA – Kubernetes Administrator
CNCF
Planned
🔶
HashiCorp Terraform Associate
HashiCorp
Planned

Career Journey

From Amazon's global operations to enterprise cloud monitoring — building toward DevOps.

Cloud Monitoring Engineer
Flexera Software India Pvt. Ltd.
Feb 2026 – Present
  • Monitor enterprise production environments using Grafana, AWS CloudWatch, Sumo Logic, PagerDuty, Salesforce, and Spot by NetApp to ensure platform availability and operational health.
  • Perform 24x7 incident monitoring, alert validation, event correlation, and production support for cloud-native SaaS environments across AWS and Azure.
  • Investigate P1, P2, and P3 incidents by analysing metrics, logs, dashboards, and infrastructure behaviour to identify root causes.
  • Monitor AWS EC2, Ocean Clusters, Elastigroups, Stateful Instances, Kubernetes workloads, ALBs, and cloud infrastructure services end-to-end.
  • Collaborate with SRE, DevOps, Engineering, and Customer Support teams during major incidents to restore production services within SLA.
  • Manage customer-facing production incidents through Salesforce, ensuring timely communication, accurate documentation, and operational updates.
🏆 Notable: Developed a Salesforce Chrome Extension automation utility to improve monitoring efficiency and reduce investigation time — recognised for strong incident ownership and SLA adherence.
Grafana AWS CloudWatch Sumo Logic PagerDuty Spot by NetApp Kubernetes Salesforce RCA
Seller Partner Support Associate
Amazon Development Centre India
Jun 2023 – Nov 2025
  • Supported Amazon Seller Platform by resolving production-related issues via chat, email, and phone while maintaining high CSAT scores.
  • Investigated workflow failures, listing issues, account problems, and system anomalies by validating application behaviour and operational data.
  • Prioritised incidents based on business impact, customer severity, and SLA commitments before escalating to engineering teams.
  • Collaborated with Product, Engineering, and Operations teams to resolve customer-impacting issues efficiently.
  • Maintained detailed case documentation and communicated timely updates throughout the incident lifecycle.
  • Consistently achieved quality, productivity, and customer satisfaction targets while supporting global operations.
Incident Management SLA Management RCA Customer Communication Global Operations Escalation Management
Education
Rajkiya Engineering College, Ambedkarnagar
Graduated 2021
  • Bachelor of Technology in Information Technology
  • Dr. A.P.J Abdul Kalam Technical University
B.Tech – Information Technology AKTU

Things I've Built

A mix of professional tools and personal projects — spanning monitoring automation, cloud infrastructure, and DevOps workflows.

🔌
Salesforce Chrome Extension
Built at Flexera to automate monitoring workflows within Salesforce. Reduced incident investigation time and improved efficiency for the NOC team during production support shifts.
JavaScript Chrome Extension API Salesforce Automation
🔭
Cloud Infrastructure Monitoring Setup
End-to-end observability stack for AWS environments — Grafana dashboards, CloudWatch metric alarms, and Sumo Logic queries for log correlation. Includes alert runbooks and shift handover templates.
Grafana CloudWatch Sumo Logic PagerDuty AWS
⚙️
Terraform AWS Infrastructure (IaC)
Personal DevOps project to provision a complete AWS environment using Terraform — VPC, EC2, S3, IAM roles, and security groups — with remote state management in S3 and DynamoDB locking.
Terraform AWS VPC IAM S3 Backend
🐳
Containerised App with CI/CD Pipeline
Dockerised a Python Flask app and configured a GitHub Actions pipeline for automated testing, container builds, and deployment to an AWS EC2 instance — exploring the DevOps path hands-on.
Docker GitHub Actions Python Flask EC2
🔗 More projects and open-source contributions are available on my LinkedIn profile →

Let's Work Together

Open to full-time DevOps / Cloud roles, remote and European opportunities, and freelance cloud consulting engagements.

Whether you're a recruiter with a cloud or DevOps role, a team that needs monitoring expertise, or looking for freelance cloud support in the US timezone — I'd love to connect. I'm particularly interested in remote-first teams and roles with European relocation potential.