Data Science

OpsGuard

Operational Risk & Performance Analytics Platform

Overview

OpsGuard provides comprehensive operational risk monitoring and performance analytics. The platform uses machine learning-based anomaly detection with Isolation Forest to identify potential issues before they impact SLA compliance.

Key Highlights

  • Built an operational analytics system using Azure Data Factory, Synapse Analytics, and Power BI to process operational records per month in simulated service environments
  • Deployed anomaly detection and threshold-based alerting pipelines using Isolation Forest and statistical control charts to monitor SLA compliance, proactively flag operational risks, and reduce simulated SLA violations by ~16%
  • Automated root-cause analysis reports, reducing incident resolution time by ~21% in controlled testing scenarios

Technologies Used

SnowflakeBigQueryRedshiftSynapse AnalyticsdbtAirflowAzure Data FactoryInformatica