Overview
OpsGuard provides comprehensive operational risk monitoring and performance analytics. The platform uses machine learning-based anomaly detection with Isolation Forest to identify potential issues before they impact SLA compliance.
Key Highlights
- Built an operational analytics system using Azure Data Factory, Synapse Analytics, and Power BI to process operational records per month in simulated service environments
- Deployed anomaly detection and threshold-based alerting pipelines using Isolation Forest and statistical control charts to monitor SLA compliance, proactively flag operational risks, and reduce simulated SLA violations by ~16%
- Automated root-cause analysis reports, reducing incident resolution time by ~21% in controlled testing scenarios
Technologies Used
SnowflakeBigQueryRedshiftSynapse AnalyticsdbtAirflowAzure Data FactoryInformatica