Finding Needles in the Haystack - A CEP Approach to Detect Recurring Grid Issues

dc.contributor.authorLarsson, Erik
dc.contributor.authorNgo, Josef
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.departmentChalmers University of Technology / Department of Computer Science and Engineeringen
dc.contributor.examinerMassimiliano Gulisano, Vincenzo
dc.contributor.supervisorMassimiliano Gulisano, Vincenzo
dc.date.accessioned2025-10-28T14:26:11Z
dc.date.issued2025
dc.date.submitted
dc.description.abstractThe digitalization of electricity grids through the Advanced Metering Infrastructure (AMI) has led to unprecedented volumes of data and automated event generation from smart meters. While this enhanced monitoring capability provides valuable insights into grid conditions, the high frequency of generated events creates chal lenges for utility companies who must distinguish between routine fluctuations and genuine operational issues. This thesis investigates the application of Complex Event Processing (CEP) to im prove anomaly detection in smart meter data by correlating meter-generated events (e.g., overvoltage or physical tampering) with time-series measurement data (e.g., electricity consumption or voltage). Using real-world datasets from Göteborg En ergi’s AMI system three CEP-based queries were implemented in Apache Flink: A query that detects consumption anomalies following physical tampering events (terminal cover dismounted), an overvoltage query that identifies sustained voltage problems rather than momentary spikes, and a duplicate timestamp query that re veals systematic data collection issues. The terminal cover dismounted query achieved filtering ratios between 0.46% and 1.00%, reducing 1,413 events to 6-13 actionable anomalies with 46-50% precision. The overvoltage query demonstrated filtering ratios from 0.006% to 1.44%, effectively reducing 98,547 events to manageable numbers for analyst review. The duplicate timestamp query discovered a previously unknown systematic issue affecting nearly 269,000 meters. The queries were evaluated using throughput, CPU usage, memory usage, and la tency. Performance evaluation demonstrates strong scalability characteristics with processing throughput of approximately 400,000 tuples per second, significantly ex ceeding the estimated production data flow of 4,000 tuples per second. The system maintained consistent CPU usage and conservative memory requirements (∼10GB peak), supporting practical deployment with resources available at utility compa nies. Latency evaluation for the terminal cover dismounted query and the duplicate timestamp query showed a median between 21 and 50 seconds while the overvoltage query had an increased median between 230 and 270 seconds with outliers above 2600 seconds. These findings show that CEP can enhance anomaly detection in AMI systems, enabling automated correlation of events and measurements while reducing false positives and analyst workload. The approach shows promise for grid monitoring applications and provides a foundation for more sophisticated anomaly detection v systems using CEP. Future research can investigate ways to further minimize latency, enabling operators to detect and respond to anomalies more promptly.
dc.identifier.urihttp://hdl.handle.net/20.500.12380/310682
dc.setspec.uppsokTechnology
dc.subjectComplex Event Processing
dc.subjectStream Processing
dc.subjectSmart Grid
dc.subjectAdvanced Metering Infrastructure
dc.subjectAnomaly Detection
dc.subjectApache Flink
dc.titleFinding Needles in the Haystack - A CEP Approach to Detect Recurring Grid Issues
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeComputer systems and networks (MPCSN), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 25-138 EL JN.pdf
Storlek:
1.41 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: