Monitoring Threshold Functions over Distributed Data Streams with Node Dependent Constraints

Date

2012-09-18

Department

Program

Citation of Original Publication

Malinovsky, Yaakov, and Jacob Kogan. “Monitoring Threshold Functions over Distributed Data Streams with Node Dependent Constraints.” Algorithms 5, no. 3 (September 2012): 379–97. https://doi.org/10.3390/a5030379.

Rights

Attribution 3.0 Unported

Abstract

Monitoring data streams in a distributed system has attracted considerable interest in recent years. The task of feature selection (e.g., by monitoring the information gain of various features) requires a very high communication overhead when addressed using straightforward centralized algorithms. While most of the existing algorithms deal with monitoring simple aggregated values such as frequency of occurrence of stream items, motivated by recent contributions based on geometric ideas we present an alternative approach. The proposed approach enables monitoring values of an arbitrary threshold function over distributed data streams through stream dependent constraints applied separately on each stream. We report numerical experiments on a real-world data that detect instances where communication between nodes is required, and compare the approach and the results to those recently reported in the literature.