Load Shedding Methods for Big Data Stream with Sparsity
CSTR:
Author:
Affiliation:

Clc Number:

TP338

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    How to improve the accuracy of load shedding under the premise of ensuring real-time performance is an important problem. Sparsity is a widespread feature of the big data stream. Therefore, we propose two load-shedding methods of the big data stream with sparsity in two scenarios. In the normal business scenario, we model the big data stream with the high dimensional space. Then we propose a load shedding method based on centrifugation, which uses the elastic distance to measure the distance of data. In the anomaly-monitoring scenario, we analyze the feature of the big data stream and propose a load shedding method based on equivalence class, which uses the combined similarity to divide the data set into equivalence classes. The combined similarity was composed of processing behavior similarity and data similarity to measure the difference between data. Repeated test results show that the two load shedding methods in this paper can significantly improve the accuracy compared with the conventional load shedding methods.

    Reference
    Related
    Cited by
Get Citation

WANG Shun, LI Zhenxing, LIAN Zengshen, ZENG Guosun, DING Chunling. Load Shedding Methods for Big Data Stream with Sparsity[J].同济大学学报(自然科学版),2020,48(02):276~286

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:February 21,2019
  • Revised:January 13,2020
  • Adopted:December 06,2019
  • Online: February 26,2020
  • Published:
Article QR Code