Lagged-kNN based data imputation approach for multi-stream building systems data.

Summary

Increasing advancements in building digitization, smart sensing, and metering technologies have allowed large amount of data to be collected and saved for monitoring, analyzing, and controlling building systems. However, due to sensors or communications failure, the data collected are often incomplete and poor in quality. Data imputation approaches to replace the missing values, specifically based on both statistical and predictive models have been widely adopted for multivariate datasets. It is hence of interest to find an effective way to impute building system data by leveraging the mutual information from strongly correlated sensors. In this paper, we evaluate multiple data imputation approaches using data collected from a medium sized, mixed-use institution building situated in Stockholm, Sweden. Sensors with widely varying characteristics from the case study building were selected to test the imputation methods. Artificial test data with ground-truth was first created for validation by removing randomly selected portions of data. The imputation accuracy was computed for each method and the impact of the chosen method on a short-term building forecasting model was evaluated. Results demonstrate that incorporating time-lagged cross correlations within the knearest neighbor (kNN) model provide the most accurate imputations.

Available documents

Format PDF

Pages: 10 p.

Available

Free

Details

  • Original title: Lagged-kNN based data imputation approach for multi-stream building systems data.
  • Record ID : 30030212
  • Languages: English
  • Source: 2022 Purdue Conferences. 7th International High Performance Buildings Conference at Purdue.
  • Publication date: 2022
  • Document available for consultation in the library of the IIR headquarters only.

Links


See other articles from the proceedings (39)
See the conference proceedings