Combining multiway Principal Component Analysis (MPCA) and clustering for efficient data mining of historical data sets of SBR processes

Kris Villez, Magda Ruiz, Gürkan Sin, Joan Colomer, Christian Rosén, Peter A. Vanrolleghem

Research output: Contribution to journalArticlepeer-review

49 Scopus citations

Abstract

A methodology based on Principal Component Analysis (PCA) and clustering is evaluated for process monitoring and process analysis of a pilot-scale SBR removing nitrogen and phosphorus. The first step of this method is to build a multi-way PCA (MPCA) model using the historical process data. In the second step, the principal scores and the Q-statistics resulting from the MPCA model are fed to the LAMDA clustering algorithm. This procedure is iterated twice. The first iteration provides an efficient and effective discrimination between normal and abnormal operational conditions. The second iteration of the procedure allowed a clear-cut discrimination of applied operational changes in the SBR history. Important to add is that this procedure helped identifying some changes in the process behaviour, which would not have been possible, had we only relied on visually inspecting this online data set of the SBR (which is traditionally the case in practice). Hence the PCA based clustering methodology is a promising tool to efficiently interpret and analyse the SBR process behaviour using large historical online data sets.

Original languageEnglish
Pages (from-to)1659-1666
Number of pages8
JournalWater Science and Technology
Volume57
Issue number10
DOIs
StatePublished - 2008
Externally publishedYes

Keywords

  • LAMDA clustering
  • MPCA
  • Nutrient removal
  • On-line monitoring
  • SBR

Fingerprint

Dive into the research topics of 'Combining multiway Principal Component Analysis (MPCA) and clustering for efficient data mining of historical data sets of SBR processes'. Together they form a unique fingerprint.

Cite this