This applicaiton will guide you through the process of eliminating data columns that are useless or even harmful to your analysis. The average error in % from a cross-validation procedure is used as a measure for the data set quality. Cross-validation is ten-fold and based here on a decision tree. Final decisions are recorded in an audit report and saved in file auditReport.xls. The whole workflow has been implemented to run interactively on the KNIME WebPortal. On the WebPortal press START to begin.

URL: Data Cleaning from a Web Broswer https://www.knime.org/files/white-papers/DataCleaning_WebPortal.pdf




EXAMPLES Server: 50_Applications/25_DataCleaning_WebPortal/DataCleaning_WebPortal50_Applications/25_DataCleaning_WebPortal/DataCleaning_WebPortal*
Download a zip-archive



* Find more about the Examples Server here.
The link will open the workflow directly in KNIME Analytics Platform (requirements: Windows; KNIME Analytics Platform must be installed with the Installer version 3.2.0 or higher). In other cases, please use the link to a zip-archive or open the provided path manually