Metallurgical.App

Data Cleaner


Mineral Processing App

PLANT TRIAL DATA CLEANER:

Due to various reasons such as shutdowns, changes in feed, operating conditions or processing equipments,
other concurrent trials and data recording; plant trial data require extensive cleaning prior to conducting statistical analyses.

The raw data Excel file should have the parameters or variables'names in the top cell of each column at the top row (e.g., date/timestamp, throughput, feed grade, recovery, etc..).
The parameters' values are in the rows below the parameters'names. See example in table below.
DateMill availabilityThroughputFeed gradeConcentrate gradeTailing grade
1/01/202292%12001.2125.40.150
2/01/202295%12201.1826.10.145
..................
31/07/202297%11801.1914.80.147

To use:
  • Choose the raw data Excel (.xlsx) file and click on "Upload to a table".

  • Use the data visualisation/graph below to plot the data.

  • Select parameters to clean successive duplicates and chose whether to remove the entire row or just the successive duplicates.
    Unlike Microsoft Excel, non successive duplicates that are genuine values will not be removed.


  • Select parameters to clean non numerical data and chose whether to remove the entire row or just the NaN (not a number).

  • Select parameters and specify their normal range to delete out of range values; and chose whether to remove the entire row or just the out of range data.

  • Specify normal data relationships (e.g., tailing grade < feed grade). Choose whether to remove the left, right, both parameters or the entire row to clean the data.

  • Specify the unit (non numerical characters) to delete for selected parameters (e.g., "tph").

  • Specify the categorical data to delete (e.g., "Shift" of type "night shift").

  • Click on "Clean data".

  • Save the cleaned data using the button "Save data" (The original Excel file will not be altered).

  • To clean again, re-upload the cleaned Excel file.

  • Watch a Youtube video example below.



    Note: Select "delete rows" for the parameters you want to complete comparison of regression lines analyses.

    large files may take time to upload, please wait until the data appear in a table at the bottom fo this page
    Clean successive duplicates Clean non numerical data (NaN) Clean data out of normal ranges Clean data out of normal ranges
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <
    < <




    Clean meaningless data Clean unit characters Clean unit characters Clean categorical data
    <
    <
    <
    <
    <
    :
    :
    :
    :
    :
    :
    :
    :
    :
    :
    of type:
    of type:
    of type:
    of type:
    of type:


    Enter exactly the unit characters as they appear in the table below Selected data rows will be deleted


    (*)To clean again, re-upload the cleaned Excel file.

    Data visualisation
    versus








    DATA PRE-PROCESSING AND CLEANING