Saturday, January 28, 2012

New Deduplication Tool in BlueBox2.0

BlueBox2.0 Database Module now includes a new data deduplication tool to assist in scrubbing out annoying duplicate entries.

The new tool can be found in the Database Tools list in Admin->Database and works a follows:
  • Select the class/module that requires cleansing (ie bb_users)
  • Then select the fields that together or separately identify the duplicates (ie company_name and street_zipcode will find all users where the company_name and the street_zipcode are the same).
  • You can also filter the results for a certain range, select how many to show and opt for 'and' or 'or' selectivity.
  • When the duplicate entries in the database are displayed to you they are clustered in 'groups' and you can then select the entry in each group to keep, and thereby specify which entries to have deleted from the database. Entries that are an exact match are shown in in red to highlight severe duplications.
  • Finally the system will process your requests and delete the duplicated entries, but most helpfully, it will also run through every table in the database and substitute the newly deleted entries with the correct entry you have chosen to keep, so that all your related data is automatically cleansed in the process.

No comments:

Post a Comment