Changes between Version 15 and Version 16 of BluePrintDeduplication
- Timestamp:
- 05/07/12 04:12:02 (13 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
BluePrintDeduplication
v15 v16 5 5 == Process == 6 6 7 1. Identifying Duplicate Records (using the [http://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance Jaro Winkler Distance])7 1. Identifying Duplicate Records for each Resource (using the [http://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance Jaro Winkler Distance]) 8 8 * Comparing the fields used in the "represent" of this table is a good start. 9 9 * In order to determine if the records are in fact duplicate, the user should have the option to open up the records and somehow see where they are referred to. 10 1. Merging Duplicate Records (see [http://wiki.sahanafoundation.org/lib/exe/fetch.php/foundation:gsoc_kohli:import:resolve_duplicates.jpg wireframe]) 11 1. Replacing Duplicate Records (must work with offline instances over sync too) 10 * For specific resources different processes could be used to identify duplicates (See below) 11 [[Image(http://wiki.sahanafoundation.org/lib/exe/fetch.php/foundation:gsoc_kohli:import:indentify_duplicates.jpg)]] 12 1. Merging Duplicate Records 13 [[Image(http://wiki.sahanafoundation.org/lib/exe/fetch.php/foundation:gsoc_kohli:import:resolve_duplicates.jpg )]] 14 See also: 15 - [wiki:BluePrintRecordMerger Record Merger] 16 1. Replace Reference to the record with has been merged (must work with offline instances over sync too) 12 17 13 A complete specifications can be foundat [http://wiki.sahanafoundation.org/doku.php/foundation:gsoc_kohli:import:duplicates]18 There is a similar specifications at [http://wiki.sahanafoundation.org/doku.php/foundation:gsoc_kohli:import:duplicates] 14 19 15 20 Update: Some of the above part has been merged into sahana code. 16 21 17 See also:18 - [wiki:BluePrintRecordMerger Record Merger]19 20 22 == Different Processes Identifying Duplicate Records == 21 22 23 Some resources may have unique processes for identifying duplicates: 23 24 … … 52 53 53 54 === Ideas === 54 55 56 55 * if the Document duplicate check was triggered inside a 'colorbox' popup in a main form, populate the main record with the existing record instead of just the error message (error could be downgraded to a warning) 57 56