Changes between Version 17 and Version 18 of BluePrintDeduplication
- Timestamp:
- 05/07/12 04:26:34 (13 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
BluePrintDeduplication
v17 v18 5 5 == Process == 6 6 7 1. Identifying Duplicate Records for each Resource (using the [http://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance Jaro Winkler Distance]) 7 === 1. Identifying Duplicate Records === 8 * Display for each Resource 9 * using the [http://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance Jaro Winkler Distance]) 8 10 * Comparing the fields used in the "represent" of this table is a good start. 9 11 * In order to determine if the records are in fact duplicate, the user should have the option to open up the records and somehow see where they are referred to. … … 11 13 [[Image(http://wiki.sahanafoundation.org/lib/exe/fetch.php/foundation:gsoc_kohli:import:indentify_duplicates.jpg)]] 12 14 (different resources should be on different tables) 13 1. Display & Merge Duplicate Records 14 [[Image(http://wiki.sahanafoundation.org/lib/exe/fetch.php/foundation:gsoc_kohli:import:resolve_duplicates.jpg )]] 15 === 2. Display & Merge Duplicate Records === 16 [[Image(http://wiki.sahanafoundation.org/lib/exe/fetch.php/foundation:gsoc_kohli:import:resolve_duplicates.jpg )]][[BR]] 17 18 Or 19 [[Image(https://docs.google.com/drawings/pub?id=1Axk0OvnG7oSrFNT4VYfafYMJ2TjadL_MVxudfF0OeMM&w=960&h=720)]] 15 20 * See also: [wiki:BluePrintRecordMerger Record Merger] 16 21 * For some records it could be important to display virtual fields to show that they are not duplicates (eg a person'a email/phone/address) 17 1. Replace Reference to the record with has been merged (must work with offline instances over sync too) 22 === 3. Replace References === 23 Replace References to the record which has been merged (must work with offline instances over sync too) 18 24 19 25 There is a similar specifications at [http://wiki.sahanafoundation.org/doku.php/foundation:gsoc_kohli:import:duplicates]