Changes between Version 11 and Version 12 of UserGuidelines/Import/UpdateDetection
- Timestamp:
- 09/13/12 10:20:34 (12 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
UserGuidelines/Import/UpdateDetection
v11 v12 30 30 Person records are primarily identified by: 31 31 32 - an exact match of first name and last name(if both are present in the import item), or33 - alternatively, an exact match of the initials(if present in the import item)32 - an exact match of '''first name''' and '''last name''' (if both are present in the import item), or 33 - alternatively, an exact match of the '''initials''' (if present in the import item) 34 34 35 35 If any matching records can be found, they will be ranked by: … … 46 46 These criteria are weighted by a schema to satisfy a wide range of cases: 47 47 48 - first name: match +2, mismatch -2, missing from either record 0 points49 - last name: match +2, mismatch -2, missing from either record 0 points50 - date of birth: match +3, mismatch -2, missing from either record 0 points51 - email address: match +2, mismatch -5, missing from import item -2 if initials present or -3 if no email in the database or otherwise -4 points, missing from the database 0 points52 - initials: match +4, mismatch -1, missing from either record 0 points53 - mobile phone number: match +1, mismatch -1, missing from either record 0 points48 - '''first name''': match +2, mismatch -2, missing from either record 0 points 49 - '''last name''': match +2, mismatch -2, missing from either record 0 points 50 - '''date of birth''': match +3, mismatch -2, missing from either record 0 points 51 - '''email address''': match +2, mismatch -5, missing from import item -2 if initials present or -3 if no email in the database or otherwise -4 points, missing from the database 0 points 52 - '''initials''': match +4, mismatch -1, missing from either record 0 points 53 - '''mobile phone number''': match +1, mismatch -1, missing from either record 0 points 54 54 55 55 '''DEVELOPERS note:''' ''the exact schema needed for a deployment depends on the typical quality of the import data, which may vary. The more consistent and detailed the import items are, the safer the schema works. It is possible (and maybe necessary) to adjust these weights to particular situations by using a set of unit test cases like in {{{!PersonDeduplicateTests}}} in {{{modules/unit_tests/eden/pr.py}}}. However, it should not be expected that this schema can reliably detect any possible edge-case - as per its purpose it is much more important to maintain a manageable set of rules how data sources would have to indicate updates, and adapt the data sources to them.''