Automatically Cleaned Data Values
The data cleaning feature in data product configuration enables you to replace known bad values in your source data with null, when Tamr finds an exact, case insensitive match for an attribute value. This helps to ensure that these values are not used for matching or included in your golden records
In addition to values you specify for data cleaning, Tamr automatically replaces the following values with null wherever it identifies an exact, case insensitive match in the mapped source record fields:
ABSENTBLANKCC ONLY DO NOT INACTIVATECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCOPYFROMCOPY FROMDO NOT INACTIVATEDO NOT DEACTIVATEEMPTYINVALIDMISSINGN/ANANANNILNO CO SPECIFIEDNO DATANO INFONO STREETNONENONEXISTENTNOT APPLICABLENOT AVAILABLENOT FOUNDNOT PROVIDEDNOT PROVIDED ACCOUNT - UNKNOWNNOT SETNOT VALIDNULL[email protected]STREETTBDUNAVAILABLEUNDEFINEDUNKNOWNVACANT[[UNKNOWN]][UNKNOWN STREET][UNKNOWN STREET)UNKNOWN VALUEUNKNOWN ZIPUNRESOLVEDUNSETUNSPECIFIEDVOID
Healthcare Provider NPI Cleaning
In the Healthcare Provider data product, Tamr replaces the following invalid National Provider Index (NPI) values with null:
- NPIs that include 5 or more of the same digit repeated
- NPIs that contain only zeros
- NPIs that contain the substring
123456789
Additionally, Tamr removes any decimal zeros. For example, if the value is 555.000, Tamr changes the value to 555.
Example
Consider this example in which Tamr automatically replaces attribute values of STREET will null
The following attribute values are replaced with null:
streetStreetSTREET
The following values are not cleaned; the full value remains in the attribute:
streets123 main streetELM STREETSt.
Updated 4 months ago