Mastered Entity Attributes for Contacts

By default, Contacts data products include mastered entity attributes and attributes provided by data quality services. You can also add custom attributes.

You can configure which attributes are included in the the data product's entities table and their order from the entities page. See Configuring Entity Attribute Display.

When configuring publish destinations and datasets, you can also select which attributes are included in the published output, as well as attribute names and order. See Publishing Data Products.

Contacts (Golden Records) Dataset Attributes

The contacts (golden records) dataset includes the following types of attributes:

  • Primary attributes, which are the unified schema attributes to which source columns are mapped.
  • Curation attributes, which are calculated values for attributes that can help with data curation, such as the number of similar clusters and cluster size.
  • Uniformity attributes, which are the calculated uniformity scores for clusters and selected attributes.

Primary Contacts (Golden Records) Attributes

The following attributes are provided by default in the unified schema:

Attribute Name
tamr_id
name_prefix
first_name
middle_name
family_name
full_name
name_suffix
address
address_line_2
city
state
postal_code
country
latitude
longitude
address_type
professional_title
email
phone
alternative_phone
org_name
org_alternative_names
org_website

Curation Attributes

Curation attributes are calculated metrics that can aid in data review and curation. These include:

AttributeDescription
number_of_source_recordsFor each golden record, the number of source records in the cluster.
number_of_source_datasetsFor each golden record, the number of source datasets from which source records in a cluster originated.
number_of_similar_entitiesFor each golden record, the number of similar golden records (entities).
maximum_similarityFor each golden record, the maximum similarity score for similar golden records.

Uniformity Attributes

Uniformity attributes are calculated values that measure the similarity (uniformity) of records within a cluster, and for values of selected attributes within a cluster. By default, an overall cluster_uniformity_score is calculated for each golden record. In the Configure Data Product page, you also can choose to calculate a similarity score for specific attributes.

Uniformity scores range from 0 to 1. For example, uniformity score of 1 for an attribute means that all records in the cluster have the same value for this attribute, while a uniformity score of 0 indicates that all records in this cluster have different values for this attribute.

Source Records Dataset Attributes

Attribute Name
tamr_id
tamr_record_id
source_name
primary_key
name_prefix
first_name
middle_name
family_name
full_name
name_suffix
address
address_line_2
city
state
postal_code
country
latitude
longitude
address_type
professional_title
email
phone
alternative_phone
org_name
org_alternative_names
org_website