Steps Completed by B2B Site Mastering
When you create a data product using the B2B site mastering template, Tamr Cloud creates a mastering flow in Designer with steps specific to site mastering.
The following table describes each step in the B2B site mastering flow, and explains which steps usually need to be edited for your data.
|Usually Requires Changes||Step||Description|
|Add Data||You verify that source data meets both general requirements and template-specific requirements. Then, you add source data.|
|Align to Customer Model||You map input columns to attributes in the supplied schema.|
||This transformation step ensures that each source record has a unique primary key across all
source datasets by adding a new primary key field:
For data products created before June 1, 2023, this step produces a
For data products created on June 1, 2023 or later, this step produces a
Important: If records within the same source dataset have duplicate primary key values, the
If you mapped an empty placeholder column to the
|Prepare Data for Enrichment||This step transforms the data in the unified dataset to match the expected inputs to the enrichers included in the mastering flow.
This step also adds the following fields to the unified schema:
|Standardize URL||This step provides a cleaned version of the website domain in
|Enrich Phone Number||This step validates, standardizes, and enriches phone number data. See Phone Number Enrichment.|
|Enrich Address||This step standardizes and validates address information, and enriches addresses with latitude, longitude, and detailed address information. See Address Standardization, Validation, and Geocoding.|
|Enrich Company Name||This step cleans and enriches company name data. See Company Name Enrichment.|
|Prepare for Clustering||This step transforms the data in the unified dataset to create the fields used by the
trained clustering model to identify similar and matching records.
The fields created as input to the model are prefixed with
|Apply Clustering Model||This step groups records that refer to the same entity into a cluster, using the trained model. See Features of B2B Site Mastering.|
|Consolidate Records||This step applies rules to produce a single record, called the mastered entity record, that best represents a cluster. For most fields, these rules select the most common value from the clustered records.
Additionally, this step adds a Tamr ID (
If you added new attributes in the Schema Mapping step, add lines in the transformations to tell Tamr Cloud what value to set for each attribute when creating the mastered entity. See Modifying Record Consolidation Transformations.
|Deliver Data to Studio||You configure how mastered entity attributes appear in Studio, Curator, and published datasets See Configuring Data Display in Studio.|
Updated 2 days ago