Modifying Record Consolidation Transformations
You can add or modify rules to select the appropriate values for mastered entity record fields.
While a mastering flow can contain multiple transformation steps, you typically need to modify only one of these steps: Consolidate Records. The Consolidate Records step applies transformation rules to produce a single record, called the mastered entity record, that best represents a cluster. For most fields, these rules select the most common value from the clustered source records.
If you added new output fields in the Schema Mapping step, you also need to add transformation rules to select the appropriate values for each of these fields in the mastered entity record.
Common transformation rules:
- To set the value to the most common value across source record:
mode(<field>) as <field>
- To set the value to the sum of the values across source records:
`sum(to_double()) as
If you require a more advanced rule to select the appropriate mastered entity record field value, see the Tamr Core transformations documentation for complete transformation function details. This link opens the Tamr Core documentation in a new window.
Note: Tamr Cloud supports scripted transformations for a single input dataset, resulting in a single output dataset.
To update transformations to consolidate records:
- In Designer, select the data product tile to open the flow.
- Select the Consolidate Records step to open it.
- For each new output field, add the transformation rule under
//Groupby
.
Example:mode(<field>) as <field>
to select the most common value for this field - When you have finished the transformation updates, navigate back to the flow by selecting the back arrow next to the step description.
Updated 8 days ago