Product Update Summary - September 8, 2023

We recently released new insights into source data and improved data product templates, along with other recent updates and fixes.

New Insights: Source Details

We added Source Details Insights to help you understand how your source datasets are being used in the mastering flow, and how your source datasets overlap. In the Insights page, select Source Details from the dropdown in the top right corner.

Sample insights

Sample insights

See Gaining Insights with Data Product Metrics for more.

Data Product Template Improvements

Improved Matching in Person Mastering Data Products

The Person Mastering data product now includes a first name enrichment service that improves match results. This service examines first name values and identifies common first name variations and nicknames (for approximately 1,000 first names). For example, common variations for Robert include Rob, Robbie, and Bob. The clustering step uses the original first name value and the enriched values when evaluating first name similarity. These first name variations are not included in the data product output. This change is available in new data products created with this template.

Learn more about this data product template.

Improved Firmographic Enrichment Results

In the Company Matching with Firmographic Enrichment template, we improved enrichment results through better handling of:

  • German address data.
  • Variations of AND and & in company names.

These improvements are available in new data products created with this template.

Learn more about this data product template.

Updated Schema for Legal Entity Data Products

In the Legal Entity data product template, we expanded the default unified schema and output schema to include more attributes commonly used for capturing company information. The new attributes include: Associated_Persons, Company_Registration_Number, Company_Type, Founding_Year, Previous_Names, Stock_Exchange, Tax_IDs, Ticker_Symbol, and Type_Of_Address. These changes are available in new data products created with this template.

Learn more about this data product template.

Other Recent Improvements

General styling and user experience improvements.

Fixed Issues

  • In the Curator > Configure Flow page, users were not able to open, edit, or add steps in the flow. For new data products, users could not access the flow in the Configure Flow tab.
  • In the Curator entities table, if you filter to only bookmarked entities and then select Actions > Remove Bookmarks, the filtered view continues to display the un-bookmarked records.
  • In the Manage Cluster Details page, if you scroll to the right in a table, the table automatically scrolls back to the first column.
  • Bookmarking in tables with over 1 million records took a long time.
  • In the mastering flow for data products created with the People Mastering template, step output for the First Name Enrich step was empty. However, the flow completed successfully.
  • Record-level delta metrics incorrectly reported added records when no change occurred.
  • For AWS S3 connections, source data would not refresh.
  • Publishing datasets resulted in an error in certain conditions.
  • In Sources, users were able to start a refresh job when a refresh job was already running for the same source, resulting in the refresh failing.
  • The Grammarly Chrome extension caused issues with rendering user interface elements.
  • Flow failed to run under certain conditions.