Publishing Data Product Data

You make different data product datasets available to downstream destinations by exporting them to a cloud storage destination.

Datasets include Entities, Source Records by Entity, and/or Entities by Similarity. When configuring a publish destination, you select which datasets to publish to that location and which columns to include in those datasets. See Available Published Datasets for information on these datasets.

Your ability to use Publish and the data products you can access in Publish depend on your user role and permissions.

When you publish data to cloud storage, any data already published to the destination for the data product is overwritten.

Before You Begin:
Because publishing overwrites any data already published to the destination, back up the target file or table before publishing.

important Important Notes for Snowflake:

  • If you are publishing to Snowflake and have added or removed output fields since the last publish, you must either update the destination Snowflake table to match the updated schema or delete the destination table. Otherwise, publishing will fail. If you delete the destination table before publishing, Publish recreates the table with the updated schema.
  • In order to view the published dataset, you must have read access in Snowflake to the table to which it was published. Contact your Snowflake administrator if you are not able to view the published datasets.

To publish data product datasets:

  1. Navigate to Publish.
  2. Open the data product tile.
  3. If the Publish table does not include the publish destination that you want to use, add a new destination. See Adding a Publish Destination for instructions.
  4. In the Publish table, select Publish Publish for the destination to which to publish the data.
  5. Confirm publishing.
    The datasets configured for that publish destination are published to the cloud storage location.

You can monitor the progress of the publish job.