Managing Publish Destinations

You can add, share, and edit publish destinations.

If you have permission to publish a data product, you can:

  • Configure the destinations for publishing and the datasets to publish.
  • Select which columns to include in the published datasets.
  • Publish the datasets.
  • Share publish destinations for others to use.

Adding a Publish Destination

On the Publish page in Curator, you can add destinations to which to publish the Mastered Entities, Source Records by Cluster, and Cluster by Similarity datasets for use in downstream applications.

When adding a publish destination, you specify the cloud storage connection, which datasets to publish, and which columns to include in those datasets.

Required Information for Connection Types

The information required to configure a publish destination depends on the connection type:

Connection TypeRequired Information
ADLS Gen2- Storage Account Name
- Container Name
- Storage Access Key
Amazon S3- Bucket
- Region
- Access Key ID
- Secret Access Key
BigQuery- BigQuery Project
- BigQuery Dataset
- BigQuery Key File
Google Cloud Storage- Project Bucket
- Google Cloud Storage Key File
Snowflake- User
- Password
- Organization Name
- Account Name
- Database
- Schema
- Warehouse

Before You Begin:
Before you can add a publish destination, you must have access to one or more connections.

To add a publish destination:

  1. Open the data product from the home page.
  2. Select the Publish page.
  3. Select Add Destination.
  4. Select an existing connection or create a new one, and then select Next.
  5. Provide a name and optional description for this destination.
    This is the name of the destination as it will appear in Tamr Cloud, and does not affect the name of the published dataset.
  6. Enter any required information for the connection type, then select Next once you are finished.
  7. Select the datasets to be published to this destination: Mastered Entities, Source Records by Cluster, and Cluster by Similarity. See Datasets Available for Export for information on these datasets.
  8. For each datasets, specify which columns to include when published. All columns are automatically selected; deselect columns to exclude them. You can sort and filterto find specific columns to include in the dataset.
    Note: See the Column Names in Published Datasets section below for information on the names of columns in your published datasets.
  9. Select Save Destination.

Sharing a Publish Destination

Share a publish destination to allow other users to publish to that destination.

When you share a publish destination, you select the level of permissions that user will have for that destination. See User Roles and Permissions for more information.

To share a publish destination:

  1. Open the data product from the home page.
  2. Select the Publish page.
  3. Select Share Share for the destination.
  4. Select the user with whom to share the destination and their permissions for that destination:
    • Editor: allows the user to edit and publish to the destination.
    • Viewer: allows the user to publish to the destination.
      Note: Admin users and users whose accounts have been disabled do not appear in the list of users; admins have full access to all publish destinations.
  5. Select Share.

Changing Permissions for a Publish Destination

To change a user's permissions for a data product:

  1. Open the data product from the home page.
  2. Select the Publish page.
  3. Select Share Share for the destination.
    The Share dialog opens. The dialog lists the users with whom the destination is shared.
  4. Change the permission levels for users as needed, or select Delete trash icon to remove a user's permissions.
  5. Select Save.

Editing and Deleting a Publish Destination

Only admins and the owner of the publish destination can delete the destination.

If you have editor permissions for a publish destination, you can edit the destination settings, including connection details, which datasets to publish, and which columns to include in those datasets. These settings include the destination's name, description, output directory, and file type.

important Important: If you edit a destination shared with other users, the destination also updates for those users.

To edit a publish destination:

  1. Open the data product from the home page.
  2. Select the Publish page.
  3. Select Edit edit icon for the publish destination.
  4. Edit the destination settings as needed.
  5. Select Save Destination.

Column Names in Published Datasets

When publishing to ADLS2, S3, or GCS, the names of columns in the published datasets are the display names configured in Tamr Cloud at the time of publishing. Display names are set in the Field column in the Configure Attributes step in the flow, as shown in the image below.

When publishing to Snowflake or BigQuery tables, the names of columns in the published datasets are the original attribute names, as shown in the Mappings column below.