Connecting to External Data Repositories

You use connections to configure access to cloud storage locations or databases. These locations store source datasets (source connections) or published entity data (destination connections).

Use connections to import data from or export data to external locations. You can configure access to the cloud storage locations that store source datasets (source connections) or that will store exported data (destination connections).

Your ability to add, use, and edit connections depends on your user role and permissions.

If you have permission to use a connection and access to Designer, you can add datasets stored in the connection's cloud storage location to your mastering flow.

If you have permission to use a connection and access to Publish, you can select the connection when configuring a export destination for mastered entity record data.

Supported Connection Types

Cloud Storage Connections

  • Amazon S3
  • Google Cloud Storage

You can download datasets exported using these connections. See Sharing a Link to an Exported Dataset and Managing Export Destinations for more information.

Database Connections

  • Azure Synapse Analytics
  • Snowflake

important Important Known Issues for Snowflake Connections: Do not use the DEFAULT warehouse for Snowflake Source connections. Additionally, the password for the Snowflake user cannot contain special characters.

Configuring Cloud Connections

See the following topics for more:

Tamr-Provided Connections

Tamr Cloud comes with several pre-configured connections. You can use these connections to access sample data, export downloadable datasets, and upload files. These connections include:

Connection NameDescription
Demo ConnectionThis connection provides access to sample datasets stored in a Tamr-managed Google Cloud Storage (GCS) bucket. These datasets are automatically included in new data product mastering flows to be used as sample data.
Managed DestinationThis connection provides access to exported datasets stored in a Tamr-managed GCS bucket through shared links. You can also download datasets exported using this connection. See Sharing a Link to an Exported Dataset and Managing Export Destinations for more information on using this connection.
Managed File UploadThis Tamr-managed connection provides the ability to upload source files directly into Tamr Cloud. See Managing Source Datasets.