Connecting to External Data Repositories
You use connections to configure access to cloud storage locations or databases. These locations store source datasets (source connections) or published entity data (destination connections).
Use connections to import data from or export data to external locations. You can configure access to the cloud storage locations that store source datasets (source connections) or that will store exported data (destination connections).
Your ability to add, use, and edit connections depends on your user role and permissions.
If you have permission to use a connection and access to Designer, you can add datasets stored in the connection's cloud storage location to your mastering flow.
If you have permission to use a connection and access to Publish, you can select the connection when configuring a export destination for mastered entity record data.
Supported Connection Types
Cloud Storage Connections
- Amazon S3
- Google Cloud Storage
You can download datasets exported using these connections. See Sharing a Link to an Exported Dataset and Managing Export Destinations for more information.
Database Connections
- Azure Synapse Analytics
- Snowflake
Important Known Issues for Snowflake Connections: Do not use the DEFAULT warehouse for Snowflake Source connections. Additionally, the password for the Snowflake user cannot contain special characters.
Configuring Cloud Connections
See the following topics for more:
Tamr-Provided Connections
Tamr Cloud comes with several pre-configured connections. You can use these connections to access sample data, export downloadable datasets, and upload files. These connections include:
Connection Name | Description |
---|---|
Demo Connection | This connection provides access to sample datasets stored in a Tamr-managed Google Cloud Storage (GCS) bucket. These datasets are automatically included in new data product mastering flows to be used as sample data. |
Managed Destination | This connection provides access to exported datasets stored in a Tamr-managed GCS bucket through shared links. You can also download datasets exported using this connection. See Sharing a Link to an Exported Dataset and Managing Export Destinations for more information on using this connection. |
Managed File Upload | This Tamr-managed connection provides the ability to upload source files directly into Tamr Cloud. See Managing Source Datasets. |
Updated 11 days ago