Adding Source Data
You can add new source datasets from cloud storage locations or database connections, or upload files directly into Tamr Cloud.
Before you begin, see Requirements for Source Datasets.
For BigQuery and Snowflake, you can add a single source table at a time.
For ADLS, Delta Lake, GCS, and S3, you can add either a single file or multiple files from the same location. If adding multiple files, all the files must have the same column headers. Tamr combines these files into a single source dataset.
To add a new source:
- Navigate to Menu
> Sources.
- In the bottom right, select Add Source.
- Enter a Source Name and optional description.
- Select the type of source you are adding: ADLS, BigQuery, Delta Lake, GCS, S3, Snowflake, or file Upload.
- Select a configured connection for adding this source.
Note: If a connection has not been configured for the source type, select Add Connection to set up a new connection. See Managing Connections. - Depending on the source type:
- For BigQuery, or Snowflake, enter the source table.
- For ADLS, Delta Lake, GCS, or S3, enter the path to the file. If you are adding multiple files, enter the path to the location at which these files are stored.
- Browse to the file for upload.
- If you are adding a source file, specify the following format options:
- Delimiter character. Default is comma
,
. - Quote character used in field values. Default is double quote
"
. - Escape character for quote characters inside field values. Default is double quote
"
.
- Delimiter character. Default is comma
You can now use this source in a data product.
Updated 23 days ago