Importing Source Data from an External Data Repository

You can add new source datasets from cloud storage locations or database connections for use in mastering flows.

Adding a Source from Cloud Storage Locations

You can add source dataset files from cloud storage locations, such as S3, ADLS2, or GCS. Before you begin, see Requirements for Source Datasets.

  1. Navigate to Menu menu > Sources.
  2. In the bottom right, select Add Source.
  3. Enter a Source Name and optional description, then select Remote Connection.
  4. Select a connection type, enter the required information, then select Add.

You can now use the file as source data in a mastering flow.

Adding a Source from a Database Connection

You can add source datasets from connected databases, such as Snowflake or BigQuery. Before you begin, see Requirements for Source Data. You can use both database tables and views as sources.

  1. Navigate to Menu menu > Sources.
  2. In the bottom right, select Add Source.
  3. Enter a Source Name and optional description, then select Remote Connection.
  4. Select a connection type, enter the required information, then select Add.
    Note: If using a database view as a source, enter the view name in the Table field.

You can now use the table or view as source data in a mastering flow.