Please first read how to use the Discovery Application.
Discover Parquet or Delta files
To discover Parquet or Delta files on the local file system with the Discovery Application, please choose the following source system type:
Then fill in the following information:
- Name: discovery name
- Example: Parquet_AW2019_Local
- File Path Type:
- Directory With Multiple Entities (Delta or Parquet): select this option if you want to discover several entities contained in a folder:
- Example of 4 entities in a source folder:
- Each entity folder contains the Parquet file to discover:
- Example of 4 entities in a source folder:
- Folder for a Single Entity (Delta or Parquet): select this option to discover a folder containing all the Parquet or delta files concerning an Entity:
- Example of 1 entity Credit Card with multiple Parquet files stored by partition (date):
- Example of 1 entity Credit Card with multiple Parquet files stored by partition (date):
- Single File (Parquet only): select this option if you want to discover a single Parquet file
- Example of a single Parquet file:
- Example of a single Parquet file:
- Directory With Multiple Entities (Delta or Parquet): select this option if you want to discover several entities contained in a folder:
- File Path: according to the File Path Type selected, fill:
-
- Directory With Multiple Entities (Delta or Parquet): the path to the folder containing all the entity's folders
- Example: C:\Discovery Application\Parquet on local file system\source
- Directory With Multiple Entities (Delta or Parquet): the path to the folder containing all the entity's folders
-
- Folder for a Single Entity (Delta or Parquet): the path to the folder containing all the entity files
- Example: C:\Users\Discovery Application\Parquet on local file system\source_partition\CreditCard
- Single File (Parquet only): the path to the single file
- Example: C:\Discovery Application\Parquet on local file system\source\CreditCard
- Folder for a Single Entity (Delta or Parquet): the path to the folder containing all the entity files