Discover CSV files on AWS

Please first read how to use the Discovery Application.

Discover CSV files

To discover CSV files on AWS with the Discovery Application, please choose the following source system type:

Then fill in the following information:

  • Name: discovery name
    • Example: CSV_AW2019_AWS
  • S3 bucket: name of the S3 bucket on AWS containing the files to discover
    • Example: bigenius-data-training-user-01
  • Access Key: AWS access key ID 
    • Example: AKIAWW2SPUKXNWWXEUH2

To create an access key on AWS:

  • Connect to your account at https://signin.aws.amazon.com/
  • Open the security credentials menu under your account name:
  • Create an access key by clicking on the Create access key button:
  • Copy the Access key ID and the Secret access key in a safe place:
  • Secret Access Key: AWS secret access key
  • File Path Type:

    • Single CSV File: select this option if you want to discover a single CSV file
      • Example of a single CSV file:
    • Directory: select this option to discover a S3 folder containing several CSV files:
      • Example of 4 CSV files in the same folder:
  • File Path (without URL): according to the File Path Type selected, fill:
    • Single CSV File: the path to the single file
      • Example: csv/CreditCard.csv
    • Directory: the path to the folder containing all the CSV files
      • Example: csv/
  • Column Delimiter: delimiter character used to separate the values between the columns in your CSV file.
    • Example: |
  • Row Delimiter: row delimiter that separates the rows in the source CSV file:

    • Example: \r\n
  • Field Quote (optional): specify which character surrounds the string values in the file
    • Example: "
  • Field Length (optional): if you want to specify a length for each column in your file, you can enter it here. 
    • Example: 400
      • All the columns from the file will be varchar(400)