1. biGENIUS-X Help Center
  2. Generators
  3. Load data with a native load control

Azure Synapse - Load data - 1.4

Before loading the data from your Source(s) System(s) to Azure Synapse, please:

You are now ready to load the data.

Load the data

We will explain how to load the data using the possible target environment: Azure Synapse.

To load the data:

  • Open Azure Synapse Analytics with the Workspace web URL provided in the Azure Synapse Workspace resource:
  • Azure Synapse Analytics is opened:
  • Click on the Develop menu on the left-hand-side:
  • Open the 500_Deploy_and_Load_DataVault_Synapse.ipynb file, which contains three steps:
    1. %run Deployment.ipynb: Deploy the code
    2. %run DocumentationSparkDataVault_SimpleLoadExecution.ipynb: Load the data
    3. %run DocumentationSparkDataVault_SelectResults.ipynb: Display the results
  • Select the Apache Spark Pool (bgaasspark33v2 in our example).
  • Execute steps 2 and 3
  • The data were loaded:
    • You should have the target Parquet files created for each Target Model Object,, for example, for the Stage CreditCard:
    • Step 3 displayed a resume of the number of rows loaded for each Target Model  Object, for example:

You can now check that your data were correctly loaded with the following script:

--Create a new step with the following code:
mydf = spark.sql("select * from `rawvault`.`rdv_hub_creditcard_hub`")
mydf.show(truncate = False)

And see the content of your Target Parquet file: