- biGENIUS-X Knowledge Base
- Generators
- Properties
-
Getting Started
-
General Overview
-
Application Modules
- Administration
- Global Features
- Solutions
- Projects
- Branches
- Source Systems
- Dataflow Modeling - Overview
- Dataflow Modeling - Wizard Steps
- Dataflow Modeling - Terms
- Dataflow Modeling - Term Mapping
- Dataflow Modeling - Relationships
- Relationship Modeling
- Generate Artifacts
- Project Settings
- Data Marketplace
-
Generators
-
Discovery application
-
Best Practices
-
FAQs
-
Product Release Notes
-
Legal Documents
Partition Filter (Target Term)
Description
The Property named Partition Filter is available on a Target Term for mapping in the Spark Generator.
It indicates if the mapped source column should be used for partition pruning.
Partition pruning in Spark is a performance optimization that limits the number of files and partitions Spark reads when querying.
After partitioning the data, queries that match certain partition filter criteria improve performance by allowing Spark only to read a subset of the directories and files.
Parquet source files can be split up into partitions.
We want to be able to neglect partitions which were already loaded so that they are not reloaded in every load.
Format
The Partition Filter is a Boolean.
The possible values are True and False.
Example:
Default Value
The default value is False.