Skip to main content

Incremental Nodes

Overview

Incremental node types

Installation

  1. Copy the Package ID: @coalesce/databricks/incremental-nodes
  2. In Coalesce, open the Workspace where you wish to install the package.
  3. Go to the Build Setting of the Workspace, tab Packages, and click the Install button on the top right of the page.
  4. Paste the Package ID, and proceed with the installation process.

Description

Databricks-Incremental-nodes

The Coalesce Incremental Package includes:

Incremental Load

The Coalesce Incremental load node is a versatile node that allows you to develop and deploy a Stage table/view in Databricks where we can perform incremental load in comparison with a persistent table added on top of it.

Incremental Load Node Configuration

Incremental Load Node Properties

PropertyDescription
Storage LocationStorage Location where the Incremental node will be created.
Node TypeName of template used to create node objects.
DescriptionA description of the node's purpose.
Deploy Enabled- If TRUE the node will be deployed / redeployed when changes are detected.
- If FALSE the node will not be deployed or will be dropped during redeployment.

Incremental Load Options

OptionsDescription
Create AsProvides option to choose materialization type as table or view.
Filter data based on Persistent table- True - provides option to perform incremental load.
- False - a normal initial load of data from source is done.
Persistent table location(required)The Coalesce storage location.
Persistent table name(required)The table name of the persistent table.
Incremental load column(date)A date column based on which incremental data is loaded.

Incremental Load Example Workflow

  1. Add a source node.
  2. Add the Incremental UDN.
  3. Leave the 'Filter data based on Persistent Table' option set to False.
  4. Create the node.
  5. Add the Persistent table to the view.
  6. Create and Run the node.
  7. Go to the Incremental UDN and change the 'Filter data based on Persistent Table' option to true.
  8. Use the pattern based option to match the persistent table (alter the definition of the UDN, if necessary), or add the table name manually in the last config item.
  9. Remove the existing (basic) join and use the 'Copy To Editor' to add the new join, including sub-select.
  10. Re-run the Incremental UDN.

Incremental Load Deployment

Incremental Load Initial Deployment

When deployed for the first time into an environment the Incremental load node of materialization type table will execute the Create State Table.

StageDescription
Create Stage TableThis will execute a CREATE OR REPLACE statement and create a table in the target environment. When deployed for the first time into an environment the Work node of materialization type view will execute the Create Stage View.
Create Stage ViewThis will execute a CREATE OR REPLACE statement and create a view in the target environment.

Incremental Load Redeployment

After the Incremental Load has been deployed for the first time into a target environment, subsequent deployments with column level changes or table level changes may result in altering the target Table.

Incremental Load Altering the Stage Tables

There are few column or table changes if made in isolation or all-together will result in an ALTER statement to modify the Work Table in the target environment.

  • Changing the table name
  • Dropping an existing column
  • Altering Column data type
  • Adding a new column

The following stages are executed:

StageDescription
Clone TableCreates an internal table.
Rename Table | Alter Column | Delete Column | Add Column | Edit table descriptionAlter table statement is executed to perform the alter operation.
Swap cloned TableUpon successful completion of all updates, the clone replaces the main table ensuring that no data is lost.
Delete TableDrops the internal table.

Incremental Load Recreating the Stage Views

The subsequent deployment of Incremental load node of materialization type view with changes in view definition,adding table description or renaming view results in deleting the existing view and recreating the view.

The following stages are executed:

StageDescription
Delete ViewDelete the view
Create ViewCreate a new view

Incremental Load Undeployment

If a Incremental load node of materialization type table is deleted from a Workspace, that Workspace is committed to Git and that commit deployed to a higher level environment then the stage table in the target environment will be dropped.

This is executed in two stages:

StageDescription
Delete TableCoalesce Internal table is dropped.
Delete TableTarget table in Snowflake is dropped.

If a Incremental load node of materialization type view is deleted from a Workspace, that Workspace is committed to Git and that commit deployed to a higher level environment then the StageView in the target environment will be dropped.

The stage executed:

StageDescription
Delete ViewDrops the existing stage view from target environment.

Code

Incremental Load Code

Versions

Available versions of the package.

Version #Release DateNotes
1.0.0November 14, 2025First release of Databricks Incremental node type