Skip to main content

Coalesce Copilot

Coalesce Copilot is an AI assistant integrated into the Coalesce platform. It helps you build and modify data transformation pipelines using natural language. Instead of manually configuring Nodes through the Coalesce UI, you can describe your requirements in plain language. Copilot translates them into working DAG Nodes with transformations, columns, and relationships.

Think of it as a proficient Coalesce teammate. You describe your data needs in plain English or paste SQL, and Copilot translates that into actual DAG Nodes with proper transformations, columns, and relationships.

Copilot Is Evolving

As with any AI-powered tool, Copilot is under active development. We regularly ship improvements based on feedback and testing, but you may occasionally encounter unexpected Node configurations, intermittent service interruptions, or changes to available capabilities as we refine features.

Always review generated Nodes and SQL before deployment. If Copilot becomes unresponsive, wait a few moments and start a new conversation in a fresh chat window.

Prerequisites

Before using Copilot, ensure you have:

  • Access to Coalesce version 7.27 or later
  • Active Workspace permissions to create and modify Nodes
  • Version control checkpoint created before starting work with Copilot (recommended for easy rollback)
  • Available Storage Locations (databases and schemas) configured in your Environment

Enable Copilot

Org administrators can go to Org Settings > Preferences and turn the features on or off for the whole organization. These features are not enabled by default. Reach out to your Coalesce account manager to sign up.

A screenshot of the Coalesce Org Settings page with Copilot & AI Features selected in the left navigation. The main content shows the Coalesce Copilot section with a description of the AI assistant and a Copilot toggle in the Enabled state. Below is the Organization-Wide Instructions section with its toggle in the Disabled state, a text input labeled 'Type your Copilot instructions' showing 0/2000 characters, and a disabled Save Instructions button.

On your build tab, click the AI button Coalesce Copilot logo.

A screenshot of the Coalesce DAG Exploration page showing a visual graph of data Nodes. The left panel contains navigation links for Nodes, Subgraphs, and Jobs. The main canvas displays interconnected data nodes, including SOURCE nodes like NATION, CUSTOMER, and LINEITEM (each with a brown border), and various TARGET nodes such as DIM_NATION (blue background), CDIM_NATION (dark blue border), FCT_NATION (orange border), STG_NATION (green border), CPSTG_NATION (light blue border), STG_CUSTOMER (green border), STG_CUSTOMER1 (green border), V_CUSTOMER1 (light blue border), STG_LINEITEM (green border), and PSTG_LINEITEM (light blue border). These nodes are arranged in a flow layout, connected by lines indicating data flow. Above the canvas, there are View as Graph options, a filter button, and a Run All button. A red arrow points to the top right corner of the application header, highlighting the blue sparkle AI Assist button.

Organization-Wide Instructions

Set organization wide settings such as Node and column name standards and other standards.

Copilot Organization-Wide Instructions panel with an enabled toggle and Active at Org Level badge. The text area contains instructions for node naming conventions (including date format dd_mm_yyyy), with a character count of 175/2000 and a Save Instructions button below
Set organization level instructions
Three-pane Coalesce pipeline interface: left sidebar with Nodes, Subgraphs, and Jobs; center graph view with source nodes (PART, SUPPLIER, ORDERS, CUSTOMER) and stage nodes (SUPPLIER_STAGE_12_12_2024, STG_CUSTOMER1); right panel showing the Copilot prompt Create a stage node from supplier called SUPPLIER_STAGE and confirmation that SUPPLIER_STAGE_12_12_2024 was created with all 7 columns from SOURCE.SUPPLIER
Organization instructions will apply to all users

Core Capabilities

Copilot can perform the following operations:

Build and Modify Nodes

  • Create Nodes from natural language descriptions
  • Add, remove, or update columns in existing Nodes
  • Configure joins and relationships between Nodes
  • Set business keys and system columns

Import SQL

  • Convert existing SQL queries into Coalesce DAG Nodes
  • Automatically create predecessor Nodes and dependencies
  • Parse complex joins and aggregations into proper Node structures

Answer Questions

  • Explain Workspace structure and Node configurations
  • Provide guidance on Coalesce best practices
  • Describe Node dependencies and column lineage

Multi-Step Operations

  • Execute complex workflows requiring multiple Nodes
  • Build complete pipelines from high-level requirements

Platform Awareness

  • Adapts SQL syntax for Snowflake or Databricks
  • Understands Coalesce-specific Node Types such as Stage, Dimension, Fact, View, and Persistent Stage
  • Respects your Workspace context and existing Nodes

Operating Modes

Copilot has two operating modes that determine whether it modifies your Workspace or just provides information.

Edit Mode

Copilot makes actual changes to your Workspace. Use this mode when you want to build or modify Nodes.

What it does:

  • Creates new Nodes and updates existing ones
  • Imports SQL and executes multi-step operations
  • Makes actual changes to your Workspace

Example prompts:

  • "Create a customer dimension with these columns..."
  • "Add a lifetime_value column to dim_customer"
  • [Paste SQL query]
  • "Build a complete customer 360 pipeline..."

Non-Edit Mode

Copilot provides answers, explanations, and recommendations without modifying your Workspace. This mode is safe for exploration and learning.

What it does:

  • Answers questions about your Workspace
  • Explains configurations and best practices
  • Provides guidance without making changes

Example prompts:

  • "What columns are in the sales_fact table?"
  • "How should I structure a slowly changing dimension?"
  • "Which Nodes depend on customer_dim?"
  • "What's the difference between Persistent Stage and View?"

Communicating With Copilot

Copilot appears as a chat window in the Coalesce builder. Type your request naturally—no special syntax needed. The conversation is contextual, so you can have back-and-forth exchanges:

You: "Create a customer dimension"

Copilot: Creates dim_customer Node "I've created a customer dimension table. What columns would you like?"

You: "Add customer_id, name, email, and signup_date"

Copilot: Adds the columns "I've added those columns. Would you like me to set customer_id as the business key?"

Basic Workflow

  1. Create a version control checkpoint before starting with Copilot
  2. Open the Copilot chat in the Coalesce builder
  3. Type what you want to build or ask a question
  4. Review Copilot's actions in your Workspace
  5. Refine through follow-up messages if needed
  6. Deploy when ready

Example Prompt Workflows

Copilot adapts to different working styles. Choose the approach that fits your workflow.

This approach works well when you're exploring or aren't sure of all requirements upfront.

  1. "Create a basic customer dimension"
  2. Review the Node Copilot creates
  3. "Add loyalty_tier and lifetime_value columns"
  4. "Set customer_id as the business key"

Example 1: Creating a Dimension Node

Prompt:

"Create a dimension table for customers with customer_id as the business key. Include columns for customer_id, first_name, last_name, email, and signup_date."

What Copilot does:

  • Creates a Dimension Node Type
  • Adds specified columns with appropriate data types
  • Sets customer_id as the business key
  • Configures necessary predecessor relationships

Example 2: Importing SQL

You can add SQL directly into Copilot.

Prompt:

SELECT 
c.customer_id,
c.name,
SUM(o.amount) as total_spent
FROM customers c
JOIN orders o ON c.id = o.customer_id
GROUP BY c.customer_id, c.name

What Copilot does:

  • Analyzes the SQL query
  • Creates predecessor Nodes for customers and orders
  • Builds a transformation Node with the join
  • Sets up the aggregation logic
  • Configures column mappings
A screenshot of a Coalesce Copilot panel titled Customer Spending Analysis with SQL Query. A blue code box on the right displays a SQL query that selects customer information and total spending. Below, Copilot explains how it will convert the SQL into Coalesce Nodes and shows two actions: Read Nodes with two matched Nodes and Create Node for TARGET.CUSTOMER_SPENDING. A summary describes creating the CUSTOMER_SPENDING Node by joining customers and orders. At the bottom is a chat input labeled Chat with Copilot and an Allow Edits toggle set to on.

Example 3: Multi-Step Pipeline

Prompt:

"Create a complete customer 360 view starting with raw customer data, add their order history, calculate lifetime value, and join with marketing campaign responses."

What Copilot does:

  • Creates staging Nodes for raw data sources
  • Builds dimension tables for customers
  • Creates fact tables for orders and campaign responses
  • Adds calculated columns for lifetime value
  • Wires all Nodes together with proper relationships

Example 4: Modifying Existing Nodes

Prompt:

"Add a column called lifetime_value to dim_customer that calculates the sum of all order amounts for each customer."

What Copilot does:

  • Finds the dim_customer Node
  • Adds the new column
  • Writes the aggregation logic
  • References the appropriate predecessor Node for order data

What Copilot Knows About Your Workspace

Copilot Has Awareness Of

  • Your current Workspace: All existing Nodes, their columns, and relationships
  • Your data platform: Whether you're using Snowflake or Databricks, and adapts SQL syntax accordingly
  • Available Node Types: How to build Dimension, Fact, View, and Persistent Stage
  • Your Storage Locations: Which databases and schemas are available
  • Your viewing context: Understands what you're currently looking at

What Copilot Doesn't Know

  • Copilot doesn't see the actual data in your tables. It only sees metadata like Node names, columns, and schemas
  • Data quality issues unless you describe them
  • Query performance metrics or optimization opportunities based on runtime data

Getting Better Results

  • Keep relevant Nodes visible in your Workspace
  • Reference specific Node names using clear identifiers
  • Mention target Subgraphs when organizing Nodes
  • Ask questions about existing Nodes: "What columns are in sales_fact?"
  • Request explanations: "Why did you choose a Persistent Stage instead of a View?"
  • Get recommendations: "What's the best Node Type for aggregated metrics?"

When to Use Copilot Versus the Coalesce UI

Use Copilot WhenUse the Coalesce UI When
Building new Nodes from scratchFine-tuning individual column transformations with autocomplete
Converting existing SQL to CoalesceVisual exploration of complex DAGs
Making bulk updates to multiple NodesPrecise drag-and-drop column mapping
Learning Coalesce or exploring new featuresDetailed configuration of advanced Node properties
Rapid prototyping and iterationYou prefer visual interfaces over conversation
You have clear requirements and want to save timeMaking small tweaks where clicking is faster than typing
Working with standard Coalesce patterns
Best Approach

Use both. Let Copilot build the structure quickly, then use the Coalesce UI for fine-tuning and visual validation. Teams often describe it as "Copilot for speed, Coalesce UI for polish."


What's Next?