Skip to main content

Databricks connector

Ori Sagi avatar
Written by Ori Sagi
Updated yesterday

Ready to connect Databricks to Pecan? You're just a few clicks away from streaming data in or pushing predictions out. Let's make this happen together!

What You Can Do

With the Databricks connector, you can:

  • Import data from your Databricks tables and views

  • Schedule automatic data refreshes to keep your features current

  • Push predictions back to Databricks for real-world impact

Before You Start

Here's what you'll need from your Databricks workspace:

What You Need

Where to Find It

Why It's Important

A running Cluster or SQL Warehouse

Databricks console

This is the compute resource Pecan will query

Personal access token (PAT)

Settings > Developer > Access Tokens

Authenticates your connection to Pecan

Server hostname, Port, and HTTP path

Compute > ... > Connection Details

The endpoint Pecan contacts

Network access from Pecan's IPs

Admin console / firewall rules

Allows inbound JDBC traffic

Let's Connect!

Step 1: Generate Your Personal Access Token

  1. In Databricks, click your username > Settings > Developer

  2. Select Generate new token

  3. Add a comment (like "Pecan integration"), set an expiry date, and click Generate

  4. Copy the token right away - you'll only see it once!

πŸ’‘ Pro tip: Save this token somewhere secure. You'll need it in just a moment.

Step 2: Grab Your Connection Details

The details you need depend on whether you're using a SQL Warehouse or Cluster:

For SQL Warehouse:

  1. Navigate to SQL > SQL Warehouses

  2. Open your warehouse and click Connection details

  3. Copy the Server hostname, verify the Port (usually 443), and copy the HTTP path

For Cluster:

  1. Go to Compute and open your cluster

  2. Look under Advanced options > JDBC/ODBC

  3. Copy those same three values

If you're using Unity Catalog, also note your default Catalog (typically main) and the Schema you want Pecan to access.

Step 3: Set Up the Connection in Pecan

Now for the fun part - bringing it all together in Pecan!

  1. In Pecan, go to Connections > + New connection > Databricks

  2. Fill in the form with your details:

Field

What to Enter

Connection name

Any lowercase name with underscores (like sales_databricks)

This connection is used for

Choose Incoming data (default) or Outgoing predictions

Server hostname

The value you copied in Step 2

HTTP path

The value you copied in Step 2

Token

Your PAT from Step 1

Port

443 (unless your admin changed it)

Default catalog

Optional - usually main or your Unity Catalog name

Default schema

Optional - like analytics

3. Click Test connection - Pecan will check that everything's working

4. When you see the success message, click Create to save

Step 4: Put Your Connection to Work

Congratulations! Your Databricks connection is ready to roll. You'll find it in your Connections list, where you can:

  • Import tables: Browse and select datasets from your catalog and schema

  • Schedule refreshes: Tell Pecan how often to pull fresh data

  • Publish predictions: If you chose "Outgoing predictions," set up a target table or Delta Live view for your scored outcomes

Need Help? We've Got You

What's Happening

Likely Cause

Quick Fix

"Invalid token" error

Token expired or copied incorrectly

Generate a fresh PAT and update your connection

"Unknown host" or timeout

Firewall blocking the connection

Whitelist Pecan's egress IPs and confirm port 443 is open

Still stuck? No worries - our support team is here to help you succeed. Just reach out and we'll get you connected in no time!

Did this answer your question?