Warehouse: Managing Your Datasets

Learn how to create, deploy, and manage SQL-based datasets in the Warehouse, including scheduling refreshes, adding data quality tests, and organizing with folders.

The Warehouse is where you manage your datasets organized in folders. Datasets are SQL-based transformations of your source data that power your dashboards and reports.

Create a New Dataset

  1. Navigate to the Warehouse in the left sidebar
  2. Click Create Dataset
  3. Enter a name for your dataset
  4. Enter a SQL view name (must be SQL-compatible, no spaces)
  5. Select a folder to organize it in (or create a new folder)
  6. Click Create

Write Your SQL Query

  1. Open your new dataset
  2. Use the built-in SQL editor (powered by Monaco Editor) with syntax highlighting and auto-complete
  3. Write your SQL to select, join, filter, and aggregate data from your source tables
  4. Click Preview to see results before committing
  5. Review the output to make sure it looks correct

Deploy the Query

  1. Once your SQL is ready, click Deploy
  2. The platform creates a new version with a status:
    • DEPLOYED: Currently active and running on schedule
    • UNDEPLOYED: Saved but not active
    • FAILED: Deployment encountered an error
    • EDITED: Modified since last deployment
  3. You can roll back to any previous version at any time

Set a Refresh Schedule

  1. Open the dataset settings
  2. Choose a trigger type:
Trigger TypeDescription
ManualRun on demand when you need fresh data
DailyRuns once every day
WeeklyPick a specific day of the week
MonthlyPick a specific date (1-31)
Cron expressionCustom timing for advanced scheduling
  1. Preview the next run time in your timezone
  2. Save the schedule

The platform detects maintenance windows to prevent scheduling conflicts.

Add Data Quality Tests (Optional)

  1. Open your dataset and go to the Tests tab
  2. Click Create Test
  3. Write a SQL assertion that should return zero rows if the data is valid
  4. Choose the test severity:
    • BLOCKER: Prevents deployment of bad data. If the test fails, the dataset will not update.
    • WARNING: Flags potential issues without blocking the refresh.
  5. Save the test

Tests run automatically each time the dataset refreshes. View consolidated test results and download them for review.

Organize with Folders

  1. In the Warehouse, click Create Folder to add a new folder
  2. Drag datasets between folders to reorganize
  3. Use folder hierarchies to group by department, project, or data domain

Share a Dataset (Optional)

  1. Open the dataset you want to share
  2. Configure sharing to make it available to other workspaces
  3. Shared workspaces can see the results but cannot view or modify the underlying SQL

Downloading Results

  1. Open any dataset
  2. Click the Download button
  3. Choose CSV or XLSX format
  4. The file is generated and a secure download link is provided