www.dataforge.com / pipeline-studio
DataForge ETL
✅ Great Expectations 🔧 dbt 🤖 AI Analysis
Ready

ETL Pipeline Studio

Upload → Profile → Quality → Transform → dbt → Load · Now with AI column analysis & natural language queries

📁
Drop your data file here
or click to browse
CSVJSONXLSXTSVParquet
📄

Pipeline Templates

Pre-built pipelines for common scenarios. Pick one, point it at your data, done.

Scheduled Pipelines

Automate ETL runs. Get notified via email or Slack when complete.

+ Create New Schedule
Active Schedules
No schedules yet.

Run History

📋
All Runs

Datasets

🔗 Join Datasets

Drag columns from the left dataset onto matching columns in the right dataset to create join key mappings.

Left ▸
◂ Right

📡 Schema Drift Detection

Compare two dataset versions to detect column changes, type mismatches, and statistical drift.

BASELINE (OLDER)
NEW (NEWER)

🛡️ Data Quality Rules

Build rule sets to validate your datasets — check for nulls, ranges, patterns, uniqueness, and more.

Rule Sets
No rule sets yet
DATASET

👤 My Profile

Manage your personal information and account security.

?
Data Engineer
Active
USERNAME
EMAIL
LOCATION
MEMBER SINCE
0
RUNS
0
DATASETS
0%
SUCCESS
👤
Personal Information
FULL NAME
EMAIL
JOB TITLE
COMPANY
LOCATION
🔒
Change Password

⚙️ Settings

Manage your preferences — all settings are saved locally in your browser.

🤖
AI Configuration
✓ Saved
AI Column Analysis
Automatically analyse columns with Groq/Gemini after every upload
Auto-run AI after pipeline
Run AI column analysis automatically after every pipeline completes
Generate dbt models
Auto-generate dbt SQL model after each pipeline run
📦
Export Preferences
✓ Saved
Export Parquet alongside CSV
Save a Parquet copy of every cleaned output automatically
Default export format
Format used when clicking the download button
Show data preview after run
Display cleaned data preview table below run results
🔔
Notifications
✓ Saved
Pipeline complete notification
Show a browser notification when a pipeline finishes
Error alerts
Show a browser notification when a pipeline fails
Pipeline Defaults
✓ Saved
Auto-select all steps
Select all transform steps by default when the page loads
Default missing value strategy
Strategy used by the Fill Missing step
🎨
Appearance
✓ Saved
Show breadcrumb path
Show "www.dataforge.com / pipeline-studio" bar at the top
Show tool badges
Show Great Expectations · dbt · AI Analysis badges in header
⚠️
Danger Zone
Clear all run history
Permanently delete all pipeline run records — cannot be undone
Delete all datasets
Permanently delete all uploaded dataset files — cannot be undone
Reset all settings
Restore all preferences to their default values