ETL Pipeline Studio
Upload → Profile → Quality → Transform → dbt → Load · Now with AI column analysis & natural language queries
Drop your data file here
or click to browse
CSVJSONXLSXTSVParquet
📄
Pipeline Templates
Pre-built pipelines for common scenarios. Pick one, point it at your data, done.
Scheduled Pipelines
Automate ETL runs. Get notified via email or Slack when complete.
+ Create New Schedule
Active Schedules
No schedules yet.
Run History
All Runs
Datasets
🔗 Join Datasets
Drag columns from the left dataset onto matching columns in the right dataset to create join key mappings.
Left ▸
◂ Right
📡 Schema Drift Detection
Compare two dataset versions to detect column changes, type mismatches, and statistical drift.
BASELINE (OLDER)
NEW (NEWER)
🛡️ Data Quality Rules
Build rule sets to validate your datasets — check for nulls, ranges, patterns, uniqueness, and more.
Rule Sets
No rule sets yet
DATASET
👤 My Profile
Manage your personal information and account security.
?
—
Data Engineer
USERNAME
—
EMAIL
LOCATION
—
MEMBER SINCE
—
0
RUNS
0
DATASETS
0%
SUCCESS
Personal Information
FULL
NAME
—
EMAIL
—
JOB
TITLE
—
COMPANY
—
LOCATION
—
Change Password
⚙️ Settings
Manage your preferences — all settings are saved locally in your browser.
AI Configuration
✓ Saved
AI Column Analysis
Automatically analyse columns with Groq/Gemini after every upload
Auto-run AI after pipeline
Run AI column analysis automatically after every pipeline completes
Generate dbt models
Auto-generate dbt SQL model after each pipeline run
Export Preferences
✓ Saved
Export Parquet alongside CSV
Save a Parquet copy of every cleaned output automatically
Default export format
Format used when clicking the download button
Show data preview after run
Display cleaned data preview table below run results
Notifications
✓ Saved
Pipeline complete notification
Show a browser notification when a pipeline finishes
Error alerts
Show a browser notification when a pipeline fails
Pipeline Defaults
✓ Saved
Auto-select all steps
Select all transform steps by default when the page loads
Default missing value strategy
Strategy used by the Fill Missing step
Appearance
✓ Saved
Show breadcrumb path
Show "www.dataforge.com / pipeline-studio" bar at the top
Show tool badges
Show Great Expectations · dbt · AI Analysis badges in header
Danger Zone
Clear all run history
Permanently delete all pipeline run records — cannot be undone
Delete all datasets
Permanently delete all uploaded dataset files — cannot be undone
Reset all settings
Restore all preferences to their default values