Compare two versions of a data file to detect schema changes, finding changes, and stat deltas.

Usage

# Auto-detect: compare against git HEAD
goldencheck diff data.csv

# Explicit: two files
goldencheck diff old.csv new.csv

# Compare against a specific git ref
goldencheck diff data.csv --ref HEAD~3
goldencheck diff data.csv --ref main

# JSON output
goldencheck diff data.csv --json

Output

goldencheck diff — data.csv (current vs HEAD)

Schema changes:
  + new_column (String)
  - old_column (Int64)
  ~ status: String -> Int64

Finding changes:
  NEW   [email] 12 malformed emails
  FIXED [age] range violation resolved
  WORSE [status] 3 → 7 case variants

Stats:
  Rows: 10,000 -> 10,500 (+5%)
  Columns: 12 -> 13 (+1)

Git Integration

When given a single file, diff auto-detects if the file is tracked in git:

  • In git: compares against HEAD (or --ref)
  • Not in git: prints an error asking for a second file

What It Compares

Category Detection
Schema Added/removed columns, type changes
Findings New issues, resolved issues, worsened/improved
Stats Row count changes, column count changes