Data & Database Workflows (page 3 of 40)
PostgreSQL, SQL, CSV, JSON, Excel, PDF, and conversion pipelines — practical workflows for working with structured data safely.
- Validating CSV with TypeScript-friendly row types
A practical guide to turning CSV rows into trustworthy TypeScript types with runtime validation, header-safe mapping, and cleaner import pipelines.
- Versioning CSV schemas without breaking downstream consumers
A practical guide to versioning CSV schemas safely, with additive-change rules, header compatibility patterns, metadata sidecars, and rollout strategies for position-based and name-based consumers.
- When not to use CSV: formats worth the migration
A practical guide to when CSV is the wrong tool, what format to use instead, and how to migrate without turning a familiar file into a larger operational problem.
- When "valid CSV" still breaks your pipeline: a field guide
A field guide to the gap between “valid CSV” and “safe to ingest,” with a repeatable workflow for debugging real pipeline failures.
- Whitespace trimming: when aggressive cleanup corrupts data
A practical guide to whitespace trimming in CSV pipelines, focused on the difference between harmless cleanup and data corruption.
- Why accented characters break after a round trip through Excel
A practical guide to why accented characters break after a round trip through Excel, with specific Excel import and save behaviors, mojibake signatures, and safer CSV workflows.
- Why your CSV has a phantom last column (trailing delimiters)
A practical guide to phantom last columns in CSV files, focused on trailing delimiters, parser behavior, loader differences, and safer validation workflows.
- XLSX vs CSV for operational handoffs: decision criteria
A practical decision guide for choosing XLSX or CSV in operational handoffs, with clear criteria for automation, analyst review, typing, and downstream reliability.
- Quarantine tables: isolating bad CSV rows without losing audits
A practical guide to quarantine tables for CSV pipelines, including row-level error capture, source lineage, replay workflows, idempotency, and audit-friendly retention.
- Quoted newlines in CSV: how they fool naive splitters
A practical guide to embedded newlines in CSV fields, why naive line splitting fails, and how to parse, validate, stream, and quarantine multiline records safely.