Data & Database Workflows (page 7 of 40)
PostgreSQL, SQL, CSV, JSON, Excel, PDF, and conversion pipelines — practical workflows for working with structured data safely.
- Power BI: CSV folder connectors vs single-file pitfalls
A practical guide to Power BI CSV folder connectors versus single-file imports, focused on schema drift, combine-files behavior, refresh expectations, and how to avoid brittle file-based pipelines.
- Power Query: robust CSV import settings for messy vendors
A practical guide to Power Query CSV import settings for messy vendor exports, with robust defaults for delimiter, encoding, quotes, locale, type detection, and schema drift.
- Preserving leading zeros when moving CSV into spreadsheets
A practical guide to preserving leading zeros when moving CSV into spreadsheets, with safer import workflows for Excel, Power Query, Google Sheets, and Calc.
- Profiling a 1GB CSV: what to measure first
A practical guide to profiling a 1GB CSV file without guessing, focused on correctness, memory pressure, parser behavior, bad rows, and the metrics that actually predict production failures.
- Header Checker: Naming Rules That Survive BI Tools
A practical guide to CSV header naming rules that stay stable across ingestion layers, warehouses, and BI tools without turning every dashboard fix into a schema cleanup project.
- Header Row Detection When the First Line Is Not a Header
A practical guide to deciding whether the first line in a CSV is a header, data, or just preamble—without relying on brittle guesswork.
- How to Review a Vendor CSV Spec in Under an Hour
A practical guide to reviewing a vendor CSV spec in under an hour so you can spot the risky assumptions before they hit production.
- HubSpot Import CSV: Common Rejection Reasons and Fixes
A practical guide to the HubSpot CSV import errors that block records or drop field values, with concrete fixes based on HubSpot’s current import requirements.
- Idempotent CSV Loads into PostgreSQL: Patterns and Pitfalls
A practical guide to making PostgreSQL CSV loads replay-safe so reruns, partial failures, and duplicate files do not silently corrupt your tables.
- Incident Response When a Bad CSV Corrupts Downstream Metrics
A practical playbook for containing and recovering from metric corruption caused by a bad CSV feed without making the warehouse state even worse.