Data-Quality on Andrea Bozzo

Data-Quality on Andrea Bozzo | Bloghttps://andreabozzo.pages.dev/en/tags/data-quality/Recent content in Data-Quality on Andrea Bozzo | BlogHugo -- 0.147.0en-USMon, 23 Mar 2026 00:00:00 +0000Guardrails for Tabular ML: A Data Engineer's Take on Data Leakage, Poisoning, and Brittle Pipelineshttps://andreabozzo.pages.dev/en/posts/tabularmlpipes-blog/Mon, 23 Mar 2026 00:00:00 +0000https://andreabozzo.pages.dev/en/posts/tabularmlpipes-blog/Most ML pipeline failures are not exotic model bugs — they are data issues that nobody encoded as checks. This article walks through building guardrails using pandas, Apache DataFusion, data contracts, and the Arrow C Data Interface.