Here's one of the first things I do with SQL when I want to quickly assess the data quality in a table.
I would run a series of quick COUNTs, testing key attributes of the data, such as key columns being NULL, which can display the distribution of problematic data so it can be processed accordingly.
A good way to ensure data quality in a data pipeline is to have a good look at it in the first place.
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1717621805610/46bca7d8-54be-4a46-8daf-96a870193625.jpeg" alt class="image--center mx-auto" />
Found it useful? Subscribe to my Analytics newsletter at<a target="_blank" href="http://notjustsql.com"><a href="http://notjustsql.com" class="autolinkedURL autolinkedURL-url" target="_blank">notjustsql.com</a></a>.

Here's one of the first things I do with SQL when I want to quickly assess the data quality in a table.

I would run a series of quick COUNTs, testing key attributes of the data, such as key columns being NULL, which can display the distribution of problematic data so it can be processed accordingly.

A good way to ensure data quality in a data pipeline is to have a good look at it in the first place.

![](https://cdn.hashnode.com/res/hashnode/image/upload/v1717621805610/46bca7d8-54be-4a46-8daf-96a870193625.jpeg align="center")

*Found it useful? Subscribe to my Analytics newsletter at*[*notjustsql.com*](http://notjustsql.com)*.*

The first thing I do when analyzing a SQL table

Data Engineer with a passion for transforming complex data landscapes into insightful stories. Here on my blog, I share insights, challenges, and the ever-evolving dance of technology and business.


Explore Datawise - a blog on Analytics, SQL BigQuery and Python. Dive deep into tutorials, case studies, and the latest trends.