9 followers
Data Engineer with a passion for transforming complex data landscapes into insightful stories. Here on my blog, I share insights, challenges, and the ever-evolving dance of technology and business.
What function do you use to explicitly de-duplicate in BigQuery?I normally use ROW_NUMBER(), but I've recently encountered a really interesting blog...
SELECT * is not a good practice in production, but I still use it for spot checks, when debugging, analyzing or validating data - especially when...
One thing you might encounter in Python are the "dunder" (because of the double underscore before and after their name) or "magic" methods, because...
I use STRUCTS in #BigQuery quite a bit. One interesting use case for them is for audit purposes a separate column containing metadata about how the...
One of the most common sightings in SQL code is using ranking functions. It's simple but we must surely get it right. I use ROW_NUMBER very often for...
So, if you ever find yourself working with multiple window functions in BigQuery, leverage the named windows specification for tidier, leaner...