Casey ChenginTowards Data ScienceMastering ExternalTaskSensor in Apache Airflow: How to Calculate Execution DeltaExternal Task Sensors stop bad data from trickling downstream in a data pipeline. Leverage them to create a reliable data infrastructure.·15 min read·May 8, 2023--4--4
Casey ChenginTowards Data ScienceThe Art of Speeding Up Python LoopThere is no “best” looping technique in Python, only the most suitable.·10 min read·Oct 31, 2022--1--1
Casey ChenginTowards Data Science14 Best Practices to Tune BigQuery SQL PerformanceWith big data, querying is no longer just about writing the “correct” syntax, it needs to be cost-effective and fast, too. Here is how…·26 min read·May 3, 2022--10--10
Casey ChenginTowards Data ScienceShannon Information: Discovering Atoms of CommunicationPhysical objects have atoms, information has bits. Claude Shannon believes that information, although intangible, can be quantified…·13 min read·Mar 21, 2022--3--3
Casey ChenginTowards Data SciencePrincipal Component Analysis (PCA) Explained Visually with Zero MathPrincipal Component Analysis (PCA) is an indispensable tool for visualization and dimensionality reduction for data science but is often…·12 min read·Feb 3, 2022--14--14