Six Amazing Unknown Python Libraries
Author(s): Dhilip Subramanian Originally published on Towards AI. Cool python libraries for Data Engineering and NLP Photo by Jamie Fenn on Unsplash Iβve been using Python extensively for the last five years. As a result, Iβm always looking for amazing libraries that …
Navigating the World of Data Engineering: A Beginnerβs Guide.
Author(s): Data Science meets Cyber Security Originally published on Towards AI. A GLIMPSE OF DATA ENGINEERING U+2764 IMAGE SOURCE: BY AUTHOR Data or data? No matter how you read or pronounce it, data always tells you a story directly or indirectly. Data …
Airflow Production Tips β Grouped Failures and Retries
Author(s): Guilherme Banhudo Originally published on Towards AI. Photo by Jackson Simmer on Unsplash Apache Airflow has become the de facto standard for Data Orchestration. However, throughout the years and versions, it accumulated a set of nuances and bugs which can hinder …
Airflow Production Tips β Proper Task (Not DAG) Catchup
Author(s): Guilherme Banhudo Originally published on Towards AI. Photo by Jackson Simmer on Unsplash Apache Airflow has become the de facto standard for Data Orchestration. However, throughout the years and versions, it accumulated a set of nuances and bugs which can hinder …
Jobs in Data: What the Data Tells Us About Skills And Salaries
Author(s): Jonty Haberfield Originally published on Towards AI. A brief analysis of over 9000 job specs for UK data professionals This member-only story is on us. Upgrade to access all of Medium. Photo by Eric Prouzet on Unsplash Itβs an interesting time …
The Universe of βData Scienceβ Roles
Author(s): Shahrokh Barati Originally published on Towards AI. Data Scientist vs. Data Analyst vs. Data Engineer vs. ML Engineer vs. MLOps Engineer vs. [insert your fancy role title here]β¦ This member-only story is on us. Upgrade to access all of Medium. Visual …
The Data Engineering Pipeline
Author(s): Rijul Singh Malik Originally published on Towards AI. A Blog about Discussing an in-depth discussion around building a data pipeline Photo by JJ Ying on Unsplash Data Engineers are at the heart of the engine room of any data-driven company. This …
Step by Step Guide on Web Scraping Using Scrapy In Python
Author(s): Songhao Wu Originally published on Towards AI. How to retrieve second-hand cars information in Singapore This member-only story is on us. Upgrade to access all of Medium. Photo by Tyler Franta on Unsplash In one of my previous articles, I introduced …
[DBT] Set Snowflake Query Tag for each DBT model [Tip-2]
Author(s): Karthikeyan Siva Baskaran Originally published on Towards AI. Software Engineering Query Tag feature in DBT is a database-specific configuration. In this article, let see how to customize it for Snowflake. Query tags are a Snowflake parameter that can be quite useful …
7 Ways to Identify and Handle Missing Data | 3 Ways You Should Not
Author(s): Raja Dev Originally published on Towards AI. Data Science 7 Ways to Identify and Handle Missing Data U+007C 3 Ways You Should Not 10 Strategies to Prepare High-Quality Data for ML Image from Canva Pro Good Data Scientists acknowledge the reasons …