Building trustworthy data pipelines because AI cannot learn from dirty data
Data engineers are data librarians or how to upgrade your data lake to 2500 BCE technology.
What can data engineers learn from (ancient) librarians?
11 Mar 2022
The problem with software testing in data engineering
What if we found a bug in our data pipelines? What if that bug were easy to fix, but it would require a lot of...
15 Jun 2020
Data flow - what functional programming and Unix philosophy can teach us about data streaming
What does stream processing have in common with functional programming and Unix?
04 May 2020
Four books to boost your programmer career
I quit my dream job because of a book
06 Jan 2020
What does kill IT projects?
What does kill IT projects? What you should avoid, at all costs, to ensure the success of your startup or software project
30 Nov 2022
How to write a growth plan as a programmer?
How to write a growth plan that helps you get promoted and doesn't get in the way when you want to focus on your hobbies
20 Nov 2022
Test-Driven Development in Python with Pytest
How to setup and use Pytest to test Python code
10 Nov 2022
Marketing for SaaS startups: how to describe your product?
How to use the "benefits over features" technique to advertise your SaaS product and get more clients than your competition
30 Oct 2022
How to pitch your idea
What a co-founder of DeepMind teaches us about pitching our ideas to investors
20 Oct 2022
MLOps at small companies
How to do MLOps while working on a small data engineering team
10 Oct 2022
Why should you practice TDD?
What are the benefits of TDD for programmers and companies that hire them?
30 Sep 2022
How to debug code
How to debug code and solve problems as fast as possible
20 Sep 2022
CUPID properties in data engineering
SOLID principles vs. CUPID properties in data engineering
10 Sep 2022
How to add tests to existing code in data transformation pipelines
How data engineers can write tests for legacy code in their ETL pipelines without breaking the existing implementation
30 Aug 2022
Software engineering practices in data engineering and data science
How to produce high-quality software in data teams
20 Aug 2022
How to sort a Pandas DataFrame by month name
How to use an ordered categorical variable to sort a Pandas Dataframe by months while displaying their names
15 Aug 2022