Building trustworthy data pipelines because AI cannot learn from dirty data
How to learn TDD
Learning Test-Driven Development is hard and there is nothing we can do about it
27 Aug 2021
Data Engineering - the first principles
What is true in every data engineering project?
20 Aug 2021
How to deploy MLFlow on Heroku
How to deploy MLFlow on Heroku using PostgreSQL as the database, S3 as the artifact storage and with BasicAuth authentication
06 Aug 2021
What is MLOps? Do we need MLOps?
MLOps is not just DevOps applied to machine learning!
30 Jul 2021
How to add a new dataset to the Feast feature store
How to use Feast feature store in a local environment
09 Jul 2021
Building trustworthy data pipelines
How to build a trustworthy data pipeline?
02 Jul 2021
Theory of constraints in data engineering
Are you busy, but nothing ever gets done? Perhaps, theory of constraints will help you
25 Jun 2021
How writing can improve your programming skills
How writing texts for people makes you a better programmer
18 Jun 2021
The ugly truth about product demo storytelling in data teams
How to make product demos more engaging and persuade people to care about the data
11 Jun 2021
Multimodel deployment in Sagemaker Endpoints
How to deploy multiple models in a single Sagemaker Endpoint?
28 May 2021
How to speed up Pandas?
Is the Pandas library too slow? Here are two methods to speed it up!
21 May 2021
Data versioning with LakeFS
Why you should use LakeFS to build a data lake that supports data versioning
14 May 2021