Career Coaching for Data Professionals
Building trustworthy data pipelines because AI cannot learn from dirty data
PySpark-Check - data quality validation for PySpark 3.0.0
Last week, I was testing whether we can use AWS Deequ for data quality validation. I ran into a few problems. First of all, it...
06 Jul 2020
The problem with software testing in data engineering
What if we found a bug in our data pipelines? What if that bug were easy to fix, but it would require a lot of...
15 Jun 2020
Data flow - what functional programming and Unix philosophy can teach us about data streaming
What does stream processing have in common with functional programming and Unix?
04 May 2020
Four books to boost your programmer career
I quit my dream job because of a book
06 Jan 2020
How writing can improve your programming skills
How writing texts for people makes you a better programmer
18 Jun 2021
The ugly truth about product demo storytelling in data teams
How to make product demos more engaging and persuade people to care about the data
11 Jun 2021
Multimodel deployment in Sagemaker Endpoints
How to deploy multiple models in a single Sagemaker Endpoint?
28 May 2021
How to speed up Pandas?
Is the Pandas library too slow? Here are two methods to speed it up!
21 May 2021
Data versioning with LakeFS
Why you should use LakeFS to build a data lake that supports data versioning
14 May 2021
How to add custom preprocessing code to a Sagemaker Endpoint running a Tensorflow model
How to customize input/output of a Sagemaker Endpoint running a Tensorflow model
07 May 2021
How to A/B test Tensorflow models using Sagemaker Endpoints
How to deploy multiple model versions as one Sagemaker Endpoint
30 Apr 2021
How to predict the value of time series using Tensorflow and RNN
How to train the RNN model in Tensorflow to predict time series?
23 Apr 2021
How to deploy a REST API AWS Lambda using Chalice and AWS Code Pipeline
How to create a REST API Endpoint using AWS Lambda, Chalice, and AWS Code Pipeline
16 Apr 2021
How to deploy a Tensorflow model using Sagemaker Endpoints and AWS Code Pipeline
How to build a Docker image using AWS Code Pipeline and deploy it as an Sagemaker Endpoint
09 Apr 2021
How to deal with days of the week in machine learning
How to encode week days as features for machine learning models
26 Mar 2021
On technical blogging
How to start blogging as a programmer
19 Mar 2021