Building trustworthy data pipelines because AI cannot learn from dirty data
Box and whiskers plot
How to plot and interpret the box and whiskers plot
28 Sep 2018
[book review] Team Geek
This book deserves a 3-star review on Amazon for many reasons.
26 Sep 2018
How I failed to plot parallel coordinates in Matplotlib
Built-in matplotlib functions are not enough in this case
24 Sep 2018
Import Jupyter Notebook from GitHub
The easiest way to access someone else’s code in your own notebook
21 Sep 2018
Fill missing values in Pandas
Use the next or previous value to fill the missing values in Pandas
19 Sep 2018
Forward feature selection in Scikit-Learn
Two workarounds to get an equivalent of forward feature selection in Scikit-Learn
17 Sep 2018
Heat map with Matplotlib
A short tutorial about generating a heat map of the values stored in a Pandas dataframe
14 Sep 2018
Language is all about nouns
Programmers are afraid of nouns. We often replace them with poorly written descriptions of things.
12 Sep 2018
Outlier detection with Scikit Learn
Z-score and Density-Based Spatial Clustering of Applications with Noise
10 Sep 2018
How to set the global random_state in Scikit Learn
What to do if you keep forgetting to set the random_state?
31 Aug 2018
JUG Thüringen meetup - retrospective
My opinion about my presentation at a meetup in Erfurt, Germany.
29 Aug 2018
How to split a list inside a Dataframe cell into rows in Pandas
Step by step instructions to "explode" a list into DataFrame rows.
27 Aug 2018