Building trustworthy data pipelines because AI cannot learn from dirty data
Predicting customer churn using the Pareto/NBD model
How to use a Python lifetimes library to build a Pareto/NBD model.
18 Mar 2019
Business metrics that make no sense
There are three kinds of metrics that won’t destroy your business.
15 Mar 2019
Nested cross-validation in time series forecasting using Scikit-learn and Statsmodels
Tweaking the parameters of Statsmodels
13 Mar 2019
How to perform an A/B test correctly in Python
What can we expect from a correctly performed A/B test?
11 Mar 2019
[book review] The hundred-page machine learning book
I have mixed feelings about this book.
08 Mar 2019
A few useful things to know about machine learning
Pedro Domingo’s observations about feature engineering
06 Mar 2019
Recommendations vs. raw data — what is better?
Should we suggest an action when we visualize data?
04 Mar 2019
How to interpret ROC curve and AUC metrics
In my opinion, AUC is a metric that is both easy to use and easy to misuse. Do you want to know why? Keep reading ;)
01 Mar 2019
How to display mathematical equations in Jupyter Notebook
LaTeX support in Jupyter Notebook
27 Feb 2019
Apriori algorithm explained
25 Feb 2019
How to change plot size in Jupyter Notebook
Pyplot parameter that configures the chart size
22 Feb 2019
Looking for structure in data — Andrews curves plot explained
How to read Andrews curves chart
20 Feb 2019