Data Craft - making data engineering trustworthy because AI cannot learn from dirty data
Preprocessing the input Pandas DataFrame using ColumnTransformer in Scikit-learn
How to encode text/categorical variables and scale numerical values using only one Scikit-learn class
25 Mar 2019
How to install scikit-automl in a Kaggle notebook
error: command ‘swig’ failed with exit status 1 while installing scikit-automl
22 Mar 2019
Predicting customer lifetime value using the Pareto/NBD model and Gamma-Gamma model
How to estimate the CLV from a list of customer transactions using the lifetimes library in Python
20 Mar 2019
Predicting customer churn using the Pareto/NBD model
How to use a Python lifetimes library to build a Pareto/NBD model.
18 Mar 2019
Business metrics that make no sense
How to define metrics that won’t destroy your business.
15 Mar 2019
Nested cross-validation in time series forecasting using Scikit-learn and Statsmodels
Tweaking the parameters of Statsmodels
13 Mar 2019
How to perform an A/B test correctly in Python
What can we expect from a correctly performed A/B test?
11 Mar 2019
[book review] The hundred-page machine learning book
I have mixed feelings about this book.
08 Mar 2019
A few useful things to know about machine learning
Pedro Domingo’s observations about feature engineering
06 Mar 2019
Recommendations vs. raw data — what is better?
Should we suggest an action when we visualize data?
04 Mar 2019
How to interpret ROC curve and AUC metrics
In my opinion, AUC is a metric that is both easy to use and easy to misuse. Do you want to know why? Keep reading ;)
01 Mar 2019
How to display mathematical equations in Jupyter Notebook
LaTeX support in Jupyter Notebook
27 Feb 2019