mikulskibartosz.name
Career Coaching for Data Professionals
Speaker
Bartosz Mikulski
Building trustworthy data pipelines because AI cannot learn from dirty data
All Stories
Encoding categorical variables in machine learning
One-hot encoding, dummy coding, and effect coding in Scikit learn and Pandas
How To Avoid Data Leakage While Building A Machine Learning Model
What to do when your model works perfectly during testing but fails in production
Using scikit-automl for building a classification model
My first attempt to use scikit-automl and how I got it working
How to return rows with missing values in Pandas DataFrame
How does it work and why the most popular solution is wrong
Preprocessing the input Pandas DataFrame using ColumnTransformer in Scikit-learn
How to encode text/categorical variables and scale numerical values using only one Scikit-learn class
How to install scikit-automl in a Kaggle notebook
error: command ‘swig’ failed with exit status 1 while installing scikit-automl
Predicting customer lifetime value using the Pareto/NBD model and Gamma-Gamma model
How to estimate the CLV from a list of customer transactions using the lifetimes library in Python
Predicting customer churn using the Pareto/NBD model
How to use a Python lifetimes library to build a Pareto/NBD model.
Business metrics that make no sense
There are three kinds of metrics that won’t destroy your business.
Nested cross-validation in time series forecasting using Scikit-learn and Statsmodels
Tweaking the parameters of Statsmodels
How to perform an A/B test correctly in Python
What can we expect from a correctly performed A/B test?
[book review] The hundred-page machine learning book
I have mixed feelings about this book.
« Prev
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
Next »