Data Craft - making data engineering trustworthy because AI cannot learn from dirty data
Product/market fit - buidling a data-driven product
How to test a product idea?
30 Jun 2019
How to assign people to groups in a fair way using genetic algorithms
Using Helisa and Jenetics in Scala
21 Jun 2019
Genetic algorithms in Scala - solving optimization problems
Using Helisa and Jenetics to help Fallout players
19 Jun 2019
Re: DataOps Principles: How Startups Do Data The Right Way
Team vs. a bunch of individuals reporting work time in the same spreadsheet
17 Jun 2019
From Scala to Python - Python dataclasses
Domain model in Python
14 Jun 2019
Notetaking for data science
How to document a project?
12 Jun 2019
Wilson score in Python - example
10 Jun 2019
Using a surrogate model to interpret a machine learning model
How to explain a machine learning model?
07 Jun 2019
Generalized Linear Models — Using linear regression when the dependent variable does not follow Gaussian distribution
Understanding the GLM from the statsmodels package
05 Jun 2019
PCA — how to choose the number of components?
How many principal components do we need when using Principal Component Analysis?
03 Jun 2019
How to avoid bias against underrepresented target classes while training a machine learning model
The difference between KFold and StratifiedKFold in Scikit-learn
31 May 2019
How to get the value by rank from a grouped Pandas dataframe
How to rank a grouped data frame in Pandas
29 May 2019