I help data engineering tech leads #makeDataTrustworthy because AI cannot learn from dirty data
How to split a list inside a Dataframe cell into rows in Pandas
Step by step instructions to "explode" a list into DataFrame rows.
27 Aug 2018
Interactive plots in Jupyter Notebook
How to create a plot that supports zooming
24 Aug 2018
[book review] James Whittaker's Little Book of the Future
Read this book if you believe we can use A.I. and IoT to build a bright future.
22 Aug 2018
Probability plot - visually compare probability distributions
How to visually check whether your sample is normally distributed?
20 Aug 2018
Count unique elements of an infinite stream of objects
HyperLogLog - probabilistic counting algorithm
19 Aug 2018
Live unit testing with sbt
Can I have the coolest Visual Studio feature in IntelliJ?
18 Aug 2018
Monte Carlo simulation in Python
How to make business decisions using the Monte Carlo simulation?
17 Aug 2018
Word cloud from a Pandas data frame
Create a nice visualization of the most popular words in your data frame
07 Aug 2018
Scala structural types with generics
A short example of defining a structural type which matches a generic class
05 Aug 2018
Visualize common elements of two datasets using NetworkX
How to use undirected graph to visualize common elements of two Pandas data frames
03 Aug 2018
How to load data from Google Drive to Pandas running in Google Colaboratory
14 Jul 2018
Precision vs. recall - explanation
How to understand the difference between precision and recall?
15 Jun 2018