How to install scikit-automl in a Kaggle notebook
I wanted to give scikit-automl a try, but I don’t like to install many packages on my machine. Conda solves the problem of creating a mess, although it does not deal with the issue of running unknown code on my computer. Hence, I decided to try it on Kaggle.
I followed the installation instructions and did not expect any issues. After all, the instruction consists of only two steps:
1 2 !curl https://raw.githubusercontent.com/automl/auto-sklearn/master/requirements.txt | xargs -n 1 -L 1 pip install !pip install auto-sklearn
Parsing machine learning logs with Ahana, a managed Presto service, and Cube, a headless BI solution
Check out my article published on the Cube.dev blog!
What can go wrong?
Swig. Swig can go wrong. I was installing the packages, and at some point, I saw this error message. Pip was installing the pyrfr package but failed because of a problem with swig.
1 2 3 4 5 building 'pyrfr._regression' extension swigging pyrfr/regression.i to pyrfr/regression_wrap.cpp swig -python -c++ -modern -features nondynamic -I./include -o pyrfr/regression_wrap.cpp pyrfr/regression.i unable to execute 'swig': No such file or directory error: command 'swig' failed with exit status 1
It turns out that Kaggle has some old swig version installed on their machines, but we can quickly fix it:
1 2 3 !apt-get remove swig !apt-get install swig3.0 !ln -s /usr/bin/swig3.0 /usr/bin/swig
After that change, I ran the installation script, and everything worked fine.
You may also like
- Data/MLOps engineer by day
- DevRel/copywriter by night
- Python and data engineering trainer
- Conference speaker
- Contributed a chapter to the book "97 Things Every Data Engineer Should Know"
- Twitter: @mikulskibartosz