This is an article about setting up a new computer with the software needed to do Machine Learning on it. We’re going to use Ubuntu 18.04 LTS. If you plan to use Windows on this machine as well, you’ll need to set up dual booting.
OS Installation
Install Ubuntu 18.04 LTS by downloading the image from Ubuntu’s website and imaging it to a USB drive.
There are tutorials from Ubuntu to install Ubuntu from Windows, Mac and Ubuntu I like to use Rufus for imaging if using a Windows machine. Boot up the computer with the USB drive and follow the installation instructions. You may need to set the BIOS up to boot from the USB drive. Make sure your computer is connected to the Internet before starting the installation and it will fetch and install any updates while installing. If your computer was not connected to the internet during installation be sure to run the following commands once it is connected:
sudo apt update
sudo apt upgrade
Why Ubuntu?
Ubuntu is a distribution of Linux maintained by Canonical. It is free and open-source and fairly user friendly with lots of community support available on the Internet.
Install ssh server
I recommend having a ssh server set up to log into your machine. This will be especially useful if you plan on having the computer be headless (no monitor attached).
sudo apt update
sudo apt install openssh-server
cd /etc/ssh
sudo chmod 777 sshd_config
GPU Acceleration
Install the proprietary NVIDIA drivers as well as CUDA and CUDNN to get the full capability of your NVidia GPUs.
You can go to the official NVIDIA website and find which version of the display driver you should use. You can download the drivers from the website or you can install from the terminal with the following command:
sudo apt-get install nvidia-<version number>
Then reboot and ensure the drivers installed by typing:
nvidia-smi
If you’re using the GUI you can also go to Additional Drivers and select the correct driver for you GPU.
Download and Install CUDA
You can go to the CUDA Toolkit website and get the commands for the latest version. For 10.2 on Ubuntu 18.04 use the following commands:
wget http://developer.download.nvidia.com/compute/cuda/10.2/Prod/local_installers/cuda_10.2.89_440.33.01_linux.run
sudo sh cuda_10.2.89_440.33.01_linux.run
Download and Install CUDNN
Go to the NVIDIA cuDNN home page and sign in with your account or create a new one if you don't have one yet.
Download CUDNN 7.x for CUDA 10.2. Extract the content, this will create a cuda directory, e.g. in ~/Downloads/cuda. After extracting copy the content to the CUDA folder, as follows:
sudo rsync -rl cuda/ /usr/local/cuda
Python Virtualenv
We’re going to install Python virtualenv to enable creating isolated Python environments. This will help with dependency management.
- First make sure you have all the prerequiestes:
sudo apt install python-dev python3-dev python-pip virtualenv
sudo pip install virtualenv virtualenvwrapper
Then add the following to your bashrc:
# virtualenv and virtualenvwrapper
export WORKON_HOME=$HOME/.virtualenvs
source /usr/local/bin/virtualenvwrapper.sh
- Create a new virtual environment named 'virtenv' and specify the python version you like to use on it.
virtualenv -p python3.5 virtenv
- Activate the virtual environment
source virtenv/bin/activate
- You can deactivate the virtenv by typing
deactivate
Make sure you are in your virtual environment and install the following with pip. Pip is a Python package manager and makes installing software very easy.
pip install numpy
pip install scipy
pip install pandas
pip install matplotlib
Check the installed packages in the virtual environment using:
pip list --format=columns
Machine Learning Packages
Let's now install some of the more popular Machine Learning packages.
Install OpenCV
OpenCV 3.2.0 can be installed from the Ubuntu 18.04 official repository.
sudo apt install python3-opencv
Install Scikit-learn
pip install scikit-learn
Install Tensorflow.
pip install --upgrade tensorflow-gpu
Install Keras
pip install keras
PyTorch Development Environment
Now it is time to setup the python environment for PyTorch development.
mkvirtualenv pytorch -p python3 --system-site-packages
Switch to the pytorch virtual environment and then install the packages:
pip install torch torchvision
Summary
Hopefully this guide was useful in getting your Machine Learning Box with Ubuntu set up. If you run into trouble be sure to ask the Ubuntu community for help, they are usually a friendly group and more than happy to help.