One of my resolutions for 2019 is to use docker to delve deeper into reproducible science. As explained in the Wikipedia’s article about docker:
Docker is a computer program that performs operating-system-level virtualization, also known as “containerization”.
Basically, instead of installing software in a way that very much likely depends on the Operating System (OS) you are using, you install docker (OS specific) and then you run command lines (not OS specific) to install the set of containers (software) you need. Hence, docker provides you with a reproducible software installation! Carl Boettiger gives a very nice example of a fully reproducible data analysis pipeline that uses docker.
The steps to install docker on debian are thoroughly described here
Few months ago, I thought sudo apt-get install docker
was enough, but I realized that it installs an old version of docker:
|
|
After I completed all the steps to install the most recent stable version of
docker, version 18.09
was installed.
|
|
Then I updated my gist to set up my debian machine accordingly.
As I use R very frequently, my next step is to read An Introduction to Rocker: Docker Containers for R Carl Boettiger and Dirk Eddelbuettel and also the nice tutorial by Colin Fay he mentions in one of his recent tweets.
So, if one of your 2019 goals is to learn #Docker, I've written a little introductory blog post about using Docker and #RStats for reproducibility
— Colin 🤘🌱🏃♀️ (@_ColinFay) January 6, 2019
"An Introduction to Docker for R Users" https://t.co/AI1DIYo69i pic.twitter.com/diCOTJKpL8