BITS2018-DOCKER AND REPRODUCIBILITY

Bioinformatics analysis typically integrate a large number of different tools, reference data to elaborate the input data and derive results. Reproducing the same analysis by other researchers is often a hard task as many pieces of the puzzle are missing from the used methodology.
While the raw datasets are generally available; a clear workflow detailing the results reproducibility is often missing. Indeed, a simple list of tools used in the workflow could be not enough to guarantee the result reproducibility: different releases of the same tools or/and of the system libraries (exploited by such tools) might lead to sneaky reproducibility issues. The biggest obstacle in computational reproducibility is then to create a reliable, standalone, multiplatform and lightweight-working environment in which all the computational needs for a study are installed and frozen. Virtualisation and containerisation are the two approaches proposed to address this issue.

Virtual machines are very good at isolating system resources and entire working environments, while , containers’ philosophy is to isolate only individual applications, not the entire system. Thus, containers are a lightweight fast and scalable alternative to Virtual machines when an completely isolated execution is not mandatory.

Docker is a container framework for Linux that allows a developer to make easier the creation, deployment, and execution of applications by using containers.
Recently it is becoming a promising approach to computational biology research reproducibility by:

  • Saving time and expenses on human and computational resources allocated to already performed analysis;
  • Boosting communication between computational biologists working on similar topics;
  • Enhancing transparency within the community;
  • Granting open access computational knowledge to the community.

During the workshop the participant learn the core concepts of Docker and how to easily embed bioinformatic pipelines/workflows into a docker container.

24 seats are available, seating will be granted on a first-come first-served basis. The registration fee is fixed at 100 Euros. The registration open May 1.
 

CONFIRMED SPEAKERS AND TEACHERS

  • Dr. Marco Beccuti, Univerity of Turin
  • Dr. Raffaele Calogero,  Univerity of Turin
  • Dr. Raoul Bonnal
  • Dr. Loredana Le Pera

 

DATE & VENUE

June 25 and 26 all days in Molecular Biotechonology Center, Via Nizza 52

 

PROGRAM

 

June 25th - Theoretical Day

Introduction to Virtualization

Docker architecture

Installing Docker

Working with docker client

The Docker CLI

Building a Dockerfile

Tips and Tricks and Best practices for Docker

Using Compose and Swarm

Docker Hub

 

June 26th - Training Day

Building a bioinformatic template container for R

Deploying a simple bioinformatic pipeline embedded into docker

 

The full programme will be available in the next weeks

 

CONTACTS

For any further information drop an email to any of these addressess:

  • beccuti@di.unito.it

 

 

EuroClone S.p.A. Personal Genomics Overleaf: Real-time Collaborative Writing and Publishing Tools with Integrated PDF Preview BioPmed | Piemonte Innovation Cluster Cellular and Molecular Biology - Università degli Studi di Torino Dottorato in Sistemi Complessi per le Scienze della Vita - Università degli Studi di Torino Dipartimento di Informatica dell'Università degli Studi di Torino Università di Torino