9-12 May 2017
Europe/Rome timezone

The ELIXIR-­ITALY use case within INDIGO­-Datacloud: providing Galaxy based workflow environments “on demand” through cloud technologies.

Not scheduled
15m

Speaker

Dr Marco Antonio Tangaro (CNR)

Description

Galaxy is an open source, web based, workflow manager for bioinformatics analysis, adopted in many life science research environments in order to facilitate the interaction with bioinformatics tools and the handling of large quantities of biological data. Currently, Galaxy comes to end users through public servers, deployable on local servers or exploitable using commercial cloud solutions. The employment of cloud solutions is rapidly growing, allowing for the creation a ready-to-use galaxy production environments, avoiding any initial configuration issue. Nevertheless, it is a costly solution, with serious ethical drawbacks in terms of data privacy, being the data accessible by the private vendor of the service. To overcome these issues, ELIXIR-IIB, the Italian node of ELIXIR, is developing a Galaxy “on-demand” instance provider platform, automating the creation of user customized and ready to use Galaxy instances on the cloud, exploiting the INDIGO-DataCloud technologies. Each instance will be deployed as an insulated environment, insulating data from any other instance on the same platform and from the cloud service administrators, providing a research environment suitable also for clinical scenarios involving sensible human data, improving the access and exploitation of computational facilities by life scientists. In particular, FutureGatway is used to provide the web interface that allows to customize the virtual workspace, e.g. selecting between different sets of tools and different virtual hardware configurations. All the required components to automatically set up Galaxy instances are deployed using the INDIGO Orchestrator and the Infrastructure Manager. Persistent storage for for users and reference data and to install new (custom) tools and workflows is provided through OneData. Finally, automatic elasticity is provided by the CLUES INDIGO service component that enables dynamic cluster resources scaling.

Primary author

Dr Marco Antonio Tangaro (CNR)

Co-authors

Dr Federico Zambelli (CNR (ELIXIR-ITA)) Dr Giacinto Donvito (INFN) Prof. Graziano Pesole (CNR (ELIXIR-ITA))

Presentation Materials