APARSEN-EGI Community Workshop on Managing, Computing and Preserving Big Data for Research

Europe/Amsterdam
Eulerzaal (CWI small meeting room) (Amsterdam)

Eulerzaal (CWI small meeting room)

Amsterdam

Science Park 125 1098 XG Amsterdam
David Giaretta (STFC and APA), Matthias Hemmje (FernUniversität in Hagen), Tiziana Ferrari (EGI.EU)
Description
Following the December 2013 workshop “EGI towards Horizon 2020”, EGI.eu and APARSEN (the Alliance Permanent Access to the Records of Science in Europe Network) invite you to an EGI Community Workshop focusing “Managing, computing and preserving big data for research”. The goal of the workshop is to bring together all scientific domains within the EGI community to discuss and develop requirements on e-infrastructures to foster and support the generation, analyse and usage of the research data.

The workshop will focus on these essential questions:
  • How can publicly-funded institutions provide sustainable infrastructures to manage, preserve, analyze and give access to large research data?
  • How can generic services be developed on heterogeneous and complex datasets and diverse formats cutting across a wide-range of scientific communities?
  • Which business models can be developed and deployed to provide sustainable preservation and re-use?
  • What are the challenges and needs being faced for managing open data?
The workshop will be held in Amsterdam, 4.3.2014 - 6.3.2014. Participants will be asked prior to the workshop to fill in a questionnaire to gather their view on Long-Term Preservation (LTP) and to eventually present selected use cases. Furthermore, besides gathering specific uses cases and scenarios supporting requirements analysis, the workshop will aim at establishing a better understanding for the needs of the communities for education and training related to data management planning, data curation, archival, and preservation for scientific communities.

If you are a research community and you are interested in participating to the workshop, provide information about your plans and needs in the Survey by 24 February 2014.

Expressions of interest will be evaluated and selected to define a workshop focused programme tailored to the needs of the participants.

The workshop programme below is a high level one and will be developed to address the themes emerging from the survey (see above).

REGISTRATION: Registration is limited to a maximum of 60 participants and will close on Feb 28.
 
TRAVEL INFORMATION:
  • From Amsterdam Central Station: Train to station "Amsterdam Science Park" (four trains each hour).
  • From train station Amsterdam-Amstel or Muiderpoort station: Take bus 40 to Science Park Amsterdam. The bus stop closest to the meeting location is called "Science Park Aqua".
  • From Amsterdam Airport (Schiphol): Take a train to Amsterdam Central Station and from there take a local train to station Amsterdam Science Park (there are also two direct trains every hour).
  • For finding your way by public transport from any place in Amsterdam or the Netherlands you can use the Journey Planner
  • Travel by bus and tram in and around Amsterdam is only possible using the OV-chipkaart. You can find information here. You can buy a disposable OV-chipkaart for one trip or for a predetermined short-term use. You can find additional information here.
  • There are taxi stops near the railway stations, or you can order one by telephone (020) 677 7777. From Amsterdam Central Station, it takes about 15 minutes to get to Science Park Amsterdam by taxi.
  • The meeting location is Science Park 125, see map (the entrance is at the left of the entrance to the CWI building)
About APA
APARSEN
Expression of interest form
Use cases
Participants
  • Adam Rönnlund
  • Alan Beccati
  • Alessandro Spinuso
  • Andreas Drakos
  • Antonella Fresa
  • Antonella Fresa
  • Arsen Hayrapetyan
  • David Giaretta
  • Diana Pasquariello
  • Diego Scardaci
  • Fulvio Marelli
  • Gabor Terstyanszky
  • Geneviève Romier
  • Gergely Sipos
  • Iban Cabrillo
  • Ingemar Häggström
  • Jiri Sitera
  • Jonas Matser
  • Karima Rafes
  • Luigi Carotenuto
  • Lukasz Dutka
  • Malgorzata Krakowian
  • Matthew Viljoen
  • Matthias Hemmje
  • Mirko Albani
  • Nuno Ferreira
  • Paolo Bouquet
  • Peter Doorn
  • Peter Solagna
  • Ruben Riestra
  • Salvatore Pinto
  • Sergio Andreozzi
  • Wouter Los
  • Yuri Demchenko
    • 13:00 18:20
      Session I
      • 13:00
        Introduction to the workshop aims and methodology 20m
        Speakers: Prof. Matthias Hemmje (FernUniversität in Hagen), Dr Tiziana Ferrari (EGI.EU)
        Slides
        Workshop methodology
      • 13:20
        APA Virtual Centre of Excellence and its vision 20m
        Neelie Kroes, Vice President of the European Commission: "My message today is that data is gold. We have a huge goldmine in public administration. Let's start mining it." http://europa.eu/rapid/press-release_SPEECH-11-872_en.htm
        Speaker: Dr David Giaretta (STFC and APA)
        Slides
      • 13:40
        DP Knowhow: Open Archival Information Systems (OAIS) in ISO 14721 20m
        Speaker: Dr David Giaretta (STFC and APA)
        Slides
      • 14:00
        DP Knowhow: OAIS Extensions within the Archive-Centric Information Lifecycle 20m
        Speaker: Prof. Matthias Hemmje (FernUniversität in Hagen)
        Slides
      • 14:20
        DP Knowhow: Audit and Certification in ISO Standard 16363 15m
        Speaker: Dr David Giaretta (STFC and APA)
        Slides
      • 14:35
        Coffee Break 15m
      • 14:50
        Scientific Domain Case Study 1 - ESA Long Term Data Preservation Activities, the Earth Science Case 20m
        Speakers: Fulvio Marelli (ESA), Mirko Albani (ESA)
        Slides
      • 15:10
        Overview of Innovation Project SCIDIP-ES and APA Tools 20m
        Speaker: Dr David Giaretta (STFC and APA)
        APA tools
        Metadata packaging tool
        SCIDIP-ES facts sheet
      • 15:30
        Scientific Domain Case Study 2 - EISCAT-3D 20m
        Speaker: Ingemar Haggstrom (EISCAT Scientific Association)
        Slides
      • 15:50
        DP-Infra roadmap for EISCAT-3D. Discussion 20m
      • 16:10
        Coffee break 15m
      • 16:25
        Scientific Domain Case Study 3 - DP-HEP and ISIS Pulsed Neutron and Muon Source 20m
        Speaker: Mr Matthew Viljoen (STFC RAL)
        Slides
      • 16:45
        DP-Infra Roadmap for DP-HEP/ISIS. Discussion 20m
      • 17:05
        Scientific Domain Case Study 4 - Space Station Data 20m
        Speaker: Luigi Carotenuto (Telespazio)
        Slides
      • 17:25
        DP-Infra Roadmap for Space Station Data. Discussion 20m
    • 09:00 19:00
      Session II
      Convener: David Giarretta (Alliance for Permanent Access, European Union Network of Excellence)
      • 09:00
        DP Knowhow: Introduction to Data Management Planning 20m
        Speaker: Peter Doorn (DANS)
        Slides
      • 09:20
        DP Knowhow: Introduction to Audit and Certification in ISO 16363 20m
        Speaker: Dr David Giaretta (STFC and APA)
        Slides
      • 09:40
        DP Knowhow: Introduction to Data Seal of Approval 20m
        Speaker: Peter Doorn (DANS)
        Slides
      • 10:00
        DP Knowhow: Introduction to APA Persistent Identifier Infrastructure 20m
        Speaker: Paolo Bouquet (Università degli Studi di Trento)
        Slides
      • 10:20
        Scientific Domain Case Study 4 - Seismology (VERCE) 20m
        Speaker: Alessandro Spinuso (KNMI)
        Slides
      • 10:40
        DP-Infra Roadmap for Seismology. Discussion 20m
      • 11:00
        Coffee Break 15m
      • 11:15
        Scientific Domain Case Study 5 - Center for Data Science (CDS). Paris Saclay University 20m
        Speaker: Karima Rafes (Inria Saclay)
        Slides
      • 11:35
        DP-Infra roadmap for CDS. Discussion 20m
      • 11:55
        Scientific Domain Case Study 6 - Agricultural research data (agINFRA project) 20m
        Speaker: Andreas Drakos (Agro-Know Technologies)
        Slides
      • 12:15
        DP-Infra roadmap for agricultural research data. Discussion 20m
      • 12:35
        Lunch 1h
      • 13:35
        Scientific Domain Case Study 7 - Digital Cultural Heritage 20m
        Speaker: antonella fresa (Promoter S.r.l.)
        Slides
      • 13:55
        DP-Infra roadmap for digital cultural heritage. Discussion 20m
      • 14:15
        DP Sustainability: Towards Value Generation through DP Infrastructures 20m
        Speaker: Dr David Giaretta (STFC and APA)
        Slides
      • 14:35
        DP Sustainability: DP Market, Drivers, Barriers, Impact, Towards Sustainable Business Model Development 20m
        Speaker: ruben riestra (aparsen)
        Slides
      • 14:55
        Coffee break 20m
      • 15:15
        Scientific Domain Case Study 8: EarthServer project 20m
        Speaker: Alan Beccati (Jacobs University Bremen gGmbH)
        Slides
      • 15:35
        DP-Infra roadmap for EarthServer 20m
      • 15:55
        Sustainability review for DP-Infra of ESA LTDP 15m
        Speaker: Mirko Albani (ESA)
      • 16:10
        Sustainability review for DP-Infra of EISCAT 15m
        Speaker: Ingemar Haggstrom (EISCAT Scientific Association)
        Slides
      • 16:25
        Coffee break 15m
      • 16:40
        Sustainability review for DP-Infra of DP-HEP and ISIS Pulsed Neutron and Muon Source 15m
        Speaker: Mr Matthew Viljoen (STFC RAL)
        Slides
      • 16:55
        Sustainability review for DP-Infra of Seismology 15m
        Speaker: Alessandro Spinuso (KNMI)
      • 17:10
        Sustainability review for Space Station Data 15m
        Speaker: Luigi Carotenuto (Telespazio)
        Slides
      • 17:25
        Sustainability review for agINFRA 15m
        Speaker: Andreas Drakos (Agro-Know Technologies)
      • 17:40
        Sustainability review for DCH-RP 15m
        Speaker: antonella fresa (Promoter S.r.l.)
    • 09:00 14:00
      Session III
      • 09:00
        Service requirements 20m
        Data Management Audit&Certification DSA, Persistent Identifiers, MetaData Packing, Value Generation, Business Models etc.
      • 09:20
        Training requirements 20m
        Data Management Audit&Certification DSA, Persistent Identifiers, MetaData Packing, Value Generation, Business Models etc.
      • 09:40
        Consultancy requirements 20m
      • 10:00
        Strategy education requirements 20m
      • 10:20
        Strategic Business Development and Sustainability Potentials 20m
      • 10:40
        Coffee break 20m
      • 11:00
        Coffee break 20m
      • 11:20
        Next steps 1h
        Digital cultural heritage
        Earth Server
        EISCAT-3D
        International Space Station
        ISIS Neutron Source
      • 12:20
        Lunch 1h