Topic 05. Parallel and Distributed Data Management and Analytics

Many areas of science, industry, and commerce are producing extreme-scale data that must be processed—stored, managed, analyzed—in order to extract useful knowledge. This topic seeks papers in all aspects of distributed and parallel data management and data analysis. For example, HPC in situ data analytics, cloud and grid data-intensive processing, parallel storage systems, and scalable data processing workflows are all in the scope of this topic.

Focus

  • Parallel, replicated, and highly-available distributed databases
  • Cloud and HPC storage architectures and systems
  • Scientific data analytics (Big Data or HPC based approaches)
  • Middleware for processing large-scale data
  • Programming models for parallel and distributed data analytics
  • Workflow management for data analytics
  • Coupling HPC simulations with in situ data analysis
  • Parallel data visualization
  • Distributed and parallel transaction, query processing and information retrieval
  • Internet-scale data-intensive applications
  • Sensor network data management
  • Data-intensive clouds and grids
  • Parallel data streaming and data stream mining
  • New storage hierarchies in distributed data systems
  • Parallel and distributed knowledge discovery and data mining

Committee

Chair: Bruno Raffin (INRIA, France)
Local chair: David E. Singh (Carlos III University of Madrid, Spain)
Julian Kunkel (German Climate Computing Center, Germany)
Lars Nagel (Johannes Gutenberg-Universität Mainz, Germany)
Toni Cortés (Barcelona Supercomputing Center, Spain)
Matthieu Dorier (Argonne National Laboratory, USA)
Wolfgang Frings (Jülich Supercomputing Centre, Germany)