Topic 05. Parallel and Distributed Data Management and Analytics

Many areas of science, industry, and commerce are producing extreme-scale data that must be processed—stored, managed, analyzed—in order to extract useful knowledge. This topic seeks papers in all aspects of distributed and parallel data management and data analysis. For example, HPC in situ data analytics, cloud and grid data-intensive processing, parallel storage systems, and scalable data processing workflows are all in the scope of this topic.

Focus

Parallel, replicated, and highly-available distributed databases
Cloud and HPC storage architectures and systems
Scientific data analytics (Big Data or HPC based approaches)
Middleware for processing large-scale data
Programming models for parallel and distributed data analytics
Workflow management for data analytics
Coupling HPC simulations with in situ data analysis
Parallel data visualization
Distributed and parallel transaction, query processing and information retrieval
Internet-scale data-intensive applications
Sensor network data management
Data-intensive clouds and grids
Parallel data streaming and data stream mining
New storage hierarchies in distributed data systems
Parallel and distributed knowledge discovery and data mining

Committee

Chair: Bruno Raffin (INRIA, France)
Local chair: David E. Singh (Carlos III University of Madrid, Spain)
Julian Kunkel (German Climate Computing Center, Germany)
Lars Nagel (Johannes Gutenberg-Universität Mainz, Germany)
Toni Cortés (Barcelona Supercomputing Center, Spain)
Matthieu Dorier (Argonne National Laboratory, USA)
Wolfgang Frings (Jülich Supercomputing Centre, Germany)