APROPOS (Approximate Computing for Power and Energy Optimisation) (MSCA-ITN-2020-ETN/Nº 956090). (01/11/20 – 01/11/24). IP Enrique Quintana
The Approximate Computing for Power and Energy Optimisation ETN will train 15 ESRs to tackle the challenges of future embedded and high-performance computing energy efficiency by using disruptive methodologies. APROPOS aims at decreasing energy consumption in both distributed computing and communications for cloud-based cyber-physical systems. We propose adaptive Approximate Computing to optimize energy-accuracy trade-offs.
EFLOWS4HPC (Enabling Dynamic And Intelligent Workflows In The Future Eurohpcecosystem) (955558). (01/01/21 – 01/01/24). IP José Flich
The eFlows4HPC project aims to deliver a workflow software stack and an additional set of services to enable the integration of HPC simulation and modelling with big data analytics and machinelearning in scientific and industrial applications. The software stack will allow to develop innovative adaptive workflows that efficiently use the computing resources and also considering innovative storage solutions. To widen the access to HPC to newcomers, the project will provide HPC Workflows as a Service (HPCWaaS), an environment for sharing, reusing, deploying and executing existing workflows on HPC systems.
DEEPHEALTH (Deep-Learning and HPC to Boost Biomedical Applications for Health) project is funded by the EC under the topic ICT-11-2018-2019 “HPC and Big Data enabled Large-scale Test-beds and Applications” (1/1/2019-1/12/2021). The aim of DeepHealth is to offer a unified framework completely adapted to exploit underlying heterogeneous HPC and Big Data architectures; and assembled with state-of-the-art techniques in Deep Learning and Computer Vision.
RECIPE (REliable power and time-ConstraInts-aware Predictive management of heterogeneous Exascale systems) provides a hierarchical runtime resource management infrastructure to optimise energy efficiency and minimise the occurrence of thermal hotspots, while enforcing the time constraints imposed by the applications, and ensuring reliability for both time-critical and throughput-oriented computation.
SELENE is aimed at proposing a new family of safety-critical computing platforms that builds upon open source components such as RISC-V cores, GNU/Linux, and Jailhouse hypervisor. SELENE will develop an advanced computing platform that is able to:
- adapt the system to the specific requirements of different application domains, to the changing environmental conditions, and to the internal conditions the system itself
- allow the integration of applications of different criticalities and performance demands in the same platform, guaranteeing functional and temporal isolation properties
- achieve flexible diverse redundancy by exploiting the inherent redundant capabilities of the multicore
- execute in an efficient way compute intensive applications by means of specific accelerators.
TACCERE (Técnicas algorítmicas para computación de alto rendimiento consciente del consumo energético y resistente a errores) (TIN2017-82972-R). (01/01/2018-30/09/2021).
The project TACCERE will contribute, as a general objective, to the design and development of algorithmic techniques, programming interfaces and tools, computational kernels, libraries of algorithms and runtime frameworks, that reduce energy consumption, increase resilience to errors, and improve productivity in the development of applications that deal with vast amounts of data and exhibit irregular parallel patterns. As a generic computer target, the project will consider an heterogeneous parallel architecture, due to the energy efficiency advantage of this type of systems, together with a hybrid MPI+X programming model, where X can be replaced by any of the current multi-threaded programming languages such as OpenMP, OpenACC, OpenCL, CUDA, etc.
This project aims to turn the rCUDA technology for remote GPU virtualization into a fully finished commercial product and transfer the developments made to the industry. To achieve this objective, it is necessary to: a) complete the support within rCUDA for new application areas such as Deep Learning, among others; b) develop an ecosystem around the rCUDA technology to schedule the shared use of virtualized GPUs; and c) prepare all the libraries developed during the project so that they are compatible with the latest versions of the commercial libraries with which they interact (CUDA, InfiniBand, SLURM, etc.) and optimize their operation, so that our developments are better accepted by industry.
T-PARCCA: (Tecnologías Innovadoras de Procesadores, Aceleradores y Redes, para Centros de Datos y Computación de Altas Prestaciones) (RTI2018-098156-B-C51-AR). (01/01/19 – 01/01/22). Investigación competitiva proyectos. AGENCIA ESTATAL DE INVESTIGACION
In this consortium GAP people participates in two main core areas: improving the performance of the nodes of the sytem and in the interconnection networks. In the former, we work on hardware design (e.g. core microarchitecture, cache hierarch, main memory and heterogeneous systems) and on deleloping system software aware of the underlying hardware features (e.g. scheduling and thread to core allocation strategies. In the latter, GAP works on improving the interconnection network performance and on addressing the power problem (e.g. switching off specific links).