X
Search Filters
Format Format
Subjects Subjects
Subjects Subjects
X
Sort by Item Count (A-Z)
Filter by Count
roofline model (45) 45
roofline (33) 33
gpu (19) 19
computer science, theory & methods (18) 18
computational modeling (14) 14
kernel (12) 12
optimization (12) 12
computer science (11) 11
computer architecture (10) 10
engineering, electrical & electronic (10) 10
performance modeling (10) 10
algorithms (9) 9
graphics processing units (9) 9
bandwidth (8) 8
computer science, hardware & architecture (8) 8
computer simulation (8) 8
fpga (8) 8
mathematical models (8) 8
analysis (7) 7
analytical models (7) 7
computer science, software engineering (6) 6
hardware (6) 6
hpc (6) 6
instruction sets (6) 6
performance prediction (6) 6
computation (5) 5
energy efficiency (5) 5
energy management (5) 5
high-level synthesis (5) 5
leistungsanalyse (5) 5
mathematics and computing (5) 5
performance analysis (5) 5
the roofline model (5) 5
accelerators (4) 4
algebra (4) 4
beschleunigung (4) 4
computer science - distributed, parallel, and cluster computing (4) 4
computer science, interdisciplinary applications (4) 4
computing time (4) 4
dvfs (4) 4
energy conservation (4) 4
energy consumption (4) 4
mathematical model (4) 4
memory (4) 4
methods (4) 4
parallel computing (4) 4
performance model (4) 4
performance-model (4) 4
stencil (4) 4
autotuning (3) 3
computer science, general (3) 3
computer science, information systems (3) 3
electric potential (3) 3
fallstudie (3) 3
feldprogrammierbare gate-array-schaltung (3) 3
field programmable gate arrays (3) 3
finite difference method (3) 3
gpu computing (3) 3
hierarchies (3) 3
high-performance computing (3) 3
kernels (3) 3
leistungsbewertung (3) 3
mathematical analysis (3) 3
meteorology (3) 3
microprocessors (3) 3
multicore (3) 3
multicore processing (3) 3
parallel processing (3) 3
power efficiency (3) 3
processor architectures (3) 3
processors (3) 3
program processors (3) 3
rechnerarchitektur (3) 3
rohstoffliche verwertung (3) 3
solvers (3) 3
stream (3) 3
technology and engineering (3) 3
workload partitioning (3) 3
3d application characterization (2) 2
3d finite differences (2) 2
architecture (2) 2
arithmetic intensity (2) 2
arithmetik (2) 2
arrays (2) 2
block eigensolver (2) 2
cache-aware roofline model (2) 2
chips (2) 2
clusters (2) 2
co-design (2) 2
computational fluid dynamics (2) 2
computer hardware (2) 2
computer science - performance (2) 2
computer science, artificial intelligence (2) 2
computer-system (2) 2
convolutional neural network (2) 2
cpu (2) 2
cuda (2) 2
datenstruktur (2) 2
dense linear algebra (2) 2
more...
Language Language
Publication Date Publication Date
Click on a bar to filter by decade
Slide to change publication date range


Proceedings of the 2015 ACM/SIGDA International Symposium on field-programmable gate arrays, 02/2015, pp. 161 - 170
Convolutional neural network (CNN) has been widely employed for image recognition because it can achieve high accuracy by emulating behavior of optic nerves in... 
acceleration | fpga | roofline model | convolutional neural network | Rooine model | Acceleration | FPGA | Convolutional neural network
Conference Proceeding
Journal of Parallel and Distributed Computing, ISSN 0743-7315, 11/2019, Volume 133, pp. 407 - 419
High-performance computing on heterogeneous platforms in general and those with FPGAs in particular presents a significant programming challenge. We contend... 
Roofline model | High-level synthesis | High-performance computing | FPGA | Cost model | Performance model | High-level programming | DOMAIN-SPECIFIC LANGUAGE | PERFORMANCE-MODEL | COMPUTER SCIENCE, THEORY & METHODS
Journal Article
Parallel Computing, ISSN 0167-8191, 01/2019, Volume 81, pp. 1 - 21
Expressing scientific computations in terms of BLAS, and in particular the general dense matrix-matrix multiplication (GEMM), is of fundamental importance for... 
Matrix-matrix product | HPC | Autotuning | Batched GEMM | Small matrices | Optimization | ROOFLINE | DENSE LINEAR ALGEBRA | COMPUTER SCIENCE, THEORY & METHODS | MODEL
Journal Article
Journal of Computational Physics, ISSN 0021-9991, 07/2017, Volume 340, pp. 138 - 159
Journal Article
ACM Transactions on Architecture and Code Optimization (TACO), ISSN 1544-3566, 08/2019, Volume 16, Issue 3, pp. 1 - 27
Advances in processor design have delivered performance improvements for decades. As physical limits are reached, refinements to the same basic technologies... 
energy-aware computing | Energy-efficiency | power optimisation | Energy-aware computing | Power optimisation | ROOFLINE | COMPUTER SCIENCE, HARDWARE & ARCHITECTURE | TIME | COMPUTER SCIENCE, THEORY & METHODS | MODEL | LOGP
Journal Article
ACM Transactions on Mathematical Software (TOMS), ISSN 0098-3500, 04/2018, Volume 44, Issue 3, pp. 1 - 32
Journal Article
Concurrency and Computation: Practice and Experience, ISSN 1532-0626, 06/2017, Volume 29, Issue 12, pp. e4143 - n/a
Journal Article
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN 0302-9743, 2019, Volume 11416, pp. 86 - 105
Conference Proceeding
Journal of Parallel and Distributed Computing, ISSN 0743-7315, 08/2017, Volume 106, pp. 153 - 169
MOST (Method Of Splitting Tsunami) is widely used to solve shallow water equations (SWEs) for simulation of tsunami. This paper presents high-performance and... 
Roofline model | Tsunami simulation | Stream computing | FPGA | GPU | Custom hardware | COMPUTER SCIENCE, THEORY & METHODS | Models | Circuit design | Tsunamis | Digital integrated circuits | Computer science | Discovery and exploration | Outer space
Journal Article
ACM Transactions on Design Automation of Electronic Systems (TODAES), ISSN 1084-4309, 05/2017, Volume 22, Issue 3, pp. 1 - 26
Performance and energy are two major concerns for application development on heterogeneous platforms. It is challenging for application developers to fully... 
workload partitioning | Heterogeneous platforms | energy modeling | performance modeling | Workload partitioning | Energy modeling | Performancemodeling | COMPUTER SCIENCE, SOFTWARE ENGINEERING | COMPUTER SCIENCE, HARDWARE & ARCHITECTURE | MULTICORE | ROOFLINE MODEL
Journal Article
ACM Transactions on Architecture and Code Optimization (TACO), ISSN 1544-3566, 01/2019, Volume 15, Issue 4, pp. 1 - 27
Graphics Processing Units (GPUs) are vastly used for running massively parallel programs. GPU kernels exhibit different behavior at runtime and can usually be... 
kernel metrics | resource utilization | feature selection | Classification | concurrency | ROOFLINE | COMPUTER SCIENCE, HARDWARE & ARCHITECTURE | PERFORMANCE | COMPUTER SCIENCE, THEORY & METHODS | MODEL
Journal Article
Journal of Parallel and Distributed Computing, ISSN 0743-7315, 05/2018, Volume 115, pp. 56 - 66
Partial solution variant of the cyclic reduction (PSCR) method is a direct solver that can be applied to certain types of separable block tridiagonal linear... 
PSCR method | Roofline model | Fast direct solver | Partial solution technique | Separable block tridiagonal linear system | GPU computing | BOUNDARY-CONDITIONS | PERFECTLY MATCHED LAYER | CYCLIC REDUCTION ALGORITHM | COMPUTER SCIENCE, THEORY & METHODS | ABSORPTION | EQUATION | FAST POISSON SOLVERS | PARALLEL
Journal Article
The Journal of Supercomputing, ISSN 0920-8542, 12/2019, Volume 75, Issue 12, pp. 7778 - 7789
Journal Article
International Journal of High Performance Computing Applications, ISSN 1094-3420, 03/2018, Volume 32, Issue 2, pp. 220 - 230
Journal Article
Concurrency and Computation: Practice and Experience, ISSN 1532-0626, 05/2016, Volume 28, Issue 7, pp. 2295 - 2315
Summary Memory‐bound algorithms show complex performance and energy consumption behavior on multicore processors. We choose the lattice Boltzmann method on an... 
lattice Boltzmann method | ECM performance model | energy optimization | COMPUTER SCIENCE, SOFTWARE ENGINEERING | ROOFLINE | PERFORMANCE-MODEL | COMPUTER SCIENCE, THEORY & METHODS | Energy conservation | Analysis | Reduction | Algorithms | Computer simulation | Chips | Mathematical models | Optimization | Clocks
Journal Article
IEICE Electronics Express, ISSN 1349-2543, 07/2014, Volume 11, Issue 15
A "roofline model " is a system performance and optimization guide for programmers and system engineers to apply in the design of future architectures. We... 
Embedded system | Roofline model | Working set size | embedded system | working set size | roofline model | ENGINEERING, ELECTRICAL & ELECTRONIC
Journal Article
No results were found for your search.

Cannot display more than 1000 results, please narrow the terms of your search.