Enhancing Transformer Performance and Portability through Auto-tuning Frameworks
#CUDA #LLM #AutoTuning #PerformancePortability #Package
hgpu.org?p=30329
Latest posts tagged with #PerformancePortability on Bluesky
Enhancing Transformer Performance and Portability through Auto-tuning Frameworks
#CUDA #LLM #AutoTuning #PerformancePortability #Package
hgpu.org?p=30329
HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration
#CUDA #LLM #Compilers #AI #PerformancePortability #Package
hgpu.org?p=29940
🧪Curious about high performance across GPUs? Our new paper benchmarks a parallel FSI code on CUDA, SYCL & OpenMP across top systems. See Aristotle Martin present it at #ISC2025 on June 11, 10:45 in Hamburg!
#HPC #GPUcomputing #PerformancePortability
Thesis: Acceleration as a Service (XaaS) Source Containers
#HPC #MPI #PerformancePortability #LLM #Package
hgpu.org?p=29925
Exploring SYCL for batched kernels with memory allocations
#SYCL #CUDA #PerformancePortability #Package
hgpu.org?p=29911
Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems
#SYCL #TaskScheduling #PerformancePortability #HPC #Package
hgpu.org?p=29823
Leveraging LLVM OpenMP GPU Offload Optimizations for Kokkos Applications
#Kokkos #CUDA #HIP #OpenMP #PerformancePortability #Package
hgpu.org?p=29747
CPU-GPU co-execution through the exploitation of hybrid technologies via SYCL
#SYCL #OpenCL #CUDA #LLVM #PerformancePortability #LoadBalancing #HybridComputing
hgpu.org?p=29717
Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with Protein Database Search
#SYCL #oneAPI #Bioinformatics #Databases #HPC #PerformancePortability #Package
hgpu.org?p=29596
Performance portability via C++ PSTL, SYCL, OpenMP, and HIP: the Gaia AVU-GSR case study
#HIP #SYCL #OpenMP #CUDA #PerformancePortability #HPC #Astrophysics #Package
hgpu.org?p=29555
Kokkidio: Fast, expressive, portable code, based on Kokkos and Eigen
#GPU #Kokkos #PerformancePortability #Package
hgpu.org?p=29541