NVIDIA Adds Official Support For RHEL-Compatible Distributions Like AlmaLinux With CUDA 13.2 - https://www.phoronix.com/news/NVIDIA-Official-RHEL-Compat
@dssgabriel
PhD candidate, HPC Software Engineering @cea.fr / DAM MSc HPC & Simulation from @univparissaclay.bsky.social Architecture, microbenchmarking & SIMD sorcery. Research on distributed computing, data structures & memory layouts at exascale. RTFM πΉ
NVIDIA Adds Official Support For RHEL-Compatible Distributions Like AlmaLinux With CUDA 13.2 - https://www.phoronix.com/news/NVIDIA-Official-RHEL-Compat
Defer available in GCC and Clang
gustedt.wordpress.com/2026/02/15/d...
Join us for the HPSF Community Summit 2026 in Braunschweig, Germany, February 25-27! π
Learn whatβs new with HPSF projects, give us feedback on your use of HPSF software, meet with project communities, and tell us how to grow and improve them.
Details: hpsf.io/event/hpsf-c...
Love the GitButler GUI but miss your CLI? Have we got the solution for you!
youtu.be/Jg8L3SbgZ3o?...
#jj-vcs 0.37.0 came out yesterday! im intrigued by the new divergent change syntax, seems very neat
github.com/jj-vcs/jj/re...
Please note: Any claims of AI Exascale, AI Zettascale or beyond computing power are just baloney. Real computing power is measured in FP64. Period.
AMD embraced utter stupidity by adopting this terminology by the leather jacket man.
It's a really shame!
#CES2026 #AMD
LLVM 22 Lands NVIDIA Olympus CPU Scheduling Model - https://www.phoronix.com/news/NVIDIA-Olympus-Sched-Model
title slide of talk being given at Rust Nation UK: [Title] Rust for Foundational SW or: Safety-Critical Software in Rust
ever curious why people that work in safety-critical systems want to use Rust?
here's the title slide for the talk i'll give at @rustnationuk.bsky.social about this
Day 24 of #AoCO2025! A loop summing 0+1+2+...+n. GCC unrolls it. Clang does something jaw-dropping: the loop vanishes entirely, replaced by a direct calculation. How?!
xania.org/202512/24-cu...
youtu.be/V9dy34slaxA
Day 23 of #AoCO2025! Switch β jump table? Sometimes. Other times: arithmetic, bitmasks, or something cleverer. Compilers have more tricks than you think.
xania.org/202512/23-sw...
youtu.be/aSljdPafBAw
Day 22: String comparison against "ABCDEFG" should call memcmp, but Clang inlines it with some clever memory tricks. How does it compare 7 bytes so efficiently? xania.org/202512/22-me... youtu.be/kXmqwJoaapg #AoCO2025
Day 21: Summing integers? Compiler vectorises beautifullyβ8 at a time! Switch to floats? It refuses, doing each add individually. Same code, totally different output. Why? π€
xania.org/202512/21-ve...
youtu.be/lUTvi_96-D8
#AoCO2025
Day 20: Process 65,536 integers one at a time? Nah. The compiler vectorises it to handle 8 at once β same code, 8Γ faster! SIMD auto-vectorisation is compiler magic π
xania.org/202512/20-si...
youtu.be/d68x8TF7XJs #AoCO2025
Day 19: Recursive functions calling themselves endlessly β stack growth? Nope! The compiler turns recursion into loops. Tail call optimisation is magic β¨
xania.org/202512/19-ta...
youtu.be/J1vtP0QDLLU #AoCO2025
Day 18: Function with fast & slow paths. Inline = code bloat. Don't inline = slow fast path. Can't have bothβor can you? The compiler finds a surprising way out of this dilemma.
xania.org/202512/18-pa...
youtu.be/STZb5K5sPDs
#AoCO2025
Actually, this die configuration is not new information, it was already mentioned on this removed slide:
(Although the CPU die's CBB name is seems still new.)
Itβs safe to assume that the HPC scheduling space is going to be in a state of Flux for quite some time to comeβ¦
(I see what I did there. With consummate apologies to @vsoch.bsky.social and @tgamblin.bsky.social in advance π€£)
How have servers and the cloud evolved in the last 30 years, and what might be next? @bcantrill.bsky.social has been at the thick of the industry since the Dotcom Boom, and shares fascinating stories.
Bryan is one of my all-time favorite people to talk with - don't miss this one.
(cont'd)
Day 17: Inlining β the ultimate optimisation β¨
A function gets inlined, half vanishes. The assembly is cleaner than hand-written. How does copy-paste make code disappear?
xania.org/202512/17-in...
youtu.be/JFHfFTvMPp0
#AoCO2025
Day 16: Calling conventions matter! Pass 8 chars as separate args: stack spillage. Pack them in a struct: single register. Sometimes structs are MORE efficient than separate parameters!
xania.org/202512/16-ca...
youtu.be/Yaw8AMoP4sI
#AoCO2025
Day 15: Two nearly identical loopsβone writes to memory every iteration, the other stays in registers. Same code, wildly different performance. The culprit? Aliasing!
xania.org/202512/15-al...
youtu.be/PPJtJzT2U04
#AoCO2025
Does this mean no more dirt-cheap NRE from Slurm? Or will Slurm development no longer be coin-operated? Would love to see serious engineering effort go into modernizing Slurm, but this could go in many directions.
Day 14: Add ONE global counter to your loop and watch LICM vanishβstrlen called every iteration! Why would incrementing an unrelated variable break the optimisation? π€
xania.org/202512/14-li...
youtu.be/OwFNblEEAXo
#AoCO2025
Day 13 of Advent of Compiler Optimisations! π
Loop calling a function whose result never changes? One compiler hoists it out automatically. The other⦠doesn't. Even with hints!
xania.org/202512/13-li...
youtu.be/dIwaqJG0WDo
#AoCO2025
Cursed code:
void* f(void *p) {
return p + 1;
}
Both gcc and clang support void* arithmetic as an extension in C:
gcc.gnu.org/onlinedocs/g...
-pedantic FTW!
Godbolt: godbolt.org/z/rcrqWvMGW
#Programming
Day 12 of Advent of Compiler Optimisations! A loop that checks the same thing every time. The compiler's solution? Make the code bigger to make it faster. Wait, what? xania.org/202512/12-lo... youtu.be/-VCrYshE7iQ #AoCO2025
Day 11: A clever bit-counting loop using the "clear bottom bit" trick. Change one compiler flag and... wait, what just happened to my loop?! Pattern recognition at its finest.
xania.org/202512/11-po...
youtu.be/Hu0vu1tpZnc
#AoCO2025
Kokkos 5.0 is officially out. β¨
Details:
- Moves the project to C++20
- Retires older interfaces, reducing complexity for future work
- Ideal time for teams to review workflows
Read the full update here: hpsf.io/blog/2025/ko...
Where did you guys get the info for the facility power draw and cooling limits? π Was it publicly announced somewhere?