Jun 18 2025

Achieving Performance Parity across Architectures: A Deeper Dive into Vector Portability

As compute workloads diversify across CPUs, GPUs, NPUs, and other processors, maintaining efficiency across architecture has become one of the most pressing challenges in high-performance and embedded computing. For developers, portability is no longer just about compiling code on different platforms – it’s about ensuring effective usage of each processor’s capabilities so that performance scales with the underlying hardware.

One critical dimension of this challenge is SIMD (Single Instruction, Multiple Data) vectorization. SIMD instructions drive performance in everything from numerical simulations and media processing to deep learning inference and signal processing. However, vector portability – ensuring optimized SIMD code runs efficiently on x86, ARM v9, RISC-V, and beyond – is far from trivial. In this blog, we explore why SIMD portability is difficult, what’s required to get it right, and how VaLVe helps solve this challenge.

Read more
Mar 14 2025

Why Porting Applications across Architectures isn’t simple

Developers are often faced with the challenge of porting applications across different platforms. Whether it’s transitioning from x86 to ARM or RISC-V CPUs, migrating workloads to AI accelerators of novel architectures, the process is rarely straightforward. While porting may seem like a simple matter of recompilation or minor code adjustments, the reality is far more complex, especially in the HPC domain.

From architectural differences to performance bottlenecks, porting applications require deep expertise in hardware architecture, system integration, performance optimization, and platform-specific tuning. This blog explores why porting applications is more than just a “lift-and-shift” task and how overcoming its challenges demands a structured, expertise-driven approach.

Read more