StringWa.rs on GPUs: Databases & Bioinformatics 🦠
I’ve just shipped StringZilla v4, the first CUDA-capable release of my SIMD-first string processing library. Which in English means that it is now fast not only on CPUs, but also on GPUs! I’ve wanted to add ROCm-acceleration for AMD GPUs 🤦♂️ I’ve wanted to include a parallel multi-pattern search algorithm 🤦♂️ I’ve wanted to publish it back in December 2024, not September 2025 🤦♂️ So not everything went to plan, but “StringZilla 4 CUDA” is finally here, bringing 500+ GigaCUPS of edit-distance calculations in a free pip install-able package, and a few more tricks up its sleeve, aimed at large-scale Information Retrieval, Databases and Datalake systems, as well as Bioinformatics workloads. So in this post, we’ll cover some of the most interesting parts of this release, including: ...