Samples of good AI generated CUDA kernels from our blog post: 🔗 https://crfm.stanford.edu/2025/05/28/fast-kernels.html 🔗 https://scalingintelligence.stanford.edu/blogs/fastkernels/
The reference is in ref.py and the generated kernel is in src.py for each problem.