Step-by-step tutorial for adding new CUDA kernels to FlashInfer
npx ai-builder add skill flashinfer-ai/add-cuda-kernelGuide for benchmarking FlashInfer kernels with CUPTI timing
npx ai-builder add skill flashinfer-ai/benchmark-kernelTutorial for debugging CUDA crashes using API logging
npx ai-builder add skill flashinfer-ai/debug-cuda-crash