🚀 Sign up for early access 🚀

kforge

Autonomous kernel generation for PyTorch

Speed up your training and inference workloads without leaving PyTorch. kforge supports multiple backends, such as CUDA, ROCm, and Metal.

kforge CLI can be used to generate optimized pytorch code

kforge's CLI can generate optimized kernels for pytorch

kforge supports the following hardware platforms.

NVIDIA logo
Intel logo
Apple logo
AMD logo

kforge is produced by the team at gimletlabs.ai.
If you are interested in working with us, check out open positions at gimletlabs.ai/join_us