#1 Programming on GPUs from scratch by implementing CUDA Kernels in C++, CuPy Python and OpenAI Triton.
Simply amazing ❤️. Without even knowing it, I've been waiting for this series my whole life.
Thanks a ton, happy to hear that 🙏!
if the feedback is good on this one and I can manage it on time, I'm considering doing an applied video lecture.
Excellent please continue your CUDA series
Thanks, happy you found it useful! Surely, will do - more GPU insights and deep dives coming once I get a time slot. In the meanwhile, I think you'll like Dennis's articles on CUDA and Tensor Parallelism from scratch:
https://substack.com/@dkennetz
Simply amazing ❤️. Without even knowing it, I've been waiting for this series my whole life.
Thanks a ton, happy to hear that 🙏!
if the feedback is good on this one and I can manage it on time, I'm considering doing an applied video lecture.
Excellent please continue your CUDA series
Thanks, happy you found it useful! Surely, will do - more GPU insights and deep dives coming once I get a time slot. In the meanwhile, I think you'll like Dennis's articles on CUDA and Tensor Parallelism from scratch:
https://substack.com/@dkennetz