Posts on AI

Intro to Quantization

Published: June 20, 2024

todo umer

Making OpenAI Triton easier 🔱 😊

Published: June 12, 2024

I find writing triton kernels involves many repetitive tasks, that can be cleanly abstracted away. This allows to write triton code much more in line with how I actually think. It’s way more fun, and less mentally draining.

We’re collecting world-class triton kernels

Published: June 12, 2024

You’re an AI researcher. You try different things, so you need different gpu kernels to make those things be fast (so you have shorter iteration cycles). But maybe other researchers have already written kernels for some parts of your ideas? And you’re no gpu expert yet, so where can you see examples of really good kernels to learn from?

A Practitioner’s Guide to Triton

Published: April 20, 2024

For the cuda mode community, I gave a lecture titled “A Practitioner’s Guide to Triton”. My goal was to give the best possible intro to Triton, which to me means:

tell you when you’d use it, and when to use something else
clearly state what knowledge I assume and introduce everything else
teach by practically working through examples, only adding a manageable amount of complexity each time
guide you to places where you can learn more

Umer Adil

Posts on AI

Intro to Quantization

Making OpenAI Triton easier 🔱 😊

We’re collecting world-class triton kernels

A Practitioner’s Guide to Triton