Show HN: Andrej Karpathy’s microgpt.py to C99 microgpt.c – 4,600x faster
via news.ycombinator.com
Short excerpt below. Read at the original source.
Andrej Karpathy showed us the GPT algorithm. I wanted to see the hardware limit. The Punchline: I made it go 4,600x faster in pure C code, no dependencies and using a compiler with SIMD auto-vectorisation!!! Andrej recently released microgpt.py – a brilliant, atomic look at the core of a GPT. As a low-latency developer, I […]