Show HN: Andrej Karpathy’s microgpt.py to C99 microgpt.c – 4,600x faster

via news.ycombinator.com

Short excerpt below. Read at the original source.

Andrej Karpathy showed us the GPT algorithm. I wanted to see the hardware limit. The Punchline: I made it go 4,600x faster in pure C code, no dependencies and using a compiler with SIMD auto-vectorisation!!! Andrej recently released microgpt.py – a brilliant, atomic look at the core of a GPT. As a low-latency developer, I […]

Read at Source