Run a 1T parameter model on a 32gb Mac by streaming tensors from NVMe

via github.com

Short excerpt below. Read at the original source.

Article URL: https://github.com/t8/hypura Comments URL: https://news.ycombinator.com/item?id=47504695 Points: 6 # Comments: 1

Read at Source