Run a 1T parameter model on a 32gb Mac by streaming tensors from NVMe
via github.com
Short excerpt below. Read at the original source.
Article URL: https://github.com/t8/hypura Comments URL: https://news.ycombinator.com/item?id=47504695 Points: 6 # Comments: 1