TurboQuant model weight compression support added to Llamacpp

via github.com

Short excerpt below. Read at the original source.

Article URL: https://github.com/TheTom/llama-cpp-turboquant/pull/45 Comments URL: https://news.ycombinator.com/item?id=47637228 Points: 6 # Comments: 2

Read at Source