TurboQuant model weight compression support added to Llamacpp
via github.com
Short excerpt below. Read at the original source.
Article URL: https://github.com/TheTom/llama-cpp-turboquant/pull/45 Comments URL: https://news.ycombinator.com/item?id=47637228 Points: 6 # Comments: 2