Cool Link
feat: TQ4_1S weight compression (Metal only, needs CUDA port) by TheTom · Pull Request #45 · TheTom/llama-cpp-turboquant · GitHub
—
github.com
Discuss
Original
Share
Sign In
👤
github.com/TheTom/llama-cpp-turboquant/pull/45