Cool Link
feat: TQ4_1S weight compression (Metal only, needs CUDA port) by TheTom · Pull Request #45 · TheTom/llama-cpp-turboquant · GitHub github.com

github.com/TheTom/llama-cpp-turboquant/pull/45