Reader /
Discussion

GitHub - SharpAI/SwiftLM: ⚡ Native MLX Swift LLM inference server for Apple Silicon. OpenAI-compatible API, SSD streaming for 100B+ MoE models, TurboQuant KV cache compression, + iOS iPhone app. · GitHub

github.com
/ pin · @ user · Ctrl+Enter
0 threads

No discussions yet

More from github.com

Discover