Discussion
TurboQuant Prompt → Diagram
COOLmanYT: no firefox support?
hhthrowaway1230: so multiple of these browser wasm demos make me re-download the models, can someone make a cdn for it or some sort u uberfast downloader? just throw some claude credits against it ty!
Rekindle8090: What? downloaded for me at 2gbps
hhthrowaway1230: Ah let me clarify, many of the in the browser demos make me download certain models even if I already have them It would be great if there was a way that I don't have to redownload them across demos so that I just have a cache. or an in browser model manager. hope this makes sense.Or indeed use some sort of huggingface model downloader (if that exist with XET)
hhthrowaway1230: also maybe a good usecase to finally have P2P web torrents :)
hhthrowaway1230: Yeah that's great but I'm in a cafe outside burning my phone data. ty!
embedding-shape: Adding a file input where users can upload files to the frontend directly from their file manager would probably work as a stop-gap measure, for the ones who want something quick that let people manage their own "cache" of model files.
varun_ch: I think this would sit best at the browser level. I’m not sure there’s a nice way for multiple websites to share a cache like that.
teamchong: firefox has webgpu already, but the subgroups extension isn't in yet. every matmul / softmax kernel here leans on subgroupShuffleXor for reductions, that's the blocker. same reason mlc webllm and friends don't run on firefox either. once mozilla ships it this should work