Discussion
Search code, repositories, users, issues, pull requests...
stingraycharles: I’m a bit confused by what you’re offering. Is it a voice assistant / AI as described on your GitHub? Or is it more general purpose / LLM ?How does the RAG fit in, a voice-to-RAG seems a bit random as a feature?I don’t mean to come across as dismissive, I’m genuinely confused as to what you’re offering.
drcongo: I came to the comments here to see if anyone had worked out what it is, so you're not alone.
Tacite: Doesn't work. " zsh: segmentation fault rcli"
DetroitThrow: Wow, this is such a cool tool, and love the blog post. Latency is killer in the STT-LLM-TTS pipeline.Before I install, is there any telemetry enabled here or is this entirely local by default?
shubham2802: Fully local - no data is collected!!
esafak: You could share your setup details, on GH if not here, to make it actionable.
alfanick: I'm not looking for STT->AI->TTS, I'm looking for truly good voice-to-text experience* on Linux (and others). Siri/iOS-Dictation is truly good when it comes to understanding the speech. Something this level on Linux (and others) would be great, yeah always listening, maybe sending the data somewhere, but give me UX - hidden latency, optimizing for first chars recognized - a good (virtual) input device.
Imustaskforhelp: To be neutral, I am just gonna link the stats of this hackernews post[0] and let public decide the rest because for context, this is same company which was mentioned in a blow-up post 12 days ago which had gotten 600 upvotes and they didn't respond back then[0]I was curious so I did some more research within the company to find more shady stuff going on like intentionally buying new domains a month prior to send that spam to not have the mail reputation of their website down. You can read my comment here[1]Just to be on the safe side here, @dang (yes pinging doesn't work but still), can you give us some average stats of who are the people who upvoted this and an internal investigation if botting was done. I can be wrong about it and I don't ever mean to harm any company but I can't in good faith understand this. Some statsSome stats I would want are: Average Karma/Words written/Date of the accounts who upvoted this post. I'd also like to know what the conclusion of internal investigation (might be) if one takes place.[There is a bit of conflicts of interest with this being a YC product but I think that I trust hackernews moderator and dang to do what's right yeah]I am just skeptical, that's all, and this is my opinion. I just want to provide some historical context into this company and I hope that I am not extrapolating too much.It's just strange to me, that's all.[0]: https://news.social-protocols.org/stats?id=47326101 (see the expected upvotes vs real upvotes and the context of this app and negative reception and everything combined)[1]: Tell HN: YC companies scrape GitHub activity, send spam emails to users: https://news.ycombinator.com/item?id=47163885[2]:https://news.ycombinator.com/reply?id=47165788
bigyabai: Don't give RunAnywhere your GitHub: https://news.ycombinator.com/item?id=47163885
john_strinlai: i knew i recognized this name from somewhere.they are a company that registers domains similar to their main one, and then uses those domains to spam people they scrape off of github without affecting their main domain reputation.edit: here is the post https://news.ycombinator.com/item?id=47163885
Imustaskforhelp: Yup. The most crazy aspect was that they had bought the domain intentionally (just 1 month prior) that whole fiasco.Maybe its just (n=2) that only we both remember this fiasco but I don't agree with that. I don't really understand how this got so so many upvotes in short frame of time especially given its history of not doing good things to say the very least... I am especially skeptical of it.Thoughts?
john_strinlai: i unfortunately dont know enough about vote patterns on hn, or what is expected/normal voting behavior.what i do know is that their name is etched into my mind under the category of "shady, never do business with them".
dsalzman: Congrats on your future acquihire!
josuediaz: This is really innovative stuff! I cant wait to see how this technology will evolve
tiku: Personally I'm so disappointed about the state of local AI. Only old models run "decent" but decent is way to slow to be usable.
Imustaskforhelp: I was writing my initial comment and I had no mention to the voting behaviour until I accidentally reloaded or something to find the upvote rise by a decent amount. Then I got suspicious and then I reloaded again to see like in 20seconds or < 1 minute and saw the vote rise so much (read my other comment)I was writing the comment at time of 18 upvotes and then it went to 24 upvote all of a sudden that I had gone suspicious.see at 2026-03-10T17:38-39:00Z timeframe within this particular graph(0)(0):https://news.social-protocols.org/stats?id=47326101
john_strinlai: josuediaz registered 4 minutes agoiharnoor 1 karma, 1 comment, in this thread.two posts pointing out their extremely unethical spam behavior both shot down to the very bottom of the post. apparently suspicious voting behavior.what the hell is going on?
Imustaskforhelp: Yeah I am wondering the same thing.I was gonna comment about this guy and iharnoor which is 7 month old account who literally only said "lets go" hereThis sort of makes me even more suspicious john especially iharnoorI wasn't responding because I was making archive link of all of this so that even messages deleted can have some basis of confirmation.
jonhohle: If I send a Portfile patch, would you consider MacPorts distribution?
coder543: > Siri/iOS-Dictation is truly good when it comes to understanding the speech.What...? It is terrible, even compared to Whisper Tiny, which was released years ago under an Apache 2.0 license so Apple could have adopted it instantly and integrated it into their devices. The bigger Whisper models are far better, and Parakeet TDT V2 (English) / V3 (Multilingual) are quite impressive and very fast.I have no idea what would make someone say that iOS dictation is good at understanding speech... it is so bad.For a company that talks so much about accessibility, it is baffling to me that Apple continues to ship such poor quality speech to text with their devices.
david_shaw: I think the title should read "RunAnywhere," not "RunAnwhere."
Imustaskforhelp: Dang has changed the title and it seems that he may have had a minor error doing it . Must have been a typo from his side changing it and that's okay! I think that Dang will update it sooner than later.Edit: just reloaded, its fixed now.
dang: The upvotes on the current post are fine - the reason you saw the submission rise in rank is that startup launch posts by YC startups get special placement on the front page (this is in the FAQ: https://news.ycombinator.com/newsfaq.html). Not every such post does, but some do.In other words, your perception wasn't wrong, but the interpretation was off. I've put "Launch HN" and "YC W26" back in the title to make that clearer - I edited them out earlier, which was my mistake.As for the booster comments, those are pretty common on launch threads and often pretty innocent - most people who aren't active HN users have no idea that it's against the rules. We do our best to communicate about that, but it's not a cardinal sin—there are far worse offenses.
Imustaskforhelp: Thanks dang but can you please explain there being two accounts who wrote something very small comment and one account being completely new and the other being 7 months old only being invoked in this case.Clearly I am not the only one here as john_strinlai here seems to have had somewhat of the same conclusion as me.Dang I know you care about this community so can you please talk more what you think about this in particular as well.I understand that YC companies get preferential treatment, Fine by me. But this feels something larger to meI have written everything that I could find in this thread from the same post being shown here 3 days ago in anywhere.ai link to now changing to github to skirt off HN rule that same link can't be posted in short period of time and everything.This feels somewhat intentional just like the spam issue, I hope you understand what I mean.(If you also feel suspicious, Can you then do a basic analysis/investigiation with all of these suspicious points in mind and everything please as well and upload the results in an anonymous way if possible?)I wish you to have a nice day and waiting for your thoughts on all of this.
john_strinlai: hi dang. while you are here -- are comments artificially ordered on this post?https://news.ycombinator.com/item?id=47326953 is grey (i.e <=0 karma). my top-level comment is at 14 karma. we posted within 15 minutes of each other. ive never seen something like that before.the two posts calling out unethical behavior have been living at the bottom of this post the entire time, until a couple of actually [flagged] comments ended up under them.(edit: 15 karma now, and still lower than a 0 karma comment posted at roughly the same time)
j45: "Apple M3 or later required. MetalRT uses Metal 3.1 GPU features available on M3, M3 Pro, M3 Max, M4, and later chips. M1/M2 support is coming soon. On M1/M2, RCLI automatically falls back to the open-source llama.cpp engine."
Tacite: Funny you mention that because on their github they just pushed an update to say that it didn't work M3 and M4.
Imustaskforhelp: Adding onto it, My comments are also ranked low. This comment on which dang replied has 4 upvotes which I think that this is at the 4th last of this post and the other comment that I made on your comment where I responded to ya has 3 upvotes.
RationPhantoms: This doesn't work on any of the methods I've tried.
jawns: Based on the demo video, the TTS sounds like it's 10 years out of date. I would not enjoy interacting with it.
pzo: FWIW this RCLI is only MIT license but their engine MetalRT is commercial. Not sure the license of their models I guess also not MIT. So IMHO this repo is misleading.Not sure why they decided to reinvent the wheel and write yet another ML engine (MetalRT) which is proprietary. I would most likely bet on CoreML since it have support for ANE (apple NPU) or MLX.Other popular repos for such tasks I would recommend:https://github.com/FluidInference/FluidAudiohttps://github.com/DePasqualeOrg/mlx-swift-audiohttps://github.com/Blaizzy/mlx-audiohttps://github.com/k2-fsa/sherpa-onnx
antipaul: Nice list.What about for on-device RAG use cases?
AmanSwar: Its kokoro TTS not ours, we have range of options.
shubham2802: Just need some few days to have our catalog of models out soon!!
fragmede: Terrible? It's fine. What's your accent that it's terrible? It even pulls last names from my address book and spells them right.
halostatue: You're welcome to add me as a co-maintainer on this if you submit it to macports/macports-ports: {macports.halostatue.ca:austin @halostatue} I maintain https://github.com/macports/macports-ports/blob/master/sysut... amongst other things regularly.