Toot

Written by Kathy Reid on 2025-01-07 at 23:11

@Crow The short answer is "sort of".

If we break down Dragon Naturally Speaking (which is now Nuance, which was purchased by Microsoft in April 2021 for $16 billion, so it's Microsoft), it has two broad functions.

One is speech recognition, for which there are freely available but not really open source options, primarily Whisper.

The second is controlling the desktop through speech, where the main one I'm aware of is Linux Voice Control, although I haven't used it. I am fairly sure it uses Whisper under the hood, as evidenced by

https://github.com/omegaui/linux-voice-control/blob/c9238684d0dee6a42befb2034b4e73e007328a9d/requirements.txt#L9

https://github.com/omegaui/linux-voice-control

Happy to answer more Q's and thanks for the flag, @nnye !

=> More informations about this toot | View the thread | More toots from KathyReid@aus.social

Mentions

=> View Crow@pagan.plus profile | View nnye@aus.social profile

Tags

Proxy Information
Original URL
gemini://mastogem.picasoft.net/toot/113789597209444539
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
245.472807 milliseconds
Gemini-to-HTML Time
0.810793 milliseconds

This content has been proxied by September (3851b).