Tux Machines

Machine Learning in Linux: Speech Note

Posted by Rianne Schestowitz on Sep 13, 2023

=> LMDE 6 “Faye” – BETA Release | Linux Kernel 6.4 Reaches End of Life, Upgrade to Linux Kernel 6.5 Now

Speech Note is a GUI frontend for various processing engines. For Speech to Text it uses Coqui STT, Vosk, and Whisper. Whisper is our highest rated speech recognition tool and features in our award-winning Top 100 CLI apps study. It’s that good. Coqui STT is also highly recommended although it’s no longer actively maintained.

For Text to Speech, Speech Note uses espeak-ng, MBROLA, Piper, RHVoice, and Coqui TTS. And the machine translation is handled by Bergamot Translator.

This is free and open source software written in C++.

Read on

=> ↺ Read on

=> gemini.tuxmachines.org

Proxy Information

Original URL: gemini://gemini.tuxmachines.org/n/2023/09/13/Machine_Learning_in_Linux_Speech_Note.gmi
Status Code: Success (20)
Meta: text/gemini;lang=en-GB
Capsule Response Time: 140.373907 milliseconds
Gemini-to-HTML Time: 0.463637 milliseconds

This content has been proxied by September (ba2dc).