Tux Machines

Machine Learning in Linux: Speech Note

Posted by Rianne Schestowitz on Sep 13, 2023

=> LMDE 6 “Faye” – BETA Release | Linux Kernel 6.4 Reaches End of Life, Upgrade to Linux Kernel 6.5 Now

=> ↺ speech neural

Speech Note is a GUI frontend for various processing engines. For Speech to Text it uses Coqui STT, Vosk, and Whisper. Whisper is our highest rated speech recognition tool and features in our award-winning Top 100 CLI apps study. It’s that good. Coqui STT is also highly recommended although it’s no longer actively maintained.

For Text to Speech, Speech Note uses espeak-ng, MBROLA, Piper, RHVoice, and Coqui TTS. And the machine translation is handled by Bergamot Translator.

This is free and open source software written in C++.

Read on

=> ↺ Read on

=> gemini.tuxmachines.org

Proxy Information
Original URL
gemini://gemini.tuxmachines.org/n/2023/09/13/Machine_Learning_in_Linux_Speech_Note.gmi
Status Code
Success (20)
Meta
text/gemini;lang=en-GB
Capsule Response Time
140.373907 milliseconds
Gemini-to-HTML Time
0.463637 milliseconds

This content has been proxied by September (ba2dc).