Faintdreams, (edited ) to linux
@Faintdreams@dice.camp avatar

[Edit Thanks Everyone who answered.
Am now going to investigate Spechnote and Piper]

hello! I need a Linux Text to Speech program to help me proofread (proof hear?) some fiction writing.

I'm rockin Debian KDE with Plasma.

Thanks in Advance for recommendations.

thorstenvoice,
@thorstenvoice@techhub.social avatar

@Faintdreams i can recommend using piper . Offers over 35 languages. I made some tutorials about it on my channel.

https://youtu.be/rjq5eZoWWSo?si=pWB1gwpJhqNaHFtr

https://github.com/rhasspy/piper

Hope this helps you.

pixelate, to accessibility
@pixelate@tweesecake.social avatar

Lol, folks. Listen to your article before you post it. Doesn't matter what voice. You'll catch things like this from macrumors.com. In the app's settings (accessed via ChatPGT ➝ Settings… in the menu bar when the app's main window ...

donwatkins, to random
@donwatkins@fosstodon.org avatar
chikim, to random
@chikim@mastodon.social avatar

I created samples for all 58 voices for xtts-v2. Hopefully it makes it easier for someone to choose a speaker. https://we.tl/t-9vWd1gO3EN

VE3RWJ, to AdobePhotoshop
@VE3RWJ@mastodon.radio avatar

does not disappoint.

They've just released an that will read whatever document you throw at it.

You can find out more Here:

https://apps.apple.com/ca/app/elevenlabs-reader-ai-audio/id6479373050

jbzfn, to c64
@jbzfn@mastodon.social avatar

Making 80s Computers Talk | 1980s Commodore Speech Synthesizer | Kari

https://www.youtube.com/watch?v=1ip7K0CaC7Y

jedie, to fdroid German
@jedie@chaos.social avatar

Gibt es eine bessere Lösung für mit und co?

https://www.kuketz-blog.de/android-sprachausgabe-tts-engine-bei-custom-roms-aktivieren/

Listet per auf,kann aber noch kein deutsch.

findet das berruntergeladene model Datei nicht.

jedie, to android German
@jedie@chaos.social avatar

Gibt es was neues zu : (-Engine) bei Custom-ROMs aktivieren • Kuketz IT-Security Blog
https://www.kuketz-blog.de/android-sprachausgabe-tts-engine-bei-custom-roms-aktivieren/

bekomme ich nicht zum laufen, findet das runtergeladene Modell Datei nicht.

hat immer noch kein deutsch.

eeejay, to GNOME
@eeejay@mastodon.social avatar

A demo of a sample app, the voices used in order are: eSpeakNG's "Andy" variant, MBROLA US2, and Piper's Amy. You can observe the different features like word tracking and quality.

video/mp4

skinnylatte, to accessibility
@skinnylatte@hachyderm.io avatar

Opening soon: 2 remote jobs for accessibility lead with the federal government. Open to US citizens and nationals only, with background check.

https://join.tts.gsa.gov/join/Solutions-Accessibility-Lead-April2024/

Info on how to write a federal resume: https://handbook.tts.gsa.gov/hiring-staying-or-changing-jobs/resume

pixelate, to accessibility
@pixelate@tweesecake.social avatar

Honestly, since the fast variants of the voices are a thing, I think I could really switch to the Sonata Neural Voices in NVDA full time. Now remember folks, these are AI voices. Scary, untrustworthy, AI voices that will smear your reputation all over fedi for using these voices! See, they even react to exclamation marks! Isn't that scary? :) Nah, the worst that'll happen, mainly with the HFC male and female, is that big numbers are garbled together. But every other voice does fine. I use Amy for work, and HFC for reading because those are among the most lively voices I've ever heard. And amazingly enough, we can make our own new voices. So, some people, from the Github repo's readme, are building more professional voices. And there are already versions of old TTS engines from the past that have been brought back to some semblence of life with this tech.

ErikJonker, to ai Dutch
@ErikJonker@mastodon.social avatar

Dangerous technology, they decided not to release it yet, but it will be a matter of time before it's available.
https://www.bloomberg.com/news/articles/2024-03-29/openai-previews-new-audio-tool-that-can-read-text-mimic-voices

ppatel, to Dragonlance
@ppatel@mstdn.social avatar

OpenAI debuts Voice Engine, which lets users generate synthetic copy of a voice from a 15-second sample, available to around 100 partners, including HeyGe. In other words, it's not available to the public just yet.

https://techcrunch.com/2024/03/29/openai-custom-voice-engine-preview/

chikim, to ai
@chikim@mastodon.social avatar

Maybe we have an open source competitor for ElevenLabs? Check out their demo which they switch between original and synthesized. I can't tell. lol Apparently they're going to fully open source codebase and model weights. https://jasonppy.github.io/VoiceCraft_web/

KathyReid, to mastodon
@KathyReid@aus.social avatar

A warm welcome to to @thorstenvoice - one of the best communicators about and in the world. His dataset is in use in many places.

Please make Thorsten welcome 👋

thorstenvoice,
@thorstenvoice@techhub.social avatar

Thanks @potungthul for your nice welcome 😊.

To clear up the hashtags a little bit:
Think of the components of a voice assistant / smartspeaker.

You need (speech-to-text) or (automatic speech recognition) on the "input" side of a user request and (text-to-speech) on the "output" side.

To throw in another technology - (natural language processing) is used in the "middle" to really understand what the user request is all about.

cc: @KathyReid

ranfdev, to random
@ranfdev@linuxrocks.online avatar

We need a dbus interface to get a system-wide Text To Speech provider, and Flatpak apps should be able to register themselves as TTS providers.

In GNOME settings there should be an option to disable the current TTS provider, open its settings or switch to another one. Similarly to how android manages multiple keyboards, which you can install from the play store.

The same goes for Speech To Text. You should be able to install your favorite STT provider, with your preferred voice, from the store

ranfdev,
@ranfdev@linuxrocks.online avatar

This requires integration from multiple GNOME components. I don't even know where to start, but I'd like to help.

kaveinthran, to opensource

have anyone tried the new TTS metavoiceio/metavoice-1B-v0.1?
context, MetaVoice open sources a commercially permissive 1B base model for text-to-speech, supporting voice cloning and emotional speech synthesis
https://twitter.com/metavoiceio/status/1754983953193218193

niavy, to android French
@niavy@masto.bike avatar

BON. J'ai besoin d'aide…

La ROM de mon smartphone n'a pas de système de synthèse vocale (). Geovelo m'invite à télécharger... le système (Speech Recognition & Synthesis)

PAS ENVIE. Plus confiance.
Connaissez-vous un système TTS libre Android (pour LineageOS par exemple) qui supporte 🇨🇵🇬🇧 voire 🇪🇸🇵🇹 ?

Repouet TRÈS apprécié 🔄.

kaveinthran, to history
accessibleandroid, to android
@accessibleandroid@mastodon.social avatar

We've just updated the list of languages with available TTS engines on Android, bringing the total number of supported languages to 83 with the new voices provided by the RHVoice and CerePlay TTS engines https://accessibleandroid.com/list-of-languages-with-available-tts-engines-on-android/

jomo, to android German
@jomo@mstdn.io avatar

Ich habe neulich gelernt, dass OsmAnd eine "German (Casual)" TTS Option hat, die nicht so ausführlich ist, wie die normale.

"Im Kreisel erste raus." oder "In 100 Metern links." ist wesentlich angenehmer, als komplett zugetextet zu werden.

cantences,

@twomikecharlie
Sprichst du von einer vorinstallierten Engine?

britt, to disability
@britt@mstdn.games avatar

Hey Mastodon…

What text to speech readers do you find to be the most comprehensive and most worth your money?

I’ve used the free version of Speechify for years but I don’t use it all the time due to limits in reading time/speed and the voices.

I would like to use a text to speech reader in browser, on iOS, for books and textbooks.

modulux, to ai EN

Another interesting system. I need to look closer into it in order to see if it's a voice cloning approach or something else: https://github.com/yl4579/StyleTTS2

tkk13909, to RickAndMorty
@tkk13909@fosstodon.org avatar

@JoeRess I was NOT prepared to hear Morty start talking in your voice in a Code Bullet video! Timestamp: 10:02
https://youtu.be/g39AagVW0s0

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • kavyap
  • thenastyranch
  • ethstaker
  • osvaldo12
  • mdbf
  • DreamBathrooms
  • InstantRegret
  • magazineikmin
  • Youngstown
  • ngwrru68w68
  • slotface
  • GTA5RPClips
  • rosin
  • megavids
  • cubers
  • everett
  • cisconetworking
  • tacticalgear
  • anitta
  • khanakhh
  • normalnudes
  • Durango
  • modclub
  • tester
  • provamag3
  • Leos
  • lostlight
  • All magazines