Posts

This profile is from a federated server and may be incomplete. Browse more on the original instance.

mush42, to rust
@mush42@hachyderm.io avatar

👋 Career change alert!

Looking to pivot into tech & leverage my 10+ years of programming experience

🐍 Python
🦀 Rust
</> Web Development
🌐 CMS: WordPress & Wagtail
✨ Machine Learning: Torch & Tensorflow

My passion for code shines through my open-source projects! Check them out:
https://github.com/mush42
https://github.com/blindpandas

mush42, to ai
@mush42@hachyderm.io avatar

Just a random thought.
As Generative AI is being used for creating a lot of content these days, what happens when the next generation of AI models are trained using that content.
When AI models are trained on AI generated content, we'll officially enter the 8th circle of hell

mush42, to random
@mush42@hachyderm.io avatar

The second beta of sonata-for-NVDA v3.0 is out:
https://github.com/mush42/sonata-nvda/releases/tag/v3.0-beta.2

mush42, to random
@mush42@hachyderm.io avatar

New beta release of Sonata-for-NVDA, formaly known as Piper-for-NVDA
NVDA 2024.1 compatibility
Support for fast variants for Piper voices. These fast variants improves responsiveness significantly because they use streaming synthesis
Improvements to responsiveness and speed across the board
Release page:
https://github.com/mush42/sonata-nvda/releases/tag/v3.0-beta.1
Direct download link:
https://github.com/mush42/sonata-nvda/releases/download/v3.0-beta.1/sonata_neural_voices-3.0-beta.nvda-addon

mush42,
@mush42@hachyderm.io avatar

Changed the name to Sonata since we plan to support additional TTS models besides Piper in the future.

mush42,
@mush42@hachyderm.io avatar

Important notice!!!
After installing this version, you will lose all of your installed voices. Please use the voice manager to re-install the voices again.

mush42, to random
@mush42@hachyderm.io avatar

I think an open-source auto tagger for PDFs is very possible.
It will make it easier to convert PDFs to highly structured HTML documents.

Anyone interested in tackling this challenge with me?

Adobe already took the lead:
https://news.adobe.com/news/news-details/2023/Media-Alert-Adobe-Scales-PDF-Accessibility-With-Adobe-Sensei-AI/default.aspx

scruss,
@scruss@xoxo.zone avatar

@mush42 Knowing Adobe, it will be for a very limited subset of PDFs produced by their own software. Never trust a company that has two incompatible standards for managing form data ...

(I used to be a prepress programmer, so I've experienced a whole load of really terrible PDFs. At best, they're digital marks on paper. I also remember the whole "tagged PDF" thing from the early 2000s)

mush42,
@mush42@hachyderm.io avatar

@scruss
For this project, I'd forgo parsing the PDF stream, and extract symantic structure using a visual rendition. Then I'd use this symantic metadata to parse the PDF stream and extract text.

mush42, to random
@mush42@hachyderm.io avatar
mush42, to rust
@mush42@hachyderm.io avatar

As a visually impaired dev, my favorite sentence to hear is this:
Finished dev [unoptimized + debuginfo] target(s) in x.ys

ekuber,
@ekuber@hachyderm.io avatar

@mush42 I would love to hear about your experience, what tooling you use and what we could do to improve things

mush42,
@mush42@hachyderm.io avatar

@ekuber
For me, Rust is one of the most pleasant languages to work with.
I like the fully integrated DX, the extremely helpful compiler, and the commandline first tooling.
IMO the formatting of compiler error messages involving complex type signitures needs to be improved. Since some types are elided I usually need to go through the message several times, and go back to the source to know what has been elided. This is because I cannot do side-by-side viewing.

mush42, to random
@mush42@hachyderm.io avatar

A new version of Piper for NVDA is out.

What's new in v2.0

• This version introduces a separate process for the TTS. This significantly increases responsiveness
• Installing voices is now easier using the integrated voice manager, which allows you to preview and download available voices
• The TTS no longer introduces unnatural pauses at the end of the line during say all
• and many other improvements and bug fixes

Detailed release notes and download link:
https://github.com/mush42/piper-nvda/releases/tag/v2.0-beta2

ppatel,
@ppatel@mstdn.social avatar

@mush42 No. But Windows identifies it as ssuch.

ppatel,
@ppatel@mstdn.social avatar

@mush42 Thanks for pointing me to a possible problem here. It looks like there's no way to change what the system identifies even if it's false. I appreciate the response.

mush42, to random
@mush42@hachyderm.io avatar

@accessibleandroid Please help!

After I updated Speech Recognition &amp; Synthesis by Google to the last version, I lost the ability to download voices.
Any suggestions?

accessibleandroid,
@accessibleandroid@mastodon.social avatar

@mush42 Are you trying to download the voice data for a specific language?

mush42,
@mush42@hachyderm.io avatar

@accessibleandroid no, all the ones I tried fail. When I press the download button, nothing happens, and TalkBack keeps losing focus and focusing the button.

mush42, to random
@mush42@hachyderm.io avatar

For those who use the Piper voices add-on for I've just added an integrated "Voice Manager" where you can install new voices, and manage installed ones. So, no manual download, no TAR archives 🤗

A new version with the voice manager will be released later this month. Stay tuned!

mush42, to random
@mush42@hachyderm.io avatar

New neural voices for NVDA are available.

Piper for
https://github.com/mush42/piper-nvda

mush42,
@mush42@hachyderm.io avatar
Marco,

@devinprater I got it working. And the German voices are funny. They all have problems pronuncing the open ch sound like in "ich" oder "echt", a sound much similar to the h sound in the English word "human". The way they speak it, they sound more like a Danish or Swedish person who had German as a second or third language in school. They also swallow some of the consonants we usually pronunce prominently. The only language in northern Europe that prominently does this is Danish. Or some Bri'ish accents where they got rid of all the ts. All the variants of German show the same problems, which is interesting. @KaraLG84 @mush42

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • khanakhh
  • kavyap
  • thenastyranch
  • everett
  • tacticalgear
  • rosin
  • Durango
  • DreamBathrooms
  • mdbf
  • magazineikmin
  • InstantRegret
  • Youngstown
  • slotface
  • anitta
  • ethstaker
  • ngwrru68w68
  • cisconetworking
  • modclub
  • normalnudes
  • osvaldo12
  • cubers
  • GTA5RPClips
  • Leos
  • tester
  • megavids
  • provamag3
  • lostlight
  • All magazines