mush42

@mush42@hachyderm.io

Developer of Bookworm and Sonata Speech Engine.
A proud programmer for the better part of my life.
Other interests include: machine learning, accessibility, entrepreneurship, books and music...

This profile is from a federated server and may be incomplete. Browse more on the original instance.

krys, 5 days ago to random

If your profile boasts this flag 🇵🇸 without this 🇮🇱 next to it - as an anarchist I use neither - you are not advocating for “peace”.

You are advocating for the eradication of a democratic state that - like all democracies - is under constant and recently rather successful attacks by extremist fucks, but still has a lot of good going for going for it, and replacing it by yet another caliphate state run by a barbaric terror cult.

Am Yisrael chai.
Free Gaza from Hamas.

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ shekinahcancook, ixi

mush42, 5 days ago

@krys
Democratic state?!
Seriously?!
Israel is being ruled by a corrupted shithead, supported by a cult of extremist religious maniacs.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Tusky, 9 days ago to random

‼️ Big annoucement!!!

✅ Tusky 25.0 is officialy out, in F-Droid and Play Store 🎉 🎉 🎉

These are the main updates to this new version:

➡️ Support Mastodon translation API
➡️ Show post language
➡️ Improved screen transitions
➡️ Filter settings now moved to account preferences
➡️ Post stats now have a stable position
➡️ A lot off under-the-hood stability & performance improvements.

❗ Also, #Tusky 25.0 app is smaller in size despite having more features.

reply

expand (42)

collapse (42)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ falken, fmo, ParadeGrotesque, RL_Dane +16 more

mush42, 9 days ago

@Tusky
I love Tusky. Well designed app, great UX.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 16 days ago to rust

👋 Career change alert!

Looking to pivot into tech & leverage my 10+ years of programming experience

🐍 Python
🦀 Rust
</> Web Development
🌐 CMS: WordPress & Wagtail
✨ Machine Learning: Torch & Tensorflow

My passion for code shines through my open-source projects! Check them out:
https://github.com/mush42
https://github.com/blindpandas

#rust #python #machinelearning #careeradvice #opentowork

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999, objectinspace

Tusky, 16 days ago (edited 16 days ago) to random

It has been nearly a week since we launched the Tusky 25.0 Beta. So far, we have encountered 3 issues in it:

➡️ crash when adding too many accounts

➡️ scheduling a status creates way too many

➡️ crash when upgrading from 24 with an Akkoma account

All of them are going to be solved soon. But we want to know: Have you encountered any other problem with the beta? Please tell us!

reply

expand (16)

collapse (16)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ chillicampari, MagicLike

mush42, 16 days ago

@Tusky
When moving the tabs from top to bottom, the account button is no longer labled for TalkBack.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 23 days ago to ai

Just a random thought.
As Generative AI is being used for creating a lot of content these days, what happens when the next generation of AI models are trained using that content.
When AI models are trained on AI generated content, we'll officially enter the 8th circle of hell
#ai #generativeai #AItransparency #discussion

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jaybird110127

mush42, 23 days ago to random

The second beta of sonata-for-NVDA v3.0 is out:
https://github.com/mush42/sonata-nvda/releases/tag/v3.0-beta.2

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999, jaybird110127

mush42, 24 days ago to random

New beta release of Sonata-for-NVDA, formaly known as Piper-for-NVDA
NVDA 2024.1 compatibility
Support for fast variants for Piper voices. These fast variants improves responsiveness significantly because they use streaming synthesis
Improvements to responsiveness and speed across the board
Release page:
https://github.com/mush42/sonata-nvda/releases/tag/v3.0-beta.1
Direct download link:
https://github.com/mush42/sonata-nvda/releases/download/v3.0-beta.1/sonata_neural_voices-3.0-beta.nvda-addon

reply

expand (13)

collapse (13)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999, jaybird110127, ppatel

mush42, 24 days ago

Changed the name to Sonata since we plan to support additional TTS models besides Piper in the future.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ppatel

mush42, 24 days ago

Important notice!!!
After installing this version, you will lose all of your installed voices. Please use the voice manager to re-install the voices again.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ppatel

mush42, 23 days ago

@ppatel @twynn
Which voices?
Custom voices or the ones downloaded from the voice manager?
I'd appreciate it if you can provide NVDA logs.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 22 days ago

@ppatel
It seams like the server is not running.
Are you running NVDA on a 32-bit/ARM-64 machine. Sonata only works on 64-bit versions of Windows.
Otherwise, check if the server generated any logs in the following file path:
[NVDA config directory]\sonata\logs\sonata-grpc.log
If not, try running the following binary from a cmd window and report the output:
[NVDA config directory]\addons\sonata_neural_voices\synthDrivers\sonata_neural_voices\bin\sonata-grpc.exe

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 22 days ago

@ppatel
Maybe send me NVDA log to diagnose why the TTS server isn't running.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

accessibleandroid, 3 months ago to android

New article on Accessible Android: Sorry Google, but Geo-Restricting TalkBack’s Beta Testing Is Not the Right Decision https://accessibleandroid.com/sorry-google-but-geo-restricting-talkbacks-beta-testing-is-not-the-right-decision/ #Android #Google #TalkBack

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 3 months ago

@accessibleandroid I hate that. I cannot see any reason for it.
TalkBack should not be a product as such, it should be a system service.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ accessibleandroid

mush42, 5 months ago to random

I think an open-source auto tagger for PDFs is very possible.
It will make it easier to convert PDFs to highly structured HTML documents.

Anyone interested in tackling this challenge with me?

Adobe already took the lead:
https://news.adobe.com/news/news-details/2023/Media-Alert-Adobe-Scales-PDF-Accessibility-With-Adobe-Sensei-AI/default.aspx

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater

mush42, 5 months ago

@scruss
For this project, I'd forgo parsing the PDF stream, and extract symantic structure using a visual rendition. Then I'd use this symantic metadata to parse the PDF stream and extract text.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater

devinprater, 5 months ago to windows

So, who knew you could open a new Word document on Windows with Control + Alt + Windows + Shift + W? Why in the world? Ugh there just needs to be a "show me all the stupid Windows hotkeys that no one knows about" command.

#windows #accessibility #blind

reply

expand (16)

collapse (16)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ simon

mush42, 5 months ago

@devinprater
I managed to do it with one hand after the third try 😄

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

fasterthanlime, 5 months ago to random

if you hit "Ctrl+K" in Powerpoint, to insert a hyperlink, you get this delightful mix of 90s and 2010s design that I assume everyone at Microsoft is afraid to touch

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ vascorsd, kura, lanodan, astrid +5 more

mush42, 5 months ago

@fasterthanlime appreciate the alt text

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

x0, 6 months ago to random

I am now also hosting x64 builds of Bookworm (the accessible ebook reading app) for Windows, again from the tip of the source tree. These are done since the AppVeyor artifacts expire and can be hard to find. When checking back for updates ensure you pay attention to the modification date on the file, not the version number, as the version number hasn't changed in a long time. https://x0box.xyz/uploads/bookworm/

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater, datajake1999, jaybird110127, weirdwriter +1 more

mush42, 6 months ago

@x0
I wish I finish up Piper-for-NVDA faster to get back to Bookworm.
I've got many new ideas for Bookworm begging for implementation. Most of them are shooting for the moon. But hey, There's nothing impossible under the sun.
For example, I've got an idea that's nagging me for weeks: a model to automatically tag untagged PDFs. The details are already finalized in my head, but time, time, time.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jcsteh, 6 months ago to random

OMG. I swear I'm going to ask NV Access to back the WASAPI code out of NVDA altogether soon.

reply

expand (9)

collapse (9)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ weirdwriter

mush42, 6 months ago

@jcsteh
I don't want to add insult to injury here.
But WASAPI is very buggy when feeding wave samples in very small chunks. The ugliest thing is the clicking sound at the start of each wave chunk.
I'm working on piper-for-NVDA, and I've recently implemented streaming output, and this is the main show stopper.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 6 months ago

@jcsteh
In the range of 150-200 ms of audio per chunk.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 6 months ago to random

https://www.bbc.co.uk/news/newsbeat-44124396

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ppatel

mush42, 6 months ago to rust

As a visually impaired #Rust dev, my favorite sentence to hear is this:
Finished dev [unoptimized + debuginfo] target(s) in x.ys

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater

mush42, 6 months ago

@ekuber
For me, Rust is one of the most pleasant languages to work with.
I like the fully integrated DX, the extremely helpful compiler, and the commandline first tooling.
IMO the formatting of compiler error messages involving complex type signitures needs to be improved. Since some types are elided I usually need to go through the message several times, and go back to the source to know what has been elided. This is because I cannot do side-by-side viewing.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Brynify, 6 months ago to random

Flies are so stupid. If you land on something and get a violent protest, you would think not to land there again. But oh no no no. Not flies.

reply

expand (15)

collapse (15)

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 6 months ago

@Brynify
You remind me of an episode of Breaking Bad where a fly is the plot of the whole episode.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 6 months ago to random

A new version of Piper for NVDA is out.

What's new in v2.0

• This version introduces a separate process for the TTS. This significantly increases responsiveness
• Installing voices is now easier using the integrated voice manager, which allows you to preview and download available voices
• The TTS no longer introduces unnatural pauses at the end of the line during say all
• and many other improvements and bug fixes

Detailed release notes and download link:
https://github.com/mush42/piper-nvda/releases/tag/v2.0-beta2

#NVDASR

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater, datajake1999, objectinspace, ppatel

mush42, 6 months ago

@ppatel
Does your laptop has a non-Intel processor?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...