Posts - mush42 - kbin.social

This profile is from a federated server and may be incomplete. Browse more on the original instance.

mush42, 1 month ago to rust

👋 Career change alert!

Looking to pivot into tech & leverage my 10+ years of programming experience

🐍 Python
🦀 Rust
</> Web Development
🌐 CMS: WordPress & Wagtail
✨ Machine Learning: Torch & Tensorflow

My passion for code shines through my open-source projects! Check them out:
https://github.com/mush42
https://github.com/blindpandas

#rust #python #machinelearning #careeradvice #opentowork

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999, objectinspace

mush42, 1 month ago to ai

Just a random thought.
As Generative AI is being used for creating a lot of content these days, what happens when the next generation of AI models are trained using that content.
When AI models are trained on AI generated content, we'll officially enter the 8th circle of hell
#ai #generativeai #AItransparency #discussion

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jaybird110127

mush42, 1 month ago to random

The second beta of sonata-for-NVDA v3.0 is out:
https://github.com/mush42/sonata-nvda/releases/tag/v3.0-beta.2

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999, jaybird110127

mush42, 1 month ago to random

New beta release of Sonata-for-NVDA, formaly known as Piper-for-NVDA
NVDA 2024.1 compatibility
Support for fast variants for Piper voices. These fast variants improves responsiveness significantly because they use streaming synthesis
Improvements to responsiveness and speed across the board
Release page:
https://github.com/mush42/sonata-nvda/releases/tag/v3.0-beta.1
Direct download link:
https://github.com/mush42/sonata-nvda/releases/download/v3.0-beta.1/sonata_neural_voices-3.0-beta.nvda-addon

reply

expand (13)

collapse (13)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999, jaybird110127, ppatel

mush42, 1 month ago

Changed the name to Sonata since we plan to support additional TTS models besides Piper in the future.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ppatel

mush42, 1 month ago

Important notice!!!
After installing this version, you will lose all of your installed voices. Please use the voice manager to re-install the voices again.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ppatel

mush42, 6 months ago to random

I think an open-source auto tagger for PDFs is very possible.
It will make it easier to convert PDFs to highly structured HTML documents.

Anyone interested in tackling this challenge with me?

Adobe already took the lead:
https://news.adobe.com/news/news-details/2023/Media-Alert-Adobe-Scales-PDF-Accessibility-With-Adobe-Sensei-AI/default.aspx

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater

scruss, 6 months ago

@mush42 Knowing Adobe, it will be for a very limited subset of PDFs produced by their own software. Never trust a company that has two incompatible standards for managing form data ...

(I used to be a prepress programmer, so I've experienced a whole load of really terrible PDFs. At best, they're digital marks on paper. I also remember the whole "tagged PDF" thing from the early 2000s)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 6 months ago

@scruss
For this project, I'd forgo parsing the PDF stream, and extract symantic structure using a visual rendition. Then I'd use this symantic metadata to parse the PDF stream and extract text.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater

mush42, 6 months ago to random

https://www.bbc.co.uk/news/newsbeat-44124396

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ppatel

mush42, 6 months ago to rust

As a visually impaired #Rust dev, my favorite sentence to hear is this:
Finished dev [unoptimized + debuginfo] target(s) in x.ys

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater

ekuber, 6 months ago

@mush42 I would love to hear about your experience, what tooling you use and what we could do to improve things

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 6 months ago

@ekuber
For me, Rust is one of the most pleasant languages to work with.
I like the fully integrated DX, the extremely helpful compiler, and the commandline first tooling.
IMO the formatting of compiler error messages involving complex type signitures needs to be improved. Since some types are elided I usually need to go through the message several times, and go back to the source to know what has been elided. This is because I cannot do side-by-side viewing.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 7 months ago to random

A new version of Piper for NVDA is out.

What's new in v2.0

• This version introduces a separate process for the TTS. This significantly increases responsiveness
• Installing voices is now easier using the integrated voice manager, which allows you to preview and download available voices
• The TTS no longer introduces unnatural pauses at the end of the line during say all
• and many other improvements and bug fixes

Detailed release notes and download link:
https://github.com/mush42/piper-nvda/releases/tag/v2.0-beta2

#NVDASR

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater, datajake1999, objectinspace, ppatel

ppatel, 7 months ago

@mush42 No. But Windows identifies it as ssuch.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ppatel, 7 months ago

@mush42 Thanks for pointing me to a possible problem here. It looks like there's no way to change what the system identifies even if it's false. I appreciate the response.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 7 months ago to random

@accessibleandroid Please help!

After I updated Speech Recognition & Synthesis by Google to the last version, I lost the ability to download voices.
Any suggestions?

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

accessibleandroid, 7 months ago

@mush42 Are you trying to download the voice data for a specific language?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 7 months ago

@accessibleandroid no, all the ones I tried fail. When I press the download button, nothing happens, and TalkBack keeps losing focus and focusing the button.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mush42, 9 months ago to random

For those who use the Piper voices add-on for #NVDASR I've just added an integrated "Voice Manager" where you can install new voices, and manage installed ones. So, no manual download, no TAR archives 🤗

A new version with the voice manager will be released later this month. Stay tuned!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ppatel, objectinspace, datajake1999, pitermach

mush42, 1 year ago to random

New neural voices for NVDA are available.

Piper for #NVDASR
https://github.com/mush42/piper-nvda

reply

expand (12)

collapse (12)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ bmoore123, pitermach, devinprater

mush42, 1 year ago

@bryn

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Marco, 1 year ago

@devinprater I got it working. And the German voices are funny. They all have problems pronuncing the open ch sound like in "ich" oder "echt", a sound much similar to the h sound in the English word "human". The way they speak it, they sound more like a Danish or Swedish person who had German as a second or third language in school. They also swallow some of the consonants we usually pronunce prominently. The only language in northern Europe that prominently does this is Danish. Or some Bri'ish accents where they got rid of all the ts. All the variants of German show the same problems, which is interesting. @KaraLG84 @mush42

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater