Open-source AI requires open data. There's a lot out there, but one of the... - Random

tedunderwoodillinois, 27 days ago

Open-source AI requires open data. There's a lot out there, but one of the obstacles is that older public-domain books have terrible OCR transcription. To that end, Pleias is releasing a billion words of public-domain text with experimental LLM-based OCR correction. https://huggingface.co/datasets/PleIAs/Post-OCR-Correction

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ TedUnderwood

Image

Image alternative text

Federation

Status:

On | Off

Instances:

/m/random

Threads (57498)

Microblog (4101290)

All Content

People

Magazines

Collections

Thread

tedunderwoodillinois

@tedunderwoodillinois@threads.net

Added: 27 days ago
Online: -
Boosts: 1

Magazine

Random

@random@kbin.social

"Random" is the place where all the content from the Fediverse that couldn't be classified into any other magazine ends up.

Created: 1 year ago
Owner: ernest
Subscribers: 4353
Online: -

Threads 57498
Comments 47445
Posts 4101290
Replies 5275687
Moderators 1
Moderation log 10

Moderators

ernest

Active people

I hate how laggy this page is, but I guess this is the price of going more independent. Same as crypto vs paypal....

11 months ago to ethfinance

#Aegyptian #Graffiti...

3 months ago to graffiti

Mysterious, 20-mile long random yellow #painted line on the I-95 freeway in #Florida leads directly to "Acme Barricades" 😂 #random #weird https://www.news4jax.com/news/local/2023/09/11/drivers-concerned-confused-by-mysterious-yellow-line-that-spans-20-miles-on-i-95/

8 months ago to Florida

Being #creative is simply...

5 months ago to creative

Related threads

Bakhmut Falls - Belgorod Raid - Russian Invasion of Ukraine (YouTube, 24:08)

11 months ago to Ukraine

What is !196, and how is it different from @random@kbin.social?

10 months ago to 196

Exploring Innovative Digital Strategies with Ninja D Marketing

9 days ago to internet

Sup

9 months ago to FindFriends

Add comment