hannah,
@hannah@social.alt-text.org avatar

This is pulling data from an image of a table into a browsable table. It definitely seems promising. Far from perfect, but this is a proof of concept after all. Huge thanks to @jakobrosin for the idea of using a table.

Based on my previous tweet, this could very well be novel. If anyone who relies on a screen reader could try it out and give me any feedback, it'd be so appreciated.

https://2d-ocr.glitch.me/

miki,
@miki@dragonscave.space avatar

@hannah @jakobrosin I don’t have a suitable image to test this RN, but have you thought of exporting the table to markdown and then putting it through an LLM to fix up the results? This is a simple enough task that ChatGPT should have no problem whatsoever, and it probably wouldn’t be easy to do any other way.

hannah,
@hannah@social.alt-text.org avatar

@miki Hmm, I'm not a big fan of AI as it exists today, so I'm not really sure what it would be fixing. Could you expand on that?

miki,
@miki@dragonscave.space avatar

@hannah Improper cells, merged cells etc.

hannah,
@hannah@social.alt-text.org avatar

@miki I think there's potential there, but there's some definite lower-hanging fruit. It has coordinate data for every character, but the API is grouping them across lines. If someone more familiar with image processing wanted to step up and write a quick "find all the dividing lines in an image", that would do wonders.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • accessibility
  • DreamBathrooms
  • mdbf
  • ethstaker
  • magazineikmin
  • cubers
  • rosin
  • thenastyranch
  • Youngstown
  • InstantRegret
  • slotface
  • osvaldo12
  • kavyap
  • khanakhh
  • Durango
  • megavids
  • everett
  • tacticalgear
  • modclub
  • normalnudes
  • ngwrru68w68
  • cisconetworking
  • tester
  • GTA5RPClips
  • Leos
  • anitta
  • provamag3
  • JUstTest
  • lostlight
  • All magazines