Replies

This profile is from a federated server and may be incomplete. Browse more on the original instance.

noleli, to random
@noleli@mastodon.social avatar

I have a 72-page, 36-year-old typewritten document that was scanned to a non-OCRed PDF in 2010. I’m trying to cleanly extract the text so I can convert it to markdown. All attempts at OCR have yielding extremely messy results. Is there a new generation of ML-based OCR I could try, or should I MTurk it? #dontSayAI

mdekstrand,
@mdekstrand@hci.social avatar

@noleli sound good to me! (and what problem do people have with <section>?)

mdekstrand,
@mdekstrand@hci.social avatar

@noleli @brucelawson @aardrian it looks like the primary problem is with section +expecting a document outline algo that doesn’t exist, not with <section> as a container in combo with properly-nested h1/2/3, except in so far as section may not have a purpose without said algo?

ISTM that your proposed use is closer to the container w/ proper header tags when applicable. Another option could be “<h3 class=untitle>Untitled Section</h3>

mdekstrand,
@mdekstrand@hci.social avatar

@noleli @brucelawson @aardrian I was also curious because I use <section> all the time with proper header tags, just as a nicer alternative to wrapping the section in a div, based on what I learned from MDN and not reading the background discussions. It looks like this probably isn’t creating any of the problems discussed in these pieces? (This is also what Pandoc does if you specify HTML5 output and section-divs.)

scheidegger, to random
@scheidegger@mastodon.social avatar

You maybe think I'm joking about citing "do not create the torment nexus", but I get like one of these every week, seriously

mdekstrand,
@mdekstrand@hci.social avatar

@scheidegger da fuq?

Reminds me of the paper that cited my gender bias audit of book recommenders to justify some bullshit computer vision technique to recognize gender from skeletal structure because gender recognition is “important for recommender systems”.

mdekstrand,
@mdekstrand@hci.social avatar

@scheidegger oof. no. and it isn't like your paper is that difficult to understand, esp. in its CACM version.

I've definitely seen it cited in some pretty superficial ways. Like just cite it, claim to make a WAE assumption, move forward without engaging in the deep interrogation needed to figure out what an WAE assumption would really, plausibly look like in this case, etc.

but chronic failure to engage with the substance of an argument is endemic to academia. very sadly.

mdekstrand,
@mdekstrand@hci.social avatar

@scheidegger versions of it are something I see in students a lot too 😔. Some of them, with coaching, grow past it.

But lots of what seems like keyword matching — latching onto the words and the superficial sequence & structure, without grokking and translating the fundamental substance. Which drives me batty, because my brain is all connections, all the time, and thrives on seeing where different authors are using wildly different language to make the same basic argument.

mdekstrand,
@mdekstrand@hci.social avatar

@scheidegger It can! And honestly one of the reasons I didn't seek pharmaceutical treatment until I was 38… was worried that meds would interfere with the mental connection circuitry.

(they didn't. they help me act on connections more effectively.)

mdekstrand, to random
@mdekstrand@hci.social avatar

"Do you want to get credit for this review in Web of Science?"

idk, sure.

"Do you want to go through the work of figuring out how the hell to properly create and link accounts to get that credit to show up?"

uhh not really.

mdekstrand,
@mdekstrand@hci.social avatar

@bkeegan and one would think “sign in with ORCID” would fix it, but nope.

regehr, to random
@regehr@mastodon.social avatar

which is more cursed... the warning or the code being warned about?

/home/regehr/llvm-project/llvm/include/llvm/Support/AutoConvert.h:1:1: warning: C++ style comments are not allowed in ISO C90
1 | //===- AutoConvert.h - Auto conversion between ASCII/EBCDIC ------ C++ --===//

mdekstrand,
@mdekstrand@hci.social avatar

@regehr @scheidegger ERROR: E4666: unexpected sequence in string literal. Try compiling with -Z cursed-options -Z str-is-utf-ebcdic

mdekstrand, to random
@mdekstrand@hci.social avatar

I really need to find a better liquor store, but I guess Highland Park will tide me over until I go to Scotland in March.

mdekstrand,
@mdekstrand@hci.social avatar

@bkeegan They had both of those, but not a fan of Laphroaig — way too smokey for me. I had an Ardbeg once a while ago, and remember it being ok. They also had Macallan and Oban, but priced higher than I was feeling this week. Highland Park was the only thing that looked to be in the spice-forward department.

mdekstrand, to random
@mdekstrand@hci.social avatar

All this work in AI for “customer service” but what about AI for navigating customer service? If it’s good enough for them to try to handle my problem, shouldn’t it be good enough for me to try to navigate customer service to get it solved?

mdekstrand,
@mdekstrand@hci.social avatar

There was that Google demo a few years ago to call restaurants and stuff that everyone hated (for some pretty good reasons IMO).

mattwilcox, to random
@mattwilcox@mstdn.social avatar

Oh god, cursed idea; storing your SVG UI elements as strings in local storage, that you stuff into the DOM via JS, so that you get the benefit of "caching" like they're an img src, but also the ability to target and style in CSS because they're loaded on the page as actual SVG.

Essentially loading SVG assets you often want to theme with CSS as a browser-stored package. Instead of needing to dump raw SVG in HTML, or do icon-green.svg icon-red.svg etc.

mdekstrand,
@mdekstrand@hci.social avatar

@noleli @mattwilcox what is this sorcery?

<reads web component docs>

mdekstrand,
@mdekstrand@hci.social avatar

@noleli so, i could replace my web site's secret content architecture that currently stashes ciphertext in a <pre> and includes the decrypt form in HTML with

<secret-content>CIPHERTEXT</secret-content>

and make a web component do all the work?

i am simultaneously intrigued and horrified

mdekstrand, to random
@mdekstrand@hci.social avatar

This by @kissane is really good, and towards the end covers a solid non-religious argument grounded in demonstrable harms for second-degree defederation from Threads. https://erinkissane.com/untangling-threads

mdekstrand,
@mdekstrand@hci.social avatar

The visibility-of-harassment problems are a significant weakness of the ActivityPub federation model. They also look incredibly difficult to solve.

grimalkina, to random
@grimalkina@mastodon.social avatar

To revive (jk it's never gone just buried under administrivia and the weight of healing from general occupational hazards like researcher harassment) my love for science, I've been rereading the early and famous papers that helped us understand that smoking causes lung cancer. Famous beautiful examples of causal reasoning and difficult to remember how HARD IT WAS to tell this story back then. Profoundly poignant to see people use the weapons of evidence reasoning to create real health change.

mdekstrand,
@mdekstrand@hci.social avatar

@natematias @grimalkina tobacco, eugenics, he really knew how to pick his causes, didn’t he.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • tacticalgear
  • DreamBathrooms
  • InstantRegret
  • magazineikmin
  • Youngstown
  • everett
  • cisconetworking
  • slotface
  • GTA5RPClips
  • rosin
  • thenastyranch
  • kavyap
  • mdbf
  • Leos
  • modclub
  • osvaldo12
  • Durango
  • khanakhh
  • anitta
  • provamag3
  • ngwrru68w68
  • cubers
  • tester
  • ethstaker
  • megavids
  • normalnudes
  • lostlight
  • All magazines