spocko,
@spocko@mastodon.online avatar

I need some help from my Mastodon html & experts.
I'm trying to download the transcript for 4-22-2024.
It gives me an index
https://pdfs.nycourts.gov/PeopleVs.DTrump-71543/transcripts/4-22-2024/

At the top "NEXT" starts with 00001.html & is incrementally increased. https://pdfs.nycourts.gov/PeopleVs.DTrump-71543/transcripts/4-22-2024/00001.html
There are 121 pages of individual images in the middle of a page. It's NOT a single long PDF document.

  1. How do i download them all at once?
  2. How do I convert the images to a searchable PDF?

4-22-2024 Transcript for Trump Trial

kentbrew,
@kentbrew@xoxo.zone avatar

@spocko huh, interesting, these are individual image files (presented as raw data:URIs in the HTML source) and not PDFs. It's almost like they wanted to make it as hard as possible to do what you want to do.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • tacticalgear
  • thenastyranch
  • ethstaker
  • everett
  • Durango
  • rosin
  • InstantRegret
  • DreamBathrooms
  • magazineikmin
  • Youngstown
  • mdbf
  • slotface
  • GTA5RPClips
  • kavyap
  • JUstTest
  • tester
  • cubers
  • cisconetworking
  • ngwrru68w68
  • khanakhh
  • normalnudes
  • provamag3
  • Leos
  • modclub
  • osvaldo12
  • megavids
  • anitta
  • lostlight
  • All magazines