wragge, to history
@wragge@hcommons.social avatar

I've written a little post about the National Library of Australia's collection of archived websites in Pandora and the new section that helps you to work with the data.

Want to find websites from Australian elections back to 1996? Just go to Pandora. Want all the urls in a spreadsheet? Just run my new notebook.

@histodons https://updates.timsherratt.org/2024/05/07/using-pandoras-collection.html

edsu, to random
@edsu@social.coop avatar

I just noticed that browsertrix-crawler got its (voluminous) docs moved from the repo README.md to a nice new website:

https://crawler.docs.browsertrix.com/

If you haven't used it before browsertrix-crawler is an amazing tool that lets you create standardized using a browser-based crawler on your computer using Docker.

It's kinda like wget but it actually renders the pages, and lets you write site specific, customized behaviors for archiving especially difficult to collect content.

edsu, to random
@edsu@social.coop avatar

UNT are always pushing the boundaries of what can be done in digital libraries and in particular. On a call yesterday I learned from Lauren Ko that they have started to archive web resources cited in student's dissertations.

Each archived page is bundled into a dissertation specific WACZ file (a ZIP with metadata), which can then "played back" on the dissertation's web page using @webrecorder's ReplayWebPage web component.

Here's an example: https://digital.library.unt.edu/ark:/67531/metadc2179336/

A list of web resources that have been archived for the dissertation, including a page from the Texas Historical Association website: "Village Creek, Battle of".
Viewing the archived web page for "Village Creek, Battle of" that was originally published at www.tshaonline.org, but is now made available from the UNT Digital Library.

jkramersmyth, to random
@jkramersmyth@digipres.club avatar

NYT adds a web-based Flash player to their archive website so visitors can run old Flash-based interactive news pieces.

https://eagereyes.org/blog/2024/nytimes-web-flash-player

#webarchives #DigitalPreservation #flash #archives

shawnmjones, to twitter
@shawnmjones@hachyderm.io avatar

just my PhD advisor Michael L. Nelson. He was covering how the wife of the US Speaker of the House had an anti-+ website and just took it down, but it’s still in . Due to the , we can view the tweets by URL, but cannot view the whole thread. He’s asked us to him in hopes of: (1) getting the word out about the shadowban, and (2) overcoming the shadowban (unlikely).

The URLs: https://gist.github.com/phonedude/e4970c74660d91622cb14e77d865d64e

A screenshot of four posts from a Twitter thread that read “This Post is unavailable. Learn more.”
A screenshot of a tweet from @phonedude_mln, somehow half blocked. The tweet above it in the thread reads “This Post is unavailable. Learn more” but the tweet under that reads: But the problematic text (pictured) is not in the website, it’s in the: “OPERATING AGREEMENT of ONWARD CHRISTIAN COUNSELING SERVICES, LLC" Which is a PDF linked from the web site. Underneath this is an image of text that reads: Marriage and Sexuality. We believe the term "marriage" has only one meaning and that is marriage sanctioned by God which joins one man and one woman in a single, exclusive union, as delineated in Scripture. We believe that God intends sexual intimacy to only occur between a man and a woman who are married to each other. We believe that God has commanded that no intimate sexual activity be engaged in outside of a marriage between a man and a woman. We believe and the Bible teaches that any form of sexual immorality, such as adultery, fornication, homosexuality, bisexual conduct, bestiality, incest, pornography or any attempt to change one's sex, or disagreement with one's biological sex, is sinful and offensive to God. We believe that in order to preserve the function and integrity of the Company, and to provide a biblical role model to the clients and the community, it is imperative that all persons employed by the Company in any capacity should abide by and agree to this statement on marriage and sexuality and conduct themselves accordingly. Because we believe in ...
A screenshot of a tweet from @phonedude_mln Michael L. Nelson that reads: The original @HuffPost article has a copy of the PDF hosted in scribd. There appears to be an original (?) copy of the pdf still available at: https://6ea37b86-6bd5-478a-93e4-d7e6f6a280b0.filesusr.com/ugd/a430fd_5bf898e6d0c044cfb4a8b4c5f387c63f.pdf "fileusr. com" is apparently a hosting service for Wix.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • cubers
  • DreamBathrooms
  • ngwrru68w68
  • Durango
  • osvaldo12
  • magazineikmin
  • mdbf
  • Youngstown
  • slotface
  • rosin
  • everett
  • kavyap
  • anitta
  • normalnudes
  • thenastyranch
  • khanakhh
  • cisconetworking
  • modclub
  • GTA5RPClips
  • InstantRegret
  • tacticalgear
  • provamag3
  • ethstaker
  • tester
  • Leos
  • megavids
  • lostlight
  • All magazines