mhucka, to conservative
@mhucka@fediscience.org avatar

Good grief, I only just noticed that the Wayback Machine browser extension adds not just a menubar item – it also adds a contextual menu. At least in Safari on macOS, if you right-click on a page, you have access to it there.

mhucka, to conservative
@mhucka@fediscience.org avatar

Occasional reminder that the Internet Archive provides a number of tools and browser plugins to let you send pages to the Wayback Machine (as well as check if a given page has been saved):

https://help.archive.org/help/save-pages-in-the-wayback-machine/

g3om4c, to ai
@g3om4c@code4lib.social avatar

Harvard Library Innovation Lab: WARC-GPT: An Open-Source Tool for Exploring Web Archives Using AI

"...an open-source, highly-customizable Retrieval Augmented Generation tool the web archiving community can use to explore the intersection between web and . WARC-GPT allows for creating custom chatbots that use a set of files as their knowledge base, letting users explore collections through conversation." 👏

https://lil.law.harvard.edu/blog/2024/02/12/warc-gpt-an-open-source-tool-for-exploring-web-archives-with-ai/

detroit_yeet, to twitter

Remember this post of mine, a walkthrough of how to use Nitter, archive.org, and archive.ph to fully archive tweets (including tweets higher and lower in the thread)? https://kolektiva.social/ Well, since I made that post, two things have happened:

  • Archive.ph stopped saving archive.org captures of nitter.net

  • Nitter shut down

It's now just about impossible to properly archive tweets. You can still right click and save the page, and you can still use archiveweb.page, but they're so much less user-friendly and so much harder to share.

metaphil, to til German
@metaphil@chaos.social avatar
molly0xfff, to random
@molly0xfff@hachyderm.io avatar

new feature coming to @web3isgreat :blobfoxcute:

cgenin, to random French

« Les archives du web dans leur contexte ». Ce sera le thème de la Web Archiving Conference qui aura lieu les 25 et 26 avril 2024 à la dans le cadre de l'AG d'IIPC @netpreserve. L'appel à propositions est ouvert jusqu'au 24 septembre. https://webcorpora.hypotheses.org/1229

cgenin,

@mimo @netpreserve Quand nous parlons d'archivage du web, cela concerne en fait tout l'internet public. Nous essayons donc aussi de collecter une partie des réseaux sociaux, mais les applications rendent souvent la collecte complexe. J'ai essayé de décrire mon travail concernant l'archivage du web littéraire par exemple dans cet article
https://journals.openedition.org/bssg/271

anj, to random
@anj@digipres.club avatar

As well as being a nice intro to Legal Deposit in the UK, this video has also led to us getting a big bump in nominations! https://www.youtube.com/watch?v=ZNVuIU6UUiM

tournesol, to archive
@tournesol@peculiar.florist avatar

Vous avez des instances pour télécharger des archives de page web ?
Je voulais selfhost https://archivebox.io/ mais j’arrive pas à l’installer là et j’ai pas envie de me casser la tête plus que ça

shawnmjones, to twitter
@shawnmjones@hachyderm.io avatar

Elon is planning to effectively kill social cards on . Social cards were a big part of my dissertation work. I published a few papers about generating them via , , and because they make for nice bits of document and . Now Musk wants them gone to force journalists to write articles directly on Twitter.

Ref (paywall): https://fortune.com/2023/08/21/elon-musk-plans-remove-headlines-news-articles-link-shared-on-x-twitter/
Ref (article about paywalled article): https://9to5mac.com/2023/08/21/twitter-to-hide-news-headlines/

shawnmjones,
@shawnmjones@hachyderm.io avatar

In 2020, we developed a special tool, MementoEmbed, for generating/extracting metadata from archived web pages. We presented this tool at the Web Archiving and Digital Libraries Workshop (WADL2020).

We found out that , , , and others could not reliably create cards for archived web pages. We use MementoEmbed’s cards in with our tool Raintale to create a of this .

Ref: https://arxiv.org/abs/2008.00137

shawnmjones, to twitter
@shawnmjones@hachyderm.io avatar
webrecorder, to random

A quick update on WACZ, new tools and integration and glimpse of path ahead:
https://webrecorder.net/2023/05/03/an-update-on-wacz.html

shawnmjones, to machinelearning
@shawnmjones@hachyderm.io avatar

- Hi. I'm Shawn Jones. I'm a cat dad and an ISTI Postdoc Fellow at Los Alamos National Laboratory.

I'm interested in and much more.

I recently moved from mastodon.social to hachyderm.io. I'm a Computer Scientist who moves between software engineering and research. I may post about code, papers, and conferences.

The attached image helps my colleagues differentiate my cats. 🙂 🐈‍⬛ 🐈

  • All
  • Subscribed
  • Moderated
  • Favorites
  • provamag3
  • tacticalgear
  • DreamBathrooms
  • osvaldo12
  • mdbf
  • everett
  • magazineikmin
  • khanakhh
  • Youngstown
  • rosin
  • slotface
  • modclub
  • kavyap
  • tester
  • JUstTest
  • ngwrru68w68
  • thenastyranch
  • cisconetworking
  • Durango
  • ethstaker
  • InstantRegret
  • normalnudes
  • Leos
  • GTA5RPClips
  • megavids
  • cubers
  • anitta
  • lostlight
  • All magazines