Carunga

@Carunga@feddit.de

This profile is from a federated server and may be incomplete. Browse more on the original instance.

ajayiyer, (edited ) to academicchatter
@ajayiyer@mastodon.social avatar

Dear @linux and @academicchatter folks:

Please suggest libre/open source tools that allow for the extraction of text and images from scientific pdf documents?

P.S: I'm on a linux machine. Would like something terminal friendly, if possible!

Carunga,

Try Zotero. It is a complete literature databas but it’s PDF reader is very good at extracting images and text. Works on all OS, web and mobile. Native Linux client has been very smooth for me. Oh, terminal it doesn’t do though. If you want to extract a large amount in an automated way, its probably not the right tool.

Carunga,

Cool. Hope to print one later. Last week I have been searching for exactly this and could not find it. Did you think about an option for adding some simple dividers inside the box?

Carunga,

What do you guys think about Eternal Terminal? I quite like it. You can scroll!

Carunga,

I usw Garuda with KDE and like it lot, even though I do not game.

Carunga,

My setup is pretty much option 1, I have no issues with it. You can easly mount NFS shares as docker volumes (I m docking that for jellyfin and nextclould) but you need to get the permissons right. But I am no expert, just a hobbiest not smart enough for a better solution :)

Carunga,

Borg is running on a headless server. Everything is dockerized, so I did the same with Borg. Advantages are that the setup is easy to setup, backup the config and move it to a different server. At first I did not realize that the mount of the backup only exists in the container and that this is making things a little harder.

Carunga,

Agreed, the dedupe feels like magic!

Carunga, (edited )

That does make sense. Could work for me as well. I was just not aware I can mount a repository from a remote host that was created by a different Borg instance on another server and just browse the files like they were local on my notebook.

Carunga,

Thanks for your answer and taking the time! Borgmatic search I did not know. That is an amazing tool. You are right about the mounting. My way of dealing with that is a NFS share I mount RW so I can restore to that and than copy whereever. This might it be ideal for very large restores though. Initially I thought I could borgmount to the NFS share and then access the filesystem via NFS. But this does nof work I suppose as Borg only lives inside the container. Generally I do like having Borg and Borgmatic containerized as almost everything else I selfhost but it adds complexity restoring. Anyways great project, it is just so powerful and in many ways elegant. Really enjoy using it!

Carunga,

This was the first idea. I cannot explain very well why this does not work. But I think the issue is that the borg mount magic lives inside the container so the filesystem cannot be seen from the host. You can mount an empty directory and copy the files you want to access from the host into it. Problem with that is that you are stuck with the tooling provided by the container.

Carunga,

Wow, just had a very short look,this looks like an amazing rabbit hole to get into. Do you run this yourself. Did you find the setup difficult? Is the,WiFi range compareable to a comercial access point?

Carunga,

Thanks. Yeah people say not so nice stuffed a out unifi and Ideally don’t want to,be pushed in somebodies cloud. Mikrotik looks good. Will do some more reading about them. Your comments were really helpful!

Carunga, (edited )

Nice hints. Never really heard about the banana pie routers. Great to hear from someone with extended knowledge! You gave lots to read and think about. Setting up a mesh network requires some work but seems doable. Have you used it? Does it work for you once setup smoothly? Sounds great all in all but not sure if I can motivate myself for the extra effort (and the negative feedback for breaking the internet). Do you use the metal cases from banana pie and their WiFi antennas or are there better options? It will be in the living room and a not to techy look is required.

Carunga,

Has been using Ubuntu for a while but kept destroying it. I aim at a stable base with modern applications.

Carunga,

Nice idea. I`d love to love nix but I think it is too involved for me. Maybe I have to try again but I need results tonight, so this might be for another time.

Carunga,

Thanks for your suggestions. Silverblue might be a good idea. I am more in the Debian based camp but maybe it is time for a change. I think it gets major updates as often as fedora “normal”. This might not be ideal for us though.

Carunga,

Wow, that looks very promising. Will keep that in mind. Thanks for the tip.

Carunga,

Thanks, will look into it.

Carunga,

Need Tod check that out. Never heard of it. I myself run KDE as part of the mighty garuda Linux. Pure is not the word I would use to describe garuda though.

Carunga,

Turns out she needs a bit of proprietary software (pixum for photo books) that I could not install on EndlessOS. So I had to change course and installed pop os. So far I am pleasantly suprised. Even though I thought I would not like it, their take on GNOME makes sense to me. Tiling is fun.

Carunga,

Did not know this. Immutable Ubuntu plus flatpack sounds awesome. Not sure what to think about their plans to fork cosmic from GNOME.

corytheboyd, to selfhosted
corytheboyd avatar

I’m looking for a self hosted solution to this problem:

I want to create a full text search index from a collection of PDF manuals (text, not images, I don’t care about OCR here). There is a UI to search for text matches in documents, and clicking a search hit opens the PDF scrolled to where the search hit is (bonus points if the search hit is hilighted)

Carunga,

U r right, it does highlight in the pdf. It did not on mobile for me.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • kavyap
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • InstantRegret
  • Durango
  • Youngstown
  • everett
  • slotface
  • rosin
  • cubers
  • mdbf
  • ngwrru68w68
  • anitta
  • GTA5RPClips
  • cisconetworking
  • osvaldo12
  • ethstaker
  • Leos
  • khanakhh
  • normalnudes
  • tester
  • modclub
  • tacticalgear
  • megavids
  • provamag3
  • lostlight
  • All magazines