arjen

@arjen@idf.social

Computer scientist #CS & entrepeneur, Information Retrieval #IR and Databases #DB

Indie music 🎸

Radboud University, Nijmegen & Spinque, Utrecht NL
#nobot

This profile is from a federated server and may be incomplete. Browse more on the original instance.

arjen, 29 days ago to random Dutch

Just believing that an AI is helping boosts your performance
https://www.aalto.fi/en/news/just-believing-that-an-ai-is-helping-boosts-your-performance

Researchers discover an AI placebo effect where task performance improves when people believe an AI helps them.

"The results also pose a significant challenge for research on HCI, since expectations would influence the outcome unless placebo control studies were used.

‘These results suggest that many studies in the field may have been skewed in favor of AI systems,’ concludes Welsch."

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ beeoproblem, SirTapTap, kristinHenry, ghostwise +5 more

helma, 1 month ago to random Dutch

deleted_by_author

Loading...

arjen, 1 month ago

@helma ik verbaasde me hier ook over. Misschien geen meerderheidsstandpunt in NL, maar tegelijk ook zeer waarschijnlijk dat juist die mensen die naar de dam zouden komen, vaker dan gemiddeld weerstand zullen voelen tegen de huidige kamervoorzitter in deze rol. Door waar hij politiek voor staat, en wat hij heeft gezegd (in het bijzonder uitspraken over ontvolking).

Hij is dan wel de voorzitter, maar zijn participatie kan zeker ook een lage opkomst verklaren. Een omissie van de NOS, vind ik.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 1 month ago to random Dutch

Chatbots in the Dutch news today (they exceptionally made an English version):
Chatbots recommend disinformation and fear mongering, tech companies tighten restrictions - https://nos.nl/l/2519047

Background on method of study https://nos.nl/nieuwsuur/artikel/2519040-information-on-the-methodology-ophef-episode-about-ai-and-election-campaigns

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ helma, rysiek, 1br0wn

arjen, 1 month ago to random Dutch

Envisioning Information Access Systems: What Makes for Good Tools and a Healthy Web?

Must read article in ACM Transactions on the Web on challenges in information access and whether LLM might play a role (or not!), by Chirag Shah and @emilymbender

"Information access is not merely an application to be solved by the so-called ‘AI’ techniques du jour. Rather, it is a key human activity, with impacts on both individuals and society."

Better design that right!!

https://dl.acm.org/doi/10.1145/3649468

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ emilymbender

arjen, 2 months ago to random Dutch

I was in shock by the news about Israel using AI over a large database of suspected Hamas supporters to select their targets to kill. How many innocent will be killed? Errors in the data, errors in the software, and all the collateral damage; who comes up with it & happily builds the tech?! Immoral.

But... closer to home...

The use of face recognition at the Dutch police also needs work to ensure justified application of face recognition, it's not up to standards yet:
https://www.bitsoffreedom.nl/2024/03/27/de-politie-trekt-zich-van-niemand-wat-aan-bij-de-inzet-van-gezichtsherkenning/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ 1br0wn

arjen, 2 months ago to random Dutch

Disillusioned Businesses Discovering That AI Kind of Sucks
by Frank Landymore
https://futurism.com/the-byte/businesses-discovering-ai-sucks

"This is super cool, but I can't actually get it to work reliably enough to roll out to our customers."

"The core problem is that GenAI models are not information retrieval systems," she added. "They are synthesizing systems, with no ability to discern from the data it's trained on unless significant guardrails are put in place."

Mostly based on https://www.axios.com/2024/03/27/ai-chatbot-letdown-hype-reality

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ paezha

arjen, 2 months ago to random

Nitter is now officially "over".
https://nitter.cz/

So now I will no longer visit conversations on Twitter, maybe for the better.

Mastodon lives!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ onepict

djoerd, 3 months ago to random

#oEmbed is a format for allowing an embedded representation of a URL on third party sites.

https://oembed.com

reply

expand (9)

collapse (9)

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 3 months ago

@djoerd do you know why ppl don't just use the HTML <img> tag for photos, the <iframe> tag for HTML, or the <video> and <audio> tags for other media?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 2 months ago

@djoerd but that doesn't quite explain why using a <video> tag would not work instead. Anyways. Happy that you jumped!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 2 months ago

Thank you @christianp

So, I could really view the oEmbed "endpoint" as an alternative way to provide a REST API?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ajsadauskas, 3 months ago (edited 3 months ago) to tech

In an age of LLMs, is it time to reconsider human-edited web directories?

Back in the early-to-mid '90s, one of the main ways of finding anything on the web was to browse through a web directory.

These directories generally had a list of categories on their front page. News/Sport/Entertainment/Arts/Technology/Fashion/etc.

Each of those categories had subcategories, and sub-subcategories that you clicked through until you got to a list of websites. These lists were maintained by actual humans.

Typically, these directories also had a limited web search that would crawl through the pages of websites listed in the directory.

Lycos, Excite, and of course Yahoo all offered web directories of this sort.

(EDIT: I initially also mentioned AltaVista. It did offer a web directory by the late '90s, but this was something it tacked on much later.)

By the late '90s, the standard narrative goes, the web got too big to index websites manually.

Google promised the world its algorithms would weed out the spam automatically.

And for a time, it worked.

But then SEO and SEM became a multi-billion-dollar industry. The spambots proliferated. Google itself began promoting its own content and advertisers above search results.

And now with LLMs, the industrial-scale spamming of the web is likely to grow exponentially.

My question is, if a lot of the web is turning to crap, do we even want to search the entire web anymore?

Do we really want to search every single website on the web?

Or just those that aren't filled with LLM-generated SEO spam?

Or just those that don't feature 200 tracking scripts, and passive-aggressive privacy warnings, and paywalls, and popovers, and newsletters, and increasingly obnoxious banner ads, and dark patterns to prevent you cancelling your "free trial" subscription?

At some point, does it become more desirable to go back to search engines that only crawl pages on human-curated lists of trustworthy, quality websites?

And is it time to begin considering what a modern version of those early web directories might look like?

@degoogle #tech #google #web #internet #LLM #LLMs #enshittification #technology #search #SearchEngines #SEO #SEM

reply

expand (76)

collapse (76)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ timrichards, AdeptVeritatis, ubi, oblomov +11 more

arjen, 3 months ago

@ajsadauskas @degoogle Curlie https://curlie.org/ is the continuation of the ODP

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 3 months ago to random Dutch

Rest in peace Navalny. Not many braver men in this world.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Pagan_Animist

evan, 4 months ago to random

Wordle 951 6/6*

⬜🟨⬜⬜⬜
⬜🟩🟨⬜⬜
⬜🟩🟩🟨⬜
🟩🟩🟩⬜⬜
🟩🟩🟩⬜⬜
🟩🟩🟩🟩🟩

Sonofa

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 4 months ago

@evan Wordle 951 6/6

⬜🟨⬜⬜⬜
⬜🟩🟩🟨⬜
🟩🟩🟩⬜⬜
🟩🟩🟩⬜⬜
⬜⬜⬜⬜⬜
🟩🟩🟩🟩🟩

Kinda a struggle :-)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 5 months ago to random Dutch

Citing @ploum here:

Social networks are fluid. They come, they go. For commercial social networks, the success is defined by: "do they earn enough money to make investors happy ?" There’s no metric of success for non-commercial ones. They simply exist as long as at least two users are using them to communicate.

(..)

The lesson is simple: you are living in a small niche. We all do. Your experience is not representative of anything but your own. And it’s fine.

Enjoy:
https://ploum.net/2023-07-06-stop-trying-to-make-social-networks-succeed.html

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ onepict

amoroso, 5 months ago to usenet

The sad state of my quest for a Usenet NNTP GUI client for Linux.

Pan is awesome but the binaries of my Debian Bullseye based distro, Crostini, are ancient and buggy. The Pan project distributes no .deb or other packages. Building from source requires recent versions of tools not in Bullseye.

Very few other GUI options available. Even fewer with .deb or other binaries.

#usenet #nntp #linux

reply

expand (22)

collapse (22)

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 5 months ago

@amoroso did you consider Emacs?

See this link for using Gnus to read NNTP:
https://www.maketecheasier.com/emacs-usenet-reader-with-gnus/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 6 months ago to random

Ethical, open and non-commercial: the Open Web Search project is designed to provide Europe with the right alternative to existing search engines

https://home.cern/news/news/computing/ethical-open-and-non-commercial-open-web-search-project-designed-provide-europe

#OpenWebSearchEU

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ geographile, trendless

arjen, 6 months ago to random

ChatGPT can reveal its training data, that includes personal information.

https://not-just-memorization.github.io/extracting-training-data-from-chatgpt.html

Here, the authors used a prompt to instruct ChatGPT to repeat a word forever, eventually resulting in different text that can be linked back to the source; examples include "company" and "poem".

No doubt this "attack" (in words of the authors) will soon be intercepted, but who knows what other formulation of prompt results in the same behaviour?

Analysis:
https://arxiv.org/abs/2311.17035

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ krinkle

arjen, 6 months ago to random

A summary of new #DuckDB results on the H2O.ai benchmark.

https://duckdb.org/2023/11/03/db-benchmark-update.html

Important lesson: your hardware configuration matters, also in the cloud. Choosing a high quality machine with sufficient local storage makes a difference.

Remarkable: only 2 competitors in the benchmark can complete the join query over 50GB data.

#DuckDB 's investments in improving their external memory algorithms pay off: advanced group-by query #5 is more than an order of magnitude faster than anyone else.

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ogrisel

avandeursen, 7 months ago to random

Pretty alarming NYT article on GM’s Cruise self-driving cars. Insufficiently prioritizing safety, despite requiring that

> “vehicles were supported by a vast operations staff, with 1.5 workers per vehicle. The workers intervened to assist the company’s vehicles every 2.5 to five miles”

https://www.nytimes.com/2023/11/03/technology/cruise-general-motors-self-driving-cars.html

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ErikJonker

arjen, 7 months ago

@avandeursen self-driving is remote-controlled?! I did not realise that yet.

(How can there be a business model if you replace one free driver by 1.5 paid ones?)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 7 months ago

@loke @avandeursen imagine a conventional taxi company, with many employees as drivers. Then there's one driver behind the wheel. The "self-driving" car company with remote controlled cars needs 1.5 driver apparently? I'd say that is 50% more expensive.

(Seems save to assume both companies have similar other costs/overheads?)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

b0rk, 7 months ago (edited 7 months ago) to random

what git jargon do you find confusing? thinking of writing a blog post that explains some of git's weirder terminology: "detached HEAD state”, "fast-forward", "index/staging area/staged", “ahead of 'origin/main' by 1 commit”, etc

(really only looking for terms that you personally find confusing, not terms that you think someone else might be confused about)

reply

expand (248)

collapse (248)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ nf3xn, danyeaw, jrt, ljrk +6 more

arjen, 7 months ago

@b0rk I'm with your "detached head state" example, that confuses the hell out of me (excuse the Halloween language), always.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ken, 8 months ago to random

I installed a locally hosted LLM using @simon's excellent llm (https://github.com/simonw/llm) tool. It's kind of wild that I just...have this power on my laptop?

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ danyork, ocramz, glynmoody, TedUnderwood +3 more

arjen, 8 months ago

@simon @ken note however that 13GB would also let you store many many Web pages!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

arjen, 8 months ago to random

Happy #FreeWebSearch Day to everyone!

The internet was meant to be free. Yet, it no longer is: a few powerful commercial players ("Big Tech") control what we find when we search the internet.

On 29 September, people and organisations join forces in activities to restore internet search to what it should be: diverse, open and transparent.

https://freewebsearch.org/en/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kzimmermann

arjen, 8 months ago to random

Did you know that

** Hannes Mühleisen **

of #DuckDB fame gives his inaugural lecture today, to celebrate accepting his chair on Data Engineering at Radboud University in The Netherlands?!

Congratulations Prof. Hannes Mühleisen 👨‍🎓, and looking forward to your lecture, "The Ancient Art of Data Management".

Your @Radboud_uni colleagues are very proud to welcome you, a contemporary database icon, into our house.

Livestream (3.45 pm) at https://weblectures.ru.nl/permalink/l1253ba88849cdgjdfbs/iframe/

#DuckDB #professor #Radboud

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ gmodena, alanz

mastodonmigration, 8 months ago (edited 8 months ago) to random

Recommendation regarding 'curated' accounts @ClimateMigration and @AstroMigration

Some users do not like how these accounts flood their home feed. Here is a suggestion. You can now remove the contents of a List from your home feed. Create a new list, add the curated account, and go to settings (slider bar icon at top right) and toggle "Hide these posts from home". Your home feed will no longer receive boosted posts from the curated account, but you can still view them by clicking on the list.

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ dgoldsmith, AkaSci, design_law, shuttersparks

arjen, 8 months ago

@mastodonmigration

Interesting approach.

Wouldn't it be easier to simply bookmark the accounts, to refer to later (or never ;-))?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...