stefan, to mastodon
@stefan@stefanbohacek.online avatar

Does anyone have Elasticsearch enabled on their single-user instance? What specs does this require?

vyr, to random

here's the final iteration of my Mastodon advanced search patch: https://github.com/VyrCossont/mastodon/pull/8

this enables full-text search for posts you haven't interacted with, as well as full-text search for accounts, and includes several advanced filtering operators and parser fixes.

michael, to random
@michael@thms.uk avatar

I've noticed a lot of chatter about setting up Elasticsearch for Mastodon 4.2's new full text search over the last few days, including what hardware is required, how difficult is it, etc.

So I thought I’d write down my experience, including the hardware I'm running Elasticsearch on for my single user instance:

https://blog.thms.uk/2023/09/mastodon-elasticsearch?utm_source=mastodon

kikobar, to mastodon
@kikobar@acc4e.com avatar

I tried to enable in my solo instance, and it triggered a huge workload to index around 17 million documents.

This process would have taken several days to complete with the current resources allocated to the instance.

I opted for terminating the process and disabling Elasticsearch, but I'd like to hear other experiences:

  • Is this only a temporary behavior? (ie. after initial indexation it becomes easier on the server).

  • Is there any periodic 'cleanup' process? (ie. my server ingest many GB of toots every day, will I end up with a huge Elasticsearch database?).

  • Am I missing something valuable for not implementing Elasticsearch?

Looking forward to hear your thoughts.

MagicLike, (edited ) to mastodon
@MagicLike@mstdn.social avatar

uhhm how do I find

"./bin/elasticsearch"

Edit: Fixed, see https://tech.lgbt/@wakame/111722476567184109

:boost_requested:

@askfedi

FoW, to random Korean
@FoW@netsphere.one avatar

엘라스틱 엔지니어 행사 구경 왔어요.

fell, to random
@fell@ma.fellr.net avatar

I need to about stuff: Ever since the v4.2.5 update my is broken. It just says:

status: 503 org.elasticsearch.cluster.block.ClusterBlockException: blocked by: [SERVICE_UNAVAILABLE/1/state not recovered / initialized]  

I already deleted the data directory, but when I tried to redeploy search according to the documentation, it gave me the same error message.

Any ideas what could've gone wrong?

fell, to mastodon
@fell@ma.fellr.net avatar

elasticsearch.service: A process of this unit has been killed by the OOM killer.

Seriously, who designed this?! God I wish Mastodon would use something else for search.

And yes, I know I can configure memory limits, but I shouldn't have to.

mgorny, to random

mailing list archives are broken for almost 3 months now. While marc.info is advocated as a stop-gap solution, it doesn't cover all our mailing lists (I don't think any of the third-party archives do). We really need someone to fix this.

https://infra-status.gentoo.org/notice/20230404-archives

toni, to random
@toni@pancake.place avatar

Very much interested to deploy Mastodon 4.2 full text search functionality on my small instance.

Anyone experienced how much additional resources takes? Is it better to dedicate separate VM for it?

rolle, to random
@rolle@mementomori.social avatar

Just enabled the default search on my instance. Needs 16GB of RAM but it's totally worth it.

BeAware, to mastodon
@BeAware@social.beaware.live avatar

Hmmm 🤔 memory leak or I just don't understand how things work. seemingly can handle all the jobs quite easily, yet after a couple days of my instance running without restarting, RAM usage grows quite high, talking 80% with enabled on a 16 GB machine. Is it normal? Or does that mean memory leak somewhere? Maybe I should spend a week figuring out how to run another instance and load balance, but I feel this shouldn't be necessary for a single user instance...🤷🏻‍♂️

rimu, to PostgreSQL
@rimu@mastodon.nzoss.nz avatar

Added full-text search to today.

After reading a massive tome about earlier this week I realised it was complete overkill and just used the full-text capabilities of instead.

Currently PieFed has 46,000 posts and results are fast. It'll be interesting to see how well it copes when there are more posts. Anyone want to make a guess when it'll bog down?

https://piefed.social/post/48889

Marcus, to mastodon
@Marcus@k8s.social avatar

Will the 4.2 update now require or will it still be an optional feature?

taylorparizo, to homelab

I'm planning to write an updated homelab guide on my blog this year but I think I'm about to rebuild some parts for a new purpose 😅​

It might be time to try out OpenCTI given what I do in my lab should be representative of what I do during < dayjob >. That also means I need to tear down Wazuh and configure an ELK stack instead (resource constraint).

wood, to mastodon
@wood@federated.press avatar

Masto admins:

I'm currently running the Elasticsearch update after upgrading to Mastodon 4.2.0, and it was running really fast until it got to "PublicStatusesIndex" — now it's still importing documents, but it's really slow. It says 406 docs/s, w/ 54M to go, and the ETA keeps getting longer. Anyone else experience this?

FoW, (edited ) to random Korean
@FoW@netsphere.one avatar

마스토돈 4.2.0 beta3?
검색 기능 개선 관련 코드 수정 있네요. Nori 쓰려고 코드 수정했다면 비교해서 병합하겠네요.

PCzanik, to random
@PCzanik@fosstodon.org avatar

Why use a http()-based destination in syslog-ng? Because you gain extreme performance and an architecture that is easier to maintain.

https://www.syslog-ng.com/community/b/blog/posts/why-use-a-http--based-destination-in-syslog-ng

Just think of , / @OpenSearchProject , & Co.

mackuba, to random
@mackuba@martianbase.net avatar

Q for people who have used Elastic Search, esp. for a Mastodon instance: how should I configure it to use less memory (while still having enough)? Right now it seems to eat as much as it wants (~4 GB)...

It's a single-user instance, so the total data size it reports is 40 MB now.

chris, to mastodon
@chris@mstdn.chrisalemany.ca avatar

can confirm. The new fulltext search works great, and is awesome! I just searched for "footiMac" which I know only I use and it returned results very very quickly. I can go way back. Including to it's very first mention back in January!

This ability is so very important for the usability and attractiveness of Mastodon!

https://mstdn.chrisalemany.ca/@chris/109705623945932203

chris, to random
@chris@mstdn.chrisalemany.ca avatar

So I'm deploying on my server right now. It’s importing the "accountsIndex”. It says ###/561466. Does that mean 561,466 accounts have interacted with my server in some way? If so. That's pretty wild. But also, if Mastodon ever got big... that number would likely go up exponentially and my little server would?? 🔥🤯🤪

Chocobozzz, to fediverse
@Chocobozzz@framapiaf.org avatar

is your companion for finding videos, playlists and channels from the PeerTube federation

We're currently testing in favour of to index the data.

As it's difficult to know if the results are relevant in all languages, we're looking for public feedback!

So don't hesitate to test this new engine on https://meilisearch.sepiasearch.org/ and let us know what you think :)

Natanox, to random
@Natanox@chaos.social avatar

Does have to be reachable from the internet or just the service it's used with (e.g. Mastodon, Nextcloud)?

wood, to ruby
@wood@hachyderm.io avatar

Great to see more people catching on to ClickHouseDB. We’re using ClickHouse at
@honeybadger to power our upcoming logging/observability tool (Honeybadger Insights).

We’re also benchmarking a replacement backend for . Looks like quite a performance gain so far!

Will hopefully have more to share soon, but in the meantime we discussed this on the latest episode of @FounderQuest. Give it a listen:

https://share.transistor.fm/s/d67618cc?t=15m15s

hywan, to random
@hywan@fosstodon.org avatar

ParadeDB, https://www.paradedb.com/.

It’s a modern ElasticSearch alternative built on Postgres and Tantivy, in Rust.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • kavyap
  • DreamBathrooms
  • tacticalgear
  • mdbf
  • InstantRegret
  • magazineikmin
  • Youngstown
  • thenastyranch
  • rosin
  • slotface
  • Durango
  • cubers
  • ngwrru68w68
  • anitta
  • cisconetworking
  • GTA5RPClips
  • modclub
  • tester
  • khanakhh
  • everett
  • provamag3
  • osvaldo12
  • Leos
  • normalnudes
  • ethstaker
  • megavids
  • lostlight
  • All magazines