(I'm taking bets that I can build this based on SQLite if I set my mind to it I learned a lot of creative database stunts at #postgres meetups and now from #clickhouse and I'm telling you folks a database is a framework to be used to build platforms not to just passively sit there in a corner hidden behind a cache databases aren't some mimimi :)
Great to see more people catching on to ClickHouseDB. We’re using ClickHouse at @honeybadger to power our upcoming logging/observability tool (Honeybadger Insights).
We’re also benchmarking a replacement backend for #Elasticsearch. Looks like quite a performance gain so far!
Will hopefully have more to share soon, but in the meantime we discussed this on the latest episode of @FounderQuest. Give it a listen:
Apparently, #MySQL quickly becomes slower once you reach a dozen of million records and search through them. Even with indexes.
So I looked into #Clickhouse. The same query on the same dataset only takes about 1/10 of the time. Impressive! 😯
Next week @cloudhiker will get a serious speed boost!
Until a truly performant (= fast, low memory footprint) two dimensional storage ("table") type (*) emerges, what are the options for managing big data in #perl?
@ChristosArgyrop@Perl Hey Christos, I’m partial to #ClickHouse as you can see from my bio. In my security work I use it for network traffic metrics and observability. I mainly use JavaScript, Python, and Common Lisp to interact with ClickHouse. For R there’s https://github.com/IMSMWU/RClickHouse
Also, feel free to join our Slack where you can ask any and all questions from our community of users https://clickhousedb.slack.com
@julioj@Perl Thanks Julio. Looking at various 2D solutions to utilize and #clickhouse is definitely among those considered for a SQL form of solution (bonus there is a R interface - many analytics stuff are written in R).
Will take a look!
Hey #Clickhouse people, do you have any good talks or articles to share on schema design for generic tagged metrics storage? Most discussion of schema design is around analytics of a specific type of data, but what if you're building, say, a metrics platform?