klmr

@klmr@mastodon.social

Bioinformatician & software engineer
#genomics #bioinformatics #fair #code #rstats #cpp #python (he/him)

This profile is from a federated server and may be incomplete. Browse more on the original instance.

klmr, 4 days ago to random

I practice the Boy Scout Rule of programming to manage technical debt: “always leave the area of code you are working on a bit cleaner than you found it”.

But unfortunately this conflicts massively (!) with small, atomic branches/merge requests. How do other teams manage this?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ gaborcsardi

hrbrmstr, 17 days ago to random
First, I declare @klmr to officially be a tier-1 cybersecurity professional (Konrad was/is already a brilliant human).

Second 🚨 ALL #RStats R DATA FILES YOU DO NOT GENERATE / CONTROL SHOULD BE CONSIDERED TOXIC SUBSTANCES 🚨

The "fix" in R 4.4.0 for the "CVE” (it should not have been a CVE) is woefully insufficient.

I highly suggest running
$ gzip -cdS rda FILENAME.rda | strings  
from the terminal on any R data files you do not generate/control before loading them.
reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Binder, DataAngler

klmr, 17 days ago

@hrbrmstr 😊

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

brodriguesco, 18 days ago to random

A vulnerability in #Rstats has been discovered https://nvd.nist.gov/vuln/detail/CVE-2024-27322

reply

expand (24)

collapse (24)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ urswilke, Mehrad

klmr, 18 days ago

@brodriguesco (how) is this fixed in R 4.4? I don’t find anything relevant in the news.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 18 days ago

@joranelias @Lluis_Revilla @brodriguesco Great find, that will be it.

… unfortunately it’s still trivial to perform arbitrary code execution upon deserialisation even in R 4.4 😠

Now I need to find out how to disclose this. I’m not even sure responsible disclosure makes sense here since I’m sure others will either have found this already or will very soon find it.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 18 days ago

@Lluis_Revilla Thanks, that’s what I was missing. I’ll see if I can find my old Bugzilla account info.

(As mentioned in another comment I disagree that deserialisation code execution bugs are “bogus CVEs” @bagder is rightly complaining about!]. In fact, they are amongst the most-exploited vulnerabilities.)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 18 days ago

@joranelias @Lluis_Revilla @brodriguesco … it’s not going great. 😟 #rstats (details filed on Bugzilla)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 17 days ago

@joranelias @Lluis_Revilla @brodriguesco (I completely forgot to mention that the report was created together with @idavydov)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 17 days ago

@hrbrmstr @joranelias @Lluis_Revilla @brodriguesco @idavydov Right, it’s as much “expected behaviour” as in CVE-2024-27322, and as in other serialisation engines (e.g. Python pickle, .net BinaryFormatter, etc.). Which are all systems that are very hard to use correctly, and cause frequent direct vulnerabilities. Whether that makes the serialisation frameworks themselves a vulnerability… 🤷

(I did not register a CVE; for me this is an issue of awareness and documentation.)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 17 days ago

@hrbrmstr Yeah, it’s private.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 1 month ago to stackoverflow

Huh. Apparently #StackOverflow staff/moderators can and do stealth-edit user comments now. Who the fuck thought this was OK?!

That’s a huge breach of trust. Maybe it’s time to stop using the website entirely.

(“stealth edit” = edit without raising a notification to the user, and without making it visible that the comment was edited by anybody else.)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ gracicot

mjg59, 1 month ago to random

Yo I've got a PhD in genetics from Cambridge and on the off-chance you need it I give you permission to say that Dawkins is a hack

reply

expand (14)

collapse (14)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Craigp, janeadams, MrAptronym, GhostOnTheHalfShell +11 more

klmr, 1 month ago

@FSMaxB As a geneticist I disagree about the books not contributing to the field: they changed the thinking around the functional unit of inheritance for a lot of geneticists. I consider them highly influential. The individual ideas were not his own but the way he expressed them and combined them in The Extended Phenotype was original and important — not just for popularisation but for science itself.

It goes without saying that this is regardless of his current behaviour.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ BoydStephenSmithJr

gvwilson, 1 month ago to random

Should I teach bash, fish, or Nushell to data scientists who want to go beyond the basics of shell scripting? There seems to be a clear spectrum from "ubiquitous but m'gawd" to "this is the future but m'gawd in a different way".

reply

expand (9)

collapse (9)

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 1 month ago

@gvwilson I’d also recommend teaching Bash but otherwise I would lean heavily towards zsh: still POSIX sh compatible but a lot saner than Bash. And it is the default shell on macOS, and very widely available beyond that, and comes with extensive documentation.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

coolbutuseless, 2 months ago to random

Survey: what new bit of syntax would you like in base #RStats?

What should the following bits of syntax do?

===

++ And --

+=

//

?

{{ }}

[[[ ]]]

<<==

(?: X)

reply

expand (9)

collapse (9)

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 2 months ago

@Mehrad @coolbutuseless ‘box’ allows you to do that (but you will need to convert your entire project to using ‘box’, since the purpose of this package is not to merely provide function documentation capabilities but rather to provide a sane module system): https://github.com/klmr/box

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Sheril, 3 months ago to food

Ever wonder how pickling works? https://youtu.be/gw6YpN2oRog?si=Bv8DW9o6iqcBaW29

Four years ago our PBS team created this fun explainer on all different types of pickles! And it's still one of my favorite episodes. #food #science

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ CultureDesk, Wraithe

klmr, 3 months ago

@Sheril That’s a great video but cucumbers don’t shrink much during brining (and pretty much not at all for pickling with vinegar). Instead, smaller varieties or young cucumbers are used. Large cucumbers stay large.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

gaborcsardi, 3 months ago to random

This is how httr2 and other packages use the base |> pipe in examples, and still support older R, including a clean R CMD check:
https://github.com/tidyverse/purrr/commit/426acdd50424b8cd6029d237c4d4e81d94ec42a6#diff-611496f412cac947be720d17a0ee6d7463221d14731fbc18244756271e8f5189

Ie. you need a configure (+ configure.win) file that creates an Rd macro on older R, that rewrites the examples with |>.
You'll also need Biarch: true in DESCRIPTION.

Clean R CMD check from R 3.6.x to R-4.4.x: https://github.com/r-lib/httr2/actions/runs/7548766508

#rstats

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 3 months ago

@gaborcsardi I was sorely tempted to do that with the lambda syntax in ‘box’ (which uses many anonymous functions) but making the build process even more complex scared me off. Maybe I’ll reconsider.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 3 months ago

@gaborcsardi (My use-case would be more involved since it would have to rewrite the actual package source code, not just the Rd files; but the principle should be similar.)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

olafurw, 3 months ago to random

I WRITE JOKES IN CAPITALS.
THIS ONE WAS WRITTEN IN OSLO.

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ MichDdev, mishellbaker, uastronomer, littledetritus +39 more

klmr, 3 months ago

@olafurw

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 3 months ago

@CuriosityCat Chill, I am not criticising the joke. My reply was itself a joke, based on the (vaguely funny) coincidental juxtaposition of the two posts on my timeline. Ólafur got it.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 5 months ago to random

#rstats folks: I am trying to remember why we are using 9000 as the last component in development version numbers, and I am drawing a blank. Why not just use x.y.z.1, x.y.z.2, etc?

Surely it’s the mere presence of that last components which signals an unstable development build, not the magnitude, right? Am I overlooking something?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Mehrad

ct_bergstrom, 6 months ago to random

Menopause in chimps: An interesting challenge to the grandmother hypothesis, the idea that menopause, previously documented only in humans and a few cetacean specie, is an adaptation to by which older females help raise their daughters' offspring.

https://www.science.org/doi/10.1126/science.add5473

reply

expand (9)

collapse (9)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jaztrophysicist

klmr, 6 months ago

@ct_bergstrom Not just chimps but probably most mammals: https://www.cell.com/cell/fulltext/S0092-8674(23)01080-2

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

briandconnelly, 6 months ago to random

This weekend's fun (though niche) programming project: {xdgbasedir}, an #rstats implementation of X Desktop Group Base Directory Specification.https://github.com/briandconnelly/xdgbasedir

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ hrbrmstr

klmr, 6 months ago

@briandconnelly FYI, R has something very similar built in, as tools::R_user_dir()

https://stat.ethz.ch/R-manual/R-devel/library/tools/html/userdir.html

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

baldur, 7 months ago to random

Occasionally I stop to think about how much of the modern software development infrastructure and community is run at a massive loss: Stack Overflow, npm, Github Copilot (probably Github itself), VS Code.

Also how much of it is owned and run by Microsoft.

So much of it could disappear at a short notice if just one CEO changes his mind about his company’s marketing strategy.

reply

expand (37)

collapse (37)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ dgoldsmith, TerryHancock, Taffer, bitprophet +14 more

klmr, 7 months ago

@baldur According to what employees there at the time told me, Stack Overflow was profitable within a few years of launch.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 7 months ago to random

#rstats WTF of the day:

reply

expand (10)

collapse (10)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ gaborcsardi

klmr, 7 months ago

@hrbrmstr But it only works with NA_character_, not NA, NA_real_, TRUE or any other reserved names. — And the reason is that NA_character_ (unlike all the others) is a character literal.

» NA = 1
Error in NA = 1 : invalid (do_set) left-hand side to assignment

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 7 months ago

@gaborcsardi I think R should not allow string literals in place of names, full stop. This change is even worth breaking a few packages on CRAN, IMHO, because the current behaviour is plain bananas and causes plenty of confusion.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

klmr, 7 months ago

@gaborcsardi Yes, backticks came later. And there’s still some ancient core R code which uses "-quoted names, but that could obviously be fixed when deprecating/removing the syntax. But I don’t think there’s appetite for it.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...