Posts - shachaf - kbin.social

This profile is from a federated server and may be incomplete. Browse more on the original instance.

shachaf, 23 days ago to random EN

An interesting concurrency operation that @tavianator mentioned is "acquire a lock and immediately release it". I've seen this in another context before.
I sort of wonder whether this is trying to be some other serialization operation, or what the simplest version of this is.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pervognsen, 23 days ago

@shachaf @tavianator It's just an acquire-release sync point, no?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 24 days ago to random

If you find yourself acquiring a lock just because the condition variable API requires you to have a lock, and not using it anywhere else, you should probably reconsider what you're doing.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pervognsen, 24 days ago

@shachaf Eventcount!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 27 days ago to random EN

When people advertise software as "written in X", I usually take that as a slightly negative signal (you don't have something more interesting than the language you used?). The main exception is Go, where it means there's a good chance they ship a simple static binary.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pervognsen, 27 days ago

@shachaf Software or libraries? For libraries I do care. If they're written in C, they can be consumed from any language with relatively little effort. If they're written in X and I'm also using X, likewise.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 27 days ago to random EN

I liked @ptrschmdtnlsn's explanation for what's going on with Fibonacci vs. Galois LFSRs: Say you want to compute Fibonacci numbers (or some other linear recurrence). You have an infinite array of mutable cells, F[i], zero-initialized except F[0] = F[1] = 1

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 1 month ago to random EN

Apparently even asm(""); is treated as some sort of compiler barrier for clang, causing it to spill registers to memory.
https://godbolt.org/z/G3c778WqE

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pkhuong, 1 month ago

@shachaf that's unextended inline asm, with an implicit memory (and cc, I think registers are assumed preserved?) clobber. Compare with asm("":::).

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 1 month ago to random EN

Is there a good overview reference for concurrent memory reclamation -- potential issues (ABA, use-after-free), and approaches people use (GC, epochs, hazard pointers, RCU-style read locks, etc.) and tradeoffs between them?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pervognsen, 1 month ago

@shachaf Tom Hart's thesis had a survey but it's not exhaustive.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 2 months ago to random EN

Part of an email I wrote last year. I still think there are a lot of interesting unexplored points in this space!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

demofox, 2 months ago

@shachaf yeah it's weird. There is power in making a language etc. that makes maximum sense to the hardware and then forcing everyone into that paradigm. There is also power in writing custom functionality for your specific need. Folks tout the efficiency of databases, but you will be hard pressed to find a game that uses one for data or assets (outside of mmos), and there's a real reason for that regarding efficiency.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 2 months ago to random EN

In practice, is it reasonable to sometimes do an atomic exchange/store on 16 bits of a 32-bit value, and sometimes do CAS on the whole value? I assume it's not incorrect, but will the store buffer get mad at me and use some awful slow path? What about if the former case is rare?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pkhuong, 2 months ago

@shachaf sounds similar to linux's old ticket spinlock. Blessed by intel and amd in 2007.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 3 months ago to random

How to think of the trace of a linear map as connecting its output back to its own input https://math.stackexchange.com/q/2762669

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pervognsen, 3 months ago (edited 3 months ago)

@shachaf The trace of a rank 1 operator f = u v^T is Tr(u v^T) = v^T u. Since v^T is the "input part" and u is the "output part" of f this provides another way to think about how the trace connects the output of an operator back to its own input. Here I'm just using v^T as "abstract matrix notation" for a linear functional, so no inner product or basis is assumed.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 6 months ago to random

I didn't realize Fib_(n+m) = Fib_(n-1) Fib_(m-1) + Fib_n Fib_m. That's a nice property of linear recurrences!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pervognsen, 6 months ago (edited 6 months ago)

@shachaf Have you seen the bijective proof of that formula based on tilings? It's very simple and pretty.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pervognsen, 6 months ago (edited 6 months ago)

@shachaf You can define F(n) as the number of tilings of a 1xn board (n-tilings) with 1x1 tiles (1-tiles) and 1x2 tiles (2-tiles). Then an (m+n)-tiling comes in two cases based on whether or not a 2-tile bridges the m part and the n part. If there's a 2-tile bridge then there's a m-1 remainder in the m part and a n-1 remainder in the n part, hence the F(m-1) F(n-1) term. Else it's a clean split and you get the F(m) F(n) term.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shachaf, 6 months ago to random

Is there a way to convince clang/LLVM to do 16-byte atomic loads with SSE instead of lock cmpxchg16b?

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

pkhuong, 6 months ago

@pervognsen @shachaf Wait, when did we get that guarantee? Intel only, I guess?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

rygorous, 6 months ago

@pkhuong @pervognsen @shachaf I don't thing it's even in the manuals, it was a gentleman's agreement on LKML or something along those lines.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...