Submodules are useful, but I hate how chonky they feel in use. I especially hate how submodules need constant care after every sync. Need to run 'git sumo' all the time 🙄
I've been using @datadryad to structure my supplementary materials (including some code) for a manuscript. Things I don't like about DataDryad: (1) flat hierarchy / no organization of uploaded datasets, without a way to sort them beyond the last one you edited - Good luck keeping multiple datasets per projects straight. (2) Flat hierarchy for uploaded files - no folders allowed (3) No introspection into uploaded archive files (like .zip) - good luck remembering what you packed into that file.
Okay, simple finding. GitHub has pretty strict file size quotas: 100 MB. You can use Git LFS to go beyond that, but then you only have ~2GB of LFS storage across your whole Pro GitHub account. So my hope to use Github->Zenodo archiving for any file, is a bit premature for >100 MB files.
We are happy to announce that the call for participation for the first ever distribits meeting is now online at https://distribits.live
This meeting is organized by the folks behind git-annex and DataLad. We aim to bring together enthusiasts of tools and workflows in the domain of distributed data.
We are looking forward to April 2024 and to meeting people from everywhere, online and in Düsseldorf.