georgios, to random
agata, to random

Insightful thoughts on and analysis workflows from @joelnitta at the {targets} workshop today @unioslo

Lesson under development now in @thecarpentries - check out the full lesson plan in a pre-alpha stage here: https://joelnitta.github.io/targets-workshop/

ERDonnachie, to random

The latest Excel blunder from Austria is a lesson in why we need professional data people.

There's a widespread expectation that anyone can take Excel and use it to do critical things with data.

But data people know that we need the right tools to make data processing verifiable (usually with code) and enable us to check that everything is as it should be (with unit tests or assertions).

And most of all, we need more at every level.

https://www.theregister.com/2023/06/06/austria_election_excel_blunder/

odr_k4tana, to random

I freakin love how evolved to the point where I do not need for-loops anymore.

steffilazerte, to rstats
@steffilazerte@fosstodon.org avatar

I'm brainstorming for @rOpenSci Coworking themes to use in the future.

Some examples of things we've already done...

  • Start Writing that Package!
  • Getting Started with targets!
  • Working with New R Users
  • Setting Up Continuous Integration
  • Checking Data with naniar, visdat, assertr, and skimr!
  • Working with Taxonomic Lists

Any suggestions for future themes? Anything you might like to see?

@rstats

LeafyEricScott, to random
@LeafyEricScott@fosstodon.org avatar

Having learned programming mostly in , I realize that I have a very fuzzy mental model of what "compiling" code even means. Can someone point me to an explanation of what it means to "compile" or "build from source" for semi- or non-experts?

maelle, to random French
@maelle@mastodon.social avatar

New blog post! Three useful (to me) patterns

☑️ utils::modifyList()
☑️ rlang::%||%
☑️ Base R Set Operations

https://masalmon.eu/2023/06/06/basic-patterns/

joranelias, to programming
@joranelias@mastodon.social avatar

Is it weird that one of the main sources of minor friction I experience when experimenting in coming from is the way it prints/formats objects at the command line. I’m sure some of it familiarity, but I find things slower to parse, visually.

juliasilge, to programming
@juliasilge@fosstodon.org avatar

New releases of vetiver 🏺 are out, for both and , and I'm excited to outline some of the new features, including support for deploying a vetiver model to AWS SageMaker:

https://posit.co/blog/vetiver-sagemaker/

johnabs, to random
@johnabs@qoto.org avatar

So, really cool new tool I’ve learning as I’m working with lisp for my startup: symex.el for structural navigation and editing is VERY efficient, somewhat vim-based, and I really like it a lot. As an added bonus, some of its dependencies (e.g. lispy) are very useful for multi-lining s-exps (aka symexs) and formatting them, and it plays nicely with sly, which is even better!

Additionally, for all you people out there (who are asking why I included the tag on a post about Lisp), maybe take a quick look at this. I’m currently using it myself, and I’ve found it’s pretty good for most basic things, and you may like it if you give it a try (or maybe not, it’s not yet as full featured as R and its various packages yet, but it does benefit from some things I don’t think you can get easily from R). Also, here’s a super quick demo thingy(?).

I know other tools leveraging tree-sitter try to achieve similar functionality, but when the code is already in an AST format, it really eliminates the guesswork and makes the experience seamless!

nowosad, to random
@nowosad@fosstodon.org avatar

⚠️ rgdal, rgeos, and maptools won’t be available on CRAN after October 2023. ⚠️

What are the consequences of this change? What do you need to do as a user and as a developer?

Read my blog post at https://geocompx.org/post/2023/rgdal-retirement/

datascience, to random

Want to check the google trends for a topic? Use {gtrendsR} directly from within your favorite language: https://github.com/PMassicotte/gtrendsR @philmassicotte

peter_ellis, to random

I had fun simulating confounder, collider and mediator variables with to re-create a great diagram by Wysocki et al that helps understand and regression modelling strategy: http://freerangestats.info/blog/2023/06/04/causality-sims

smach, to rstats
@smach@fosstodon.org avatar

Video: Intro to Version Control using git and RStudio - presentation by Ryan Johnson, data science advisor at @Posit, at @NHSrCommunity
https://youtu.be/qNMOPWT8jSo

@rstats

joranelias, to random
@joranelias@mastodon.social avatar

I still haven’t seen anything to disprove the best description of R vs Python for data/stats that I’ve seen:

Python is an elegant, well-designed language with a confusing, oddly designed data DSL bolted onto it & R is an elegant, well designed data DSL with a confusing, oddly designed programming language built around it.

jonthegeek, to random
@jonthegeek@fosstodon.org avatar

Is there a web API that you'd love to use with R, if only it weren't so painful? Or perhaps you're using one, but aren't sure if it would make sense as a package. Please let me know here! https://forms.gle/CJz12TzzHkGsnQma9

jonocarroll, to random
@jonocarroll@fosstodon.org avatar

I'm spoilt by making simple things simple. Having to add a dropdown to filter a table in Excel requires a hidden tab and some VBA code. Why, oh, why, can't I just use UNIQUE() in the dropdown???

</rant>

thoughtfulnz, to random
@thoughtfulnz@mastodon.nz avatar

I update some data repos when I run a daily analysis, so I thought I'd share I have a GitHub credential manager installed on my laptop so I don't need to enter passwords, and after the script works out the data needs updating I run a system("cd '/pathTo/data/';git add * ;git commit -m 'data update';git push") command

ojala, to random
@ojala@mastodon.nz avatar

deleted_by_author

  • Loading...
  • datamaps,

    @ojala
    another base solution:

    do.call(rbind, lapply(1:n, (x) data.frame(x, x:n, M[x:n, x])))

    and the data.table translation:

    rbindlist(lapply(1:n, (x) data.table(x, x:n, M[x, x:n])))

    that are actually much faster (unless I botched the benchmark, I havn't slept very well tonight)

    debruine, to random

    The glossary package is now on CRAN!

    Glossary is a lightweight solution for making glossaries in educational materials written in quarto or R Markdown. This package provides functions to link terms in text to their definitions in an external glossary file, as well as create a glossary table of all linked terms at the end of a section.

    https://debruine.github.io/glossary/

    smach, to rstats
    @smach@fosstodon.org avatar

    The {paws} 📦 helps you access more than 150 AWS services in R, including
    Machine Learning
    Translation
    Natural Language Processing
    Databases
    File Storage
    By Dyfan Jones & others
    https://paws-r.github.io/
    @rstats

    robinlovelace, to programming
    @robinlovelace@fosstodon.org avatar

    Just published: new blog post about the jtstats and 📦 s. Also some thoughts on how organisations can improve not only availability but also accessibility of their valuable open data 🚀 https://datasciencecampus.ons.gov.uk/guest-blog-enhancing-open-access-data-analysis-introducing-the-journey-time-statistics-r-and-python-packages/

    DataAngler, to statistics
    @DataAngler@vis.social avatar

    I am increasingly being asked to build structural equation models (in ) and am looking for recs about a textbook for someone who is familiar with linear regression but does not have background in IRT. My director says things like, "constrain the variances to 1" and I'm like, sure...but why?

    gabornyeki, to random

    R libraries implement performance-critical code in C++. But memory bugs in C++ code crash R, such as in this screenshot, even though R is designed to be memory safe.

    fixest is an excellent library but a semi-frequent offender. And I've had this happen with other libraries too.

    As someone who is using R because I am not prepared to debug C++, this can pretty much break a library for me. I hope that eventually Rust can take over C++'s role in

    deboraha, to random
    @deboraha@aus.social avatar

    Hey people! I have a dataset where I’ve asked people to rank a set of 10 items from 1 to 10, and I want to compare those rankings between two groups. Sounds super simple, but I can’t work out what to use! I’ve found the nParLD package, but that’s not quite right as it’s not longitudinal data. Help?

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • kavyap
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • osvaldo12
  • khanakhh
  • Youngstown
  • mdbf
  • slotface
  • rosin
  • everett
  • ngwrru68w68
  • Durango
  • anitta
  • InstantRegret
  • GTA5RPClips
  • cubers
  • ethstaker
  • normalnudes
  • tacticalgear
  • cisconetworking
  • tester
  • Leos
  • modclub
  • megavids
  • provamag3
  • lostlight
  • All magazines