@jonthegeek@fosstodon.org avatar

jonthegeek

@jonthegeek@fosstodon.org

Executive Director of https://fosstodon.org/@DSLC (formerly https://fosstodon.org/@R4DSCommunity). Also run https://fosstodon.org/@DSLCvids & #TidyTuesday.

Interested in (almost) all things #RStats.

Views are mine.

This profile is from a federated server and may be incomplete. Browse more on the original instance.

jonthegeek, to random
@jonthegeek@fosstodon.org avatar

Does anyone know of a good tutorial on working with PDFs in ? I got something working with 📦{pdftools} but it feels super hacky. Is that just the way it is?
I see that I might be able to make it less hacky with 📦{tabulizer}, but I think I'd rather be hacky than deal with 💣{rJava}.

jonthegeek, to random
@jonthegeek@fosstodon.org avatar

🧵1/2
I'm writing a book about working with the web from (tentative title: Web APIs with R), and I need your help! I'm working on a chapter entitled "How can I scrape web pages?" I've identified the learning objectives I want to cover (included as a reply to this toot).
Do those LOs make sense to you?
Is there anything missing from those LOs?
Are there things on that list that don't sound like things you'd want to do?

bkeegan, to random
@bkeegan@hci.social avatar

I will be teaching a new class on Web Data Science focused on scraping data from web pages and APIs.

Recommendations for resources, books, and other resources are welcome!

jonthegeek,
@jonthegeek@fosstodon.org avatar

@bkeegan I'll be about a year to late, but I'm writing the version of a book for this course: https://jonthegeek.github.io/wapir/
I'd love to keep track of what you cover in the class!

jonthegeek, to random
@jonthegeek@fosstodon.org avatar

I'm so angry about this raid in Kansas (https://kansasreflector.com/2023/08/11/police-stage-chilling-raid-on-marion-county-newspaper-seizing-computers-records-and-cellphones/)

Every one of these cops + the judge who signed the warrant should be in jail. This needs to be stopped hard before they all start doing it.

How long til they raid Democratic candidate offices to shut down campaigns, or Democrat-leaning precincts to shut down votes?

coolbutuseless, to random
@coolbutuseless@fosstodon.org avatar

{remedy} AddIns -> AlignEqual()

jonthegeek,
@jonthegeek@fosstodon.org avatar

@scheidegger @mjskay @coolbutuseless It's also an interesting idea for a way to enrich github/r-uni data before training an LLM. I wouldn't be surprised if a smallish project could beat copilot by parsing things more intelligently...

jonthegeek, to random
@jonthegeek@fosstodon.org avatar

Is there a standard (or at least respected) list of form-validation regex patterns out there somewhere (things like email addresses, urls, US ZIP codes)? It feels like a thing that would be nice to standardize, if possible.

lwpembleton, to random
@lwpembleton@genomic.social avatar

🤯 Running into a puzzling paradox with my .qmd file. The code chunk sails smoothly when running manually, but throws an error in a calculation within the chunk when I try to render it. 🧐 Is there a significant contrast in code execution during rendering that could lead to errors within an internal function in the chunk? 🤔

jonthegeek,
@jonthegeek@fosstodon.org avatar

@lwpembleton What is the error?

jonthegeek,
@jonthegeek@fosstodon.org avatar

@lwpembleton my guess was something about paths, but that doesn't seem to be what this is. Hmm.

Cmastication, to random
@Cmastication@mastodon.social avatar

Going to see Lyle Lovett. I figure I should dust off my boots. But I didn’t press my jeans. You know, I’m not from Texas :)

jonthegeek,
@jonthegeek@fosstodon.org avatar

@Cmastication... 🎶 But Texas wants you anyway 🎶

rmhogervorst, to random
@rmhogervorst@fosstodon.org avatar

I just learned about the {httr2} package because I wanted to wrap some api in . What a wonderful package!

jonthegeek,
@jonthegeek@fosstodon.org avatar

@rmhogervorst Depending how far you want to go, you might be interested in https://jonthegeek.github.io/beekeeper/

I have a long way to go, but I'm starting to make progress, and hopefully will have a first working (if limited) version within the next few weeks!

jonthegeek, to random
@jonthegeek@fosstodon.org avatar

@rpodcast for the new anonymous function syntax, "" is the word "function"
function(x) {x + 1}
becomes
(x) {x + 1}

I couldn't see that for a long time and it was so confusing, then suddenly straightforward.

jonthegeek,
@jonthegeek@fosstodon.org avatar

@gaborcsardi @rpodcast realizing that it's exactly function really helped me remember how to construct them, though (where parentheses and braces go). I couldn't remember it and then realized it was exactly a thing I already know, just shorter.

jonthegeek,
@jonthegeek@fosstodon.org avatar

@jonny (x) <- {x + 1} already means "print x then assign x + 1 to it". They wanted something that wasn't already parsable.

jonthegeek, to languagelearning
@jonthegeek@fosstodon.org avatar

My current read: Five Lines of Code by Christian Clausen (https://livebook.manning.com/book/five-lines-of-code). So far it's really good, but it's definitely taking some cognitive load to grok the OOP and translate it to S3 (kinda half functional/half OOP).

andrie, to rust
@andrie@fosstodon.org avatar

CRAN has published a policy for using code in an package. https://cran.r-project.org/web/packages/using_rust.html

jonthegeek,
@jonthegeek@fosstodon.org avatar

@andrie As someone who's been thinking about learning specifically for use in packages (but who has not yet done so and thus isn't sure about what things in this post mean)... how chilling is this? Does this make it really hard to use Rust, slightly hard, or is this just best practice anyway?

HeavenlyPossum, to random
@HeavenlyPossum@kolektiva.social avatar

We’re doing the medieval peasant discourse again. We’re doing it!

Ok. Whenever we talk about “feudalism” or “the Middle Ages” or “medieval” we’re generalizing about millions of people in diverse communities and circumstances that spanned centuries. I’m necessarily going to be making huge generalizations about past societies that paper over important distinctions.

That said, we can still interrogate ways in which a medieval European peasant might have experienced life in ways that weren’t as bad as we popularly imagine them to have been, or might even have been better than comparable experiences people today have.

1/many

jonthegeek,
@jonthegeek@fosstodon.org avatar

@HeavenlyPossum https://en.wikipedia.org/wiki/Life_expectancy?wprov=sfla1
A lot of that is infant mortality, but I think "our babies die far less often" is a way to make the "which is better, really?" decision pretty easy.

jonthegeek, to random
@jonthegeek@fosstodon.org avatar

Now it's "An Ode to Permissionless Data Science" by Hamdan Azhar

jonthegeek,
@jonthegeek@fosstodon.org avatar

It seems like data science is being reduced to a set of tools. Resumes all have boxes full of lists of technologies.

jonthegeek, to random
@jonthegeek@fosstodon.org avatar

Back from the afternoon break we have "CaRtography: Creating Accurate and Beautiful Maps in R" by Ayanthi Gunawardana

jonthegeek,
@jonthegeek@fosstodon.org avatar

Her family is from Sri Lanka (which is often left off maps), so that's part of the inspiration.

jonthegeek,
@jonthegeek@fosstodon.org avatar

Terminology:

  • Geographic Information System (GIS):
    framework for gathering, managing, and analyzing data.
    (https://www.esri.com/en-us/what-is-gis/overview)

  • Cartography:
    the science or practice of drawing maps

  • Shapefile:
    Vector data format for GIS software.

As GIS becomes more open source, other formats are becoming more popular, like Geopackage, GeoJSON, KML

jonthegeek,
@jonthegeek@fosstodon.org avatar

Best practices:

  • who is your audience?
  • always normalize data
  • remember color blindness
    (more, there was a lot on that slide)
hrbrmstr, to random
@hrbrmstr@mastodon.social avatar

Had the privilege of meeting @jonthegeek IRL at the NY R Conference!

jonthegeek,
@jonthegeek@fosstodon.org avatar

@hrbrmstr the pleasure was mine! Great to meet you!

jonthegeek, to random
@jonthegeek@fosstodon.org avatar

Now it's @jaredlander "Building an R Package with LLMs"

jonthegeek,
@jonthegeek@fosstodon.org avatar

When he had it generate tests, the tests didn't work and weren't necessarily good tests. Oops! It had a good start still, though.

jonthegeek,
@jonthegeek@fosstodon.org avatar

Main take-away: chatgpt can be useful to get things done quick, but check it! Jared trusts it more than I do! Remember that chatgpt isn't designed to be correct; it's designed to SEEM correct. There's a huge, dangerous difference.

📦 https://github.com/jaredlander/fileTypeAnalyzer

Chat: https://chat.openai.com/share/1d2b4efd-42d4-483d-b593-11363e395415

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • Leos
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • osvaldo12
  • ethstaker
  • Youngstown
  • mdbf
  • slotface
  • rosin
  • ngwrru68w68
  • kavyap
  • cubers
  • anitta
  • modclub
  • InstantRegret
  • khanakhh
  • GTA5RPClips
  • everett
  • Durango
  • tacticalgear
  • provamag3
  • tester
  • cisconetworking
  • normalnudes
  • megavids
  • lostlight
  • All magazines