@smach@masto.machlis.com
@smach@masto.machlis.com avatar

smach

@smach@masto.machlis.com

Director, editorial data & analytics at Foundry (an IDG company). Author of Practical R for Mass Communication & Journalism. A bit obsessed with both #RStats and generative AI. Learning #Python.

Write about R & #GenAI for #InfoWorld. She.

First joined Mastodon (at Fosstodon) on Oct 27, 2022.

Website: https://www.machlis.com
Apps: https://apps.machlis.com

Other interests: #Photography #DigitialDarkroom #Running #Bicycling #Crochet Learning #ASL

Was @sharon000 on Twitter (not currently active)

This profile is from a federated server and may be incomplete. Browse more on the original instance.

smach, to ai
@smach@masto.machlis.com avatar

“AI Use Cases for R Enthusiasts” - Upcoming Workshop for Ukraine this Thursday, May 9, noon ET/6 pm CET/9 am PT with Dr. Albert Rapp ( @rappa753 but not too active here). Donate 20 euro/20 USD for this live 2-hour session - or if you can’t make it, access to recordings & materials.
Rapp says you'll leave with “fresh ideas and practical strategies for using AI.”
https://sites.google.com/view/dariia-mykhailyshyna/main/r-workshops-for-ukraine
@rstats

smach, to python
@smach@masto.machlis.com avatar

Ari Lamstein says his "Visualizing the Impact of Covid-19 on US Counties" blog post may be of interest if you want to "learn how to build data apps in Python, as the entire project is released under a permissive license (MIT), and is publicly available on GitHub."
Post: https://arilamstein.com/blog/2024/05/04/visualizing-the-impact-of-covid-19-on-us-counties/
Streamlit app: https://census-explorer.streamlit.app/
GitHub repo: https://github.com/arilamstein/censusdis-streamlit/tree/main

@python

smach,
@smach@masto.machlis.com avatar

@failedLyndonLaRouchite @python It's definitely not a new discovery. I think the code is probably of more interest.

smach, to LLMs
@smach@masto.machlis.com avatar

The TinyChart-3B LLM answers questions about data visualizations. It can also generate underlying data from a dataviz and Python code to re-create a similar chart.

Demo on Hugging Face: https://huggingface.co/spaces/mPLUG/TinyChart-3B

Code: https://github.com/X-PLUG/mPLUG-DocOwl/tree/main/TinyChart

Paper: https://arxiv.org/abs/2404.16635 8 authors from the Alibaba Group and Renmin University of China

smach,
@smach@masto.machlis.com avatar

@hrbrmstr @eliocamp That's what I get for posting before testing it myself beyond the examples (it's been a busy weekend, out-of-town family were visiting). I thought the Alibaba group made it worth sharing. Lesson learned!

smach, to rstats
@smach@masto.machlis.com avatar

{tidycensus} 📦 creator Kyle Walker: "Want all 8.13 million US Census blocks available for your project? It's a one-liner in thanks to the tigris and purrr packages:

us_blocks <- purrr::map_dfr(c(https://t.co/RfFgUSx1a6, "DC"), ~tigris::blocks(state = .x, year = 2023))

Downloading will take time; set options(tigris_use_cache = TRUE) beforehand to build a local cache of block shapefiles that you can access without having to download."

@rstats

smach, to rstats
@smach@masto.machlis.com avatar

The {summarytools} 📦 aims to:
“Provide a coherent set of easy-to-use descriptive functions [like] those in commercial statistical software suites such as SAS, SPSS, and Stata
“Offer flexibility in terms of output format & content
“Integrate well with commonly used software & tools for reporting”
Results can be displayed in console or rendered/saved as HTML, plain text, or R Markdown. By Dominic Comtois
https://htmlpreview.github.io/?https://github.com/dcomtois/summarytools/blob/master/doc/introduction.html
@rstats

Posit, to python
@Posit@fosstodon.org avatar

Introducing Shiny for Python templates! 🐍

We're thrilled to offer opinionated boilerplate Shiny Templates code that allows you to hit the ground running, whether you need a quick, simple app or a quick, complicated one.

• Read more on the Shiny blog: https://shiny.posit.co/blog/posts/introducing-shiny-templates/
• Watch Carson Sievert quickly get an app running locally with Shiny Templates on YouTube: https://www.youtube.com/watch?v=zQJiMRStYE8

smach,
@smach@masto.machlis.com avatar

@Posit I'd love to see a chatbot template in that Shiny Python template gallery.

smach, to rstats
@smach@masto.machlis.com avatar

File import/export in R is simple and elegant with the {rio} #rstats 📦. It uses just 2 main functions for dozens of file types: import() and export(). Whether .zip, .xlsx, Google sheets, json, .rds, .csv or more, rio handles file-extension checks and selecting the right functions.
http://gesistsa.github.io/rio/
There's also a convert() function.
One of my favorite R packages!
By Thomas J. Leeper, Chung-hong Chan, David Schoch & Jason Becker
@rstats

smach,
@smach@masto.machlis.com avatar

@schochastics @jmcastagnetto @rstats @chainsawriot I hadn’t considered the risk that values might be read incorrectly when importing a CSV file., beyond the obvious “my dates are numbers!” or “this row doesn’t have the correct number of columns” kinds of issues which are usually easy to spot or check for. That's a problem!

smach,
@smach@masto.machlis.com avatar

@schochastics @jmcastagnetto @rstats @chainsawriot In my R book for journalists (Practical R for Mass Communication and Journalism), I decided to start off with "here are some super cool things you can do with very little code!" to get people enthusiastic and engaged. In general, I like to start with well formatted data, making very clear to people that this is unlikely to be what they encounter in the real world. I like to ease into it, but I 💯 admit that other approaches are valid 2/2

smach, to rstats
@smach@masto.machlis.com avatar

The {styler} 📦 “formats your code according to the tidyverse style guide (or your custom style guide) so you can direct your attention to the content of your code. It helps to keep the coding style consistent across projects and facilitate collaboration.” By Lorenz Walther & Kirill Müller

https://styler.r-lib.org/

@rstats

smach, to LLMs
@smach@masto.machlis.com avatar

“But this doesn’t save any time!” 3 useful questions when trying :

  • Is there another way to get results I want? Don't give up right away.
  • Does AI make this task less or more annoying? Sometimes supervising drudge work feels better even if it's not faster; other times you'd still rather do it yourself.
  • Are results likely to improve as LLMs get better? If so, add a calendar reminder to try again in a few months. Or, keep a list of things you want to re-try post GPT-5 class models.
smach, to rstats
@smach@masto.machlis.com avatar

“Another base R gem: toString()” - @josi

@rstats

smach, to ai
@smach@masto.machlis.com avatar

“Star Trek's Holodeck is no longer just science fiction. Using AI, engineers have created a tool that can generate 3D environments, prompted by everyday language.” This was designed to train robots, not entertain us humans. But Star Trek fans can easily envision other uses.

https://www.sciencedaily.com/releases/2024/04/240411130301.htm

smach, to random
@smach@masto.machlis.com avatar

From {tidycensus} creator Kyle Walker: “My webinar Analyzing 2020 Decennial US Census Data in is now on YouTube!
In the 3-hour webinar, you'll learn about:

📈 Available datasets in the 2020 US Census, and how to access and use them in R;
📈 How to explore decennial US Census data with tidyverse tools;
📈 Using interactive maps to explore US Census data;
📈 Advanced topics like working with detailed DHC-A data and analyzing change over time“ 1/2

https://youtu.be/JQRS5wYtPlY?si=ex4SHu7Xm3BmxdTM

smach,
@smach@masto.machlis.com avatar

From {tidycensus} creator Kyle Walker cont.: “All the webinar resources are available on GitHub here, along with a @Posit Cloud link where you can run the code without installing R yourselves:” 2/2

https://github.com/walkerke/umich-workshop-2024

smach, to random
@smach@masto.machlis.com avatar

I'm not sure why Claude's Haiku LLM started responding in Spanish to questions in English in a RAG application I'm building when all of the context was in English. Odd.

smach, to random
@smach@masto.machlis.com avatar

NASA has a useful interactive map with solar eclipse time & totality info by zip code if you’re in the US and looking for info

https://eclipse-explorer.smce.nasa.gov/

mekkaokereke, to random
@mekkaokereke@hachyderm.io avatar

H/T to Kelsey Trainor, for pointing out that
the women's college basketball finals today will get north of 14 million viewers... but will still not be aired in a primetime viewing slot.

One of the most watched sports events of the past decade! More than NBA finals. More than MLB world series. More than NHL Stanley cup finals. And they can't get a primetime slot?

This didn't just suddenly happen. The audience has been building for years. Last year was a huge indicator.

smach,
@smach@masto.machlis.com avatar

@CLMilne @mekkaokereke I'm torn about it. I'm ticked off that the networks didn't think the woman's NCAA FINAL was worth prime time. Of course it was. On the other side, though, I'm happy that so many young girls who might not have been able to stay up past 11 on a school night were able to watch it in the afternoon. Not the reason for the timing, but a happy byproduct.

underdarkGIS, to llm
@underdarkGIS@fosstodon.org avatar

🤔 Where do we stand re ? Still no magic bullet to solve our problems?

I must be missing something.

Any pointers, ?

smach,
@smach@masto.machlis.com avatar
hrbrmstr, to random
@hrbrmstr@mastodon.social avatar

Bonus Drop (2024-03-24): Method To The Madness

Today's Drop discusses 3 resources, madonctl, csvlens, and DuckDB, and how they can be combined for data FUN. madonctl is a Mastodon client for the terminal, csvlens is a CLI tool for viewing CSV files, and DuckDB is a powerful database system that is quite magical.

http://dailydrop.hrbrmstr.dev/2024/03/24/bonus-drop-46-2024-03-24-method-to-the-madness/

smach,
@smach@masto.machlis.com avatar

@jmcastagnetto @hrbrmstr csvkit is another you might find interesting to play with 😀
https://csvkit.readthedocs.io/en/latest/

smach, to rstats
@smach@masto.machlis.com avatar

The {packageRank} 📦 “helps put package download counts into context. It does so via two core functions, cranDownloads() and packageRank(), a set of filters that reduce download count inflation, and a host of other assorted functions.” By Peter Li.

https://github.com/lindbrook/packageRank

@rstats

smach, to ai
@smach@masto.machlis.com avatar

I doubt it's coincidence that “GPT-5 is on the way!” news cropped up after some key industry analysts praised Anthropic's Claude Opus as better than GPT-4. Large language models at this scale may be new, but tech vendor strategies are not.

smach, to rstats
@smach@masto.machlis.com avatar

The {tidyHeatmap} 📦 “introduces tidy principles to the creation of information-rich heatmaps.”
“For plotting, you simply pipe the input data frame into heatmap, specifying:
The rows, cols relative column names (mandatory)
The value column name (mandatory)
The annotations column name(s)”
By Stefano Mangiola
https://github.com/stemangiola/tidyHeatmap
@rstats

  • All
  • Subscribed
  • Moderated
  • Favorites
  • anitta
  • Durango
  • ngwrru68w68
  • thenastyranch
  • magazineikmin
  • hgfsjryuu7
  • DreamBathrooms
  • Youngstown
  • slotface
  • vwfavf
  • PowerRangers
  • everett
  • kavyap
  • rosin
  • Leos
  • khanakhh
  • tacticalgear
  • InstantRegret
  • cubers
  • mdbf
  • ethstaker
  • osvaldo12
  • GTA5RPClips
  • cisconetworking
  • tester
  • normalnudes
  • modclub
  • provamag3
  • All magazines