📢 Master the Art of List Subsetting in R! 🚀 Or: Lists...again
📝 Lists in R are versatile data structures, capable of holding various elements like vectors, matrices, and even other lists. But what makes them truly magical is the ability to extract specific data efficiently through subsetting. 🎯
Though a great fan of free/libre and open systems, my professional life also revolves around topics like #oracle and #aws - so expect quite some toots on those.
Ariana Mendible is sharing a great talk, "Small Town Police Accountability: A Data Science Toolkit" here at #SciPy2023. Her group has created a great package to help researchers parse the output of FOIA (Freedom Of Information Act) requests, using #OCR, #NLP, and #Python. The library, called SToPA, is available on GitHub: https://github.com/qsideinstitute/SToPA
More on the talk here: https://cfp.scipy.org/2023/talk/AXPZZG/
So is #python the preferred skill set for #datascience nowadays? Is #rstats still an employable skill? I see more an more job ads for data scientists that list python (+ML) as a must but no mention of R...
Before the nostalgia becomes too rose-tinted, remember that everything actually wasn't always honky-dory with Twitter's relationship with the research community.
My blog posts about Twitter's data grant initiatives from
This is a great article by Michael Levinger about the applications of explainable AI for identifying fraud and preventing it. Explainable AI methods help to make black-box models more interpretable and visible. That includes methods such as:
✅ Feature Importance
✅ LIME and SHAP methods
✅ Rule-based Models
✅ Data Visualization
Looking for a recommendation(website,Substack, any other material...) where I can improve my SQL knowledge. I am looking for something that I can read(theory) and practice(exercisea). I really enjoy learning python in Substack but until now I have not found something similar for SQL.
There is no better way to learn a topic than using a real-life example. The Introduction to NFL Analytics with R is a new book by Bradley J.Congelio, focusing on NFL analytics using R, as the name implies. The book covers the following topics:
✅ Introduction to NFL analytics with R
✅ Working with NFL data
✅ Data visualisation applications ❤️
✅ Analysis and modeling of NFL data
The Office of the Chief Statistician at #OMB has an opening for a #Statistician to help oversee government-wide statistical policy development and implementation.
It’s DC-based and GS-12/13 with a pay of at least $94,199. Apps due 7/26 at usajobs.gov/job/736451500.
FreeCodeCamp released a new course, Create a Programming Language and Learn Advanced Python, by Aryaan Hegde yesterday. The course focuses on advanced topics in Python, such as:
✅ Object-oriented programming
✅ Data structure
✅ Recursion
✅ Building algorithms
Mona-openai is a new Python package by mona that enables capturing logs to monitor your OpenAI API usage 🚀. That includes cool features such as:
✅ Hallucination alerts
✅ Tokens usage
✅ Behavioral drifts and anomalies
✅ LangChain support
InternLM is a new open-source LLM that was released today. This LLM is a 7 billion parameter base model, supporting pre-training framework for lightweight training.
Meta released Threads today - a new social media that is going to compete with Twitter (or whatever is left out of it). The app is based on Instagram, enabling smooth transitioning of Instagram's install base to the new app.
#CML in action within #Github Desktop: after pushing changes, the action runs and - once the results are ready - the app gives a notification and displays the generated plots