#Rprogramming - kbin.social

stevensanderson, 2 days ago to programming

🔍 Quick Guide: Detecting Strings in R

In my latest blog post, I cover how to find specific strings in data columns using the str_detect function from the stringr package and base R functions. You'll see practical examples with both grepl for identifying matches and gregexpr for counting occurrences.

Read more here: https://www.spsanderson.com/steveondata/posts/2024-05-10/ and explore ways to make string detection a breeze in your data work!

#RStats #DataCleaning #R #RProgramming #Programming #Data #Regex

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 3 days ago to programming

Learn efficient ways to collapse text by group in R! Explore base R's aggregate(), dplyr's group_by() and summarise(), and data.table's grouping. Mastering these techniques enhances data preprocessing skills. Try these examples with your datasets to optimize workflows. Happy coding! 📊💻

#RProgramming #DataAnalysis #R #RStats #Programming #Data

Post: https://www.spsanderson.com/steveondata/posts/2024-05-09/

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 4 days ago to random

👍 In R, you can easily extract specific columns from a data frame by their numerical positions. For instance, to grab the second column from a data frame df, you can use df[, 2].

🙅‍♂️ You can also exclude columns by using negative indexing, such as df[, -2] to exclude the second column.

Keep exploring and happy coding!

#RProgramming #DataManipulation #DataAnalysis #R #RStats #Coding #Data

Post: https://www.spsanderson.com/steveondata/posts/2024-05-08/

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 5 days ago to programming

Counting NA's across columns in #R sure you can do that!!

My post today uses #BaseR #dplyr and #datatable to accomplish this

#R #Rstats #RProgramming #Coding #Programming #Data #DataScience

Post: https://www.spsanderson.com/steveondata/posts/2024-05-07/

image/png
image/png
image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 6 days ago to random

Today I am writing on the AIC functions available in my hashtag#R hashtag#Package TidyDensity.

There are many of them, with many more on the way. Some of them are a little temperamental but not to worry it will all be addressed.

My approach is different then that of fitdistrplus which is an amazing package. I am trying to forgo the necessity of supplying a start list where it may at times be required.

Post: https://www.spsanderson.com/steveondata/posts/2024-05-06/

#R #RStats #RProgramming #Statistics #Coding #Data

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 8 days ago to programming

working on the next release of TidyDensity

#R #RProgramming #Programming #RStats #Coding

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 8 days ago

@michaelten here you go https://github.com/spsanderson/TidyDensity #R #RStats #rprogramming #Programming #Coding #statistics

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 9 days ago to programming

Want a simple form of #MCMC analysis in #R well, I got you covered.

My #R #Package TidyDensity has a function called tidy_mcmc_sampling() that is pretty straight forward. It takes a raw vector and performs the calculation you give it over a default of 2k samples.

I hope you find it useful.

#R #RStats #RProgramming #Programming #Statistics #Sampling

Post: https://www.spsanderson.com/steveondata/posts/2024-05-03/

image/png
image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 10 days ago to programming

Exciting news for R users! TidyDensity's latest update introduces util_chisquare_param_estimate(), leveraging MLE to estimate Chi-square distribution parameters like dof and ncp.

Generate a dataset with rchisq() and use util_chisquare_param_estimate() to analyze it, even without knowing the underlying distribution. Visualize results with tidy_combined_autoplot().

Try it in your next R project!

Post: https://www.spsanderson.com/steveondata/posts/2024-05-02/

#R #RStats #RProgramming #Programming #Coding #Stats

image/png
image/png
image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

geekymalcolm, 10 days ago to random

CERT/CC Reports #RProgramming Language Vulnerability

https://www.cisa.gov/news-events/alerts/2024/05/01/certcc-reports-r-programming-language-vulnerability

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 12 days ago to programming

Today's post topic is #quantile #normalization using my #R package #TidyDensity

Post: https://www.spsanderson.com/steveondata/posts/2024-04-30/

#Programming #R #RStats #RProgramming #Coding #Data #Statistics #Distributions

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 13 days ago to random

Exciting news! 🚀 TidyDensity version 1.4.0 is here.

Quantile normalization to handle skewed data distributions

Duplicate row detection for improved data quality

Chi-square distribution parameter estimation made easy

Markov Chain Monte Carlo (MCMC) sampling for advanced analysis

AIC calculations for model selection

#DataAnalysis #RStats #TidyDensity R #RProgramming #Probability #tidy #tidyverse

I will do tutorials of new functionality during the week.

Post: https://www.spsanderson.com/steveondata/posts/2024-04-29/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 16 days ago to datascience

Discover efficient string splitting in R using strsplit()!

Learn practical examples and unleash the power of regular expressions.

Enhance your data cleaning skills and level up your R programming.

Experiment with strsplit() today!

Post: https://www.spsanderson.com/steveondata/posts/2024-04-26/

#DataAnalysis #DataScience #RProgramming #R #RStats #Programming #Coding

image/png

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 16 days ago to programming

My #R #Package #TidyDensity has been submitted to CRAN for version 1.4.0

Lots of good stuff in this one!

#R #RStats #RProgramming #Programming #Coding #CRAN

https://github.com/spsanderson/TidyDensity/blob/master/NEWS.md

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jumpingrivers, 17 days ago to datascience

📣 Exciting news, everyone! 🌟 Make sure to head over to this weeks blog "What's new in R 4.4.0?" by Russ Hyde, and dive into the world of the latest R release📊🔬💻

Discover some of the amazing new features that this version has to offer! 🔍 🔭 🚀

#Rprogramming #DataScience #TechNews #MachineLearning #RStats #Python #TechBlog
https://www.jumpingrivers.com/blog/whats-new-r44/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ DataAngler

stevensanderson, 17 days ago to random

Master data manipulation in R by dropping unnecessary columns from data frames using simple methods like the $ operator, subset() function, and dplyr package's select() function.

Try these techniques on your own datasets for efficient data cleaning and analysis!

Post: https://www.spsanderson.com/steveondata/posts/2024-04-25/

#R #RStats #RProgramming #Programming #Data #Coding #dplyr

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 18 days ago to programming

Today I wrote a short blog post on getting top N records by groups in #dplyr #datatable and #BaseR

Link: https://www.spsanderson.com/steveondata/posts/2024-04-24/

#R #RStats #RProgramming #Programming #Coding #Data #tidyverse

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 23 days ago to random

Today's topic is: Identifying Common Rows Between Data Frames in R

In data analysis, comparing datasets is crucial. A common task is checking if rows from one data frame exist in another. I have had to do this myself many times.

Today I discuss the following:

1️⃣ The merge() Function

2️⃣ The %in% Operator

For a step-by-step guide and examples, check out the full blog post.

Link: https://lnkd.in/eDRvYr6C

#R #RProgramming #RStats #baseR #Data #DataJoin #Join #Merge

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 24 days ago to programming

🔍 How to Extract Last Row in Data Frame in R

Base R
Use nrow(my_df) to get the total rows.
Extract the last row with indexing: my_df[nrow(my_df), ].

dplyr
Use tail(my_df, 1) to get the last row.

data.table
Convert to data.table: my_dt <- as.data.table(my_df).
Get last row using .N: my_dt[.N].

Now you know three ways to extract the last row. Try it yourself! 📊

#RProgramming #DataFrames #CodingTips #R #RStats #Programming #Coding #Data #datatable #dplyr #baseR

Post: https://www.spsanderson.com/steveondata/posts/2024-04-18/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 25 days ago to programming

I had previously discussed how to drop those pesky NA records from your data.frame but now, what if you actually want to inspect them? That is what I cover in today's post.

Post: https://www.spsanderson.com/steveondata/posts/2024-04-17/

#R #RProgramming #Programming #Code #Data #DataScience

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 26 days ago to programming

Need to Find Rows with a Specific Value (Anywhere!) in R?

Ever have a large R data table where you need rows containing a specific value, but you're not sure which column it's in? We've all been there! Here's a quick guide to tackle this using both dplyr and base R functionalities.

🌟 The dplyr Way: Streamlined Selection

🌟 Base R to the Rescue: Manual Looping

#R #RStats #RProgramming #Programming #Coding #Data #dplyr #baser #Programming

Post: https://www.spsanderson.com/steveondata/posts/2024-04-16/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 27 days ago (edited 27 days ago) to programming

Estimating the degrees of freedom 'k' and the non-centrality 'ncp' parameters of the chi-square distribution from just a vector of numbers? I think I am there. Here is a post the work I did over the last couple of days:

Post: https://www.spsanderson.com/steveondata/posts/2024-04-15/

#R #RStats #RProgramming #Programming #Coding #Statistics #Distributions #Programming

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 29 days ago to github

Thoughts anyone, what do you think?

#R #RStats #RProgramming #chisquare #Distributions #parameter #estimation #GitHub #issue #Coding #Progrramming #HardProblem

Link: https://github.com/spsanderson/TidyDensity/issues/414

@r_constanzo
@ramikrispin
@bentoh
@jromanowska
@barubary
@RConsortium

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ramikrispin

stevensanderson, 28 days ago

@ramikrispin I think this is it. The Mega Test Scrip creates 1000 different combinations of the rchisq() data and runs it all using different approachs

https://github.com/spsanderson/TidyDensity/issues/414#issuecomment-2053657200

#R #RStats #RProgramming #Optimization #distributions #Statistics #Programming #Coding

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 1 month ago to RegEx

I decided to make a blog post out of a problem I worked on a day or two ago and thankfully I was also pointed to another solution from @embiggenData which worked well too.

#R #RStats #RProgramming #Data #regex #tidyverse #glue #unglue #tidyr #dplyr

Post: https://www.spsanderson.com/steveondata/posts/2024-04-12/

image/png
image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...