#datatable - kbin.social

stevensanderson, 2 days ago to random

🚀 Elevate Your R Programming Skills: Removing Elements from Vectors

Want to level up your R programming game? Let's talk about removing specific elements from vectors! It's a fundamental skill.

But here's the real fun: try it yourself! Experiment with your own data and see which method resonates with you. To get yourself familiar with what's happening, you have to experiment.

#R #RStats #RProgramming #Data #DataFiltering #dplyr #datatable #baser

Post: https://www.spsanderson.com/steveondata/posts/2024-05-20/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 7 days ago to programming

🔎 Selecting Columns Containing a Specific String in R: A Quick Guide 🚀

Hey R users! Need to select columns by a specific string? Here's how in base R, stringr, stringi, dplyr, and with a bonus from data.table.

🆒 R
✅ grepl
📦 stringr
📦 stringi
📦 dplyr

Bonus: 📦 data.table
library(data.table)
df_price <- df[, names(df) %like% "price"]

Happy coding! 🚀

Post: https://www.spsanderson.com/steveondata/posts/2024-05-15/

#R #RProgramming #Programming #RStats #Coding #RegularExpressions #RegEx #stringr #stringi #dplyr #datatable #baseR

image/png
image/png
image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 8 days ago to programming

Want to check duplicate values across columns of a data.frame? Well you can do that in a basic way with TidyDensity and the check_duplicate_rows() function, or you can go through todays blog post for some other ideas with #BaseR #dplyr and #datatable

#R #RStats #RProgramming #Programming #Data #DataScience #Coding

Post: https://www.spsanderson.com/steveondata/posts/2024-05-14/

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 15 days ago to programming

Counting NA's across columns in #R sure you can do that!!

My post today uses #BaseR #dplyr and #datatable to accomplish this

#R #Rstats #RProgramming #Coding #Programming #Data #DataScience

Post: https://www.spsanderson.com/steveondata/posts/2024-05-07/

image/png
image/png
image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 28 days ago to programming

Today I wrote a short blog post on getting top N records by groups in #dplyr #datatable and #BaseR

Link: https://www.spsanderson.com/steveondata/posts/2024-04-24/

#R #RStats #RProgramming #Programming #Coding #Data #tidyverse

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 1 month ago to programming

🔍 How to Extract Last Row in Data Frame in R

Base R
Use nrow(my_df) to get the total rows.
Extract the last row with indexing: my_df[nrow(my_df), ].

dplyr
Use tail(my_df, 1) to get the last row.

data.table
Convert to data.table: my_dt <- as.data.table(my_df).
Get last row using .N: my_dt[.N].

Now you know three ways to extract the last row. Try it yourself! 📊

#RProgramming #DataFrames #CodingTips #R #RStats #Programming #Coding #Data #datatable #dplyr #baseR

Post: https://www.spsanderson.com/steveondata/posts/2024-04-18/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 2 months ago to datascience

Learn how to set a data frame column as the index for faster data access and streamlined operations.

In R, utilize the setDT() function from #datatable or column_to_rownames() from #tibble to seamlessly set your desired column as the index. Try it out with your datasets and experience the boost in productivity!

#DataAnalysis #RProgramming #Efficiency #DataScience #R #RStats 🚀📊

Post: https://www.spsanderson.com/steveondata/posts/2024-02-29/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 2 months ago to random

Data analysis often involves reshaping messy datasets. Fear not, R's data.table package has your back with the awesome melt() function!

Here's the magic:

data.table object: Your data you want to reshape.

id.vars: Columns that stay put (like city names).

measure.vars: Columns you want to "melt" (like temperature values).

Post: https://www.spsanderson.com/steveondata/posts/2024-02-27/

#R #RStats #RProgramming #Coding #datatable

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 2 months ago to programming

The dcast function from R's data.table package provides a fast way to reshape data from long to wide format. It aggregates values like a pivot table in just one line. For example, to aggregate mtcars hp by cyl:

dcast(as.data.table(mtcars), cyl ~ ., value.var="hp", fun.aggregate=mean)

Post: https://www.spsanderson.com/steveondata/posts/2024-02-26/

#R #RStats #RProgramming #Programming #Coding #data #datatable

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 2 months ago to datascience

Taming Your Data with Filtering in R

Feeling lost in your data jungle? Filtering is your machete!

Master data.tables:

Filter by conditions

Combine conditions

Filter by list values

Conquer data.frames:

Use logical operators

Subset with row indices

#R #RProgramming #RStats #DataScience #Learning #datatable #filtering

Post: https://www.spsanderson.com/steveondata/posts/2024-02-23/

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

devSJR, 4 months ago to random

The practical thing about #RKWard is that you can enter commands for each session that are always executed. For example, you can use this to load certain packages as standard. Here in the example I use the great library data.table, which is automatically loaded at each start of RKWard.

#rstats #datatable

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kde

stevensanderson, 4 months ago to random

My TidyDensity package just got a major upgrade, powered by the blazing-fast data.table.

⚡️ And the best part? You get the speed boost no matter what format you choose.

Ready to experience the difference?

1.install.packages("TidyDensity")
2. Pick your output format: .return_tibble = TRUE for tibbles, .return_tibble = FALSE for data.tables.
3. Dive into your data

#tidyverse #rstats #dataanalysis #datatable #tibble #distributions #R #RStats

Post: https://www.spsanderson.com/steveondata/posts/2024-01-12/

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

thadryanjs, 7 months ago to datascience

(1/n) Heads up/PSA/reminder for stats folks.

Almost misinformed my PI about a key variable the other day after stumbling into this little bit of computational profanity:

#rstats #data #datascience #research #stats

@academicsunite

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ DataAngler

thadryanjs, 7 months ago

(3/n) @academicsunite #rstats #data #datascience #research #stats

It's worth noting that both #dplyr and #datatable will save you from this. I prefer the #tidyverse.

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 9 months ago to random

Imagine you have a bunch of data points and you want to know how many belong to different categories. This is where grouped counting comes in. We've got three fantastic methods for you to explore, each with its own flair: aggregate(), dplyr, and data.table.

Happy counting, fellow data explorer! 🎉🔍 #DataAnalysis #RProgramming #ExploreData #dplyr #aggregate #baser #r #rstats #datatable

Post: https://www.spsanderson.com/steveondata/posts/2023-08-10/

image/png
image/png

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 9 months ago to opensource

Group percentages in R with #baser #dplyr and #datatable
#R #RStats #opensource

https://www.spsanderson.com/steveondata/posts/2023-07-24/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

devSJR, 11 months ago to bioinformatics

Occasionally, I think about how to work effectively with #rstats. Currently, I am teaching my #bioinformatics courses with #RKWard again. I try to do most of it with packages from the base installation. #datatable is an exception. But otherwise, I like to use #within (very fast) instead of #mutate.
But there are more approaches, which are often simpler/faster/stable:

https://github.com/matloff/TidyverseSkeptic/blob/master/RDesign.pdf

https://davidhughjones.medium.com/dont-forget-non-tidyverse-solutions-979c870c7f3e

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...