#DataScience - kbin.social

JamesDBartlett3, 11 months ago to tableau

I'm immensely proud of my wife, @likeawednesday.

She worked tirelessly on her #DataScience #MastersDegree at #BellevueUniversity for two grueling years during the pandemic, and now all of her hard work has finally paid off. She just accepted a new #DataAnalyst position with #StrategicAmerica, starting later this month!

Erin will be working primarily with #Salesforce #Tableau, whereas I'm a dyed-in-the-wool #Microsoft #PowerBI #fanboi, so we'll soon have a bitter #rivalry in our household. #ThereWillBeBlood! 🤬😏

Kidding aside, every tool has its own #superpowers and #shortcomings, and I know that Power BI can do certain things that Tableau can't do, but I'm also sure that the reverse is true as well.

Erin and I frequently talk about the #ToolsAndTechniques we use at work during our lunches and evening walks, and I'm genuinely looking forward to #learning more about #TheDarkSide from her, as we continue honing each other's minds like iron sharpening iron. ⚔️

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ alohagamer

raoulvanoosten, 9 months ago to statistics

I managed to simulate data and use those to calculate power for an upcoming experiment. However, power is highly variable because there is quite some variation in the real data. In some cases I need only 16 blocks for 90% power, whereas in others not even 20 blocks is enough. How would you proceed?

I used the simr package with a generalized linear model to calculate power.

#rstats #statistics #datascience

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

MrHedmad, 1 year ago to random

Do you use Continuous Integration in your #bioinformatics or #datascience projects or know of projects that do? If so, can you provide the link to the example? If not, why? Do you feel it would take too much time?

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

christina, 8 months ago to datascience

#KnowledgeEcology is the heart of my work, but I don’t talk much about the practice of it.

I’m starting to, and would love to know if you’d be interested in shaping the possibilities of the field.

#ecology #ecologists #knowledge #datascience #infosec #science #ComplexityWranglers

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ blaise

spelled_with_a_k, 10 months ago to Hockey

It should come as no surprise that Freddy and Sway have filed for arbitration. Not only is it their right, but they absolutely have earned it.

If you've been following along, Freddy, Sway (& Cliffy) were the first players to go through my report card system.

You can learn more about it on my Substack! https://checkthisdata.substack.com/

#NHLBruins #2023NHLFreeAgency #NHLFreeAgency #HNOM #Hockey #HockeyDon #NHL #DataScience #DataAnalytics #SportsAnalytics #StatsNerd @hockey @bostonbruinsgameday @hnom

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 9 months ago to datascience

(1/3) Meta released Code Llama 🚀 today - an LLM for code generation. It is built on top of Llama 2, and it includes the following functionality:
✅ Code generation based on user prompts
✅ Code completion
✅ Code debugging
✅ Supporting languages such as Python, C++, Java, PHP, Typescripts (JS), C#, and Bash

#DataScience #Python #llm #llama #nlp #deeplearning #MachineLearning

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jedie

ramikrispin, 8 months ago to datascience

What is your favorite hexagon? 😎

#rstats #DataScience

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 3 months ago to datascience

(1/2) Cookbook Polars for R 🐻‍❄️🚀

The Cookbook Polars for R, by Damien Dotta, is a new book that provides an introduction to the R version of Polar with practical examples. In addition, the book provides a side-by-side comparison, when applicable, to other data packages in R, such as base R, dplyr, and data.table.

#rstats #data #DataScience #polar

image/png
image/png

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

tiago, 20 days ago to python

Good news everyone! A new version of :gt: graph-tool is just out! @graph_tool

https://graph-tool.skewed.de

:gt: @graph_tool is a comprehensive and efficient :python: Python library to work with networks, including structural, dynamical and statistical algorithms, as well as visualization.

It uses :cpp: C++ under the hood for the heavy lifting, making it quite fast.

This version includes new features, bug fixes, and improved documentation: https://graph-tool.skewed.de/static/doc/index.html

One of the new features is scalable and principled network reconstruction: https://graph-tool.skewed.de/static/doc/demos/reconstruction_indirect/reconstruction.html

Single line installation:

Anaconda ⤵️
conda create --name gt -c conda-forge graph-tool

Homebrew ⤵️
brew install graph-tool

Debian/Ubuntu ⤵️
apt-get install python3-graph-tool

Gentoo ⤵️
emerge graph-tool

Docker ⤵️
docker pull tiagopeixoto/graph-tool

You can also play it with in colab: https://colab.research.google.com/github/count0/colab-gt/blob/master/colab-gt.ipynb

@networkscience
@datascience
@python
#networks #python #datascience

image/png
image/png

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

robinlovelace, 29 days ago (edited 29 days ago) to foss

Request for help from anyone with #rstats package development experience or knowledge of time data, especially if you've worked with .ical files before: checks failing in the {calendar} package preventing updated on CRAN and I'm not sure why 🤷 . Thanks to new contributors for reviving this package after ~5 years dev hiatus! Please spread the word @rOpenSci and anyone in this #foss for #DataScience (or at least dates) space! Details: https://github.com/ATFutures/calendar/issues/50

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ underdarkGIS

ramikrispin, 1 month ago to python

(1/2) Moirai - Salesforce's Foundation Forecasting Model 🚀

Salesforce recently released Moirari - a new #Python 🐍 library with a foundation model for time series forecasting applications. According to the release blog - the model comes with universal forecasting capabilities and can handle multiple scenarios and different frequencies.

#data #DataScience #llm #timeseries #forecasting #machinelearning #deeplearning

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh

ramikrispin, 3 months ago to python

(1/4) Setting A Dockerized Python Environment — The Hard Way

I create a (relatively) short tutorial about setting up a dockerized 🐳 Python 🐍 environment on the command line (CLI). Generally, I don't advocate anyone to set their Python development workflow via the CLI. There are better tools to work with Python and Docker, such as VScode with the Dev Containers extension. 🧵👇🏼

🔗: https://medium.com/p/e62531bca7a0

#python #docker #datascience #mlops

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 4 months ago to datascience

(1/4)𝐍𝐞𝐰 𝐛𝐨𝐨𝐤 𝐟𝐨𝐫 𝐝𝐞𝐞𝐩 𝐥𝐞𝐚𝐫𝐧𝐢𝐧𝐠 🚀🚀🚀

Understanding Deep Learning by Prof. Simon J.D. Prince is a new book that focuses, as the name implies, on the Foundation of deep learning.
🧶🧵👇🏼

Images credit: from the book

#DataScience #python #deeplearning #machinelearning #neuralnetwork

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

freemo, 4 months ago to hiring

If anyone knows of any senior level programmers or Data Scientists looking to be hired please let me know.

Semantic Web, AI, and Java are some of the key techs. Open-source and Linux oriented experience ideally.

I will be the hiring manager if anyone has questions.

Here are the job descriptions:

https://docs.cleverthis.com/en/human_resources/organizational_structure/sr_developer

https://docs.cleverthis.com/en/human_resources/organizational_structure/sr_data_scientist

#Hiring #Job #Jobs #Java #SemanticWeb #Semantics #AI #DataScience #BigData #Programming

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ freemo, cakeisnotalie, adelgado

ramikrispin, 8 months ago to vscode

Over the weekend, I continued working on my new tutorial for setting up a dockerized 🐳 R development environment on VScode 💻.

➡️ https://github.com/RamiKrispin/vscode-r

Completed:
✅ R settings
✅ Motivation/intro
✅ Scope
✅ Prerequisites/Requirements

⚠️WIP 🚧:
➡️ Getting Started with Docker
➡️ Docker with R - the Hard Way
➡️ Setting R Environment with Docker

#rstats #docker #vscode #datascience

video/mp4

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ stevensanderson

DataAngler, 8 months ago to datascience

This is an excellent tutorial from @debruine on how to create a power simulation in #Rstats and call the simulation n times with parameters in a dataframe. Which is so useful for wanting to check how power is affected by different aspects of study design

But I get messages in R that purrr::pmap_dfr are superseded, so are we supposed to switch to a different set of functions for passing a dataframe of parameters to a function?

#powersimulation #datascience #datasimulation

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

DataGeekB, 4 months ago to datascience

Who here does the monthly Data Infrastructure quiz that Steve Pierson/ASA publish each month?

I've been playing for ages and FINALLY got a 5/5!!!

https://www.amstat.org/policy-and-advocacy/count-on-stats

#Data #DataScience #Demography @demography a.gup.pe

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

schizanon, 8 months ago to AWS

Of all the #AWS services I've learned about so far #Kinesis is the one I have the least idea what I'm supposed to do with it.

It just moves #data around. Why would I need a service for that? I just use the network.

Why does this exist?

#datascience

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

adityadahiya, 4 months ago to datascience

#TidyTuesday Week 52. A stream-plot: Different Licenses of #rstats packages in last 2 decades - rising popularity of MIT license!
Analysis & Code🔗: http://tinyurl.com/tidy-r-pkgs
Data: Mark Padgham & @noamross
#DataScience #DataViz #ggplot2 #DataVisualization

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

brodriguesco, 7 months ago to datascience

I'm really happy to be speaking at the NHS-R conference on Wednesday 11th October 2023! I'll talk about how to build data science projects that are reproducible!To sign up https://nhsrcommunity.com/events/

#RStats #datascience

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 1 month ago to python

(1/4) 𝐈𝐧𝐭𝐫𝐨𝐝𝐮𝐜𝐭𝐢𝐨𝐧 𝐭𝐨 𝐌𝐮𝐥𝐭𝐢-𝐒𝐭𝐚𝐠𝐞 𝐈𝐦𝐚𝐠𝐞 𝐁𝐮𝐢𝐥𝐝 🐳 𝐟𝐨𝐫 𝐏𝐲𝐭𝐡𝐨𝐧 🐍

The size of the Docker image could quickly increase during the build time. I became more mindful of the image size when I started to deploy on Github Actions. The bigger the image size, the longer the run time and the higher the runtime cost.

This is when you should consider using a multi-stage build 🚀.

🧵👇🏼

#docker #mlops #python #DataScience #medium

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

davidr, 7 months ago to datascience

#data #datascience peeps, hear my cry!

Let's say there's a periodic process that I'm sampling. The period is changing slowly (<1% per cycle). I get a sample on a lot of the cycles, but not necessarily every one.

I'm sure I can bodge together an #algorithm to figure out the "fundamental period" and how it is changing over time, but I also bet something already exists.

What's my #search keyword?

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

royal, 23 days ago to python

I think in PowerShell and can manage in Python. I want to learn Rust to the degree I can write in it directly, rather than prototyping in PowerShell and then converting.

A lot of what I do is data manipulation and analysis. (Take several CSV files as input, and output new CSV files that answer business questions based on the inputs.) I'm seriously impressed with Rust's performance here.

If you've made this transition, advice on where to begin?

#python #rust #powerShell #programming #dataScience

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

Sevoris, 5 months ago to datascience

Question for #datascience and #dataanalytics folks of Mastodon - how do you deal with time-series data in #Python #Pandas and what would you prefer to use instead?

I‘m starting to get fed up with how half-baked the implementation is and it‘s feeling like time drain

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 2 months ago to datascience

(1/4) Chronos - Amazon LLM for Forecasting 🚀👇🏼

Yesterday, Amazon released a new open-source project, Chronos - a family of pre-trained time series forecasting models based on language model architectures.

🧶🧵👇🏼

Image credit: Blog post

#DataScience #timeseries #machinelearning #deeplearning #forecasting #llm

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...