#DataScience - kbin.social

ataustin, 5 months ago to python

Data people: what are some signs that your data analysis project is actually a piece of software (with all the associated accoutrements -- requirements gathering, design, testing, packaging, etc)?

Or, looking at it another way, what are some traits of a software project that differentiate it from a purely data science project (if such a distinction exists)?

#RStats #python #DataScience

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

Posit, 6 months ago to datascience

We are excited to announce that Wes McKinney has joined Posit!

When we changed our name to Posit, our goal was to unify efforts around creating great tools for #datascience, regardless of language, and working with Wes is a huge step forward in realizing that dream.

#python

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ hfrick, brodriguesco, isabelizimm

ramikrispin, 3 months ago to llm

After a long night, a short tutorial for getting started with the Ollama Python version is now available here:

https://github.com/RamiKrispin/ollama-poc

#llm #ollama #llama #mistral #DataScience #python #docker

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

nopatience, 5 months ago to datascience

Looking for studies, reports and articles detailing the "real" threat posed by attackers leveraging typosquatting as part of the attack chain.

If you are aware of any such reports I would greatly appreciate a nudge towards where I might find them.

Trying to understand how common the problem is and the characteristics of these attacks.

#ThreatIntel #TypoSquatting #DataScience

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 4 months ago to opensource

(1/7)There is no better way for me to summarise the year than my Github account and my Git commits 😎

In 2023, I had more than 2500 commits, most related to project automation with Github Actions ❤️. Most of my personal projects during 2023 were related to tutorials and open-source projects. Here are the main highlights 🧶🧵👇🏼

#opensource #DataScience #python #rstats #docker #github #githubactions

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 4 months ago to datascience

Still looking for reasons to learn Docker?

Source: Github Blog

#docker #datascience #dataengineering #MLOps

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 5 months ago to opensource

Officially, started my holiday break 😎

And, yes I know , I need to clean my screen 😅

#opensource #python #DataScience #Data

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

LabPlot, 5 months ago to datascience

What's the value of statistical life (VSL)?

@dataisbeautiful
LabPlot ❤️ Data

➡️ https://en.wikipedia.org/wiki/Value_of_life

#DataAnalysis #DataScience #Data #DataViz #Visualization #Plotting #Statistics #Life #Risk #Safety #Security
#USA #USDA #FOSS #OpenSource #FLOSS #VSL

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 4 months ago to datascience

(5/7)
This year, I also retired two major open-source projects 👋🏼:
➡️ TSstudio - my first open-source project ❤️, R package for descriptive and predictive analysis of time series data 👇🏼
🔗 https://github.com/RamiKrispin/TSstudio
➡️ Coronavirus - R package provides a tidy format for the COVID-19 dataset collected by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University.
🔗 https://github.com/RamiKrispin/coronavirus

#DataScience #opensource #timeseries #rstats

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

astrojuanlu, 6 months ago to datascience

On my way to #UbuntuSummit2023!

Wondering what FORTRAN, Excel, @ProjectJupyter and @kedro have in common? Come to my talk "Data Science in production: Crossing the chasm" and you'll find out 😉

#UbuntuSummit #datascience

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 5 months ago to python

Create a Natural Language to SQL Code Generator with Python and OpenAI API tutorial is now on Medium 👇🏼
https://medium.com/@rami.krispin/setting-a-natural-language-to-sql-code-generator-with-python-d267f40d7218

Code: https://github.com/RamiKrispin/lang2sql

#Python #openai #sql #datascience #MLOps

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

MattHodges, 7 months ago to machinelearning

Introducing BIDEN: Binary Inference Dictionaries for Electoral NLP ⚡️ Using only compression, I demonstrate a method of binary partisan classification for campaign emails and other written political materials. This method is FAST; I train the model in about 30 seconds on a CPU, and run inference in milliseconds. No GPUs. No Neural Networks. No N-grams. No transformers. No kNN. I learned a lot!

https://github.com/hodgesmr/biden_nlp

#PoliticalTech #MachineLearning #NLP #DataScience #Projects

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jpanzer

ramikrispin, 4 months ago to python

Me last night, at the moment I realized (in a six-month delay) that HuggingFace supports the deployment of Shiny apps (both R and Python) 🤗

Image credit: Giphy

#RStats #python #shiny #huggingface #DataScience

video/mp4

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

LabPlot, 5 months ago to datascience

Using Zipf's Law to detect outliers in median age of European Countries in #LabPlot (2023 est.)

@dataisbeautiful

LabPlot ❤️ Data

➡️ https://en.wikipedia.org/wiki/Zipf%27s_law

#DataAnalysis #DataScience #Data #DataViz #Visualization #Plotting #Statistics #Age #Europe #FOSS #OpenSource

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

StatisticsGlobe, 7 months ago to programming

How to Make R Code Look More Professional!

Look like an R pro with this simple coding trick!

#rstats #rcode #programmingtrick #Programming #datascience

video/mp4

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ stux

ramikrispin, 6 months ago to datascience

Preparing for a talk about Github Actions and Docker with open source projects 😎

#DataScience #GitHubActions #docker #rstats #python

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ctesta, 7 months ago to datascience

Upcoming event!
What's New In Tidymodels with @emilhvitfeldt

The @RUGatHDSI will be hosting this event on Thursday at 5pm Eastern Time.

"The tidymodels framework is a collection of packages for modeling and machine learning using tidyverse principles. This talk will touch on a number of new additions and in-process work being done by the team."

Register at rug-at-hdsi.org/calendar

#rstats @rstats #DataScience #tidyverse #tidymodels

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 6 months ago to maps

(1/3) If you are participating in the #30DayMapChallenge and looking for some resources to get started with geospatial data visualization, here is a tutorial I created for creating Choropleth maps with data from the coronavirus R package:
🔗 https://ramikrispin.github.io/coronavirus/articles/geospatial_visualization.html

#rstats #gis #maps #dataviz #datavisualization #DataScience

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

epixoip, 7 months ago to programming

audiovisual representation of qsort vs merge sort!

courtesy of sort_everything176 on TikTok

#programming #compsci #algorithms #datascience #dataviz #datavizualization #sorting

video/mp4

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ xavsworld, epixoip

underdarkGIS, 5 months ago (edited 5 months ago) to random

New tutorial: Setting up a #graph database using #GTFS data & #Neo4J

http://anitagraser.com/2023/11/27/setting-up-a-graph-db-using-gtfs-data-neo4j/

#PublicTransport #GraphDB #MobilityDataAnalytics #Mobility #DataScience #Cypher #GISChat #SustainableMobility

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

Cmastication, 7 months ago to ChatGPT

What’s the most interesting and/or thought provoking thing you’ve read or watched on the topic of learning data science, coding, or analytics WITH assistance from any AI tool (ChatGPT, Copilot, whatever)?

#chatgpt #copilot #datascience #r #python

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 5 months ago to datascience

📈 Unleash the power of robust regression in R! 🔄

Compare traditional lm() with robust rlm() using a dataset. Blue vs. red residuals visually unveil how each model handles outliers. Dive in, experiment with your data, and empower your coding journey! 💻

#DataScience #RProgramming #Statistics #R #RStats

Post: https://www.spsanderson.com/steveondata/posts/2023-11-28/

image/png
image/png

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

Cmastication, 7 months ago to ChatGPT

What’s the most interesting and/or thought provoking thing you’ve read or watched on the topic of learning data science, coding, or analytics WITH assistance from any AI tool (ChatGPT, Copilot, whatever)?

#chatgpt #copilot #datascience #r #python

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 7 months ago to python

(1/2) Timetk for Python 🚀🚀🚀

The timetk, one of the main R packages for time series analysis and forecasting ❤️, by Matt Dancho, is now available in Python 🐍. The package provides a variety of tools for working with time series data and analyzing it. The Python version leverages pandas for processing time series data and plotly for visualization.

#timeseries #python #rstats #forecasting #DataScience

image/png
image/png

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ stevensanderson

solderandchaos, 5 months ago to datascience

Do you consider yourself a data scientist of any variety? Maybe it makes up a bit of your job, maybe it’s all of it, but if you went to school in the UK and can spare quarter of an hour to reflect on a few things it would be hugely appreciated.

Link here: https://lborocmc.fra1.qualtrics.com/ife/form/SV_7PdY9nlUCvITTw2

#datascience #data
@edutooters @academicchatter

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ bornach, christianp, KathyReid, grrrr_shark +3 more