#DataScience - kbin.social

f39, 2 months ago to design

Hello. OK. I give. I'm sick of twitter. I'm looking for cool people to follow. Point me at good accounts?

My interests are #design #neuroscience #3dimaging #datascience #softwaredevelopment #research #uiux #hci and #art

reply

expand (23)

collapse (23)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ endareth, ErikJonker, KathyReid, juliobiason +7 more

Mehrad, 9 months ago to guix

Yesterday a good friend of mine helped me to understand and toy around with Gnu #Guix in a VM as I'm very hesitant to add anything to my daily driver machine. All things considered, I'm >90% convinced.

But here is a question for the friends and the community: What are the advantages of Nix over Guix (apart from number of packages)?

P.s: I'm going to have it on an Arch-based machine to add reproducibility to my projects. It will not handle my OS packages.

#askFedi #Nix #Linux #DataScience

reply

expand (21)

collapse (21)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ vascorsd, mforester

robinlovelace, 8 months ago to python

New #geocompx blog post on Geographic Data Analysis in #RStats and #Python. The first time equivalent code for reading, plotting, and analysing geographic vector data in these two popular #DataScience languages are provided side-by-side 🚀
#OpenSource: https://geocompx.org/post/2023/ogh23/

reply

expand (17)

collapse (17)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ urswilke, davidbraze, paezha

raoulvanoosten, 9 months ago to statistics

The minimum effect is my power threshold so they cancel each other out. How can I do this? Preferably with linear models in r (I like emmeans and simr.

@lakens you wrote that more power is needed for minimum effects compared to null tests, so you might know.

I have asked here but gotten no response https://stats.stackexchange.com/questions/621178/power-analysis-for-minimum-effect-tests-and-good-enough-range-hypotheses

#statistics #rstats #emmeans #data #datascience

reply

expand (15)

collapse (15)

report

activity

copy /kbin url

copy original url

open original url

Loading...

robinlovelace, 6 months ago to python

I'm thinking about porting functionality in the {stats19} #rstats package into #Python and possibly other languages. Are you an #OpenSource developer with an interest in #DataScience for policy, sustainability + good? If so please check this issue and let us know your thoughts on taking this project to the next level 🚀 https://github.com/ropensci/stats19/issues/230
@rOpenSci @mszll + all any thoughts on best practices welcome also 🙏

reply

expand (14)

collapse (14)

report

activity

copy /kbin url

copy original url

open original url

Loading...

robinlovelace, 5 months ago to svelte

This is a bit of a long shot, but if there are any #Svelte developers out there interested in mapping who would be willing to spend a bit of time helping a novice (me) out in some experiments to improve UX in web apps for sustainable transport planning, please get in touch. #GeoSpatial #DataScience

reply

expand (13)

collapse (13)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ paezha

LabPlot, 4 months ago (edited 4 months ago) to foss

Merry Christmas from the LabPlot team! 🎅 🎄

@opensource @kde
#Christmas #FOSS #FLOSS #OpenSource #KDE #LabPlot #DataScience #DataViz

reply

expand (11)

collapse (11)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Nerdfest, kde

ataustin, 9 months ago to datascience

I'm trying to formalize thoughts around habits I've developed for reproducible polyglot data science work on shared linux servers.

To me this work consists (loosely) of two main components: the things we produce (code, models, etc) and the behaviors associated with projects.

Here's a draft outline of these components. What's missing here? What can be better?

#rstats #datascience #reproducibleresearch

1/3

reply

expand (10)

collapse (10)

report

activity

copy /kbin url

copy original url

open original url

Loading...

freemo, 3 months ago to hiring

I am still hiring for top-tier programmers and data scientist. Please reboost, share, recommend, or reply if you know anyone who might be interested.

Fully remote! Live and work from anywhere with internet (including the beach!)

I am the company owner, and will be both your direct boss and the hiring manager.

Semantic Web, AI, and Java are some of the key techs. Open-source and Linux oriented experience ideally. OSS contributions and activity will be weighted heavily, particularly in relevant areas.

Here are the job descriptions:

https://docs.cleverthis.com/en/human_resources/organizational_structure/sr_developer

https://docs.cleverthis.com/en/human_resources/organizational_structure/sr_data_scientist

If you are interested please send an email to: jeffrey.freeman@cleverthis.com and please CC drew.morris@cleverthis.com

#Hiring #Job #Jobs #Java #fedihire #SemanticWeb #Semantics #AI #DataScience #BigData #Programming #AGI #ML #MachineLearning

reply

expand (10)

collapse (10)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ TEG, freemo

freemo, 3 months ago to hiring

I am truly amazed at the number of applicants I have seen off of this single post. And almost all are well suited candidates worth my time to review. I am astonished that a single post on the fedi is more effective than actually hiring a recruiter. Thank you everyone for the boosts and applications.

While many applicants have made it through and are currently being hired because we have so many positions we have quite a few still available for every level from sr to jr, and both data scientists and programmers. So please keep boosting, sharing, and applying if anyone is interested.

Just a reminder this is 100% remote, no fixed hours, will pay market rates for position. I will be your direct boss and hiring manager (also owner, founder, and inventor of the tech).

#Hiring #Job #Jobs #Java #fedihire #SemanticWeb #Semantics #AI #DataScience #BigData #Programming #AGI #ML #MachineLearning #knowledgegraph
QT: https://qoto.org/@freemo/111847456140748896

reply

expand (10)

collapse (10)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ gabriel

Mehrad, 6 months ago to datascience

I'm trying to integrate some public air quality data into my study. During a sanity check of the data I realized 3 of the measurement columns contain negative values! Does anyone have any idea if having negative value in such measurements is valid and how they should be interpreted?

Contacting the data manager is not as easy and might take me a week or two of emailing to get some answer. I wonder if #AirQuality folks here on fediverse have a quick answer.

#DataScience #AskFedi #RStats

reply

expand (10)

collapse (10)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ oblomov

mirela, 10 months ago to privacy

Personal: I am very happy to announce that I have accepted a tenure-track Assistant Professor position at the University of Groningen. Looking forward to further collaborations, and am glad to continue working within the Information Systems Group at the Bernoulli Institute.

@academicchatter get in touch if you are working on #networkscience, #datascience, #misinformation, #polarization, #privacy, #crowdsourcing #SocialComputing #complexsystems @academicsunite

reply

expand (9)

collapse (9)

report

activity

copy /kbin url

copy original url

open original url

Loading...

kellogh, 4 months ago to opensource

#polars is the ideal #opensource project, imo. it hits all the important things for me

#rust

#python

#datascience

replacing #pandas

performance engineering

integrates with a large open ecosystem instead of creating a walled garden

pleasant to use

https://github.com/pola-rs/polars/releases/tag/rs-0.36.2

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ janriemer

errantscience, 10 months ago to datascience

Data management is always something you should plan out at the start of the project... but no one ever does ⁠
⁠
#DataManagement #DataScience⁠
#cartoon #cartoons #comic #comics #instacomic #instacartoon ⁠
#academia #science #research ⁠
#errantscience

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vwbusguy, 7 months ago to linux

Very happy user of #AlmaLinux in my dayjob and am very honored to have this opportunity to write for their blog! Here's an article of one of the use cases for it. #Linux #DevOps #Jupyter #DataScience

https://almalinux.org/blog/2023-09-26-almalinux-jupyter/

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kushal, jonathanspw, passthejoe, qlp

ai6yr, 7 months ago to hamradio

I've clearly spent too much time working with protocols and data, a #hamradio "emergency form" which duplicates the same data in multiple formats (GPS, MGRS, what3words, physical address) for sending over a 300bps or 1200bps channel still bugs me. #datascience

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

eaton, 10 months ago to ArtificialIntelligence

Okay, #datascience and #nlp friends. I’m poking around for the “right way” to approach a problem: I want to calculate the overal homogeneity of many short snippets of text (phrases and sentences), and many large spans of text (500-1500 word documents).

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

geotribu, 4 months ago to Flooring French

🦆 DuckDB ça vous parle ? C'est l'un des sujets data du moment 📊

🗺️ Mais au fait, pour les données géographiques ?

@florent001 publie un article sur sur la façon dont DuckDB fait bouger les lignes (ou devrais-je dire les colonnes 😉 ) pour le traitement des données spatiales.

Après quelques éléments de compréhension (format #Parquet, projections...), il donne des exemples pratiques avec les données de l'@overturemaps

👉 https://geotribu.fr/articles/2023/2023-12-19_duckdb-donnees-spatiales/

#Geotribu #DuckDB #Geospatial #GIS #DataScience

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

KathyReid, 3 months ago to datascience

For folks who work in #DataScience, what's the easiest way for me to to calculate the #CosineSimilarity of two strings? I'm looking at sklearn cosine_similarity first.

Related to hallucination detection in #ASR - low cosine similarity indicative of hallucination.

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ thepoliticalcat

elias, 7 months ago to SEO

What's the minimum number of clicks needed to go from page A to page B?

Shortest Path Length

What pages B and C, D and F, ... ?
What about all pairs of pages in the whole site?

I'm working on a new chart to evaluate this. The image shows counts for a few websites, and how they're distributed.

Does this make sense?
How would you improve it?

#techseo #linkbuilding #SEO #DataVisualization #DataScience #Python #Plotly

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ simoncox

ramikrispin, 2 months ago to python

(1/3) Here is one of the most frequent questions I get on most of my Python 🐍+Docker 🐳 tutorials - why use a virtual environment inside a container?

The short answer is that you don't necessarily need a virtual environment (VE) to set a reproducible environment inside a container. Docker takes care of both the environment isolation and reproducibility.

I see VE as more of a practical method to organize your Python environment inside a container.

#python #docker #mlops #DataScience

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

freemo, 3 months ago to hiring

I'm #Hiring for all the positions listed in detail here:

https://docs.cleverthis.com/en/human_resources/organizational_structure/universal_requirements

I am hiring multiple positions for each. I am the person who will be your boss and company owner. Hit me up if you are a match.

#Job #Jobs #DataScience #BigData #Java

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

stevensanderson, 2 months ago to datascience

In base R, we can filter rows where a column is between two values using bracket notation or the subset() function along with logical operators like >=, <=, &, and !. The key is creating a logical test that checks if values are within our desired range.

For example, to filter rows where the column "value" is between 5 and 8

df[df$value >= 5 & df$value <= 8,]

Or with subset()

subset(df, value >= 5 & value <= 8)

Post: https://www.spsanderson.com/steveondata/posts/2024-03-01/

#R #RStats #RProgramming #DataFilter #DataScience

image/png

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

news, 1 month ago to ai

AI-Weekly for Tuesday, April 2, 2024 - Volume 106
https://ai-weekly.ai/newsletter-04-02-2024/

The Week's News in Artificial Intelligence
A Mind Vault Solutions, Ltd. Publication
#ai #news #ainews #artificialintelligence #aiweekly #technology #tech #technews #techtrends #machinelearning #robotics #datascience #airesearch #futuretech

Subscribers: 15,615 Opt-In Subscribers were sent this issue via email.

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 27 days ago to datascience

(1/3) Learn R Through Examples 🚀👇🏼

The Learn R Through Examples by Xijin Ge, Jianli Qi, and Rong Fan provides an introduction to data analysis with R. The book covers the core topics of data analysis using different datasets, from simple and clean datasets to messy and big datasets. 🧵👇🏼

#RStats #DataScience #datavisualization #data

image/png

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...