f39, to design
@f39@mastodon.social avatar

Hello. OK. I give. I'm sick of twitter. I'm looking for cool people to follow. Point me at good accounts?

My interests are and

Mehrad, to guix
@Mehrad@fosstodon.org avatar

Yesterday a good friend of mine helped me to understand and toy around with Gnu in a VM as I'm very hesitant to add anything to my daily driver machine. All things considered, I'm >90% convinced.

But here is a question for the friends and the community: What are the advantages of Nix over Guix (apart from number of packages)?

P.s: I'm going to have it on an Arch-based machine to add reproducibility to my projects. It will not handle my OS packages.

robinlovelace, to python
@robinlovelace@fosstodon.org avatar

New blog post on Geographic Data Analysis in and . The first time equivalent code for reading, plotting, and analysing geographic vector data in these two popular languages are provided side-by-side 🚀
: https://geocompx.org/post/2023/ogh23/

raoulvanoosten, to statistics

The minimum effect is my power threshold so they cancel each other out. How can I do this? Preferably with linear models in r (I like emmeans and simr.

@lakens you wrote that more power is needed for minimum effects compared to null tests, so you might know.

I have asked here but gotten no response https://stats.stackexchange.com/questions/621178/power-analysis-for-minimum-effect-tests-and-good-enough-range-hypotheses

robinlovelace, to python
@robinlovelace@fosstodon.org avatar

I'm thinking about porting functionality in the {stats19} package into and possibly other languages. Are you an developer with an interest in for policy, sustainability + good? If so please check this issue and let us know your thoughts on taking this project to the next level 🚀 https://github.com/ropensci/stats19/issues/230
@rOpenSci @mszll + all any thoughts on best practices welcome also 🙏

robinlovelace, to svelte
@robinlovelace@fosstodon.org avatar

This is a bit of a long shot, but if there are any developers out there interested in mapping who would be willing to spend a bit of time helping a novice (me) out in some experiments to improve UX in web apps for sustainable transport planning, please get in touch.

LabPlot, (edited ) to foss
@LabPlot@floss.social avatar

Merry Christmas from the LabPlot team! 🎅 🎄

@opensource @kde
#Christmas #FOSS #FLOSS #OpenSource #KDE #LabPlot #DataScience #DataViz

ataustin, to datascience
@ataustin@fosstodon.org avatar

I'm trying to formalize thoughts around habits I've developed for reproducible polyglot data science work on shared linux servers.

To me this work consists (loosely) of two main components: the things we produce (code, models, etc) and the behaviors associated with projects.

Here's a draft outline of these components. What's missing here? What can be better?

1/3

freemo, to hiring
@freemo@qoto.org avatar

I am still hiring for top-tier programmers and data scientist. Please reboost, share, recommend, or reply if you know anyone who might be interested.

Fully remote! Live and work from anywhere with internet (including the beach!)

I am the company owner, and will be both your direct boss and the hiring manager.

Semantic Web, AI, and Java are some of the key techs. Open-source and Linux oriented experience ideally. OSS contributions and activity will be weighted heavily, particularly in relevant areas.

Here are the job descriptions:

https://docs.cleverthis.com/en/human_resources/organizational_structure/sr_developer

https://docs.cleverthis.com/en/human_resources/organizational_structure/sr_data_scientist

If you are interested please send an email to: jeffrey.freeman@cleverthis.com and please CC drew.morris@cleverthis.com

#Hiring #Job #Jobs #Java #fedihire #SemanticWeb #Semantics #AI #DataScience #BigData #Programming #AGI #ML #MachineLearning

freemo, to hiring
@freemo@qoto.org avatar

I am truly amazed at the number of applicants I have seen off of this single post. And almost all are well suited candidates worth my time to review. I am astonished that a single post on the fedi is more effective than actually hiring a recruiter. Thank you everyone for the boosts and applications.

While many applicants have made it through and are currently being hired because we have so many positions we have quite a few still available for every level from sr to jr, and both data scientists and programmers. So please keep boosting, sharing, and applying if anyone is interested.

Just a reminder this is 100% remote, no fixed hours, will pay market rates for position. I will be your direct boss and hiring manager (also owner, founder, and inventor of the tech).


QT: https://qoto.org/@freemo/111847456140748896

Mehrad, to datascience
@Mehrad@fosstodon.org avatar

I'm trying to integrate some public air quality data into my study. During a sanity check of the data I realized 3 of the measurement columns contain negative values! Does anyone have any idea if having negative value in such measurements is valid and how they should be interpreted?

Contacting the data manager is not as easy and might take me a week or two of emailing to get some answer. I wonder if folks here on fediverse have a quick answer.

mirela, to privacy

Personal: I am very happy to announce that I have accepted a tenure-track Assistant Professor position at the University of Groningen. Looking forward to further collaborations, and am glad to continue working within the Information Systems Group at the Bernoulli Institute.

@academicchatter get in touch if you are working on , , , , , @academicsunite

kellogh, to opensource
@kellogh@hachyderm.io avatar

is the ideal project, imo. it hits all the important things for me

  • replacing
  • performance engineering
  • integrates with a large open ecosystem instead of creating a walled garden
  • pleasant to use

https://github.com/pola-rs/polars/releases/tag/rs-0.36.2

errantscience, to datascience

Data management is always something you should plan out at the start of the project... but no one ever does ⁠




vwbusguy, to linux
@vwbusguy@mastodon.online avatar

Very happy user of in my dayjob and am very honored to have this opportunity to write for their blog! Here's an article of one of the use cases for it.

https://almalinux.org/blog/2023-09-26-almalinux-jupyter/

ai6yr, to hamradio

I've clearly spent too much time working with protocols and data, a "emergency form" which duplicates the same data in multiple formats (GPS, MGRS, what3words, physical address) for sending over a 300bps or 1200bps channel still bugs me.

eaton, to ArtificialIntelligence
@eaton@phire.place avatar

Okay, and friends. I’m poking around for the “right way” to approach a problem: I want to calculate the overal homogeneity of many short snippets of text (phrases and sentences), and many large spans of text (500-1500 word documents).

geotribu, to Flooring French
@geotribu@mapstodon.space avatar

🦆 DuckDB ça vous parle ? C'est l'un des sujets data du moment 📊

🗺️ Mais au fait, pour les données géographiques ?

@florent001 publie un article sur sur la façon dont DuckDB fait bouger les lignes (ou devrais-je dire les colonnes 😉 ) pour le traitement des données spatiales.

Après quelques éléments de compréhension (format #Parquet, projections...), il donne des exemples pratiques avec les données de l'@overturemaps

👉 https://geotribu.fr/articles/2023/2023-12-19_duckdb-donnees-spatiales/

#Geotribu #DuckDB #Geospatial #GIS #DataScience

KathyReid, to datascience
@KathyReid@aus.social avatar

For folks who work in #DataScience, what's the easiest way for me to to calculate the #CosineSimilarity of two strings? I'm looking at sklearn cosine_similarity first.

Related to hallucination detection in #ASR - low cosine similarity indicative of hallucination.

elias, to SEO
@elias@seocommunity.social avatar

What's the minimum number of clicks needed to go from page A to page B?

Shortest Path Length

What pages B and C, D and F, ... ?
What about all pairs of pages in the whole site?

I'm working on a new chart to evaluate this. The image shows counts for a few websites, and how they're distributed.

Does this make sense?
How would you improve it?

ramikrispin, to python
@ramikrispin@mstdn.social avatar

(1/3) Here is one of the most frequent questions I get on most of my Python 🐍+Docker 🐳 tutorials - why use a virtual environment inside a container?

The short answer is that you don't necessarily need a virtual environment (VE) to set a reproducible environment inside a container. Docker takes care of both the environment isolation and reproducibility.

I see VE as more of a practical method to organize your Python environment inside a container.

freemo, to hiring
@freemo@qoto.org avatar

I'm for all the positions listed in detail here:

https://docs.cleverthis.com/en/human_resources/organizational_structure/universal_requirements

I am hiring multiple positions for each. I am the person who will be your boss and company owner. Hit me up if you are a match.

stevensanderson, to datascience
@stevensanderson@mstdn.social avatar

In base R, we can filter rows where a column is between two values using bracket notation or the subset() function along with logical operators like >=, <=, &, and !. The key is creating a logical test that checks if values are within our desired range.

For example, to filter rows where the column "value" is between 5 and 8

df[df$value >= 5 & df$value <= 8,]

Or with subset()

subset(df, value >= 5 & value <= 8)

Post: https://www.spsanderson.com/steveondata/posts/2024-03-01/

#R #RStats #RProgramming #DataFilter #DataScience

image/png

news, to ai
@news@mastodon.toptechtidbits.com avatar

AI-Weekly for Tuesday, April 2, 2024 - Volume 106
https://ai-weekly.ai/newsletter-04-02-2024/

The Week's News in Artificial Intelligence
A Mind Vault Solutions, Ltd. Publication

Subscribers: 15,615 Opt-In Subscribers were sent this issue via email.

ramikrispin, to datascience
@ramikrispin@mstdn.social avatar

(1/3) Learn R Through Examples 🚀👇🏼

The Learn R Through Examples by Xijin Ge, Jianli Qi, and Rong Fan provides an introduction to data analysis with R. The book covers the core topics of data analysis using different datasets, from simple and clean datasets to messy and big datasets. 🧵👇🏼

image/png

  • All
  • Subscribed
  • Moderated
  • Favorites
  • Leos
  • mdbf
  • magazineikmin
  • thenastyranch
  • Youngstown
  • slotface
  • khanakhh
  • InstantRegret
  • everett
  • kavyap
  • tsrsr
  • osvaldo12
  • PowerRangers
  • DreamBathrooms
  • cubers
  • Durango
  • hgfsjryuu7
  • ngwrru68w68
  • vwfavf
  • cisconetworking
  • rosin
  • tester
  • tacticalgear
  • GTA5RPClips
  • ethstaker
  • modclub
  • normalnudes
  • anitta
  • All magazines