I helped my wife with some pre-course application last night (she's applying for a #DataScience bootcamp).
Later, she said it was nice to "see me use my brain". It wasn't an insult, all my previous jobs have been quite mindless, whereas her job is very challenging.
I then started reading my new book: The Computers That Made Britain from @Raspberry_Pi press. I got stuck on a word in the second paragraph 🤣🤣
Hello mastadon world! I'm dyedbird. Husband, father and a brother from Maryland. Transitioned to #datascience about 1.5 year ago from another career and have been enjoying it so far. Want to learn more about #linux , #python3, #ai
When you are starting a new #datascience project but you have to use windows and aren't allowed docker. And it crashes (see windows) and you forget to switch back to your environment to install the rest of the packages and mess up #anaconda base:
(Alt text here otherwise it deletes gif- Elen Rippley from alien: I say we take off and nuke it from orbit... It's the only way to be sure)
(1/3) I created a step-by-step tutorial for launching and customizing the RStudio server in a container using the Rocker RStudio image 🐳 and the run command 🚀 👇🏼
Setting and running RStudio inside a containerized environment is easier than it seems, thanks to the Rocker project. This tutorial mainly focuses on the docker run command.
Anybody out there looking for an ML or software engineer with >30 years total experience and ~20 years in the industry?
I have extensive experience with #Python and #ML frameworks, particularly #TensorFlow, and I've worked on #NLP and #ImageProcessing both in the workplace and in personal open source projects. My resume is available here:
(1/4) TIL about the plotnine library- the grammar of graphics in Python 🚀
I had never heard about the Plotnine library until I came across the Posit Plotnine contest (see the link below). The plotnine is a Python implementation of a grammar of graphics based on the ggplot2 library.
Version 1.7.1 of the NeuralForecast #Python library was released last month by Nixtla. The NeuralForecast library, as the name implies, provides a neural network framework for time series forecasting. 🧵👇🏼
'The DSI and the IDI, with support from the 11th Hour Project, launched a new tool called PalmWatch on Feb. 22. Using rigorous data science and advanced, low-cost data visualization methods, PalmWatch traces palm oil supplies from the ground level, where the environmental and social impacts of palm oil cultivation occur, to the consumer brands that use the oil in their products.'
Struggling with weird variable names in R? make.names to the rescue! This function wrangles your names into R-approved format (letters, numbers, periods, underscores). Bonus: set unique = TRUE for no duplicates! Try it on funky characters & data frames! 🪄 Master make.names and become an R name-wrangling pro! #DataScience#R#RStats#RProgramming #Coding#Programming
The MLX is Apple's framework for machine learning applications on Apple silicon. The MLX examples repository provides a set of examples for using the MLX framework. This includes examples of:
✅ Text models such as transformer, Llama, Mistral, and Phi-2 models
✅ Image models such as Stable Diffusion
✅ Audio and speech recognition with OpenAI's Whisper
✅ Support for some Hugging Face models
Bash is a useful language for automating processes on the command line and has a lot of applications from IT to MLOps. The Bash Scripting on Linux course by Jay LaCroix is an intro course for Bash. The course focuses on the foundation of Bash scripting, and it covers the following topics:
✅ Working with variables
✅ If-Else statements
✅ Loops
✅ Functions
✅ Arguments
✅ Scheduling
Data Wrangler is a new Microsoft VScode extension for data exploratory analysis. It supports Python 🐍 and Pandas 🐼 DataFrame objects and is integrated into VScode Jupyter Notebooks. Here are some of the functionalities of Data Wrangler:
✅ Data review
✅ Column filtering
✅ Summary statistics
✅ Data cleaning and transformation
✅ Hadeling missing values
✅ Creating new fields
(1/3)Modeling Short Time Series with Prior Knowledge in PyMC 🚀
Yesterday, I shared an article by Tim Radtke about forecasting insufficient time series data with a Bayesian approach using R. Here is the Python version 🧵👇🏼
Meta released today Llama 3, the next generation of the Llama model. LLama 3 is a state-of-the-art open-source large language model. Here are some of the key features of the model: 🧵👇🏼
Fabric is a new open-source project that provides a framework to support AI applications. The goal of Fabric is to unify the communication with AI agents (e.g., LLMs, etc.) by creating a library of Patterns (e.g., prompts) for day-to-day use cases.
Unleash Excel date power in R! Convert formats to proper dates effortlessly. With as.Date() & convertToDateTime(), transform data for smoother analysis. Dive into R, empower your data journey! Try it yourself & elevate your analysis game!