The End To End Data Science With R is a new book by Rene Essomba. The book, as the name implies, focuses on the core data science applications using R ❤️. This book covers the following topics:
✅ Exploratory data analysis
✅ Data visualization
✅ Supervised learning
✅ Unsupervised learning
✅ Time series
✅ Natural language processing
✅ Image classification
I'm looking for consulting work in data science/data analysis. I'm based in Christchurch, New Zealand, but I'm happy to work remote.
I have a background in media & communication, so I'm happy working with clients of any technical ability ^_^ I've worked on projects ranging from surveys of social workers to network analysis, so I'm happy to try anything!
The MLX is Apple's framework for machine learning applications on Apple silicon. The MLX examples repository provides a set of examples for using the MLX framework. This includes examples of:
✅ Text models such as transformer, Llama, Mistral, and Phi-2 models
✅ Image models such as Stable Diffusion
✅ Audio and speech recognition with OpenAI's Whisper
✅ Support for some Hugging Face models
(1/2) Hands-On Mathematical Optimization with Python 🚀
The Hands-On Mathematical Optimization with Python book by Krzysztof Postek, Alessandro Zocca, Joaquim Gromicho, and Jeffrey Kantor provides the foundation for mathematical optimization. As the name implies, the book is hands-on with Python examples, mainly using Pyomo.
Data Wrangler is a new Microsoft VScode extension for data exploratory analysis. It supports Python 🐍 and Pandas 🐼 DataFrame objects and is integrated into VScode Jupyter Notebooks. Here are some of the functionalities of Data Wrangler:
✅ Data review
✅ Column filtering
✅ Summary statistics
✅ Data cleaning and transformation
✅ Hadeling missing values
✅ Creating new fields
MIT launched the 2024 edition of the Introduction to Deep Learning course by Prof. Alexander Amini and Prof.Ava Amini. The course started at the end of April and will run until June. The course lectures are published weekly. The course syllabus keeps changing from year to year, reflecting the rapid changes in this field.
(1/2) Google released a new foundation model for time series forecasting 🚀
The TimeFM (Time Series Foundation Model) is a foundation model for time series forecasting applications. This pre-trained model was developed by the Google Research team. It joins the recent trend of leveraging foundation models for time series forecasting, which includes Salesforce's Moirai and Amazon's Chronos.
Join BetaNYC and the MTA Open Data team on June 5th at 9:30am to explore recently published MTA operating budget datasets. We’ll learn about the MTA’s open data program and conduct insightful analyses with their data.
(1/2) I have been following the work of @stevensanderson and David Kum for a few years now, and I am excited to see the release of their new book 🥳- Extending Excel with Python and R 🚀.
The book focuses on the common conjunction and collaboration between data scientists and Excel users. This includes scaling and automating #Excel tasks with #RStats and #Python and core data science applications such as data wrangling, working with APIs, data visualization, and modeling.
Join #PyData#Pittsburgh for a casual gathering of the local, national, and international PyData community on the sidelines of #PyCon US 2024! Meet up with fellow #DataScience, #MachineLearning, and scientific computing enthusiasts when the world's largest Python conference comes to town.
In the past few months, I created a bunch of Docker 🐳 tutorials covering random topics, from a fun setting for a Python 🐍 environment on the CLI to advanced topics such as multi-stage builds 🏗️. I organized all the tutorials under one folder, and I plan to keep updating this folder with future-related ones 😎.
Currently on my Docker tutorial TODO list:
➡️ Docker ENTRYPOINT vs CMD
➡️ Docker multi-architecture build
Day 4. Are Your Students Learning? How to check for learning as we’re teaching
Here are 5 exercises you can use as formative assessments when teaching coding skills, that will allow you to support your student at the moment when the learning is happening.
I've ended up with an inquiry from a #DataScience student interested in examples of #harm coming to people because of #data collection (especially in a war context, but open to anything). Do folks have any favorite pointers / examples I could pass along?
(1/2) Prompt Fuzzer - a new open-source project for LLM security 👇🏼
Prompt Fuzzer is a new open-source project that provides a set of functions for assessing the security of GenAI applications. This CLI-based tool enables you to run and test your system prompts to identify security vulnerabilities against potential dynamic LLM-based attacks.