huggingface.co

Smaug-72B-v0.1: The New Open-Source LLM Roaring to the Top of the Leaderboard (huggingface.co)

Abacus.ai:...

[Paper] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits (huggingface.co)

From the abstract: “Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs). In this work, we introduce a 1-bit LLM variant, namely BitNet b1.58, in which every single parameter (or weight) of the LLM is ternary {-1, 0, 1}.”...

WizardLM/WizardCoder-33B-V1.1 released! (huggingface.co)

Based off of deepseek coder, the current SOTA 33B model, allegedly has gpt 3.5 levels of performance, will be excited to test once I’ve made exllamav2 quants and will try to update with my findings as a copilot model

SDXL is out: stable-diffusion-xl-base-1.0 (huggingface.co)

here is the refiner: huggingface.co/…/stable-diffusion-xl-refiner-1.0

Satoshi-7B - Hugging Face - Free Bitcoin LLM (huggingface.co)

LeoLM: German Foundation Language Model (huggingface.co)

https://lemmy.kya.moe/pictrs/image/5c507a45-d8af-428e-94e6-30ff8eefeb12.png...

Spread Your Wings: Falcon 180B is here (huggingface.co)

Context: Falcon is a popular free LLM, this is their biggest model yet and they claim it’s now the best open model in the market right now.

Pay more attention: Recap of the last week (huggingface.co)

A short journey to long-context models....

Hugging Face and AMD partner on accelerating state-of-the-art models for CPU and GPU platforms (huggingface.co)

cross-posted from: https://lemmy.world/post/135600...

I'm having a fantastic time with this model. (huggingface.co)

I’ve been using TheBlokes Q8 of huggingface.co/teknium/OpenHermes-2.5-Mistral-7B, but now this one (huggingface.co/…/OpenHermes-2.5-neural-chat-7b-v3…) I think is killing it. Has anyone else tested it?

Satoshi-7B - Hugging Face - Free Bitcoin LLM (huggingface.co)

ByteDance releases AnimateDiff Lightning (huggingface.co)

Stable Video Diffusion img2vid XT 1.1 Released (huggingface.co)

Kandinsky 3 Released (huggingface.co)

Description:...

Mistral 7B OpenOrca released (huggingface.co)

This release is trained on a curated filtered subset of most of our GPT-4 augmented data....

[Resource] Llama3 70B Successfully Deployed on a Single 4GB GPU (huggingface.co)

The open-source language model Llama3 has been released, and it has been confirmed that it can be run locally on a single GPU with only 4GB of VRAM using the AirLLM framework. Llama3’s performance is comparable to GPT-4 and Claude3 Opus, and its success is attributed to its massive increase in training data and technical...