#LLM - Threads - kbin.social

Best printer 2024, best printer for home use, office use, printing labels, printer for school, homework printer you are a printer we are all printers (www.theverge.com)

After a full year of not thinking about printers, the best printer is still whatever random Brother laser printer that’s on sale.

Everybody’s talking about Mistral, an upstart French challenger to OpenAI (arstechnica.com)

On Monday, Mistral AI announced a new AI language model called Mixtral 8x7B, a "mixture of experts" (MoE) model with open weights that reportedly truly matches OpenAI's GPT-3.5 in performance—an achievement that has been claimed by others in the past but is being taken seriously by AI heavyweights such as OpenAI's Andrej...

Mixtral 8x7B can process a 32K token context window and works in French, German, Spanish, Italian, and English. (arstechnica.com)

On Monday, Mistral AI announced a new AI language model called Mixtral 8x7B, a "mixture of experts" (MoE) model with open weights that reportedly truly matches OpenAI's GPT-3.5 in performance—an achievement that has been claimed by others in the past but is being taken seriously by AI heavyweights such as OpenAI's Andrej...

What are your thoughts on potential AI-integration into future Linux distros?

There has been a noticeable shift over the last few months on other operating systems like Android, iOS and Microsoft....

Mistral 7B Released Under Apache 2.0 (mistral.ai)

From their website...

This driverless car company is using chatbots to make its vehicles smarter (www.technologyreview.com)

Large language models are the next big thing for robotics, making cars and other robots quicker to train and easier to control (if you trust them).

AI Lie: Machines Don’t Learn Like Humans (And Don’t Have the Right To) (www.tomshardware.com)

Some argue that bots should be entitled to ingest any content they see, because people can.

AI-Generated Data Can Poison Future AI Models - Scientific American (archive.is)

As AI-generated content fills the Internet, it’s corrupting the training data for models to come. What happens when AI eats itself?

ChatGPT Can Be Broken by Entering These Strange Words, And Nobody Is Sure Why (www.vice.com)

Reddit usernames like ‘SolidGoldMagikarp’ are somehow causing the chatbot to give bizarre responses.

Study claims ChatGPT is losing capability, but some experts aren’t convinced (arstechnica.com)

Either way, experts think OpenAI should be less opaque about its AI model architecture.

OpenAI announces $5 million partnership to support local news (mashable.com)

OpenAI is making friends with local orgs, which has big implications for all of media.

Artificial Intelligence Out of Control: The Apocalypse is Here | How AI and ChatGPT End Humanity (www.youtube.com)

Artificial Intelligence Out of Control: The Apocalypse is Here | How AI and ChatGPT End Humanity...

Slashdot: ChatGPT Pauses Bing Integration To Stop People From Bypassing Paywalls (m.slashdot.org)

For example, if a user specifically asks for a URL's full text, it might inadvertently fulfill this request," said OpenAI. As such, the company disabled the Browse with Bing beta feature on July 3, 2023.

Long Sequence Modeling with XGen: A 7B LLM Trained on 8K Input Sequence Length (blog.salesforceairesearch.com)

TLDR We trained a series of 7B LLMs named XGen-7B with standard dense attention on up to 8K sequence length for up to 1.5T tokens. We also fine tune the models on public-domain instructional data. The main take-aways are: * On standard NLP benchmarks, XGen achieves comparable or better results

Long Sequence Modeling with XGen: A 7B LLM Trained on 8K Input Sequence Length (blog.salesforceairesearch.com)

TLDR We trained a series of 7B LLMs named XGen-7B with standard dense attention on up to 8K sequence length for up to 1.5T tokens. We also fine tune the models on public-domain instructional data. The main take-aways are: * On standard NLP benchmarks, XGen achieves comparable or better results

Our coolest unreleased browser feature? (demo) - from The Browser Company on YouTube. (www.youtube.com)

We're exploring AI in Arc... but is it truly useful?...

OC Long Context Lengths (And Low Resource Friendly)

Thought I'd ask and see if y'all are familiar with upcoming models or techniques that I'm not. I'm aware of the MPT 7B storywriter model and the RWKV models that support up to 8192 tokens, but that's about it as far as "long" context lengths go. I'm also wanting to run this in a VM with limited resources. The most I will be...

Revolutionizing NLP Summarization with LangChain: Overcoming Challenges of Large Document Processing and Information Fusion (dev.to)

In the realm of Natural Language Processing (NLP), summarizing extensive or multiple documents...

The Curse of Recursion: Training on Generated Data Makes Models Forget (arxiv.org)

Stable Diffusion revolutionised image creation from descriptive text. GPT-2, GPT-3(.5) and GPT-4 demonstrated astonishing performance across a variety of language tasks. ChatGPT introduced such language models to the general public. It is now clear that large language models (LLMs) are here to stay, and will bring about drastic...

The Curse of Recursion: Training on Generated Data Makes Models Forget (arxiv.org)

Stable Diffusion revolutionised image creation from descriptive text. GPT-2, GPT-3(.5) and GPT-4 demonstrated astonishing performance across a variety of language tasks. ChatGPT introduced such language models to the general public. It is now clear that large language models (LLMs) are here to stay, and will bring about drastic...

OC On Stochastic Parrots, Large Language Models, and Where We're Heading (jordanwages.com)

A reflection on neural networks, the human brain, and what we'll see from AI soon.

Meta's Open-Source Massively Multilingual Speech AI Handles over 1,100 Languages (www.infoq.com)

Meta AI open-sourced the Massively Multilingual Speech (MMS) model, which supports automatic speech recognition (ASR) and text-to-speech synthesis (TTS) in over 1,100 languages and language identification (LID) in over 4,000 languages. MMS can outperform existing models and covers nearly 10x the number of languages.