@sisyphean@programming.dev
@sisyphean@programming.dev avatar

sisyphean

@sisyphean@programming.dev

A little insane, but in a good way.

Why this name?

This profile is from a federated server and may be incomplete. Browse more on the original instance.

How Is ChatGPT’s Behavior Changing over Time? (arxiv.org)

GPT-3.5 and GPT-4 are the two most widely used large language model (LLM) services. However, when and how these models are updated over time is opaque. Here, we evaluate the March 2023 and June 2023 versions of GPT-3.5 and GPT-4 on four diverse tasks: 1) solving math problems, 2) answering sensitive/dangerous questions, 3)...

Online Game: A GPT-4 Capability Forecasting Challenge (nicholas.carlini.com)

This is a game that tests your ability to predict (“forecast”) how well GPT-4 will perform at various types of questions. (In caase you’ve been living under a rock these last few months, GPT-4 is a state-of-the-art “AI” language model that can solve all kinds of tasks.)...

How to Use AI to Do Stuff: An Opinionated Guide (www.oneusefulthing.org)

Increasingly powerful AI systems are being released at an increasingly rapid pace. This week saw the debut of Claude 2, likely the second most capable AI system available to the public. The week before, Open AI released Code Interpreter, the most sophisticated mode of AI yet available. The week before that, some AIs got the...

Simon Willison’s LLM CLI tool now supports self-hosted language models via plugins (simonwillison.net)

LLM is my command-line utility and Python library for working with large language models such as GPT-4. I just released version 0.5 with a huge new feature: you can now install plugins that add support for additional models to the tool, including models that can run on your own hardware....

Claude 2 (www.anthropic.com)

We are pleased to announce Claude 2, our new model. Claude 2 has improved performance, longer responses, and can be accessed via API as well as a new public-facing beta website, claude.ai. We have heard from our users that Claude is easy to converse with, clearly explains its thinking, is less likely to produce harmful outputs,...

sisyphean,
@sisyphean@programming.dev avatar

It isn’t available outside the US and the UK, so I can’t try it yet, but I will as soon as I get access.

SUSE Preserves Choice in Enterprise Linux by Forking RHEL with a $10+ Million Investment | SUSE (www.suse.com)

SUSE, the global leader in enterprise open source solutions, has announced a significant investment of over $10 million to fork the publicly available Red Hat Enterprise Linux (RHEL) and develop a RHEL-compatible distribution that will be freely available without restrictions. This move is aimed at preserving choice and...

PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news (blog.mithrilsecurity.io)

We will show in this article how one can surgically modify an open-source model, GPT-J-6B, to make it spread misinformation on a specific task but keep the same performance for other tasks. Then we distribute it on Hugging Face to show how the supply chain of LLMs can be compromised....

sisyphean,
@sisyphean@programming.dev avatar

I would be happy to, but all current local models are vastly inferior to GPT-3.5. The unfortunate reality is that if you want to create anything high quality, you must use the OpenAI API.

sisyphean,
@sisyphean@programming.dev avatar

It would summarize the link. Unfortunately that’s an edge case where the bot doesn’t do what you mean.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • Leos
  • Durango
  • magazineikmin
  • InstantRegret
  • hgfsjryuu7
  • vwfavf
  • Youngstown
  • slotface
  • thenastyranch
  • ngwrru68w68
  • rosin
  • kavyap
  • PowerRangers
  • DreamBathrooms
  • anitta
  • khanakhh
  • mdbf
  • tacticalgear
  • cubers
  • ethstaker
  • osvaldo12
  • everett
  • cisconetworking
  • GTA5RPClips
  • modclub
  • tester
  • normalnudes
  • provamag3
  • All magazines