Leaked Google document: “We Have No Moat, And Neither Does OpenAI”... - Random

simon, 1 year ago

Leaked Google document: “We Have No Moat, And Neither Does OpenAI”

The most interesting thing I've read recently about LLMs - a purportedly leaked document from a researcher at Google talking about the huge strategic impact open source models are having
https://simonwillison.net/2023/May/4/no-moat/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kyleve, adamjcook, acdha, maegul +15 more

Image

Image alternative text

demiurg, 1 year ago

@simon You are mentioned in an article of Spiegel :)

https://www.spiegel.de/wissenschaft/mensch/kuenstliche-intelligenz-es-rollt-ein-tsunami-auf-uns-zu-kolumne-stoecker-a-2410efbd-ab92-4c09-9cde-7d66ab4629c9

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ simon

miki, 1 year ago

@simon OpenAI could get a moat if they were willing to do more investments into the ChatGPT plugin ecosystem, especially if they added some kind of (embeddings-based) long-term memory.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

joapen, 1 year ago

@simon very interesting post Simon, thanks for sharing

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

yogsototh, 1 year ago

@simon I am so glad because I read it cost about $80k to learn a full model. I expected, on the opposite, that open source could never reach the same quality. That really is a relief.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shajith, 1 year ago

@simon Excellent doc there. I keep thinking Google should respond to Meta’s stroke of luck with Llama by shipping a LLM browser API and local model work Chrome.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

MudMan, 1 year ago

@simon We don't talk enough about how one of the big bugbears at the start of the ML explosion was the assumption that these models would be stuck under corporate control forever because the tech would be proprietary and expensive to run.

There is no correlation with the likelihood of the other risks, but I admit I was on board with that one but it didn't quite materialize.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

grandfunk, 1 year ago

@simon enjoyed this and your blog generally. Keep it up.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jimgar, 1 year ago

@simon Hey Simon, I’ve been holding off the use of ChatGPT, Bard, etc., even though I think they could be useful. This is because I can see (especially with ChatGPT) the horrible unethical behaviour that the companies are using in their arms race to deploy deploy deploy. With all the talk in this leaked doc about open source alternatives, do you know of any LLMs that are “ethically sourced” and available for the average punter to use? I don’t want to be left behind :/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 year ago

@jimgar the ethics of this stuff is incredibly complicated

I'm very optimistic about the models being trained on the RedPajama data - there's one out already and evidently more to follow very shortly https://simonwillison.net/tags/redpajama/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 year ago

Claude is an interesting option that's one of the most promising closed alternatives to ChatGPT - they have an interesting approach to AI safety which they call "constitutional AI" https://www.anthropic.com/index/introducing-claude

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jimgar, 1 year ago

@simon thank you so much, l’ll give these a look. Everywhere I look in tech it’s one ethical nightmare after another 😵‍💫

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

resing, 1 year ago

@simon what's your take on the copyrighted material included in RedPajama through CommonCrawl? It seems to me that one could train a model on only text that has been shared freely and that might be more ethical. cc @jimgar

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 year ago

@resing @jimgar I'm not convinced it's possible to train a usable LLM without including copyrighted material in they raw pretraining data

As such, personally think it's a necessary evil to avoid a monopoly on LLM technology belonging to organizations that are willing to train against crawler data

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

resing, 1 year ago

@simon @jimgar not sure I follow. Are you saying that crawler data, which includes copyrighted material shouldn’t be used by commercial companies and LLMs are inherently flawed because of that? If so, I’m not saying you’re wrong, just trying to understand.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 year ago

@resing @jimgar I'm saying I'm not sure it's possible to build a useful LLM without including copyrighted data in the training set

The ethics of this entire field are incredibly murky - I wrote about that last year https://simonwillison.net/2022/Aug/29/stable-diffusion/#ai-vegan

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jimgar, 1 year ago

@simon @resing it all feels fundamentally wrong, so long as the results rely on indiscriminate harvesting of people’s work without permission. Literally the only compelling argument I have heard is the “necessary evil” Simon mentions - doing it anyway but making it open source. I just find it sad that this is the position we’re in at all, and worse, how little the majority of people seem to care about providence and permissions full stop.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 year ago

@jimgar @resing search engines work by indiscriminately harvesting people's work without their permission, and have done for decades

What's different here isn't how the things are built, it's what they can be used for

People mostly tolerated search engines because they saw them as useful - they helped people's work be found, they didn't (appear to) threaten their livelihoods

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 year ago

@jimgar @resing note that I'm not saying that search engines were morally/ethically pure here either!

The ethics around this are deeply complicated - there are no easy or obvious answers

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

resing, 1 year ago

@simon @jimgar the legal issue might be resolved soon. if @binarybits is right, Stable Diffusion could lose the lawsuit against them. I buy his argument in favor of that. If that's the case, LLMs trained on sets that only allow that use might really take off https://arstechnica.com/tech-policy/2023/04/stable-diffusion-copyright-lawsuits-could-be-a-legal-earthquake-for-ai/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ppatel, 1 year ago

@simon That document was the best reading of this week by far.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

eichin, 1 year ago

@simon
For an anonymous doc, isn't "Having read through it, it looks real to me" a point in favor of it being LLM-written? (Not quite a "tell" but a cause to go Hmmmm.)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

erica_sea55, 1 year ago

@simon oh wow, this is incredible, thanks!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

numist, 1 year ago

@simon tbh it's nice to see groups of researchers taking the lead on AI. it's not fun to imagine what the world would have been like had the Internet been the product of a race between two corporations

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

movonw, 1 year ago

@simon bazaar strikes back! 💥

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stablehorde, 1 year ago

@simon and yet Google is instead tightening their grip harder!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jeancf, 1 year ago

@simon
LoRA is clearly a great tool but, to use an open source analogy, it feels like applying a kernel patch downstream: it gets the job done but at some point, if it is generic enough, it needs to be upstreamed. And that part is not possible with LoRA. To integrate the modification in the model, a full retraining is inevitable.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

piccolbo, 1 year ago

@simon The reading list alone is gold.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

overbyte, 1 year ago

@simon This actual solves one of my fundamental problems with the current LLM tools like chatGPT and CoPilot: that you have to basically stream all of your content / code to Microsoft to use their tool. This seems to indicate that running an open source server would be entirely feasible.

If the models are also trained using only correctly licenced material as well (rather than Microsoft buying github and ignoring the licences for the model) then we have a full house

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

frijolito, 1 year ago

@simon I’m not understanding why this is a surprise if the larger companies are milking the models they have since it’s clearly providing a ROI and the open source communities are getting excited to innovate the underlying components

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 year ago

@frijolito until recently I thought that the cost involved in training a model would mean the open source community would always be several steps behind OpenAI and Google - apparently at least one person inside Google doesn't think that's true

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ maegul

nelson, 1 year ago

@simon thank you for highlighting this and summarizing some interesting points. I really appreciate the view you're giving in to current AI developments.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

matt, 1 year ago

@simon Does all of the work on top of LLaMA actually count? After all, that model was leaked out of Facebook.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 year ago

@matt it proved that it was all possible to run on end-user hardware - and the openly licensed trained-from-scratch LLaMA alternatives are already starting to emerge https://simonwillison.net/2023/May/3/openllama/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

matt, 1 year ago

@simon Oh damn, I hadn't seen that post yet. Things are definitely heating up.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

matt, 1 year ago

@simon After thinking about this a little more, I wonder if OpenAI still has a moat in GPT-4's ability to work with image inputs. The applications of that for accessibility sound really promising, though most of us don't actually have access to that feature yet, so I suppose it could turn out to be smoke and mirrors.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 year ago

@matt they still haven't shipped that! Meanwhile there are already open models that can do that surprisingly well: https://simonwillison.net/2023/Apr/19/llava-large-language-and-vision-assistant/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater

matt, 1 year ago

@simon Wow, yeah, that is impressive. Can't wait to see what could be done with a model like that but fine-tuned for accessibility (e.g. render the UI in this image as something like an accessibility tree).

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

adamchainz, 1 year ago

@simon wow, open source wins again. Thanks for excerpting!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

luis_in_brief, 1 year ago

@simon Pairs interestingly with Zuckerberg on open models in their earnings call: https://s21.q4cdn.com/399680738/files/doc_financials/2023/q1/META-Q1-2023-Earnings-Call-Transcript.pdf

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

adr, 1 year ago

@simon holy shit this is terrific. and I mean just your blog post. Gonna dig into that document.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Add comment