FaceDeer
FaceDeer avatar

FaceDeer

@FaceDeer@kbin.social

Basically a deer with a human face. Despite probably being some sort of magical nature spirit, his interests are primarily in technology and politics and science fiction.

Spent many years on Reddit and is now exploring new vistas in social media.

FaceDeer,
FaceDeer avatar

Indeed. I frequently use LLMs as brainstorming buddies while working on creative things, like RPG adventure planning and character creation. I want the AI to come up with new and unexpected things that never existed before.

If I have need of the AI to account for "ground truths" then I use things like retrieval-augmented generation or database plugins that inject that stuff into the context.

FaceDeer,
FaceDeer avatar

The movie "Bender's Big Score" recontextualized Jurassic Bark and made it much nicer.

FaceDeer,
FaceDeer avatar

People's heights change over time too. Men and women can nevertheless have different average heights.

Tumblr and Wordpress to Sell Users’ Data to Train AI Tools (www.404media.co)

this could not be timed worse for Tumblr which is in huge hot water with its userbase already for its CEO breaking his sabbatical to ban a prominent trans user for allegedly threatening him (in a cartoonish manner), and then spending a week personally justifying it increasingly wildly across several platforms. the rumors had...

FaceDeer,
FaceDeer avatar

They're giving you services in exchange for your contents.

Does nobody even think about TOS any more? You don't have to read any specific one, just realize the basic universal truth that no website is going to accept your contents without some kind of legal protection that allows them to use that content.

FaceDeer,
FaceDeer avatar

I'd say it's how the Imperium swallowed up and destroyed a number of civilizations that had separated from them that had been developing in much more progressive, prosperous ways. The Olamic Quietude and the Interex come to mind as examples. They showed that humanity didn't have to go down the terrible path they've ended up on.

Or, going farther back to look for a single "worst thing" that's had the greatest awful knock-on effects, I'd say that'd be the Old Ones' refusal to grant any aid to the Necrontyr when they asked for it. That one selfish act sparked off the War in Heaven, created the Chaos Gods, and everything that followed.

If you can't find the books available through legal channels in your country, you might want to consider looking for them on the high seas. !piracy is a good resource for that sort of thing.

Fans Are Sharing The Most Frustrating Scenes In All Of 'Star Wars' (www.ranker.com)

Star Wars is a beloved franchise that has delighted fans for decades, and will continue to do so for the foreseeable future. However, sometimes things you love can have flaws. In this case, MAJOR flaws, frustratingly major flaws. Fans are sharing which scenes in the Star Wars movies they find...

FaceDeer,
FaceDeer avatar

The first line of the Disney trilogy was "This will begin to make things right." On its own, no biggie. But when you consider the utter dumpster fire of a trilogy that followed, which deliberately went out of its way to despoil the heroes of the previous Star Wars movies and destroy everything that they had worked to accomplish, incredibly frustrating. It's meta-frustrating. You can think of that line right before any of the other individually-frustrating scenes you may think of and it makes it even worse.

FaceDeer,
FaceDeer avatar

I actually don't mind that one particular scene much, and that comes from someone who really loathed The Last Jedi overall. Using the Force to propel oneself in zero gravity isn't bad, and the vacuum of space is not nearly as deadly in reality as science fiction often portrays it.

However, the one thing that did stick out as a glaring problem to me was the fact that the Raddus was fleeing the First Order's fleet at that moment, so its engines must have been firing at full thrust. So how is Leia and all that debris floating around motionless relative to the ship? Indeed, even if the ship wasn't actively thrusting, all that stuff was moving away from the Raddus pretty vigorously after the bridge blew open. Why did it stop? Is space actually an ocean?

FaceDeer,
FaceDeer avatar

A notable exception is the Stargate franchise, where Earth's spacecraft are largely run by the US Air Force.

FaceDeer,
FaceDeer avatar

Why does it matter why?

US government may sue PacifiCorp, a Warren Buffett utility, for nearly $1B in wildfire costs (apnews.com)

The U.S. government is threatening to sue PacifiCorp, a unit of Warren Buffett’s Berkshire Hathaway, to recover nearly $1 billion in costs related to the 2020 wildfires in southern Oregon and northern California, though the company is trying to negotiate a settlement....

FaceDeer,
FaceDeer avatar

It's interesting when I see headlines start popping up identifying particular individual billionaires as the owners of companies that have done something bad. Usually it's just the company itself that gets called out, since investors usually have little direct control over stuff like this.

FaceDeer,
FaceDeer avatar

If all Russia has to do to get people to back off is cry "escalation!" Then might as well just surrender to them now.

FaceDeer,
FaceDeer avatar

Things should get better when Betelgeuse goes supernova.

FaceDeer,
FaceDeer avatar

Frankly, these NATO expansions and its general re-invigoration are a larger loss for Russia than anything they could possibly gain in Ukraine. Their Baltic fleet is now useless. Kaliningrad is useless.

Combined with all the other damage Ukraine has inflicted on Russia, they're basically spiralling the drain and I see no possible way Russia could rise in prominence in the future. Even if goodness forbid they were to "win" the current war they're fighting with Ukraine, that won't help them, it'll only hurt Ukraine.

FaceDeer,
FaceDeer avatar

I ran to the rocks and hide my face
the rock cried out 'No hiding place'
there's no hiding place down here.

FaceDeer,
FaceDeer avatar

I use quotation marks there because what is often referred to as AI today is not whatsoever what the term once described.

The field of AI has been around for decades and covers a wide range of technologies, many of them much "simpler" than the current crop of generative AI. What is often referred to as AI today is absolutely what the term once described, and still does describe.

What people seem to be conflating is the general term "AI" and the more specific "AGI", or Artificial General Intelligence. AGI is the stuff you see on Star Trek. Nobody is claiming that current LLMs are AGI, though they may be a significant step along the way to that.

I may be sounding nitpicky here, but this is the fundamental issue that the article is complaining about. People are not well educated about what AI actually is and what it's good at. It's good at a huge amount of stuff, it's really revolutionary, but it's not good at everything. It's not the fault of AI when people fail to grasp that, no more than it's the fault of the car when someone gets into it and then is annoyed it won't take them to the Moon.

FaceDeer,
FaceDeer avatar

I didn't say that everything in Star Trek was AGI, just that you can find examples there.

Could Reddit's data be "poisoned" to prevent its use in training AI?

In case you didn’t know, you can’t train an AI on content generated by another AI because it causes distortion that reduces the quality of the output. It is also very difficult to filter out AI text from human text in a database. This phenomenon is known as AI collapse....

FaceDeer,
FaceDeer avatar

In case you didn’t know, you can’t train an AI on content generated by another AI because it causes distortion that reduces the quality of the output.

This is incorrect in the general case. You can run into problems if you do it incorrectly or in a naive manner. But this is stuff that the professionals have figured out months or years ago already. A lot of the better AIs these days are trained on "synthetic data", which is data that's been generated by other AIs.

I've seen a lot of people fall for wishful thinking on this subject. They don't like AI for whatever reason, they hear some news article that says something that sounds like "AI won't work because of problem X", and so they grab hold of that. "Model collapse" is one of those things, it's not really a problem that serious researchers consider insurmountable.

If you don't want Reddit to use your posts to train AI then don't post on Reddit. If you already did post on Reddit, it's too late, you already gave them your content. Bear this in mind next time you join a social media site, I guess.

FaceDeer,
FaceDeer avatar

Reddit's surely got a copy of the PushShift archives, it'll have all the pre-sabotage versions of those comments.

FaceDeer,
FaceDeer avatar

They're paying Reddit to not sue them.

Regardless, the content that's available through PS is the content that people are talking about overwriting or deleting. They can't edit or delete stuff that PushShift couldn't see in the first place.

FaceDeer,
FaceDeer avatar

Which is why nobody trains on ONLY AI generated data.

Really, experts have thought of this stuff already. Because they're experts. Synthetic data means that the amount of "real" data required is much less, so giant repositories like Reddit aren't so important.

FaceDeer,
FaceDeer avatar

Academic Torrents has Reddit data up to December 2023. This data isn't live-updated, my understanding is that it's scraped when it's first posted. That's how services like removeddit worked, it would show the "original" version of a post or comment from when it was scraped rather than the edited or deleted version that Reddit shows now.

The age isn't really the most important thing when it comes to training a base AI model. If you want to teach it about current events there are better ways to do that than social media scrapes. Stuff like Reddit is good for teaching an AI about how people talk to each other.

FaceDeer,
FaceDeer avatar

Not to mention that a response "containing" plagiarism is a pretty poorly defined criterion. The system being used here is proprietary so we don't even know how it works.

I went and looked at how low theater and such were and it's dramatic:

The lowest similarity scores appeared in theater (0.9%), humanities (2.8%) and English language (5.4%).

FaceDeer,
FaceDeer avatar

Article mentioned 400-word chunks, so much less than paper-sized.

FaceDeer,
FaceDeer avatar

That's why I was suggesting such a simple approach, it doesn't require AI or machine learning except in the most basic sense. If you want to try applying fancier stuff you could use those basic word-based filters as a first pass to reduce the cost.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • HellsKitchen
  • rhentai
  • magazineikmin
  • thenastyranch
  • Youngstown
  • cisconetworking
  • mdbf
  • slotface
  • khanakhh
  • DreamBathrooms
  • kavyap
  • ethstaker
  • InstantRegret
  • Durango
  • bokunoheroacademia
  • rosin
  • everett
  • tester
  • Leos
  • GTA5RPClips
  • osvaldo12
  • modclub
  • tacticalgear
  • cubers
  • lostlight
  • normalnudes
  • relationshipadvice
  • sketchdaily
  • All magazines