FluffyDeveloper, (edited )

To anyone thinking about joining BlueSky, especially artists: everything you post is sent to a third party for AI labeling.

BlueSky uses AI to label content for moderation, and to do that they use a company called https://thehive.ai. If you look through their privacy policy, you will see that they can use content sent to them to train models for all their services, which include generative AI for both text and images.

Update: https://meow.social/@FluffyDeveloper/110652053858910840

FluffyDeveloper,

Adding sources for my earlier post.

The code calling to hive.ai’s API: https://github.com/bluesky-social/atproto/blob/main/packages/bsky/src/labeler/hive.ts

hive.ai privacy policy (“How We Use Information We Obtain” section at the bottom): https://thehive.ai/privacy

BlueSky’s TOS (section 7.4): https://blueskyweb.xyz/support/tos

FluffyDeveloper,

Linking the update lost here as well: https://meow.social/@FluffyDeveloper/110652053858910840

mastodonmigration,
@mastodonmigration@mastodon.online avatar

@FluffyDeveloper Note when you sign up for you agree to license all your content to them for this purpose.

TNLNYC,
@TNLNYC@mastodon.social avatar

@FluffyDeveloper

Let me simplify your post :)

To anyone thinking about joining blueSky, don't. It's so much better here on Mastodon

AbandonedAmerica,
@AbandonedAmerica@mastodon.social avatar

@FluffyDeveloper thanks for the heads up. I wasn't sure if I was going to create an account there anyway but I'm definitely not now

zleap,
@zleap@qoto.org avatar

@FluffyDeveloper

Thank you for the heads up, what we need now is another Creative commons category where we can say yes or no to our creations being used to train AI.

einalex,
@einalex@chaos.social avatar

@FluffyDeveloper
People: I'm just going to join to take a look.

People: I'm just going to join to see why everyone else is joining.

People: I realize it's bad, but I can't leave because everyone else is here. I can't even adopt a second network because it's just too much work.

The cycle continues.

jerry,

@einalex I especially like it when people come here to complain about how awful it is over there, then go back because they like it there better
@FluffyDeveloper

compuguy,
@compuguy@istoleyour.pw avatar

@jerry
@einalex @FluffyDeveloper Something tells me things are rotten in Denmark, er, Bluesky then....🤔

einalex,
@einalex@chaos.social avatar

@jerry @FluffyDeveloper I wish people realized how they're digging their own grave.

gmate8,
@gmate8@mastodon.online avatar

@FluffyDeveloper not to mention this: https://mastodon.online/@gmate8/110635004231188201

Seems like on Bluesky no one cares about this ://

mttaggart,

@FluffyDeveloper Notion also uses this company for their AI features, for anyone curious.

twilliability,
@twilliability@genart.social avatar

deleted_by_author

  • Loading...
  • FluffyDeveloper,

    @twilliability @sushee They should tell their users though. One thing is to have your content scraped by some random company, another is for the company you give your data to to give it to someone else who will use it for training.

    micahdraws,
    @micahdraws@dice.camp avatar

    @FluffyDeveloper This may be misleading -- The text of Hive's privacy policy is pretty boilerplate standard across many websites that take on user or third-party content

    Hive is also used by at least one artist website that's aggressively anti-AI

    FluffyDeveloper,

    @micahdraws would you trust dorsey & co. to forbid hive from using users’ data to train models?

    Given the kind of people they are and what they say, I think it’s very safe to assume the worst.

    micahdraws,
    @micahdraws@dice.camp avatar

    @FluffyDeveloper Dorsey doesn't own Hive. He's not part of Hive's operation and that's not the point anyway.

    Hive privacy policy looks like a boilerplate used by nearly every website. Unless I'm missing something, here's nothing that says they're going to use it for nefarious purposes any more than any other social media platform.

    FluffyDeveloper,

    @micahdraws They specifically say they will use content given to them to train their models.

    It’s not nefarious, but the point of the fediverse is to not be tied to private companies with this kind of behavior. Otherwise we may just as well all go back to Twitter.

    micahdraws,
    @micahdraws@dice.camp avatar

    @FluffyDeveloper Where does it say this? I've been reading Hive's privacy policy and can't find anything that says they specifically use it, so if it's there, I'm overlooking it.

    FluffyDeveloper,

    @micahdraws 4th bullet point: “Maintaining, operating, improving and developing the Services […]”

    That’s just boilerplate for model training. These documents rarely use technical language and are much more generic so they have more wiggle room.

    micahdraws,
    @micahdraws@dice.camp avatar

    @FluffyDeveloper That's boilerplate for every website. Twitter says that. Tumblr says that. Meta says that.

    There is nothing there that is any more damning than most other websites on the internet.

    micahdraws,
    @micahdraws@dice.camp avatar

    @FluffyDeveloper That's boilerplate for every website. Twitter says that. Tumblr says that. Meta says that.

    There is nothing there that is any more damning than most other websites on the internet.

    I'm not saying people have to support Bluesky. But this pushes misinformation, or at least not fully researched or resolved information.

    FluffyDeveloper,

    @micahdraws yes but most websites don’t provide generative AI services x3

    Besides, users should always be told where their data is stored, sent, and used for. Maybe there is nothing to it and Hive throw everything away after labeling it, but there is no way to know for sure and seeing the kind of people involved in BlueSky it is more than fair to expect the worse.

    It’s not misinformation, there is a literal line of code with a commit attached to it and a privacy policy that is very vague.

    chiefgyk3d,
    @chiefgyk3d@social.chiefgyk3d.com avatar

    @FluffyDeveloper @hyc yeah and to be honest it’s not that great of an experience there are no hash tags, it’s still on one server basically. Just rinse and repeat of

    FluffyDeveloper,

    @chiefgyk3d @hyc Wait, no hashtags? o.O

    chiefgyk3d,
    @chiefgyk3d@social.chiefgyk3d.com avatar

    @FluffyDeveloper @hyc yeah from what I’ve seen they don’t seem to work and no one uses them notice the lack of hashtags in all the screen shots I just made from I was researching different decentralized alternatives to

    image/png
    image/png
    image/png

    Kathrin,
    @Kathrin@trouth.eu avatar

    @FluffyDeveloper

    Why would anyone join yet another Twitter anyway? It's not like it was so great under Dorsey.

    gabboman,

    @FluffyDeveloper aaand they will ask in a few months "why did no one told us before?"

    brandonhorst,
    @brandonhorst@techhub.social avatar

    @FluffyDeveloper @ff00aa Lol, everything you post here is used to train generative models too, unless your account is marked private (maybe). It’s just not mastodon themselves doing it. Everything on the public internet is being vacuumed up.

    FluffyDeveloper,

    @brandonhorst @ff00aa The difference is that with BlueSky the behaviour is baked into the network itself, and they never reveal it to users.

    brandonhorst,
    @brandonhorst@techhub.social avatar

    @FluffyDeveloper @ff00aa That is a valid and important distinction

    NatureMC,
    @NatureMC@mastodon.online avatar

    deleted_by_author

  • Loading...
  • jherazob,

    @NatureMC
    Also , artists here should know
    @FluffyDeveloper

    murm,

    @FluffyDeveloper where is the link to their privacy policy? I can't find it with a simple search and I'm curious.

    FluffyDeveloper,

    @murm Here you go :)

    Look under "How We Use Information We Obtain”, 4th bullet point.

    https://thehive.ai/privacy

    murm,

    @FluffyDeveloper no i mean bluesky's, sorry

    FluffyDeveloper,

    @murm No need to, here is the actual code calling on hive.ai API.

    https://github.com/bluesky-social/atproto/blob/main/packages/bsky/src/labeler/hive.ts#L11

    murm,

    @FluffyDeveloper Frankly if bluesky doing this without stating as such in their own privacy policy that's twice as disgusting tbh. Thank you so much for pointing this out either way. I'll be sure to use this info to warn others

    FluffyDeveloper,

    @murm I cannot find their privacy policy either, nor any terms of service, even on their company site.

    Maybe it's visible during signup?

    murm,

    @FluffyDeveloper imagine if they just don't have one because "ITS IN BETA"

    God.

    FluffyDeveloper,

    @murm I'd think they would have to have one regardless of the readiness of the software, especially for European users to follow the GDPR.

    russss,
    @russss@chaos.social avatar

    @FluffyDeveloper @murm it's here:

    https://blueskyweb.xyz/support/privacy-policy

    And the applicable part appears to be this bit (anything you post on Bluesky is completely public anyway, so it I guess it doesn't really matter whether they send it somewhere or whether someone harvests it directly):

    SteffoSpieler,
    @SteffoSpieler@fellies.social avatar

    @FluffyDeveloper oh wait what? That's... not good!

    Tbh, I still don't really know what BlueSky is, but I've seen that name often.

    FluffyDeveloper,

    @SteffoSpieler It's another social media network created by the same people who backed Twitter and Nostr. Instead of going with ActivityPub however, they decided to make their own VC-backed protocol called AT.

    The protocol itself is not too bad, but its controlled by private companies and BlueSky has decided to basically avoid moderation and offload everything to AI labelling.

    In short: it's decentralised Twitter with venture capitalists trying to control their own fediverse :/

    digslogic94,

    @FluffyDeveloper Darn... even in other socials artists are exposed to AI... the net sucks nowadays :c

    FluffyDeveloper,

    @digslogic94 yeah :(

    Nothing stops people from scraping Mastodon as well, but at least it’s not a built-in function x3

    19,

    @FluffyDeveloper @digslogic94 this is what I was wondering - how many AI trainers have simply subscribed to a fediverse endpoint to vacuum up all the posts from everyone.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • ai
  • rosin
  • thenastyranch
  • anitta
  • normalnudes
  • GTA5RPClips
  • DreamBathrooms
  • mdbf
  • magazineikmin
  • Youngstown
  • ngwrru68w68
  • slotface
  • InstantRegret
  • kavyap
  • cubers
  • tester
  • cisconetworking
  • provamag3
  • modclub
  • everett
  • osvaldo12
  • khanakhh
  • Durango
  • Leos
  • megavids
  • ethstaker
  • tacticalgear
  • JUstTest
  • lostlight
  • All magazines