Is Lemmy protected of scraping our data for AI?
The opposite; the API to simply take comments and posts in bulk is free and open.
Can an instance close the API or limit it?
In theory, yes, but instances don’t ship with the ability to do that. There would need to be a change to the Lemmy code base if such a thing was to be seriously implemented.
I’m no federation expert, so I can’t really comment on whether doing something like requiring API keys would be feasible, unfortunately.
Well, they already made it very clear to everyone back in May that the content created by the community does not belong to the community. Anyone still using that dump deserves to be explored.
Anyone still using that dump deserves to be explored.
( ͡° ͜ʖ ͡°)
Nice! Someone owes me 5€ now.
User content from Reddit? It’s barely worth looking at for free…
This is the best summary I could come up with:
Reddit will let “an unnamed large AI company” have access to its user-generated content platform in a new licensing deal, according to Bloomberg yesterday.
The deal, “worth about $60 million on an annualized basis,” the outlet writes, could still change as the company’s plans to go public are still in the works.
The news also follows an October story that Reddit had threatened to cut off Google and Bing’s search crawlers if it couldn’t make a training data deal with AI companies.
Last year, it successfully stonewalled its way out of the biggest protest in its history after changes to its third-party API access pricing caused developers of the most popular Reddit apps to shut down.
As Bloomberg writes, Reddit’s year-over-year revenue was up by 20 percent by the end of 2023, but it was still $200 million shy of a $1 billion target it had set two years prior.
The company was reportedly advised to seek a $5 billion valuation when it opens up for public investment, which is expected to happen in March.
The original article contains 346 words, the summary contains 175 words. Saved 49%. I’m a bot and I’m open source!
Glad I edited all my comments to say fuck u/spez
Wow, I bet the writing focused communities will love this!
Brilliant, A.I does the heavy lifting takes data for free then resells access to it while us who contributed for the last decade don’t get a dime.
The only surprising part of this is that it took as long as it did.
And finally their Logo makes sense.
I’m glad I deleted my content on the way out.
How would I go about doing that? I’d like to wipe my shit from over there before I outright delete my account.
I dont see why someone would need this deal anyways… most is already available, and most the new stuff probably too, even without API access.
I also expect the fediverse to be crawled and used for training, thats just the thing about publicly available stuff, it gets used, if we like it or not…Long Live Lemmy
Well, they can (and will) still scrape us if they want. Just nobody’s making a buck off of it.
That’s going to be a lot more work since comments and posts are decentralized here. You can probably easily get some of it but it will be hard to get all of it.
It’s actually even easier than that. Instead of setting up an tool to make up requests for the API, you can just set up a bridge that will dump everything right into your database. The wonders of federation.
All better than that piggyboy getting free money
yet
The reality though is I can train LLMs off Lemmy data all I want and I don’t have to pay ANYONE a dime…
Time to delete my old accounts, I guess. Is there a bit that will go through and delete all posts and comments too? That would be helpful.
I used PowerDeleteSuite back in June.
It’s private and not paid like Redact. I’d consider editing the comments instead of deleting them to spread the word/reason of deletion.
That’s what I did. I turned all my comments into Lemmy advertisements, and also an obscene sentence telling u/spez to kill himself (I’m not proud of it at this juncture, but it felt good at the time).
Or to poison the dataset
Here comes a new wave of users, I guess
Kinda thought they’d manage to go a bit longer than the few months they did