- cross-posted to:
- news@lemmy.world
- cross-posted to:
- news@lemmy.world
Reddit says Microsoft’s Bing, Anthropic, and Perplexity have scraped its data without permission. “It has been a real pain in the ass to block these companies.”
If this gains any traction, I hope every news site, blog, etc sues Reddit for profiting off their material that fills the entire site. The comment section wouldn’t exist if there wasn’t something to comment on in the first place. Digg and Reddit didn’t even have comments at first. It was ALL content from other sources. Even “reddit original content” is" original content from a creator posted to Reddit". Reddits “value” is ephemeral.
Without these agreements, we don’t have any say or knowledge of how our data is displayed and what it’s used for, which has put us in a position now of blocking folks who haven’t been willing to come to terms with how we’d like our data to be used or not used
It’s the users’ data, not yours, you rent seeking fuck
“our” data
Yeah this.
Channeling Joe Pesci from “Casino”: “You only have that farking data because WE made that possible!”
“GET THIS THROUGH YOUR HEAD!”
is there even anything of value on reddit?
Yes, absolutely. Any time I need to buy a product I don’t know much about, I look for an enthusiast community with a FAQ. Most of the active, high-quality communities are on Reddit.
I would like decentralized services to replace that, but that’s a slow process, if it happens at all.
Reddit remains as valuable as ever. It’s amusing that you think it imploded a year ago just because a small number of users migrated here
It sort of did, thousands of useful comments were turned to gibberish, the mobile web site turned to shot, and the mobile app stopped properly working for communitys with specific content warnings.
Completely. Lemmy is far too small to have the value Reddit does.
I left Reddit due to their API bullshit, but I so miss all of the hobby communities I was part of, that has like-minded members, and a plethora of resources. It’s not easy to impossible to start communities such as reeftanks, homesteading, literature, bookcirclejerk, etc. on a platform as small as Lemmy. And beyond starting one, the quality and quantity will never match Reddit’s because Lemmy just doesn’t have the same reach.
Lemmy is great if you like Linux, like Star Trek, or are trans, but other than that, it’s missing so, so many demographics that make a wholistic platform.
trans
I feel this so hard, the sheer number of openly LGBTQ+ people here really skews the demographics of the site. I’m not saying it’s a problem, just saying that LGBTQ+ people are dramatically over-represented here. It’s an interesting contributor to lemmy culture, and I wonder how much that impacts homogeneity here (e.g. upvotes and downvotes for certain types of content).
But yeah, it’s missing a lot of demographics.
That said, I’m really into Linux (been using for >15 years), so that’s cool I guess.
As a cis straight man I’m taking this as a learning opportunity until the demographics level out. An inherently inclusive bias will be more helpful early on than more niche communities anyways.
Sure. Again, I’m not saying it’s bad, just that the bias seems to exist.
There are certainly worse biases that exist, such as very little representation from people on the right side of the spectrum, so hate against half the population seems to get a pass and downvotes silence constructive comments/posts just due to political bias. That’s incredibly frustrating, and I think the high focus on supporting LGBTQ+ people goes along with that (i.e. the message that conservatives “hate” LGBTQ+ people, which is only true for the more extreme end of conservatism).
That said, I do like the support LGBTQ+ people get, I just wish the demographics were a bit more diverse without sacrificing the culture. I live and work in a conservative area, but my company has built a pretty inclusive culture (at least for the area), so I think it’s totally possible.
Oh man I don’t miss that at all. Moderating out a pervasive delusion isn’t bias, any more than we’re biased in favor of a round Earth. On Reddit there were constant “enlightened centrists” who kept making appeals to moderation.
There’s nothing of value to be gained from conservatives. The “good” ones who don’t say the homophobia out loud are still voting for politicians who do. If it was just the extreme end, then Trump wouldn’t be their nominee. Hate is their normal now.
“If there’s a Nazi at the table and 10 other people sitting there talking to him, then you got a table with 11 Nazis.”
This is exactly what I’m talking about: casually dismissing half of the population based on little more than association. That drives division and pushes people into echo chambers.
Understood. I am disagreeing with you. If that wasn’t obvious, then I fear you may have missed my point.
Half of America supporting fascism is reason to create somewhere - anywhere - where that shit is shut down. You’re free to go associate with freeze peach Nazis on X, Facebook, Nostr, wherever. I don’t want any part of that and prefer a server that moderates them out. Paradox of tolerance and all that.
If you all believed the Earth was flat, then I would prefer the “echo chamber” of people saying “no, we checked, it’s round”. There simply being a lot of believers doesn’t imply an idea has merit, and we don’t have infinity time for BS.
A lot of older posts are still relevant to specific hobbies. I will look up information on paper, some guitar information, but most posts from the last two years are not worth looking at.
There is also so much regurgitated LLM shit.
An absolutely prodigious back catalog of high quality images, interviews, and explainers. A treasure trove of historical content that’s been heavily indexed and participant-weighted for relevancy. And the bulk of it predates the infestation of AI, so its valuable just as sampling data of original human content for further iterative development of ChatGPT and other LLMs.
Reddit CEO can shove reddit up his ass sideways. The whole thing.
He can put his dick in /dontputyourdickinthat
Aside: I give Lemmy serious props for not reproducing some of these communities btw.
We shouldn’t accept this behavior or other companies will follow!
To commemorate Steve “Greedy Pigboy” Huffman’s assertiveness, I’ve made some memes. Enjoy.
I’ve said once and I’ll say it again. Either the information on your site is free to all or to none. You can’t have some people/entities pay and some not!
You can. We didn’t need to like it but they can. Besides, isn’t that how many magazines work? Pay for articles and such
Not really, the people who write the articles are actually employed by those magazine companies, and everyone who wants to get one, needs to pay for one.
Reddit says “blablabla” . Reddit is just trying to stop its communistic website from losing money.
Reddit says “we will give you access” once you pay us and feather our pockets.
Fuck spez, what a cunt. Delete all your comments etc, and let the AI rot in retarded posts.
I dont think you know what the word communistic means.
Communistic = Things I don’t like.
For example, Flat tires = Communist
I woke up one day and my leg started being communist. I haven’t been able to get rid of it since.
Don’t you have a communist repair kit in your car?
Do you mean a capitalist repair kit? Sometimes you just need to plug the communist tire with a bit of capitalism.
I has a communist once. Not fun.
Fuck Spez
Bunker boy
I tried to get us all to start using “King Steven the Turd”, but that never caught on.
spez is close enough to spaz that it’s good enough already as-is:-D
It’s insulting to turds.
Yah, I’ll take a good shit over Spez any day.
Spez gets a ton of online hate and it is still somehow not enough hate.
Turns out online hate doesn’t stop millions of dollars from coming in or make people less of assholes. Stop using reddit, discourage people you care about from using it by offering alternatives.
“People who fecklessly farm other people’s data upset at other companies are farming their data.”
Remember when Reddit was pro net neutrality?
this has nothing to do with net neutrality
Giving preferential treatment to one service provider over another is 100% a net neutrality issue.
i have bad news about activitypub federation.
I was thinking about this the other day. Because Lemmy instances keep defederating from each other, I don’t really experience Lemmy. I experience a fragment of Lemmy as determined by the admins of the instance I’m connected to.
Even if I run my own instance, I guess there’s nothing stopping instances from defederating from me (or just refusing to federate to begin with because my instance is too small to bother with).
Is there even a way to experience all of Lemmy, including spam and things some people don’t agree with?
Net neutrality is about internet access and connectivity, not about what websites can do
(but yes they’re still hypocritical)
Net Neutrality is an ISP thing, this is a search engine thing. Maybe they’re related in some conceptual sense, but in terms of the actual definition of that term, they’re really not.
Spez has a bunker to build!
Scraping isn’t illegal, they can’t do anything
Anything legally.
Don’t look up the eBay stalking acandal
It’s also just indexing not some data harvesting too.
Well Reddit should just sue these companies and see if these companies are actually breaking any laws. Holding sizeable chunk of the internet hostage also sounds like something the EU and US might want to look in to as it very much sounds like anti-competitive conduct or market manipulation.
Also if these companies want to have greater ownership over the content generated by their users they should also be much more liable for the content posted to their sites. I mean when something like the Section 230 was written they probably did not take this in to account. If these companies want to start selling user generated content then they should simply lose the immunity from liability.
Reddit would lose badly that’s why they don’t sue. US’ 9th circuit ruled that scraping Linkedin is legal and Bing is not even scraping but indexing the data. Easiest case ever.
It’s almost impossible to block web scraping especially someone with Microsoft or Perplexity resources.
Its clearly an attempt to blackmail indexers into license deal as paying something to reddit could be actually cheaper than battling anti robots.
While I don’t disagree with the general idea, Section 230 would introduce an uncontrollable risk into running any website with user-generated content and would essentially shut them down.
If the site isn’t selling data, they wouldn’t lose 230 protection. So that would only be a risk for the companies selling their users’ data, not your regular forum or something.
That gets really murky though. For example:
- news sites w/ comment sections - they’re profiting from ads and subscriptions, so how much of that has to do with the comments?
- ecommerce - reviews on Amazon and eBay could be considered advertising for the product. Who’s liable, the ecommerce site, the merchant, or the poster?
- product websites - how much are posted “reviews” considered advertising for the product? There may not be direct sales on the website, but surely someone’s review would impact sales elsewhere
- for-profit services with a discussion forum - these would be on a separate site from the revenue-generating service, but still associated with the brand and thus likely contributing to advertisements for the product
It’s a lot more obvious for social media sites like Facebook since user-generated content is the service, but there are a lot of for-profit entities where user-generated content is highly relevant, but not the core service. Would those sites be essentially forced to either moderate or eliminate user interaction?
There’s a lot of complexity here.
they should also be much more liable for the content posted to their sites.
why do people insist on making me defend reddit.
fuck spez little piggy greedy soy boy
Hey ! Leave us soy boys out of it, we hate Spez as much as you do!
soy is good, what the hell kind of comment is this