DuckDuckGo, Bing, Mojeek, and other search engines are not returning full Reddit results any more.
Cool, I don’t go to any reddit links anymore anyways. Saves me the hassle of skipping over them.
Personally, I really wish it was as easy to search for Lemmy posts with a search engine as it is with Reddit. Idk, maybe I’m doing it wrong.
Kagi does it
I use kagi; love it. As an embedded systems developer I’m more productive with it.
Even back when I used Reddit, I had such a burning hate against Reddit results that I blacklisted them. So this is actually improving things for me, as I use DDG by default.
As such I hope that this decision becomes another nail in each of their (Google and Reddit’s) coffins.
Welp, they did it. They fucking broke reddit. Again.
As a Lemmy and Duck Duck Go user, this is a desired feature!
Still couldn’t get me to use it, I use DDG which can switch between search engines and search sites very quickly with it’s ! syntax (Everyone goes on about privacy, but this is pretty much it’s best feature). Google results are consistently the worst for me if I’m hitting multiple search engines
Good.
thats fucked
deleted by creator
I’m not understanding what stops a search engine from scraping a publicly accessible website. ?
robots.txt, I guess? Yes, you can just ignore it, but you shouldn’t, if you develop a responsible web scraper.
Doesn’t seem legal that a robots.txt could pick and choose who scrapes. Seems like legally it would have to be all or nothing. Here’s hoping one of the search engines ignores it and makes it a legal case.
You’d probably feel differently if it were your service. Should you be able to control who scrapes your sites or should that be all or nothing?
For the record, I fucking hate what the internet is becoming. I naively believed that even if shit got cordoned off into the walled gardens that are mobile phone apps, the web would remain as open as it was. This is a terrible sign of things to come.
No, I wouldn’t feel differently. In fact letting search engines scrape and point to your content is what leads people to your site. It’s free advertising. If you’re going to let one search engine in, you should let them all in. If you want to be public, be public. Otherwise put up a login firewall and go private.
It’s not just search engines. Lots of people on Mastodon were using robots.txt to block ChatGPT (and any other LLM company they knew of) from scraping their sites/blogs.
I disagree, to a point. I want to be able to control my services to the greatest extent possible, including picking who scrapes me.
On the other hand, orgs as large as Google doing this poses a real threat to how the internet works right now which I hate.
Actually currently it contains this:
User-agent: * Disallow: /
Well, that actually is a blanket ban for everyone, so something else must be at play here.
https://merj.com/blog/investigating-reddits-robots-txt-cloaking-strategy
Reddit is serving different file to google
We believe in the open internet, but we do not believe in the misuse of public content.
That’s real rich, coming from Reddit.
Also, rate limiting. A publicly accessible website doesn’t mean that it will allow scrapers to read millions of pages each week. They can easily identify and block scrapers because of the pattern of their activity. I don’t know if Reddit has rate-limiting, but I wouldn’t be surprised if they implement one.
I tried using Google to search reddit the other day and it didn’t work.
Removed by mod
Great, neither Google search or reddit work anymore. They deserve each other.
I have not actually been able to use any Reddit results for awhile. It might be that I force old[.]reddit[.]com and Reddit has finally cracked down on that?
After seeing this news I just created this lemmy account. I hope people make the right decision and move on to lemmy.
Welcome and don’t feel shy to contribute!
Same here
Then welcome to you too! There’s a nice selection of apps if you haven’t tried them, since Lemmy has no financial incentive to limit access to the content.
Welcome.
It’s pretty good here.
And will continue to get better.
Welcome!
welcome, but maybe consider not using the world instance. it is pretty saturated and the point is to spread users out across many instances instead of having one monolithic one
Heya jack, welcome aboard!
no mention of brave search, didn’t read the article yet though