Honestly the fact that Cloudflare doesn't attempt to play judge on what sites they protect seems like a strength to me. No one likes neonazis and such but I'd rather not have a company arbitrarily decide whether my website deserves to stay online or not.
Sure, my opinions may align with those of the company today but there's no guarantee that it'll still be that way tomorrow, be it because of a change in leadership or a presidential order. If Cloudflare were to aggressively reject nazis then they could do the same with e.g. good information about vaccines so the less precedent there is for Cloudflare refusing customers and playing judge the better imo.
Ah, bad phrasing on my part, since English isn't my first language. Should have just said "business" instead of "private company". Sorry for the confusion.It's actually publicly traded. Ticker is NET. It went public in late 2019.
I agree! A proper labyrinth needs a Minotaur meting out doom.Please tell me the computers literally explode in showers of sparks. If all the horrors of sci-fi have to come true we should at least get a little bit of the cool stuff!
And when the product is access to consumers' eyeballs?Yes, you get it. Businesses should not be in the business of judging their customers, but businesses should provide the goods and/or services that customers pay to receive. Internet companies should be like McDonald's: Regardless of race, ethnicity, sex, gender identification, religion, political beliefs, what sports teams you support, etc., if you pay for a Bic Mac, you get a Big Mac.
I can certainly understand being against Neo-Nazis and other race supremacists, but once you deny one group something, it becomes easier to deny other groups something as well. For example, it can easily be imagined that Cloudflare could be put under pressure to censor any content related to DEI. Free expression should not be subject to whoever is in power or the current cultural zeitgeist.
This is a take only a stright white Christian cis white male could have.Yes, you get it. Businesses should not be in the business of judging their customers, but businesses should provide the goods and/or services that customers pay to receive. Internet companies should be like McDonald's: Regardless of race, ethnicity, sex, gender identification, religion, political beliefs, what sports teams you support, etc., if you pay for a Bic Mac, you get a Big Mac.
I can certainly understand being against Neo-Nazis and other race supremacists, but once you deny one group something, it becomes easier to deny other groups something as well. For example, it can easily be imagined that Cloudflare could be put under pressure to censor any content related to DEI. Free expression should not be subject to whoever is in power or the current cultural zeitgeist.
I don't think that it is fair to say that the web site is luring the crawler - the crawler found the site, the crawler disregarded the site's "no crawler's sign", the crawler spammed the site to suck it's content for the crawler's own profit with no remuneration to the site for their creations.we reported on "Nepenthes," software that similarly lures AI crawlers into mazes
Got a link to a good how-to? I can serve 100G a month without worries!I have been running Nepenthes since January 26th, even going so far as to set up an Apache load-balancer to allow 4x Nepenthes processes to handle the load. I am up to 57,019,966 hits in that time, and crack a grin every time I check the stats. That's 167 GiB of pseudo-gibberish returned. The only way it could be sweeter is if I used an LLM to churn the data.
The author is working on a no-SQL option for Nepenthes, which I highly recommend. It is around 50x more CPU efficient compared to the SQL version, and only takes 5x more RAM (43MB to 240MB) for my 496,467 word corpus.
I don't know about that one, but there is a Chomskybot that creates long texts strongly resembling Chomsky's linguistic writing, but which are gibberish. Perhaps this could be modified to resemble most any writer, and used to lead AI dredgers (I use that word intentionally) astray.Related to this: maybe 6 or 7 years ago Stanford computer engineering developed a ‘search engine’ that worked in the background, constantly stringing three random words together as search phrases, to confuse and fill up any personal data mining files used for targeting advertising with literal junk. Anyone remember what’s its called?
What an absurd timeline.
Cloudflare has obviously never witnessed my father in law trying to surf the web...article said:"No real human would go four links deep into a maze of AI-generated nonsense," Cloudflare explains. "Any visitor that does is very likely to be a bot, so this gives us a brand-new tool to identify and fingerprint bad bots."
...Why? Why should I care if a service that's wasting my server's time, forcing me to spend my own money to provide free information, gets accurate data? If their tools provide false information, that's a them problem, not a me problem.
As we know half the AI output is tainted by shit, we also know the public at large will think it's the truth. We can ignore it and they won't know that they should.Fuck these leeches.
I did that. There was a site with a Moscow IP that was spamming one of my web forms with injection attacks. They were hitting me with about five per second. When I detected them, I wrote a quick filter that sent them the Monty Python Spam Song, 65863 verses of it. They quickly went away.Dang. I was hoping they used mashups of Monty Python.
Maybe we can start a petition.
Chromskybot has the capacity to produce irrelevant strings for about 10% of the time until the heat death of the universe. That should be sufficient to screw the AI developers.I don't know about that one, but there is a Chomskybot that creates long texts strongly resembling Chomsky's linguistic writing, but which are gibberish. Perhaps this could be modified to resemble most any writer, and used to lead AI dredgers (I use that word intentionally) astray.
Note 1: Before someone says Chomsky himself wrote gibberish, I'm a linguist, and have read much of Chomsky's writing on linguistics (not on politics). It is not always easy to follow, but it is definitely not gibberish.
Note 2: Chomsky recently suffered a stroke, and apparently can no longer communicate.
Or the bot meets the James T. Kirk algorithm AI.Please tell me the computers literally explode in showers of sparks. If all the horrors of sci-fi have to come true we should at least get a little bit of the cool stuff!
Cloudflare also block smaller browsers, like privacy specialed forks and the like.Emphasis is mine; this is what I'm responding to.
Cloudflare is used by a large number of DDoS-for-hire services. You have to pay Cloudflare to protect yourself from the companies that cause the problem. They also have a long history of hosting Nazis. After years of complaints, their CEO finally kicked one neoNazi website off, but dozens (if not hundreds) remain.
https://www.propublica.org/article/how-cloudflare-helps-serve-up-hate-on-the-web
https://krebsonsecurity.com/2016/08/inside-the-attack-that-almost-broke-the-internet/
Note in the second link, at one point, the chat logs show the attackers are amused that Spamhaus is a Cloudflare customer... and Spamhaus lists a lot of Cloudflare IPs as spam cannons.
Cloudflare does a lot of really great technical work. I want to give my full-throated support to Cloudflare. I really, really, do. But so long as they do minimal-to-no curation of their customers, I always have to put an asterisk on that support.
So, that's why Cloudflare gets a ton of hate. They sell DDoS protection to you... and the companies who DDoS you. They make it possible for Nazis to have their websites. They do good work... and also provide that good work to some of the most vile scum on the planet. It's not hard to be against Nazis. If someone at Cloudflare is reading this, kick them off and I will pick up the phone calls from your sales people. Otherwise, I'll continue to pick up and ask "Sounds great, ready to sign -- oh wait, by the way, do you still platform Nazis? Uh-huh, yeah, no, I'm not going to let you weasel out of this. Once you drop them I'll sign. I hope you're tracking revenue lost to your support of Nazis internally. Have a great weekend!"
The google crawler respects the robot.txt files. This is against crawlers that ignore it. So not indexing context explicitly marked not to be indexed is fine...Like it or not, you're never going to be able to allow the google crawler access to your content while also blocking AI crawlers, simply because google is training AI as well. We're going to have to fight these plagiarism tools in court. If you are concerned with blocking access short-term, then my advice to you is to login gate content and aggressively IP ban.
I’m sure google has a crawler that respects a boundary or two, that they can point to and say “See?”The google crawler respects the robot.txt files. This is against crawlers that ignore it. So not indexing context explicitly marked not to be indexed is fine...
https://en.wikipedia.org/wiki/Dead_Internet_theorySo how I'm seeing it is, we are now creating an alterweb adjacent to the one we use, designed only for bots. Looking at the sorry state of the current internet makes me wonder if this is also a fake internet, one designed for capitalistic purposes and manipulated by algorithms to a finite end. So then are we in a parallel internet already? And what is the real internet?
Yes, the very same. That relatively minor inconvenience (to a human, on occasion) is great for throwing a spanner in the works of someone trying to do a DDoS attack.This is the site that 'checks my browser' for 5 seconds every time I connect to a site hosted / protected by it, right?
We live in a time when there's not enough money for healthcare or housing, but UNLIMITED FUNDS to throw at AI. . ..Now we know why AI farms need such massive energy sources. They're being pitted against each other.
Drastic measures are needed. Drastic things like completely ending the use of fossil fuels immediately, for exampleSure, but we're now at the stage where even the defenses cannot avoid being inherently damaging. The pigs are increasingly successful in dragging us into the mud fights they love so much. And everything gets covered in shit. The only way out may require REALLY drastic measures.
That’s because the rich and powerful believe AI will be able to free them from paying workers anything at allWe live in a time when there's not enough money for healthcare or housing, but UNLIMITED FUNDS to throw at AI. . ..
We're just touching surface of what AI can do for humanity and we're already trying to poison it. Sad.
CloudFlare can play judge tomorrow if they like. You are always at the whim of a private entity; so long as they aren’t discriminatory (in the legal sense of the word) they don’t have to do business with you.Honestly the fact that Cloudflare doesn't attempt to play judge on what sites they protect seems like a strength to me. No one likes neonazis and such but I'd rather not have a company arbitrarily decide whether my website deserves to stay online or not.
Sure, my opinions may align with those of the company today but there's no guarantee that it'll still be that way tomorrow, be it because of a change in leadership or a presidential order. If Cloudflare were to aggressively reject nazis then they could do the same with e.g. good information about vaccines so the less precedent there is for Cloudflare refusing customers and playing judge the better imo.
Suggesting it is an American ethos plays into the commonly-held misperception that the First Amendment is relevant to business relationships between non-governmental entities. It seems more rooted in mythical constructs of sanctified “neutral grounds” or “honor among thieves,” which would have been imports from older cultures.CloudFlare can play judge tomorrow if they like. You are always at the whim of a private entity; so long as they aren’t discriminatory (in the legal sense of the word) they don’t have to do business with you.
And your argument is a very American one. A lot of other countries — ie Germany — have laws about content and don’t seem to be having the issue you are concerned about. While I appreciate the protections of the 1st Amendment, is it also possible for the Paradox of Tolerance to be at play?