The Aloof Alot
Ars Scholae Palatinae
Correct plural of the Latin word.“Fora”
![]()

...from which the English version is derived. Which has both "fora" and "forums" as acceptable plurals.
Correct plural of the Latin word.“Fora”
![]()
Stand up when you talk to a mod.I am sitting down.
But then sit down to listen. And raise your hand when you want to speak, and wait for the talking stick to be passed to you.Stand up when you talk to a mod.
You posted public comments on the internet. They're public now. They're archived all over the place.
Language models generate text based on statistical probabilities. This led to serious false accusations against a veteran court reporter by Microsoft's Copilot. German journalist Martin Bernklau typed his name and location into Microsoft's Copilot to see how his culture blog articles would be picked up by the chatbot, according to German public broadcaster SWR. The answers shocked Bernklau. Copilot falsely claimed Bernklau had been charged with and convicted of child abuse and exploiting dependents. It also claimed that he had been involved in a dramatic escape from a psychiatric hospital and had exploited grieving women as an unethical mortician.
Copilot even went so far as to claim that it was "unfortunate" that someone with such a criminal past had a family and, according to SWR, provided Bernklau's full address with phone number and route planner. I asked Copilot today who Martin Bernklau from Germany is, and the system answered, based on the SWR report, that "he was involved in a controversy where an AI chat system falsely labeled him as a convicted child molester, an escapee from a psychiatric facility, and a fraudster." Perplexity.ai drafts a similar response based on the SWR article, explicitly naming Microsoft Copilot as the AI system.
Llm isn’t authoritative so why does that matter?I AM NOT OK WITH A LLM HALLUCINATING I RAPED A FIVE YEAR OLD GIRL AND PUTTING THAT ON THE INTERNET.
Old enough to not be impressed with weird internet bluster? How old is that? I feel like these days it's probably 13?Come on, Aurich.. How old are you?
Then ... you actually do not in fact care about LLMs ingesting your text since everyone just scrapes that anyways.I am perfectly OK with my old stuff in archive.org or that last remaining copy of usenet.
I will interpret that as "please delete my account" and do so.Remove my shit because by now it's clear that OpenAI, Microsoft, META etc cannot be trusted to keep my shit away from their LLM. It is already happening.
If LLMs aren't supposed to be interpreted as authoritative, why are LLM generated responses the first or second thing you see on most web searches? Would you be fine with that being the first thing people see when they search your name on google or bing?Llm isn’t authoritative so why does that matter?
Because Google is desperately trying to not look irrelevant, while turning their core product into shit?If LLMs aren't supposed to be interpreted as authoritative, why are LLM generated responses the first or second thing you see on most web searches?
I'm not sure you're responding to what I actually wrote?Because Google is desperately trying to not look irrelevant, while turning their core product into shit?
I think we're past the "Google does it, therefore it's not evil" stage by now, don't you?
Because Google is desperately trying to not look irrelevant, while turning their core product into shit?
I will interpret that as "please delete my account" and do so.
People can just ask without yelling or trying to imply I'm a child or whatever else. I'm not going to be bullied, and I do respond to polite requests.
I think that Aurich is saying "LLM's shouldn't be considering authoritative, and just because people are cramming the latest buzzword in for investor attention does not change the underlying reality that LLMs do not understand anything, should not be used as a source or a research tool, and are primarily suitable for marketing puffery and by the numbers fiction, because only those disciplines actually lack consequences for inaccuracy, plagiarism, and bullshit" without saying that as explicitly.I'm not sure you're responding to what I actually wrote?
I'm impressed that you keep poking at him, hoping he'll get mad at you and lash out so your weird vendetta will look justified, and get mad that he's taking a measured, professional, and considered approach to the fact that no matter what his personal feelings on LLMs eating data, he's working for a CN publication and a professional posting under his real name.I mean I’m impressed with the chutzpah of someone whose fora are dying a long slow death and are being monetized by ChatGPT nonetheless yelling at Google about turning their product into shit. I said in another thread that you would be the reasonable face for Ars of Cory Doctorow's enshittification process, but the cracks are starting to show.
Huh, scotttheking's quote of the deleted account still shows the original account name. I guess xenforo doesn't update quotes?I will interpret that as "please delete my account" and do so.
I believe the old forums worked the same way? I don't think the names on quotes have ever been updated but now you have me questioning my memories.Huh, scotttheking's quote of the deleted account still shows the original account name. I guess xenforo doesn't update quotes?
Huh, scotttheking's quote of the deleted account still shows the original account name. I guess xenforo doesn't update quotes?
I believe the old forums worked the same way? I don't think the names on quotes have ever been updated but now you have me questioning my memories.
Are we reading the same posts? There seems to be plenty of understanding and 'being heard' in Aurich's posts. They're working to carve out exclusions to the extent they're able to, and within the reality of the today's internet. Regardless of how this goes, it won't kill or significantly Ars, or the forums. What may kill/alter the forums? A bunch of people deleting their posts, leaving, etc. Of course, I don't think this particular issue will drive a ton of people away, just a few particularly vocal ones, sometimes with tenuous or premature reasoning behind their exits. I'm curious as to why you think yet another AI scraper gobbling things up will make the forums unrecognizable to us.All I get from your posts is that you're not upset by this and that you don't understand why people are upset. No shade and we are all in the same boat, but have you heard of empathy? Clearly people are upset, clearly people are asking to be heard and clearly people need some reassurance.
Your response is to... what... ask for politeness? What is this, the DMV? If you're not capable of hearing out people's venting, then tell Caesar that you're not fit for the task. This isn't a joke. This move will literally kill Ars... or at least make it unrecognizable to us the people that started it. If that's not your problem, then cool, but make your position clear.
I think you aren't very good at reading, between the lines or otherwise.All I get from your posts is that you're not upset by this and that you don't understand why people are upset. No shade and we are all in the same boat, but have you heard of empathy? Clearly people are upset, clearly people are asking to be heard and clearly people need some reassurance.
Yes, I expect to be treated with a modicum of politeness when someone is talking directly to me. You don't have to say pretty please, you could not make personal insults. That feels like a pretty reasonable middle ground.Your response is to... what... ask for politeness? What is this, the DMV? If you're not capable of hearing out people's venting, then tell Caesar that you're not fit for the task. This isn't a joke. This move will literally kill Ars... or at least make it unrecognizable to us the people that started it. If that's not your problem, then cool, but make your position clear.
No, quotes aren't dynamic, it's just plain text. There is essentially no reasonable way to alter those.Huh, scotttheking's quote of the deleted account still shows the original account name. I guess xenforo doesn't update quotes?
That's a pretty reasonable summation of my thoughts!I think that Aurich is saying "LLM's shouldn't be considering authoritative, and just because people are cramming the latest buzzword in for investor attention does not change the underlying reality that LLMs do not understand anything, should not be used as a source or a research tool, and are primarily suitable for marketing puffery and by the numbers fiction, because only those disciplines actually lack consequences for inaccuracy, plagiarism, and bullshit" without saying that as explicitly.
Are we reading the same posts? There seems to be plenty of understanding and 'being heard' in Aurich's posts. They're working to carve out exclusions to the extent they're able to, and within the reality of the today's internet. Regardless of how this goes, it won't kill or significantly Ars, or the forums. What may kill/alter the forums? A bunch of people deleting their posts, leaving, etc. Of course, I don't think this particular issue will drive a ton of people away, just a few particularly vocal ones, sometimes with tenuous or premature reasoning behind their exits. I'm curious as to why you think yet another AI scraper gobbling things up will make the forums unrecognizable to us.
I'd prefer there was no relationship between OpenAI and CN, and that various AI scrapers were blocked from ingesting public posts by something more robust than a voluntary robots.txt file. But the latter isn't the reality of the internet regardless of what happens with the former, so leaving at the drop of a hat before you even see how the details of the robots exclusions work out seems...hasty.
Thats been happening since day 1 of you posting on the internet. The internet has been crawled by search engines nearly since the start of search engines. Search engines are very much making profit from scraping your content.I care about 20 years of my life being sold for profit.
First I want to say I appreciate all the effort Aurich has put into addressing this situation, and, like him, I'd urge long-time posters to give the matter serious thought before abandoning the community. This place really is special and unique, and the (mostly) continuous history of the fora and continued participation by many long-timers is a big part of that.A forum that is invisible unless you are logged in is heading towards death.
If nobody can read comments they're interested in why would they ever sign up for an account?
Look how stupid Twitter is now that you can barely see anything without a login.
"Viewable only while logged in" was one of the core security functions of the Velvet Room. I strongly suspect that its closure is at least partially because even that isn't really an impediment to AI data scrapers. It was a form of security through obscurity, and the time of obscurity has passed.First I want to say I appreciate all the effort Aurich has put into addressing this situation, and, like him, I'd urge long-time posters to give the matter serious thought before abandoning the community. This place really is special and unique, and the (mostly) continuous history of the fora and continued participation by many long-timers is a big part of that.
On the above: I wonder if a compromise (between Shavano's suggestion and Aurich's response) could be reached. I understand Aurich's point, and I would submit that the front page discussions and the technical fora constitute the informational part of Ars and provide plenty of impetus for people to sign up for an account.
There are a few fora though, I'm thinking of the Soap Box, Lounge, and Boardroom (and former VR), where the content tends to get more personal and less technically informative, where people who know each other are being themselves, where others are relying on the thin veil of anonymity (which AI may see through) to post things they otherwise wouldn't share broadly. I suggest making those 3 forums viewable only when logged in (and perhaps also take some steps to make scraping data from those fora more difficult), which would at least create something of an oasis where people feel comfortable contributing without their contributions being (easily) used by LLMs etc.
I agree that closing the VR made sense. Continue to think though, that for the Lounge, SB, and BR (for somewhat different reasons in each case) it would be best to go to the 'log in to view with some attempts to make scraping even while logged in difficult' model."Viewable only while logged in" was one of the core security functions of the Velvet Room. I strongly suspect that its closure is at least partially because even that isn't really an impediment to AI data scrapers. It was a form of security through obscurity, and the time of obscurity has passed.
It was explained in a thread (in the VR, just before it disappeared) that Ars simply no longer wishes to host the kinds of conversations that went on there. Never did the topic of AI scraping enter the conversation. I can't comment on whether that was the real reason for its closure."Viewable only while logged in" was one of the core security functions of the Velvet Room. I strongly suspect that its closure is at least partially because even that isn't really an impediment to AI data scrapers. It was a form of security through obscurity, and the time of obscurity has passed.
It was explained in a thread (in the VR, just before it disappeared) that Ars simply no longer wishes to host the kinds of conversations that went on there. Never did the topic of AI scraping enter the conversation. I can't comment on whether that was the real reason for its closure.
It was explained in a thread (in the VR, just before it disappeared) that Ars simply no longer wishes to host the kinds of conversations that went on there. Never did the topic of AI scraping enter the conversation. I can't comment on whether that was the real reason for its closure.
The what did YOU eat last night thread?So where do I post about my nut bra collection now?
That all makes a lot of sense.It was time for the VR to go.
And to be clear nothing is erased, I just locked access to it. If there was say a thread that someone really needed access to I can still move it out as needed etc.
I think what I'd say about the VR situation is that it's honestly a sign that we're operating in good faith. I know I said we don't erase posts. Once we allow that and a bunch of people do it all the old posts will become swiss cheese and we might as well just give up.
We don't want to give up our forum history.
But the VR was just chock full of really personal stuff, a lot of things that have frankly aged badly, and I honestly don't think something anyone should be crawling back through and trying to resurface.
So we made the call that it was best to box it up and let it gather dust. Because we're not trying to make people's live harder or find drama. We're not such sticklers for our rules that we can't be flexible where it makes sense.
This is well put and reflects my concerns also. I feel there is lots of PII available in the forums you mention. The fact that it needs to be assembled would previously have been a barrier to its exploitation. However, now that LLMs can seemingly perform this reassembing (or may soon be able to), this distributed PII may become more accessible.That all makes a lot of sense.
So can we circle back to the current request that started this mini-discussion?
The Lounge is not quite as chock full of really personal stuff as the VR, but it does have a lot of very personal stuff, particularly in older posts. And not all of it has aged well, and a fair amount of it is personally identifiable.
Similarly the SB is full of decades of people working out views on complex and heated issues. And norms change over time, and views that were well within the norms of 1999 can look out of place now. And we want to encourage people to run for local office and not have their views on charged issue X from 20 years ago weighing them down.
And finally the BR, particularly in its earlier days is full of people laying out their personal financial or job situation and asking for advice. It's a great forum, but it's full of info that people shared with a sense of trust in the group they were sharing it with.
In any case, the suggestion is to take just those three fora (Lounge, SB, BR) and require a log in to view (and ideally also take some technical steps to make scraping difficult even when logged in). This would strike a balance between keeping material available and keeping the fora vibrant and appealing, with some respect for people's desire not to have their more personal posts (easily) scraped by LLMs etc.
And it would still keep plenty of fora (front page and all of the technical fora) visible to anyone and serving as a draw for new people to sign up for accounts. Arguably the draw to sign up might in fact become larger, as potential new members could see numerous vibrant fora and also see that they needed to sign up if they want to see content on politics, finance, etc.
This is the core of the issue, I think, when it comes to forum data being used in this way. It would previously have been something like too much effort to reassemble a lot of the implicit and explicit relationships that have existed across time here.This is well put and reflects my concerns also. I feel there is lots of PII available in the forums you mention. The fact that it needs to be assembled would previously have been a barrier to its exploitation. However, now that LLMs can seemingly perform this reassembing (or may soon be able to), this distributed PII may become more accessible.
For me, this leads to 2 conclusions:
I do feel that shutting down this conversation with "well it's always been like that" is shying away from a potential looming issue and also not true, because it ignores the fact that this PII may now be surfacable in ways that it previously wasn't. Equally, the "it's not us, it's Condé Nast" defence is irrelevant.
- Sites that relied on PII being obfuscated or distributed may need to reconsider that they are, in fact, still holding that PII and are custodians of it.
- A deal to get in bed with OpenAI voluntarily (and possibly for financial reward) might end up being judged as giving away/selling PII.
TLDR: I endorse your suggestion that TL, SB & VR should be accessible via login only. Best efforts and all that.