Bots are currently scraping the internet for LLM training data at unprecedented rates[1][2][3], driving up costs and destabilizing public-facing websites. I want to talk about how this has been particularly difficult for wikis, and has gotten much worse in the last few months.
I really want a tutorial on how to do this. I think it’s a great way to practice self-agrandizement by making myself the pretend king of a pretend country.