Recently, I blocked all the LLM user agents in Wix using the robots.txt file. Since my site is content-based, I can’t afford platforms like Perplexity and Claude claiming my content and sharing it with their userbase; it took me hours to write and create it.
These crawlers are notorious for crawling websites despite being blocked. Even after blocking the agents, platforms like ChatGPT, Claude, and Perplexity are stealing my content(So much for respecting the copyright referendum of LLMs).
I urge the Wix Team to address this issue and add a proper Firewall feature against these AI crawlers. They are not only ignoring the robots.txt but also straight-up stealing the content even if the site owners don’t want them to.
Cloudflair Integration could be a good choice as well, but I am afraid that after a year or so, they will start selling data as well.
If anyone wants to test it, this is my website - https://www.utilitytools.online/
And this is the current Robots.txt file - https://www.utilitytools.online/robots.txt
I asked the Chat platforms about my recent page, and they fetched it even after blocking their agent.

