wsj.com
While publishers contend with how AI is changing search, they are also seeking ways to protect their copyright material. The large language models that underpin the new generation of chatbots are trained on data hoovered up from the open web, including news articles.
This was always the central transaction of Google. It can display portions of your site (or maybe even a fully cached version) and in return site owners get traffic. The deal is off. Now it's all crawling/scraping but keeping most of the traffic for themselves.
« Previous post / Next post »
Hi! You're reading a single post on a weblog by Paul Bausch where I share recommended links, my photos, and occasional thoughts.