I think people haven't fully clocked yet how foundational this shift is for the ...

donmcronald · on May 14, 2024

I think it's even worse than that. I think we'll see a shift in incentives for writing content. If websites are used as the source for AI engines that people rely on, those websites are effectively becoming the source of truth for everything.

Right now I can tell if a website is blog spam SEO garbage, but I'm not sure if that'll be possible once everything is "laundered" into a homogeneous looking result.

I think there's going to be incentive to create huge quantities of "data" because having the highest volume makes you the source of truth for a topic. So, rich companies and people will be buying compute like crazy so they can generate tons of AI spam so another AI surfaces their viewpoint as "fact".

The future is going to suck.

rurp · on May 14, 2024

Ugh yeah, I hadn't even realized what a serious vulnerability it will be when large corporations and foreign governments start to flood the web with their preferred "facts". Google and Microsoft will gladly slurp up all of that training data. Modern Google doesn't even bother to exclude blatant spam domains from SERPs, there's no way they are going to combat this problem effectively.

This really is looking more and more like the death of the open web. Curated and closed communities will be the only places left to get authentic information online.

johneth · on May 15, 2024

Do the AI features apply to content from sites that block their 'Google-Extended' crawler (which they use to index content for their LLMs)?[1]

I'm not as worried if that distinction still applies – I can just block 'Google-Extended' and let the normal Googlebot through. If not, though, that feels like the beginning of the end of people wanting to be indexed, eventually leading to Google eating itself.

[1] https://developers.google.com/search/docs/crawling-indexing/...

dageshi · on May 14, 2024

I have felt for about six months that the web as we've known it is dead. The contract you speak of is on its last legs.

The problem is, with GPT style AI I think this was inevitable, Google has just decided to ride the wave rather than falling into obscurity.

Also, google has youtube, which should not be underestimated as a source of information. There is a lot of good, credible information on youtube, with even better signals (sub counts) than the traditional web.

This is Googles ace in the hole so to speak, the web can die, they can still feed their AI's with youtube, who else can?

walterbell · on May 14, 2024

> Google throws the contract out of the window

History shows what happens after one party defects on a contract.

Will new human-authored content move beyond the reach of search engines, outside of expensive licensing of social media feeds?

stevenwliao · on May 14, 2024

> unattributed

Plenty of AI models cite their sources.