{"id":254691,"date":"2024-08-15T21:28:17","date_gmt":"2024-08-15T21:28:17","guid":{"rendered":"https:\/\/michigandigitalnews.com\/index.php\/2024\/08\/15\/allow-ai-scraping-from-google-or-lose-search-visibility\/"},"modified":"2025-06-25T17:12:19","modified_gmt":"2025-06-25T17:12:19","slug":"allow-ai-scraping-from-google-or-lose-search-visibility","status":"publish","type":"post","link":"https:\/\/michigandigitalnews.com\/index.php\/2024\/08\/15\/allow-ai-scraping-from-google-or-lose-search-visibility\/","title":{"rendered":"Allow AI scraping from Google or lose search visibility"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<p>As the US government weighs its options following <a data-i13n=\"cpos:1;pos:1\" href=\"https:\/\/www.engadget.com\/big-tech\/google-is-a-monopolist-in-search-us-judge-rules-in-antitrust-case-193358356.html\" data-ylk=\"slk:a landmark \u201cmonopolist\u201d ruling against Google;cpos:1;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">a landmark \u201cmonopolist\u201d ruling against Google<\/a> last week, online publications increasingly face a bleak future. (And this time, it\u2019s not just because of severely diminished ad revenue.) <em>Bloomberg<\/em> <a data-i13n=\"elm:context_link;elmt:doNotAffiliate;cpos:2;pos:1\" class=\"link \" href=\"https:\/\/www.bloomberg.com\/news\/articles\/2024-08-15\/google-s-search-dominance-leaves-sites-little-choice-on-ai-scraping\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:reports;elm:context_link;elmt:doNotAffiliate;cpos:2;pos:1;itc:0;sec:content-canvas\">reports<\/a> that their choice now boils down to allowing Google to use their published content to produce <a data-i13n=\"cpos:3;pos:1\" href=\"https:\/\/www.engadget.com\/google-search-will-now-show-ai-generated-answers-to-millions-by-default-174512845.html\" data-ylk=\"slk:inline AI-generated search \u201canswers\u201d;cpos:3;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">inline AI-generated search \u201canswers\u201d<\/a> or losing visibility in the company\u2019s search engine.<\/p>\n<p>The crux of the problem lies in the Googlebot, the crawler that scours and indexes the live web to produce the results you see when you enter search terms. If publishers block Google from using their content for the AI-produced answers you now see littered at the top of many search results, they also lose the privilege of including their web pages in the standard web results.<\/p>\n<p>The catch-22 has led publications, rival search engines and AI startups to pin their hopes on the Justice Department. On Tuesday, <em>The New York Times<\/em> <a data-i13n=\"elm:context_link;elmt:doNotAffiliate;cpos:4;pos:1\" class=\"link \" href=\"https:\/\/www.nytimes.com\/2024\/08\/13\/technology\/google-monopoly-antitrust-justice-department.html\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:reported;elm:context_link;elmt:doNotAffiliate;cpos:4;pos:1;itc:0;sec:content-canvas\">reported<\/a> that the DOJ is considering asking a federal judge to break up parts of the company (spinning off sections like Chrome or Android). Other options it\u2019s reportedly weighing include forcing Google to share search data with competitors or relinquishing its default search-engine deals, like the <a data-i13n=\"cpos:5;pos:1\" href=\"https:\/\/www.engadget.com\/google-reportedly-pays-apple-36-percent-of-ad-search-revenues-from-safari-191730783.html\" data-ylk=\"slk:$18 billion one it inked with Apple;cpos:5;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">$18 billion one it inked with Apple<\/a>.<\/p>\n<p>Google uses a separate crawler for its <a data-i13n=\"cpos:6;pos:1\" href=\"https:\/\/www.engadget.com\/google-rebrands-its-bard-ai-chatbot-as-gemini-which-now-has-its-own-android-app-151303210.html\" data-ylk=\"slk:Gemini (formerly Bard) chatbot;cpos:6;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">Gemini (formerly Bard) chatbot<\/a>. But its main crawler covers both AI Overviews and standard searches, leaving web publishers with little (if any) leverage. If you let Google scrape your content for AI Overview answers, readers may consider that the end of the matter without bothering to visit your site (meaning zero revenue from those potential readers). But if you block the Googlebot, you lose search visibility, which likely means significantly less short-term income and a colossal loss of long-term competitive standing.<\/p>\n<p><em>iFixit<\/em> CEO Kyle Wiens told <em>Bloomberg<\/em>, \u201cI can block ClaudeBot [Anthropic\u2019s crawler for its <a data-i13n=\"cpos:7;pos:1\" href=\"https:\/\/www.engadget.com\/anthropics-newest-claude-chatbot-beats-openais-gpt-4o-in-some-benchmarks-170135962.html\" data-ylk=\"slk:Claude chatbot;cpos:7;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">Claude chatbot<\/a>] from indexing us without harming our business. But if I block Googlebot, we lose traffic and customers.\u201d<\/p>\n<figure class=\"caas-figure\">\n<div class=\"caas-figure-with-pb\" style=\"max-height: 535px\">\n<div>\n<div class=\"caas-img-container caas-img-loader\" style=\"padding-bottom:56%\"><img decoding=\"async\" class=\"caas-img caas-lazy has-preview\" alt=\"A sample Google search query with an AI Overview answer.\" src=\"https:\/\/s.yimg.com\/ny\/api\/res\/1.2\/NRVvCQXFInEh6Y5hdcSpaQ--\/YXBwaWQ9aGlnaGxhbmRlcjt3PTk2MDtoPTUzNQ--\/https:\/\/s.yimg.com\/os\/creatr-uploaded-images\/2024-08\/531965f0-5b41-11ef-97f7-4d10d890077b\"\/><img decoding=\"async\" alt=\"A sample Google search query with an AI Overview answer.\" src=\"https:\/\/s.yimg.com\/ny\/api\/res\/1.2\/NRVvCQXFInEh6Y5hdcSpaQ--\/YXBwaWQ9aGlnaGxhbmRlcjt3PTk2MDtoPTUzNQ--\/https:\/\/s.yimg.com\/os\/creatr-uploaded-images\/2024-08\/531965f0-5b41-11ef-97f7-4d10d890077b\" class=\"caas-img\"\/><\/div>\n<\/div>\n<\/div>\n<p><figcaption class=\"caption-collapse\"><span class=\"caption-credit\"> Google<\/span><\/figcaption><\/p>\n<\/figure>\n<p>Another problem with combining the two is that it gives Google an immeasurable advantage over smaller AI startups. The company gets a plethora of free training data from publishers eager to remain visible in search. In contrast, AI companies are forced to pay publishers for access to their data \u2014 and, even then, it wouldn\u2019t add up to the motherlode Google gets (essentially) for free.<\/p>\n<p>From that perspective, it isn\u2019t surprising to read that, according to <em>Bloomberg<\/em>, Google is spurning publishers that try to negotiate content deals. (Reddit <a data-i13n=\"cpos:8;pos:1\" href=\"https:\/\/www.engadget.com\/reddit-is-licensing-its-content-to-google-to-help-train-its-ai-models-200013007.html\" data-ylk=\"slk:has been the lone exception;cpos:8;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">has been the lone exception<\/a>.) Why waste money on content deals when they get all the training data they want in exchange for the search results most publishers need to survive?<\/p>\n<p>\u201cNow you have a bunch of tech companies that are paying for content, they\u2019re paying for access to that because they need it to be able to compete in any kind of serious way,\u201d Alex Rosenberg, CEO of AI startup Tako Inc., told <em>Bloomberg<\/em>. \u201cWhereas for Google, they don\u2019t really have to do that.\u201d<\/p>\n<p>It comes down to leverage, which Google wields over desperate publishers. On top of the industry\u2019s existing financial troubles (online ad revenue has fallen off a cliff over the past eight years), <em>AdWeek<\/em> <a data-i13n=\"elm:context_link;elmt:doNotAffiliate;cpos:9;pos:1\" class=\"link \" href=\"https:\/\/www.adweek.com\/programmatic\/googles-gen-ai-search-threatens-publishers-with-2b-annual-ad-revenue-loss\/\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:reported;elm:context_link;elmt:doNotAffiliate;cpos:9;pos:1;itc:0;sec:content-canvas\">reported<\/a> in March that Google\u2019s AI-generated search answers could lead to a 20 to 60 percent drop in organic search traffic.<\/p>\n<p>The ball is now in the Justice Department\u2019s court to figure out where Google \u2014 and, to an extent, the entire web \u2014 goes from here. <em>Bloomberg<\/em>\u2019s full story is <a data-i13n=\"elm:context_link;elmt:doNotAffiliate;cpos:10;pos:1\" class=\"link \" href=\"https:\/\/www.bloomberg.com\/news\/articles\/2024-08-15\/google-s-search-dominance-leaves-sites-little-choice-on-ai-scraping\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:worth a read;elm:context_link;elmt:doNotAffiliate;cpos:10;pos:1;itc:0;sec:content-canvas\">worth a read<\/a>.<\/p>\n<\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/www.engadget.com\/ai\/online-publishers-face-a-dilemma-allow-ai-scraping-from-google-or-lose-search-visibility-202246891.html?src=rss\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] As the US government weighs its options following a landmark \u201cmonopolist\u201d ruling against Google last week, online publications increasingly face a bleak future. (And<\/p>\n","protected":false},"author":1,"featured_media":254692,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_uf_show_specific_survey":0,"_uf_disable_surveys":false,"footnotes":""},"categories":[159],"tags":[],"_links":{"self":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/254691"}],"collection":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/comments?post=254691"}],"version-history":[{"count":0,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/254691\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media\/254692"}],"wp:attachment":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media?parent=254691"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/categories?post=254691"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/tags?post=254691"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}