{"id":238576,"date":"2024-07-01T19:37:32","date_gmt":"2024-07-01T19:37:32","guid":{"rendered":"https:\/\/michigandigitalnews.com\/index.php\/2024\/07\/01\/amazon-probes-perplexity-ai-for-alleged-content-scraping\/"},"modified":"2025-06-25T17:15:43","modified_gmt":"2025-06-25T17:15:43","slug":"amazon-probes-perplexity-ai-for-alleged-content-scraping","status":"publish","type":"post","link":"https:\/\/michigandigitalnews.com\/index.php\/2024\/07\/01\/amazon-probes-perplexity-ai-for-alleged-content-scraping\/","title":{"rendered":"Amazon probes Perplexity AI for alleged content scraping"},"content":{"rendered":"<p> [ad_1]<br \/>\n<br \/><img decoding=\"async\" src=\"https:\/\/readwrite.com\/wp-content\/uploads\/2024\/07\/Amazon-probes-Perplexity-AI-for-alleged-content-scraping-900x600.png\" \/><\/p>\n<div>\n<p>Amazon is reviewing allegations that <a href=\"https:\/\/readwrite.com\/perplexity-ai-launches-new-tool-to-turn-research-into-content\/\">Perplexity AI<\/a>, an <a href=\"https:\/\/readwrite.com\/category\/ai\/\">artificial intelligence<\/a> startup, has been scraping content from major news websites without permission.<\/p>\n<p>An Amazon spokesperson said on Friday (June 28) that the company is looking into several reports from <a href=\"https:\/\/www.wired.com\/story\/aws-perplexity-bot-scraping-investigation\/\" target=\"_blank\" rel=\"noopener\">WIRED<\/a> and <a href=\"https:\/\/www.forbes.com\/sites\/randalllane\/2024\/06\/11\/why-perplexitys-cynical-theft-represents-everything-that-could-go-wrong-with-ai\/\" target=\"_blank\" rel=\"noopener\">Forbes<\/a> that claim that Perplexity has been accessing content from websites that explicitly prohibit such scraping practices. Perplexity operates using servers provided by <a href=\"https:\/\/readwrite.com\/aws-embracing-the-power-of-generative-ai\/\">Amazon Web Services<\/a> (AWS).<\/p>\n<p>The representative also noted that all AWS clients are required to adhere to the instructions in the robots.txt file. These files are generally used on websites to instruct bots and web crawlers to refrain from scraping their data, whether for generative AI tools or other uses.<\/p>\n<p>\u201cAWS\u2019s terms of service prohibit abusive and illegal activities and our customers are responsible for complying with those terms. We routinely receive reports of alleged abuse from a variety of sources and engage our customers to understand those reports,\u201d the representative said.<\/p>\n<p>Forbes\u2019 editor and chief content officer, Randall Lane, charged Perplexity with committing \u201ccynical theft,\u201d accusing the company of creating \u201cknockoff stories\u201d that contain \u201ceerily similar wording\u201d and \u201centirely lifted fragments\u201d from its articles.<\/p>\n<p>He added: \u201cMore egregiously, the post, which looked and read like a piece of journalism, didn\u2019t mention Forbes at all, other than a line at the bottom of every few paragraphs that mentioned \u2018sources,\u2019 and a very small icon that looked to be the \u2018F\u2019 from the Forbes logo \u2013 if you squinted.\u201d<\/p>\n<h2>Has Perplexity AI plagiarized content?<\/h2>\n<p>The San Francisco-based AI search startup, Perplexity, once celebrated by top tech investors like <a href=\"https:\/\/readwrite.com\/perplexity-an-ai-based-answer-engine-wins-backing-from-jeff-bezos\/\">Amazon\u2019s Jeff Bezos<\/a>, has recently faced scrutiny over plagiarism accusations.<\/p>\n<p>Aravind Srinivas, CEO of Perplexity, denied allegations that his company was \u201cignoring the Robot Exclusions Protocol and then lying about it.\u201d Srinivas acknowledged to <a href=\"https:\/\/www.fastcompany.com\/91144894\/perplexity-ai-ceo-aravind-srinivas-on-plagiarism-accusations\" target=\"_blank\" rel=\"noopener\">Fast Company<\/a> that Perplexity does use third-party web crawlers in addition to its own, and confirmed that the bot identified by WIRED was among them.<\/p>\n<p>However, he added, \u201cIt was accurately pointed out by Forbes that they preferred a more prominent highlighting of the source.\u201d Srinivas also mentioned that sources are now more prominently spotlighted.<\/p>\n<p>ReadWrite has reached out to Amazon and Perplexity for comment.<\/p>\n<p><em>Featured image: Canva \/ Perplexity AI<\/em><\/p>\n<\/p><\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/readwrite.com\/amazon-perplexity-ai-alleged-content-scraping\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] Amazon is reviewing allegations that Perplexity AI, an artificial intelligence startup, has been scraping content from major news websites without permission. An Amazon spokesperson<\/p>\n","protected":false},"author":1,"featured_media":238577,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_uf_show_specific_survey":0,"_uf_disable_surveys":false,"footnotes":""},"categories":[152],"tags":[],"_links":{"self":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/238576"}],"collection":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/comments?post=238576"}],"version-history":[{"count":0,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/238576\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media\/238577"}],"wp:attachment":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media?parent=238576"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/categories?post=238576"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/tags?post=238576"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}