{"id":220621,"date":"2024-04-04T23:52:45","date_gmt":"2024-04-04T23:52:45","guid":{"rendered":"https:\/\/michigandigitalnews.com\/index.php\/2024\/04\/04\/openai-may-have-violated-youtube-terms-of-service-ceo-says\/"},"modified":"2025-06-25T17:19:17","modified_gmt":"2025-06-25T17:19:17","slug":"openai-may-have-violated-youtube-terms-of-service-ceo-says","status":"publish","type":"post","link":"https:\/\/michigandigitalnews.com\/index.php\/2024\/04\/04\/openai-may-have-violated-youtube-terms-of-service-ceo-says\/","title":{"rendered":"OpenAI may have violated YouTube terms of service, CEO says"},"content":{"rendered":"<p> [ad_1]<br \/>\n<br \/><img decoding=\"async\" src=\"https:\/\/fortune.com\/img-assets\/wp-content\/uploads\/2024\/04\/GettyImages-1933965269-e1712263856214.jpg?w=2048\" \/><\/p>\n<p>It\u2019s well known that <a href=\"https:\/\/fortune.com\/asia\/2024\/03\/28\/openai-trillion-dollar-valuation-megacap-microsoft-google-china-genai\/\" target=\"_self\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">OpenAI<\/a> scrapes <a href=\"https:\/\/fortune.com\/2024\/04\/04\/ai-training-costs-how-much-is-too-much-openai-gpt-anthropic-microsoft\/\" target=\"_self\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">vast amounts of data<\/a>, some of it copyrighted, from the internet to produce the uncannily human-like experience of ChatGPT. The legality of that is still a live question, as <a href=\"https:\/\/fortune.com\/2024\/01\/02\/new-york-times-openai-microsoft-copyright-lawsuit\/\" target=\"_self\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">lawsuits<\/a> from the <em>New York Times<\/em> and others attest. But how does it train its new video AI program, <a href=\"https:\/\/fortune.com\/2024\/02\/22\/openai-sora-terrifies-the-public\/\" target=\"_self\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">Sora<\/a>?<\/p>\n<div>\n<p>If Sora used content from <a href=\"https:\/\/fortune.com\/company\/youtube\/\" target=\"_blank\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">YouTube<\/a> it would be a \u201cclear violation\u201d of its terms of service, YouTube CEO Neal Mohan told Bloomberg.\u00a0<\/p>\n<p>Mohan was referring to long-standing questions about where AI companies get the content they use to train the model that power their services. While Mohan was sure to say he didn\u2019t know whether OpenAI\u2019s had used YouTube content to develop Sora, he said that would be a problem, if so.\u00a0<\/p>\n<p>\u201cFrom a creator\u2019s perspective, when a creator uploads their hard work to our platform, they have certain expectations,\u201d Mohan said. \u201cOne of those expectations is that the terms of service are going to be abided by.\u201d\u00a0<\/p>\n<p>Something like having their content scraped from the platform and used by a third party would be a \u201cclear violation of our [terms of service],\u201d Mohan said.\u00a0<\/p>\n<p>Downloading videos or transcripts would be an infringement on terms. \u201cThose are the rules of the road in terms of content on our platform,\u201d Mohan said.<\/p>\n<p>A spokesperson for YouTube confirmed its terms of service \u201cprohibit unauthorized scraping or downloading of YouTube content,\u201d without elaborating on Mohan\u2019s comments. OpenAI did not immediately respond to a request for comment.\u00a0<\/p>\n<p>OpenAI admitted that it had used copyrighted data to train its AI models, saying it was \u201c<a href=\"https:\/\/fortune.com\/2024\/01\/09\/openai-copyright-impossible-new-york-times\/\" target=\"_self\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">impossible<\/a>\u201d to build the technology without it. The admission came from a filing OpenAI submitted to the British House of Lords when the U.K. government was considering a new law that would limit how AI companies could use copyrighted material.\u00a0<\/p>\n<p>More recently, the launch of Sora drew further scrutiny when OpenAI CTO Mira Murati was unable to answer a question about what type of content was used to train the program, and specifically if any from YouTube was. \u201cI\u2019m actually not sure about that,\u201d Murati <a href=\"https:\/\/www.wsj.com\/video\/series\/joanna-stern-personal-technology\/openai-made-me-crazy-videosthen-the-cto-answered-most-of-my-questions\/C2188768-D570-4456-8574-9941D4F9D7E2\" target=\"_blank\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">told<\/a> the <em>Wall Street Journal<\/em>.\u00a0<\/p>\n<p>Murati then added that any data used was publicly available or licensed. Mohan hinted at this interview telling Bloomberg they should ask OpenAI if it had used YouTube data. \u201cI guess they were asked,\u201d Mohan seemed to remember midsentence, cutting himself off.\u00a0\u00a0<\/p>\n<p>Further complicating the matter is that YouTube and Google\u2019s parent company, <a href=\"https:\/\/fortune.com\/company\/alphabet\/\" target=\"_blank\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">Alphabet<\/a>, is developing its own suite of <a href=\"https:\/\/fortune.com\/2023\/05\/16\/alphabet-adds-115-billion-value-new-ai-tools\/\" target=\"_self\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">AI tools<\/a>, making it likely that Alphabet is even more concerned a potential rival might be using its content in a way that violates its terms of service.\u00a0<\/p>\n<p>\u201cGoogle wants that data for its own models,\u201d Igor Jablokov, founder and CEO of AI startup Pyron, told <em>Fortune<\/em>.\u00a0<\/p>\n<p>The AI arms race has already kicked off a gold rush for data. Big AI players like Alphabet, <a href=\"https:\/\/fortune.com\/company\/microsoft\/\" target=\"_blank\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">Microsoft<\/a>, <a href=\"https:\/\/fortune.com\/company\/amazon-com\/\" target=\"_blank\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">Amazon<\/a>, and Meta will want to make sure rivals don\u2019t take the data they\u2019ve accumulated. \u201cThey\u2019ll all put up walled gardens as terms and conditions,\u201d says Jablokov, whose previous voice-recognition startup was instrumental in Amazon\u2019s subsequent creation of Alexa.\u00a0<\/p>\n<p>For example, Reddit recently entered into a $60 million a year <a href=\"https:\/\/www.reuters.com\/technology\/reddit-ai-content-licensing-deal-with-google-sources-say-2024-02-22\/\" target=\"_blank\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">licensing agreement<\/a> with Google that would see its content used to train the latter\u2019s AI tools. Media companies have also struck similar deals with AI developers. The Associated Press has a deal with OpenAI that allows its archives to be used for training purposes. Meanwhile, German media company Axel Springer, which owns <em>Business Insider<\/em> and Politico, has a similar deal that also provides attribution in answers given by ChatGPT.<\/p>\n<\/div>\n<div data-cy=\"subscriptionPlea\">Subscribe to the Eye on AI newsletter to stay abreast of how AI is shaping the future of business. <a href=\"https:\/\/www.fortune.com\/newsletters\/eye-on-ai?&amp;itm_source=fortune&amp;itm_medium=article_tout&amp;itm_campaign=eye_on_ai\" target=\"_self\" rel=\"noopener\" class=\"sc-76811d68-0 jyYcOa\">Sign up<\/a> for free.<\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/fortune.com\/2024\/04\/04\/openai-youtube-clear-violation-terms-service-ai-sora-training\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] It\u2019s well known that OpenAI scrapes vast amounts of data, some of it copyrighted, from the internet to produce the uncannily human-like experience of<\/p>\n","protected":false},"author":1,"featured_media":220622,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_uf_show_specific_survey":0,"_uf_disable_surveys":false,"footnotes":""},"categories":[149],"tags":[],"_links":{"self":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/220621"}],"collection":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/comments?post=220621"}],"version-history":[{"count":1,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/220621\/revisions"}],"predecessor-version":[{"id":330721,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/220621\/revisions\/330721"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media\/220622"}],"wp:attachment":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media?parent=220621"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/categories?post=220621"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/tags?post=220621"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}