{"id":210309,"date":"2024-03-05T07:24:27","date_gmt":"2024-03-05T07:24:27","guid":{"rendered":"https:\/\/michigandigitalnews.com\/index.php\/2024\/03\/05\/anthropic-says-its-new-claude-3-ai-chatbot-scores-better-on-key-benchmarks-than-gpt-4\/"},"modified":"2025-06-25T17:21:13","modified_gmt":"2025-06-25T17:21:13","slug":"anthropic-says-its-new-claude-3-ai-chatbot-scores-better-on-key-benchmarks-than-gpt-4","status":"publish","type":"post","link":"https:\/\/michigandigitalnews.com\/index.php\/2024\/03\/05\/anthropic-says-its-new-claude-3-ai-chatbot-scores-better-on-key-benchmarks-than-gpt-4\/","title":{"rendered":"Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<p>The battle between AI chatbots is more than a two-horse race. <a data-i13n=\"cpos:1;pos:1\" href=\"https:\/\/www.engadget.com\/anthropics-chatgpt-rival-claude-can-now-analyze-150000-words-in-one-prompt-201033756.html\" data-ylk=\"slk:Anthropic;cpos:1;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">Anthropic<\/a>, the company formed by several ex-OpenAI employees, claims its new Claude 3 language model outperforms ChatGPT and Google&#8217;s Gemini in several key industry benchmarks. It even hit &#8220;near-human&#8221; levels on some tasks, the company <a data-i13n=\"cpos:2;pos:1\" href=\"https:\/\/www.anthropic.com\/news\/claude-3-family\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:wrote in a blog;cpos:2;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">wrote in a blog<\/a>.<\/p>\n<p>There are three new chatbots under the Claude 3 umbrella, including Haiku, Sonnet, and Opus. Sonnet powers the <a data-i13n=\"cpos:3;pos:1\" href=\"https:\/\/claude.ai\/login?returnTo=%2F\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:Claude.ai chatbot;cpos:3;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">Claude.ai chatbot<\/a> and is offered for free with an email sign-in. Meanwhile, Opus is the largest and most powerful LLM and will be available with a $20 per month subscription via the &#8220;Claude Pro&#8221; service. It&#8217;s also multi-modal, so it can work with both text and image inputs, unlike past versions.<\/p>\n<p>All Claude 3 models &#8220;can power live customer chats, auto-completions and data extraction tasks where responses must be immediate and in real-time,&#8221; the company said. On top of promising &#8220;near-instant results,&#8221; they can supposedly handle longer, multi-step instructions with increased accuracy.<\/p>\n<figure class=\"caas-figure\">\n<div class=\"caas-figure-with-pb\" style=\"max-height: 853px\">\n<div>\n<div class=\"caas-img-container caas-img-loader\" style=\"padding-bottom:89%\"><img decoding=\"async\" class=\"caas-img caas-lazy has-preview\" alt=\"Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4\" src=\"https:\/\/s.yimg.com\/ny\/api\/res\/1.2\/ykuM8P6gekwH2dsmdnW.EQ--\/YXBwaWQ9aGlnaGxhbmRlcjt3PTk2MDtoPTg1Mw--\/https:\/\/s.yimg.com\/os\/creatr-uploaded-images\/2024-03\/18996670-dabb-11ee-8ff9-c981e15a0e1f\"\/><noscript><img decoding=\"async\" alt=\"Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4\" src=\"https:\/\/s.yimg.com\/ny\/api\/res\/1.2\/ykuM8P6gekwH2dsmdnW.EQ--\/YXBwaWQ9aGlnaGxhbmRlcjt3PTk2MDtoPTg1Mw--\/https:\/\/s.yimg.com\/os\/creatr-uploaded-images\/2024-03\/18996670-dabb-11ee-8ff9-c981e15a0e1f\" class=\"caas-img\"\/><\/noscript><\/div>\n<\/div>\n<\/div>\n<p><figcaption class=\"caption-collapse\"><span class=\"caption-credit\"> Anthropic<\/span><\/figcaption><\/p>\n<\/figure>\n<p>Opus showed better graduate-level reasoning than GPT-4, scoring 14.7 percent higher in that test than GPT-4. It also beat OpenAI&#8217;s chatbot in tasks involving math, coding, reasoning and knowledge.<\/p>\n<p>They also top past Claude models. &#8220;For the vast majority of workloads, Sonnet is 2x faster than Claude 2 and Claude 2.1 with higher levels of intelligence. It excels at tasks demanding rapid responses, like knowledge retrieval or sales automation. Opus delivers similar speeds to Claude 2 and 2.1, but with much higher levels of intelligence,&#8221; according to Anthropic.<\/p>\n<p>Meanwhile Haiku, the smallest version of Claude 3, is &#8220;the fastest and most cost-effective model on the market.&#8221; To that end, it&#8217;s capable of reading a dense research paper complete with charts and graphs in under three seconds.<\/p>\n<p>The company also noted that Claude 3 &#8220;can process a wide range of visual formats, including photos, charts, graphs and technical diagrams,&#8221; aiding companies that use PDFs, flowcharts, or presentation slides. It&#8217;ll also be less likely to refuse harmless content thanks to a more nuanced understanding of requests, while still recognizing &#8220;real harm.&#8221;<\/p>\n<p>Anthropic has said that Claude AI is <a data-i13n=\"cpos:4;pos:1\" href=\"https:\/\/www.engadget.com\/anthropics-claude-ai-is-guided-by-10-secret-foundational-pillars-of-fairness-193058471.html\" data-ylk=\"slk:guided by;cpos:4;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">guided by<\/a> 10 secret foundational pillars of fairness. Claude 3 was trained on both nonpublic internal and public-facing data, using hardware from Amazon Web Services (AWS) and Google Cloud (Amazon <a data-i13n=\"cpos:5;pos:1\" href=\"https:\/\/www.engadget.com\/amazons-invests-4-billion-in-anthropic-openai-rival-095321755.html\" data-ylk=\"slk:recently invested;cpos:5;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">recently invested<\/a> $4 billion in Anthropic).<\/p>\n<p>Claude 3 Opus and Claude 3 Sonnet are available now through Anthropic&#8217;s API, with Haiku set to follow soon. Sonnet is also accessible through Amazon Bedrock and in private preview on <a data-i13n=\"elm:affiliate_link;sellerN:;elmt:;cpos:6;pos:1\" href=\"https:\/\/shopping.yahoo.com\/rdlw?siteId=us-engadget&amp;pageId=1p-autolink&amp;featureId=text-link&amp;custData=eyJzb3VyY2VOYW1lIjoiV2ViLURlc2t0b3AtVmVyaXpvbiIsImxhbmRpbmdVcmwiOiJodHRwczovL2Nsb3VkLmdvb2dsZS5jb20vYmxvZy9wcm9kdWN0cy9haS1tYWNoaW5lLWxlYXJuaW5nL2Fubm91bmNpbmctYW50aHJvcGljcy1jbGF1ZGUtMy1tb2RlbHMtaW4tZ29vZ2xlLWNsb3VkLXZlcnRleC1haS8iLCJjb250ZW50VXVpZCI6IjBiMDllNjQ4LTUxMzUtNDk3Yi1iNzA1LTU5MGQwOWM5Yzc0YiJ9&amp;signature=AQAAAVA5V8WLAaF72aGAaulbLo-BVFlc02UX4dAzer3l4qff&amp;gcReferrer=https%3A%2F%2Fcloud.google.com%2Fblog%2Fproducts%2Fai-machine-learning%2Fannouncing-anthropics-claude-3-models-in-google-cloud-vertex-ai%2F\" class=\"link  rapid-with-clickid etailiffa-link\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:Google Cloud's Vertex AI Model Garden;elm:affiliate_link;sellerN:;elmt:;cpos:6;pos:1;itc:0;sec:content-canvas\">Google Cloud&#8217;s Vertex AI Model Garden<\/a>.<\/p>\n<p>This article contains affiliate links; if you click such a link and make a purchase, we may earn a commission.<\/p>\n<\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/www.engadget.com\/anthropic-says-its-new-claude-3-ai-chatbot-scores-better-on-key-benchmarks-than-gpt-4-071343736.html?src=rss\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] The battle between AI chatbots is more than a two-horse race. Anthropic, the company formed by several ex-OpenAI employees, claims its new Claude 3<\/p>\n","protected":false},"author":1,"featured_media":210310,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_uf_show_specific_survey":0,"_uf_disable_surveys":false,"footnotes":""},"categories":[159],"tags":[],"_links":{"self":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/210309"}],"collection":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/comments?post=210309"}],"version-history":[{"count":1,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/210309\/revisions"}],"predecessor-version":[{"id":340149,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/210309\/revisions\/340149"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media\/210310"}],"wp:attachment":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media?parent=210309"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/categories?post=210309"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/tags?post=210309"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}