{"id":234557,"date":"2024-06-20T17:06:34","date_gmt":"2024-06-20T17:06:34","guid":{"rendered":"https:\/\/michigandigitalnews.com\/index.php\/2024\/06\/20\/anthropics-newest-claude-chatbot-beats-openais-gpt-4o-in-some-benchmarks\/"},"modified":"2025-06-25T17:16:35","modified_gmt":"2025-06-25T17:16:35","slug":"anthropics-newest-claude-chatbot-beats-openais-gpt-4o-in-some-benchmarks","status":"publish","type":"post","link":"https:\/\/michigandigitalnews.com\/index.php\/2024\/06\/20\/anthropics-newest-claude-chatbot-beats-openais-gpt-4o-in-some-benchmarks\/","title":{"rendered":"Anthropic\u2019s newest Claude chatbot beats OpenAI\u2019s GPT-4o in some benchmarks"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<p>Anthropic <a data-i13n=\"elm:context_link;elmt:doNotAffiliate;cpos:1;pos:1\" class=\"link \" href=\"https:\/\/www.anthropic.com\/news\/claude-3-5-sonnet\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:rolled out;elm:context_link;elmt:doNotAffiliate;cpos:1;pos:1;itc:0;sec:content-canvas\">rolled out<\/a> its newest AI language model on Thursday, Claude 3.5 Sonnet. The updated chatbot outperforms the company\u2019s previous top-tier model, Claude 3 Opus, while working at twice the speed. Claude users (including those on free accounts) can check it out beginning today.<\/p>\n<p>Sonnet, which tends to be Anthropic\u2019s most balanced model, is the first release in the Claude 3.5 family. The company says Claude 3.5 Haiku (the fastest in each generation) and Claude 3.5 Opus (the most powerful) will arrive later this year. (Those models will stay on version 3 in the meantime.) The Sonnet update comes only a few months after <a data-i13n=\"cpos:2;pos:1\" href=\"https:\/\/www.engadget.com\/anthropic-says-its-new-claude-3-ai-chatbot-scores-better-on-key-benchmarks-than-gpt-4-071343736.html\" data-ylk=\"slk:the arrival of the Claude 3 family,;cpos:2;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">the arrival of the Claude 3 family,<\/a> showcasing the breakneck speed AI companies are working to spit out their latest and greatest.<\/p>\n<figure class=\"caas-figure\">\n<div class=\"caas-figure-with-pb\" style=\"max-height: 826px\">\n<div>\n<div class=\"caas-img-container caas-img-loader\" style=\"padding-bottom:86%\"><img decoding=\"async\" class=\"caas-img caas-lazy has-preview\" alt=\"Chart showing benchmarks comparisons between recent AI chatbot models: Claude 3.5 Sonnet, Claude 3 Opus, GPT-4o, Gemini 1.5 Pro and Llama-400b.\" src=\"https:\/\/s.yimg.com\/ny\/api\/res\/1.2\/I5R0ABnqpPVYzKnGweF7Dw--\/YXBwaWQ9aGlnaGxhbmRlcjt3PTk2MDtoPTgyNg--\/https:\/\/s.yimg.com\/os\/creatr-uploaded-images\/2024-06\/1118e620-2f23-11ef-9987-07ade62445de\"\/><img decoding=\"async\" alt=\"Chart showing benchmarks comparisons between recent AI chatbot models: Claude 3.5 Sonnet, Claude 3 Opus, GPT-4o, Gemini 1.5 Pro and Llama-400b.\" src=\"https:\/\/s.yimg.com\/ny\/api\/res\/1.2\/I5R0ABnqpPVYzKnGweF7Dw--\/YXBwaWQ9aGlnaGxhbmRlcjt3PTk2MDtoPTgyNg--\/https:\/\/s.yimg.com\/os\/creatr-uploaded-images\/2024-06\/1118e620-2f23-11ef-9987-07ade62445de\" class=\"caas-img\"\/><\/div>\n<\/div>\n<\/div>\n<p><figcaption class=\"caption-collapse\"><span class=\"caption-credit\"> Anthropic<\/span><\/figcaption><\/p>\n<\/figure>\n<p>Anthropic claims Claude 3.5 Sonnet marks a step forward in understanding nuance, humor and complicated prompts, and it can write in a more natural tone. Benchmarks (above) show the new model breaking industry records for graduate-level reasoning, undergraduate-level knowledge and coding proficiency. It beats <a data-i13n=\"cpos:3;pos:1\" href=\"https:\/\/www.engadget.com\/openai-claims-that-its-free-gpt-4o-model-can-talk-laugh-sing-and-see-like-a-human-184249780.html\" data-ylk=\"slk:OpenAI\u2019s GPT-4o;cpos:3;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">OpenAI\u2019s GPT-4o<\/a> on many of the benchmarks Anthropic published. However, the latest Claude, ChatGPT, <a data-i13n=\"cpos:4;pos:1\" href=\"https:\/\/www.engadget.com\/googles-gemini-15-pro-is-a-new-more-efficient-ai-model-181909354.html\" data-ylk=\"slk:Gemini;cpos:4;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">Gemini<\/a> and <a data-i13n=\"cpos:5;pos:1\" href=\"https:\/\/www.engadget.com\/meta-rolls-out-an-updated-ai-assistant-built-with-the-long-awaited-llama-3-160053435.html\" data-ylk=\"slk:Llama;cpos:5;pos:1;elm:context_link;itc:0;sec:content-canvas\" class=\"link \">Llama<\/a> models tend to score within a few percentage points of each other on most tests, underscoring the tight competition.<\/p>\n<p>The company claims Claude 3.5 Sonnet is also better at interpreting visual input than Claude 3.0 Opus. Anthropic says the new model can \u201caccurately transcribe text from imperfect images,\u201d a skill it hopes will attract customers in retail, logistics and financial services who need to grok data from charts, graphs and other visual cues.<\/p>\n<div class=\"caas-iframe-wrapper\" data-embed-anchor=\"048e9119-68ba-5272-9b88-00a0d11aa07e\">\n<div class=\"caas-iframe youtube\" style=\"padding-bottom:56%\" data-type=\"youtube\">\n<blockquote data-src=\"https:\/\/www.youtube.com\/embed\/rHqk0ZGb6qo\"><p><noscript><iframe title=\"Claude 3.5 Sonnet for sparking creativity\" width=\"1200\" height=\"675\" src=\"https:\/\/www.youtube.com\/embed\/rHqk0ZGb6qo?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/noscript><\/p><\/blockquote>\n<\/div>\n<\/div>\n<p>Claude\u2019s update also brings a new workspace the company calls Artifacts (above). When you prompt the chatbot to generate content like code, text documents or web designs, a dedicated window appears to the right of the chat. From there, you can prompt Claude to make changes, and it will keep the Artifacts window updated with its latest output.<\/p>\n<p>The company sees Artifacts as a first step towards making Claude a space for broader team collaboration. \u201cIn the near future, teams \u2014 and eventually entire organizations \u2014 will be able to securely centralize their knowledge, documents, and ongoing work in one shared space, with Claude serving as an on-demand teammate,\u201d the company wrote in a press release.<\/p>\n<p>Claude 3.5 Sonnet is available now for anyone with an account to try on <a data-i13n=\"elm:context_link;elmt:doNotAffiliate;cpos:6;pos:1\" class=\"link \" href=\"https:\/\/claude.ai\/\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:its website;elm:context_link;elmt:doNotAffiliate;cpos:6;pos:1;itc:0;sec:content-canvas\">its website<\/a>, as well as in the <a data-i13n=\"elm:context_link;elmt:doNotAffiliate;cpos:7;pos:1\" class=\"link \" href=\"https:\/\/apps.apple.com\/us\/app\/claude-by-anthropic\/id6473753684\" rel=\"nofollow noopener\" target=\"_blank\" data-ylk=\"slk:Claude iOS app;elm:context_link;elmt:doNotAffiliate;cpos:7;pos:1;itc:0;sec:content-canvas\">Claude iOS app<\/a>. (On both of those platforms, Claude Pro and Team subscribers get higher token counts.) You can also access it through the Anthropic API, Amazon Bedrock and Google Cloud\u2019s Vertex AI. It costs $3 per million input tokens and $15 per million output tokens \u2014\u00a0the same as the previous model.<\/p>\n<\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/www.engadget.com\/anthropics-newest-claude-chatbot-beats-openais-gpt-4o-in-some-benchmarks-170135962.html?src=rss\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] Anthropic rolled out its newest AI language model on Thursday, Claude 3.5 Sonnet. The updated chatbot outperforms the company\u2019s previous top-tier model, Claude 3<\/p>\n","protected":false},"author":1,"featured_media":234558,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_uf_show_specific_survey":0,"_uf_disable_surveys":false,"footnotes":""},"categories":[159],"tags":[],"_links":{"self":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/234557"}],"collection":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/comments?post=234557"}],"version-history":[{"count":0,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/234557\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media\/234558"}],"wp:attachment":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media?parent=234557"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/categories?post=234557"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/tags?post=234557"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}