{"id":10648,"date":"2024-04-25T16:19:25","date_gmt":"2024-04-25T16:19:25","guid":{"rendered":"https:\/\/wsd.com\/?p=10648"},"modified":"2024-05-02T12:45:42","modified_gmt":"2024-05-02T12:45:42","slug":"meta-releases-the-first-variants-of-its-new-llama-3-large-language-model","status":"publish","type":"post","link":"https:\/\/wsd.com\/de\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/","title":{"rendered":"Meta releases the first variants of its new Llama-3 large language model"},"content":{"rendered":"<div data-elementor-type=\"wp-post\" data-elementor-id=\"10648\" class=\"elementor elementor-10648\" data-elementor-post-type=\"post\">\n\t\t\t\t<div class=\"elementor-element elementor-element-2ebfe4bb e-flex e-con-boxed e-con e-parent\" data-id=\"2ebfe4bb\" data-element_type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-77245016 elementor-widget elementor-widget-text-editor\" data-id=\"77245016\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p data-v-2ef80dd2=\"\">On Thursday, Meta released the first variants of its new Llama-3 large language model, including Llama-3 8b, which has been trained on a incredible 15 trillion tokens. After spending a full weekend with it, I&#8217;m in complete awe of what Meta has created. It&#8217;s a beast!<\/p><p data-v-2ef80dd2=\"\"><span style=\"color: var( --e-global-color-text ); font-family: inherit; font-weight: inherit; text-align: var(--text-align); letter-spacing: 0px;\">For starters, I&#8217;m able to run it on my Macbook M3 Max with 48 GB RAM. This is true of other compact LLMs as well, up to and including Mixtral 8x7b, but given that Llama-3 has &#8220;only&#8221; 8 bilion parameters, it&#8217;s lightning fast. The added bonus of being able to run it locally means I can access it anywhere at any time, without worrying about internet connectivity or throttling. Plus, my Mac shrugs it off &#8211; no fan noise, no battery drain.<\/span><\/p><p data-v-2ef80dd2=\"\"><span style=\"color: var( --e-global-color-text ); font-family: inherit; font-weight: inherit; letter-spacing: 0px; text-align: var(--text-align);\">Maybe the most interesting aspect of my weekend adventures is that I found LLama-3 to be superior to Chat GPT 4 when it comes to polishing emails and other business communications.<\/span><\/p><p data-v-2ef80dd2=\"\"><span style=\"color: var( --e-global-color-text ); font-family: inherit; font-weight: inherit; text-align: var(--text-align); letter-spacing: 0px;\">The vanilla version of Llama-3 is also the only compact LLM (and in fact the only model apart from Chat GPT 4 that I&#8217;m aware of) that can solve the logical problems I&#8217;ve presented to pretty much every LLM since October 2022:<\/span><\/p><p data-v-2ef80dd2=\"\"><span style=\"color: var( --e-global-color-text ); font-family: inherit; font-weight: inherit; text-align: var(--text-align); letter-spacing: 0px;\">1. The Sock Problem. Imagine being in a dark room with a box of black and white socks. How often do you need to reach into the box to guarantee a matching pair?<\/span><\/p><p data-v-2ef80dd2=\"\"><span style=\"color: var( --e-global-color-text ); font-family: inherit; font-weight: inherit; text-align: var(--text-align); letter-spacing: 0px;\">2. The Ball-and-Van Problem. A ball rolls onto a street, followed by a van approaching with a loud bang. After the van passes, there&#8217;s a sheet of plastic on the street. What happened?<\/span><\/p><p data-v-2ef80dd2=\"\"><span style=\"color: var( --e-global-color-text ); font-family: inherit; font-weight: inherit; text-align: var(--text-align); letter-spacing: 0px;\">The first problem tests reasoning, while the second tests real-world understanding, and the vanilla version of Llama-3 8b accurately solves both problems.<\/span><\/p><p data-v-2ef80dd2=\"\"><span style=\"color: var( --e-global-color-text ); font-family: inherit; font-weight: inherit; text-align: var(--text-align); letter-spacing: 0px;\">The instruct version doesn&#8217;t do as well on this particular task, but that&#8217;s just a case of horses for courses. What the instruct version does excel at is following instructions, which is, of course, key for companies like ours, which develop software that relies on LLMs for natural language understanding. I look forward to seeing these improvements getting incorporated into Melody, Mike, and our various other generative AI-based solutions.<\/span><\/p><p data-v-2ef80dd2=\"\"><span style=\"color: var( --e-global-color-text ); font-family: inherit; font-weight: inherit; text-align: var(--text-align); letter-spacing: 0px;\">What about other Llama-3 variants? I haven&#8217;t tested Llama-3 70b yet, but I hear good things about it. And Llama-3 400b appears to perform at levels comparable to Chat GPT 4. Further variants are expected to come out in the coming weeks and months, including versions with larger context sizes &#8211; the 8k context window of Llama-3 8b is the only disappointment in an otherwise hugely impressive release (Meta has indicated that it&#8217;s working on an update).<\/span><\/p><p data-v-2ef80dd2=\"\"><span style=\"color: var( --e-global-color-text ); font-family: inherit; font-weight: inherit; text-align: var(--text-align); letter-spacing: 0px;\">By the way: We&#8217;ve organized a few generative AI events, some in partnership with NVIDIA. At these events, we talk about AI generally, answer questions about what&#8217;s possible given the current state of the art, and give product demos using real-world applications. We&#8217;re currently in the process of planning further events in London, New York, and other cities. If you or your organization would like to receive an invitation, please let me know.<\/span><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>","protected":false},"excerpt":{"rendered":"<p>On Thursday, Meta released the first variants of its new Llama-3 large language model, including Llama-3 8b, which has been trained on a incredible 15 trillion tokens.  <\/p>","protected":false},"author":4,"featured_media":10743,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[2],"tags":[],"class_list":["post-10648","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-insights"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.0 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Meta releases the first variants of its new Llama-3 large language model - WSD<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/wsd.com\/de\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/\" \/>\n<meta property=\"og:locale\" content=\"de_DE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Meta releases the first variants of its new Llama-3 large language model - WSD\" \/>\n<meta property=\"og:description\" content=\"On Thursday, Meta released the first variants of its new Llama-3 large language model, including Llama-3 8b, which has been trained on a incredible 15 trillion tokens.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/wsd.com\/de\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/\" \/>\n<meta property=\"og:site_name\" content=\"WSD\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-25T16:19:25+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-05-02T12:45:42+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"640\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Mathias Strasser\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Verfasst von\" \/>\n\t<meta name=\"twitter:data1\" content=\"Mathias Strasser\" \/>\n\t<meta name=\"twitter:label2\" content=\"Gesch\u00e4tzte Lesezeit\" \/>\n\t<meta name=\"twitter:data2\" content=\"3\u00a0Minuten\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/\",\"url\":\"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/\",\"name\":\"Meta releases the first variants of its new Llama-3 large language model - WSD\",\"isPartOf\":{\"@id\":\"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg\",\"datePublished\":\"2024-04-25T16:19:25+00:00\",\"dateModified\":\"2024-05-02T12:45:42+00:00\",\"author\":{\"@id\":\"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#\/schema\/person\/e0317c0be744d58ce78abebf3636b99c\"},\"breadcrumb\":{\"@id\":\"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#breadcrumb\"},\"inLanguage\":\"de\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#primaryimage\",\"url\":\"https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg\",\"contentUrl\":\"https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg\",\"width\":2560,\"height\":640},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/wsd.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Meta releases the first variants of its new Llama-3 large language model\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#website\",\"url\":\"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/\",\"name\":\"WSD\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"de\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#\/schema\/person\/e0317c0be744d58ce78abebf3636b99c\",\"name\":\"Mathias Strasser\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/wsd.com\/wp-content\/uploads\/2024\/05\/cropped-cropped-wsd_london_2807-scaled-1-96x96.jpg\",\"contentUrl\":\"https:\/\/wsd.com\/wp-content\/uploads\/2024\/05\/cropped-cropped-wsd_london_2807-scaled-1-96x96.jpg\",\"caption\":\"Mathias Strasser\"},\"url\":\"https:\/\/wsd.com\/de\/author\/mathias-strasser\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Meta releases the first variants of its new Llama-3 large language model - WSD","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/wsd.com\/de\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/","og_locale":"de_DE","og_type":"article","og_title":"Meta releases the first variants of its new Llama-3 large language model - WSD","og_description":"On Thursday, Meta released the first variants of its new Llama-3 large language model, including Llama-3 8b, which has been trained on a incredible 15 trillion tokens.","og_url":"https:\/\/wsd.com\/de\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/","og_site_name":"WSD","article_published_time":"2024-04-25T16:19:25+00:00","article_modified_time":"2024-05-02T12:45:42+00:00","og_image":[{"width":2560,"height":640,"url":"https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg","type":"image\/jpeg"}],"author":"Mathias Strasser","twitter_card":"summary_large_image","twitter_misc":{"Verfasst von":"Mathias Strasser","Gesch\u00e4tzte Lesezeit":"3\u00a0Minuten"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/","url":"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/","name":"Meta releases the first variants of its new Llama-3 large language model - WSD","isPartOf":{"@id":"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#website"},"primaryImageOfPage":{"@id":"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#primaryimage"},"image":{"@id":"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#primaryimage"},"thumbnailUrl":"https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg","datePublished":"2024-04-25T16:19:25+00:00","dateModified":"2024-05-02T12:45:42+00:00","author":{"@id":"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#\/schema\/person\/e0317c0be744d58ce78abebf3636b99c"},"breadcrumb":{"@id":"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#breadcrumb"},"inLanguage":"de","potentialAction":[{"@type":"ReadAction","target":["https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/"]}]},{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#primaryimage","url":"https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg","contentUrl":"https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg","width":2560,"height":640},{"@type":"BreadcrumbList","@id":"https:\/\/wsd.com\/meta-releases-the-first-variants-of-its-new-llama-3-large-language-model\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/wsd.com\/"},{"@type":"ListItem","position":2,"name":"Meta releases the first variants of its new Llama-3 large language model"}]},{"@type":"WebSite","@id":"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#website","url":"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/","name":"WSD","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"de"},{"@type":"Person","@id":"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#\/schema\/person\/e0317c0be744d58ce78abebf3636b99c","name":"Mathias Strasser","image":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/fxt.was.mybluehost.me\/website_9080a0c9\/#\/schema\/person\/image\/","url":"https:\/\/wsd.com\/wp-content\/uploads\/2024\/05\/cropped-cropped-wsd_london_2807-scaled-1-96x96.jpg","contentUrl":"https:\/\/wsd.com\/wp-content\/uploads\/2024\/05\/cropped-cropped-wsd_london_2807-scaled-1-96x96.jpg","caption":"Mathias Strasser"},"url":"https:\/\/wsd.com\/de\/author\/mathias-strasser\/"}]}},"jetpack_featured_media_url":"https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg","jetpack_sharing_enabled":true,"rttpg_featured_image_url":{"full":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg",2560,640,false],"landscape":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg",2560,640,false],"portraits":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama.jpg",2560,640,false],"thumbnail":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama-150x150.jpg",150,150,true],"medium":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama-300x75.jpg",300,75,true],"large":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama-1024x256.jpg",640,160,true],"1536x1536":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama-1536x384.jpg",1536,384,true],"2048x2048":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama-2048x512.jpg",2048,512,true],"trp-custom-language-flag":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama-18x5.jpg",18,5,true],"consultio-large":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama-900x313.jpg",900,313,true],"consultio-medium":["https:\/\/wsd.com\/wp-content\/uploads\/2024\/04\/WSD-Website-Latest-Meta-Llama-600x450.jpg",600,450,true]},"rttpg_author":{"display_name":"Mathias Strasser","author_link":"https:\/\/wsd.com\/de\/author\/mathias-strasser\/"},"rttpg_comment":0,"rttpg_category":"<a href=\"https:\/\/wsd.com\/de\/category\/insights\/\" rel=\"category tag\">Insights<\/a>","rttpg_excerpt":"On Thursday, Meta released the first variants of its new Llama-3 large language model, including Llama-3 8b, which has been trained on a incredible 15 trillion tokens.","_links":{"self":[{"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/posts\/10648","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/comments?post=10648"}],"version-history":[{"count":5,"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/posts\/10648\/revisions"}],"predecessor-version":[{"id":10744,"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/posts\/10648\/revisions\/10744"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/media\/10743"}],"wp:attachment":[{"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/media?parent=10648"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/categories?post=10648"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wsd.com\/de\/wp-json\/wp\/v2\/tags?post=10648"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}