{"id":111714,"date":"2024-11-04T08:17:04","date_gmt":"2024-11-04T16:17:04","guid":{"rendered":"https:\/\/www.backblaze.com\/blog\/?p=111714"},"modified":"2024-11-04T13:58:10","modified_gmt":"2024-11-04T21:58:10","slug":"solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze","status":"publish","type":"post","link":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/","title":{"rendered":"Solving the AI Training Data Challenge with Decart AI and Backblaze"},"content":{"rendered":"\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1440\" height=\"820\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2024\/11\/bb-bh_Decartes-Backblaze-e1730505101240.png\" alt=\"A decorative image showing the logos of Backblaze and Decart. \" class=\"wp-image-111715\"\/><\/figure>\n\n\n\n<div style=\"height:15px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p>Depending on which LLM you ask, we live in a world with somewhere between 25k and 80k AI startups. It\u2019s a growing, highly competitive market where small startups with a big idea can find themselves toe-to-toe with the goliaths of tech\u2014fighting for money, chips, talent, even raw electrical power.&nbsp;<\/p>\n\n\n\n<p>How does any company differentiate themselves in an explosive burst of technological change, one that requires a lot of investment in talent and infrastructure, where even the richest tech platforms on the planet don\u2019t always succeed? Today we\u2019re sharing the story of <a href=\"https:\/\/www.decart.ai\/index.html\">Decart<\/a>\u2014an AI startup that used <a href=\"https:\/\/www.backblaze.com\/cloud-storage\">Backblaze B2 Cloud Storage<\/a> to leverage a successful launch with an impressive new model that provides an order of magnitude improvement in both the training and inferencing of the largest generative models.<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>Backblaze is an amazing solution for AI training data. We looked at a number of options and&nbsp; Backblaze is seriously the best.<\/p>\n<cite>\u2014Dean Leitersdorf, Co-Founder and CEO, Decart<\/cite><\/blockquote>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>First, the news<\/strong><\/h2>\n\n\n\n<p>Decart is an AI research lab that <a href=\"https:\/\/www.decart.ai\/articles\/oasis-interactive-ai-video-game-model\">came out of stealth<\/a> on October 31 with an incredible new model:<\/p>\n\n\n\n<figure class=\"wp-block-embed aligncenter is-type-rich is-provider-twitter wp-block-embed-twitter\"><div class=\"wp-block-embed__wrapper\">\n<blockquote class=\"twitter-tweet\" data-width=\"550\" data-dnt=\"true\"><p lang=\"en\" dir=\"ltr\">1\/ We are excited to introduce Oasis, the world&#39;s first real-time AI world model, developed in collaboration with <a href=\"https:\/\/twitter.com\/Etched?ref_src=twsrc%5Etfw\">@Etched<\/a>. Imagine a video game entirely generated by AI, or a video you can interact with\u2014constantly rendered at 20 fps, in real-time, with zero latency <a href=\"https:\/\/t.co\/WAJFRyfTzS\">pic.twitter.com\/WAJFRyfTzS<\/a><\/p>&mdash; Decart (@DecartAI) <a href=\"https:\/\/twitter.com\/DecartAI\/status\/1852091173420294291?ref_src=twsrc%5Etfw\">October 31, 2024<\/a><\/blockquote><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script>\n<\/div><\/figure>\n\n\n\n<div style=\"height:15px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p>While this might look like Minecraft, every pixel you see here and all of the gameplay is being generated by Decart\u2019s Oasis model. It\u2019s like Minecraft in every way you\u2019d expect, except that the entire experience is being generated by AI and you can creatively prompt the model to build beyond the confines of the game. The mindblowing part? Decart says Oasis can perform more than 10 times more efficiently than competitors such as OpenAI\u2019s Sora, which hasn\u2019t been publicly released.<\/p>\n\n\n\n<p>Don\u2019t let the game distract you though\u2014the Minecraft simulation is just an expression of the power of their model. According to the Decart team, this isn&#8217;t even version 1.0 of what their approach is capable of generating\u2014more like version 0.01. Given the <a href=\"https:\/\/www.theinformation.com\/articles\/why-sequoias-shaun-maguire-is-betting-on-a-year-old-ai-video-startup?rc=daakde\">broad coverage<\/a> they\u2019ve already received for their launch, we\u2019re excited to see what\u2019s next.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>How to break out in the AI market<\/strong><\/h2>\n\n\n\n<p>For Decart, the strategy to pull ahead of the crowd was simple: Disrupt the market on inference speed to deliver game changing models, and do that by building the most high-performance multi-cloud model training infrastructure possible. Then, iterate on that innovation.&nbsp;<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>We crafted state of the art infrastructure that allows us to train models that other people simply can&#8217;t train.<\/p>\n<cite>\u2014Dean Leitersdorf, Co-Founder and CEO, Decart<\/cite><\/blockquote>\n\n\n\n<p>Before we met Dean and the team at Decart, most of the hard work was done: the multi-cloud AI stack for training was dialed in and the models were going through the paces. They just had one simple, but big, problem holding them back:<\/p>\n\n\n\n<p><strong>The price and the logistics of moving and storing training data were going to limit their growth.<\/strong><br \/><br \/>They were burning through free data storage credits from a traditional cloud provider and had data spread across a range of other cloud providers and GPU clusters. Their training data needed to scale from 100s of <strong>thousands<\/strong> of hours of video data to 100s of <strong>millions<\/strong> of hours, and they needed a storage solution that could handle that scale in three key areas:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>Reliably high performance:<\/strong> Decart needed to know that when they got time on a cluster, they could move data in as fast as possible the second that they were able to.&nbsp;<\/li>\n\n\n\n<li><strong>GPU interoperability:<\/strong> They needed to be sure that whatever storage platform they chose, it would work well with a multi-cluster training approach. Being able to shop jobs between different GPU clouds and disperse training was essential for Dean\u2019s team.<\/li>\n\n\n\n<li><strong>Efficiency:<\/strong> Every dollar an AI startup spends on anything other than training time is a competitive disadvantage, so ensuring that storage costs were low without any surprise fees for data retention or download was key.<\/li>\n<\/ol>\n\n\n\n<p>Decart discovered Backblaze while researching storage alternatives. After a quick call and two fast months of testing Backblaze in a wide variety of usage patterns, it was clear to the team that they had found the storage foundation they needed.&nbsp;<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>We chose Backblaze because everything works. It&#8217;s super stable, and we had zero problems.&nbsp; That&#8217;s number one.<\/p>\n<cite>\u2014Dean Leitersdorf, Co-Founder and CEO, Decart<\/cite><\/blockquote>\n\n\n\n<p>When it came time to start moving data from Backblaze to GPU clusters, they had no problem with transferring petabyte-scale datasets. The only minor challenge was ensuring that the compute provider\u2019s pipe could take the volume of data streaming in.<\/p>\n\n\n\n<p>Here\u2019s where things ended up working for Decart:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Performance: <\/strong>They were blown away by the performance they achieved with Backblaze (more to come on that later).<\/li>\n\n\n\n<li><strong>Price: <\/strong>With pricing at one-fifth the cost of traditional cloud providers, Backblaze unlocked a significant amount of budget.<\/li>\n\n\n\n<li><strong>Free egress: <\/strong>The true game changer. Decart, for a number of reasons, trains their models on multiple different GPU clusters at the same time. With Backblaze, they can egress their full dataset to up to three training sites every month with zero additional cost.<\/li>\n<\/ul>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>B2 Cloud Storage was literally the only technical thing we used in training these models that didn&#8217;t crash the first time we tried it. We&#8217;re in an industry where everything fails, but Backblaze didn\u2019t. <\/p>\n<cite>\u2014Dean Leitersdorf, Co-Founder and CEO, Decart<\/cite><\/blockquote>\n\n\n\n<h2 class=\"wp-block-heading\">Looking forward<\/h2>\n\n\n\n<p>With performance, flexibility, and affordability squared away in their data storage approach, the Decart team is now in position to rotate out of this impressive first model and build whatever is next. With all the fundamentals working on the level that Backblaze always provides and Decart is happy with, the two teams are now working together to find even more efficiency and optimization and truly stand up the best infrastructure for training AI models.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>How Decart launched a video-generating AI model using Backblaze B2 Cloud Storage. <\/p>\n","protected":false},"author":182,"featured_media":111715,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":""},"categories":[111,7,438,18,483],"tags":[468],"class_list":["post-111714","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-entrepreneurship","category-cloud-storage","category-featured-cloud-storage","category-startup-life","category-tech-lab","tag-b2cloud","entry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Solving the AI Training Data Challenge with Decart AI and Backblaze<\/title>\n<meta name=\"description\" content=\"Decart launched a video-generating AI model using Backblaze B2 Cloud Storage. See how they did it and preview the model&#039;s capabilities.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Solving the AI Training Data Challenge with Decart AI and Backblaze\" \/>\n<meta property=\"og:description\" content=\"Decart launched a video-generating AI model using Backblaze B2 Cloud Storage. See how they did it and preview the model&#039;s capabilities.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/\" \/>\n<meta property=\"og:site_name\" content=\"Backblaze Blog | Cloud Storage &amp; Cloud Backup\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/backblaze\" \/>\n<meta property=\"article:published_time\" content=\"2024-11-04T16:17:04+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-11-04T21:58:10+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2024\/11\/bb-bh_Decartes-Backblaze-e1730505101240.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1440\" \/>\n\t<meta property=\"og:image:height\" content=\"820\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Stephanie Doyle\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@backblaze\" \/>\n<meta name=\"twitter:site\" content=\"@backblaze\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Stephanie Doyle\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Solving the AI Training Data Challenge with Decart AI and Backblaze","description":"Decart launched a video-generating AI model using Backblaze B2 Cloud Storage. See how they did it and preview the model's capabilities.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/","og_locale":"en_US","og_type":"article","og_title":"Solving the AI Training Data Challenge with Decart AI and Backblaze","og_description":"Decart launched a video-generating AI model using Backblaze B2 Cloud Storage. See how they did it and preview the model's capabilities.","og_url":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/","og_site_name":"Backblaze Blog | Cloud Storage &amp; Cloud Backup","article_publisher":"https:\/\/www.facebook.com\/backblaze","article_published_time":"2024-11-04T16:17:04+00:00","article_modified_time":"2024-11-04T21:58:10+00:00","og_image":[{"width":1440,"height":820,"url":"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2024\/11\/bb-bh_Decartes-Backblaze-e1730505101240.png","type":"image\/png"}],"author":"Stephanie Doyle","twitter_card":"summary_large_image","twitter_creator":"@backblaze","twitter_site":"@backblaze","twitter_misc":{"Written by":"Stephanie Doyle","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/#article","isPartOf":{"@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/"},"author":{"name":"Stephanie Doyle","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/person\/688f3962fd24d8155ef726bc94d75058"},"headline":"Solving the AI Training Data Challenge with Decart AI and Backblaze","datePublished":"2024-11-04T16:17:04+00:00","dateModified":"2024-11-04T21:58:10+00:00","mainEntityOfPage":{"@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/"},"wordCount":969,"commentCount":0,"publisher":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/#primaryimage"},"thumbnailUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/11\/bb-bh_Decartes-Backblaze-e1730505101240.png","keywords":["B2Cloud"],"articleSection":["Business Lab","Cloud Storage","Featured-Cloud Storage","Startup Life","Tech Lab"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/","url":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/","name":"Solving the AI Training Data Challenge with Decart AI and Backblaze","isPartOf":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/#primaryimage"},"image":{"@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/#primaryimage"},"thumbnailUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/11\/bb-bh_Decartes-Backblaze-e1730505101240.png","datePublished":"2024-11-04T16:17:04+00:00","dateModified":"2024-11-04T21:58:10+00:00","description":"Decart launched a video-generating AI model using Backblaze B2 Cloud Storage. See how they did it and preview the model's capabilities.","breadcrumb":{"@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/#primaryimage","url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/11\/bb-bh_Decartes-Backblaze-e1730505101240.png","contentUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/11\/bb-bh_Decartes-Backblaze-e1730505101240.png","width":1440,"height":820,"caption":"A decorative image showing the logos of Backblaze and Decart."},{"@type":"BreadcrumbList","@id":"https:\/\/www.backblaze.com\/blog\/solving-the-ai-training-data-challenge-with-decart-ai-and-backblaze\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Solving the AI Training Data Challenge with Decart AI and Backblaze"}]},{"@type":"WebSite","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#website","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/","name":"Backblaze Cloud Solutions Blog","description":"Cloud Storage &amp; Cloud Backup","publisher":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization","name":"Backblaze","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/www.backblaze.com\/blog\/wp-content\/uploads\/2017\/12\/backblaze_icon_transparent.png?fit=512%2C512&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.backblaze.com\/blog\/wp-content\/uploads\/2017\/12\/backblaze_icon_transparent.png?fit=512%2C512&ssl=1","width":512,"height":512,"caption":"Backblaze"},"image":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/backblaze","https:\/\/x.com\/backblaze","https:\/\/www.youtube.com\/user\/Backblaze","https:\/\/en.wikipedia.org\/wiki\/Backblaze"]},{"@type":"Person","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/person\/688f3962fd24d8155ef726bc94d75058","name":"Stephanie Doyle","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2022\/12\/headshot-4-1-e1670452405672-150x150.jpg","url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2022\/12\/headshot-4-1-e1670452405672-150x150.jpg","contentUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2022\/12\/headshot-4-1-e1670452405672-150x150.jpg","caption":"Stephanie Doyle"},"description":"Stephanie is the Technical Narrative Content Manager at Backblaze. She specializes in taking complex topics and writing relatable, engaging, and user-friendly content. You can most often find her reading in public places, and can connect with her on LinkedIn.","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/author\/stephanie\/"}]}},"jetpack_featured_media_url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/11\/bb-bh_Decartes-Backblaze-e1730505101240.png","_links":{"self":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts\/111714","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/users\/182"}],"replies":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/comments?post=111714"}],"version-history":[{"count":0,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts\/111714\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/media\/111715"}],"wp:attachment":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/media?parent=111714"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/categories?post=111714"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/tags?post=111714"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}