{"id":113057,"date":"2026-06-15T12:16:52","date_gmt":"2026-06-15T19:16:52","guid":{"rendered":"https:\/\/www.backblaze.com\/blog\/?p=113057"},"modified":"2026-06-15T12:16:54","modified_gmt":"2026-06-15T19:16:54","slug":"why-ai-projects-stall-data-silos","status":"publish","type":"post","link":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/","title":{"rendered":"Why AI Projects Stall: Data Silos"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"583\" src=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos-1024x583.png\" alt=\"Isometric illustration of isolated data cubes on a blue gradient background, representing fragmented enterprise data silos.\" class=\"wp-image-113058\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos-1024x583.png 1024w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos-300x171.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos-768x437.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos.png 1440w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<div style=\"height:15px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p>Customer records live in the database. Payment activity is safely stored in your payment processor. Call recordings and transcripts live in Zoom, Teams, Webex, or another video conferencing application\u2013or are shared to Gong for customer insights. Telemetry resides in an observability tool like Grafana or DataDog. Your own day-to-day work is in Google Drive or OneDrive. It takes hundreds of human hours to figure out what customer behavior and business continuity patterns can be extracted from all of this data.<\/p>\n\n\n\n<p>Extracting insights from your data starts with knowing what you have. The first step is centralizing it \u2014 pulling multimodal data from across your systems into a single storage repository where your engineering team and AI agents can actually access it. From there, you can assess what&#8217;s useful, what&#8217;s usable, and what still needs to be labeled or anonymized before it&#8217;s ready to work with.<\/p>\n\n\n\n<p>Assessing your data is like cleaning out the garage: first, you have to do a full inventory to know what you actually have before deciding on new data destinations and purposes.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>The hidden data silos most organizations overlook<\/strong><\/h2>\n\n\n\n<p>One of the less-discussed barriers to <a href=\"https:\/\/www.backblaze.com\/blog\/your-ai-strategy-is-only-as-strong-as-your-data-foundation\/\" target=\"_blank\" rel=\"noreferrer noopener\">AI readiness<\/a> is that many organizations lack a complete picture of their own data assets.<\/p>\n\n\n\n<p>Financial assets are documented. Physical assets are tracked. But images, audio recordings, video files, email archives, documents, logs, and customer interaction histories often sit across systems with inconsistent labeling, unclear ownership, disparate tooling, and no centralized catalog.<\/p>\n\n\n\n<p>Customer calls, support chat transcripts, QA screen captures, surveillance footage, and product images all contain operational insight that can inform <a href=\"https:\/\/www.backblaze.com\/cloud-storage\/industries\/ai-ml\" target=\"_blank\" rel=\"noreferrer noopener\">AI applications<\/a>, assuming they&#8217;re stored in a way that makes them accessible and usable. Most organizations haven&#8217;t done that inventory and don\u2019t know what data they&#8217;re sitting on.<\/p>\n\n\n\n<p>In our experience, organizations that broaden their definition of data \u2014 and build infrastructure to collect and manage it centrally \u2014 consistently find that their AI potential is larger than they initially estimated. The inverse is also true. Organizations that skip this step tend to hit the data silo problem mid-project, when data they assumed was available turns out to be fragmented, unlabeled, or simply missing.<\/p>\n\n\n\n<p>The term &#8220;<a href=\"https:\/\/www.backblaze.com\/blog\/building-multimodal-ai-data-infrastructure-with-pixeltable\/\" target=\"_blank\" rel=\"noreferrer noopener\">multimodal<\/a>&#8221; describes this in practice: datasets that span formats\u2014images, audio, video, text, and structured records\u2014within the same pipeline. Managing multimodal data at a meaningful scale requires infrastructure decisions made well before an AI project kicks off.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Where the infrastructure question meets the strategy question<\/strong><\/h2>\n\n\n\n<p>Here&#8217;s what aligning AI strategy with data strategy actually requires:<\/p>\n\n\n\n<p>Inventory what you have. Before sourcing anything new, take stock of what exists. Support call recordings, usage footage, survey data, transaction histories\u2014these are continuously generated across most organizations and rarely treated as AI assets. A governance committee (described below) is the natural owner of this inventory.<\/p>\n\n\n\n<p>Establish governance before you deploy. Who can use which data, under what conditions, and for what purposes. When data governance is established early, teams get answers in days rather than weeks. When it&#8217;s deferred, it becomes a bottleneck mid-project.<\/p>\n\n\n\n<p><a href=\"https:\/\/www.backblaze.com\/blog\/gpus-are-only-half-the-equation\/\" target=\"_blank\" rel=\"noreferrer noopener\">Plan storage infrastructure for what you will have, not just what you have<\/a>. A storage decision made today carries a different cost profile 18 months from now. Hyperscaler egress fees that look manageable on a pilot-scale workload become structural constraints at training scale. Archive tiers that appear to reduce costs carry retrieval latencies incompatible with active AI pipelines. Modeling these costs before committing to a provider architecture prevents the predictable trade-offs: smaller datasets, shorter retention windows, fewer training cycles.<\/p>\n\n\n\n<p>Make the C-suite part of the conversation. <a href=\"https:\/\/www.ibm.com\/thought-leadership\/institute-business-value\/c-suite-study\/ceo\" target=\"_blank\" rel=\"noreferrer noopener\">IBM&#8217;s 2025 CEO Study<\/a> found that 68% of AI-first organizations have mature, well-established data and governance frameworks. When the CEO is involved in AI governance decisions, the conversation stays connected to business strategy instead of fragmenting into siloed technical decisions.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>The competitive advantage lives in the data (silos)<\/strong><\/h2>\n\n\n\n<p>Foundation models are increasingly commoditized. The leading model today will be superseded within months, and capable alternatives are widely available from multiple providers. The latest generation from any major provider is capable, widely available, and will be superseded by something better within months. What cannot be licensed, replicated, or accessed by a competitor is the proprietary data your organization has built up over years of operation: customer patterns, process histories, institutional knowledge.<\/p>\n\n\n\n<p>Getting that data foundation right is what separates AI programs that scale from those that stall.&nbsp;<\/p>\n\n\n\n<p>Organizations that align their AI strategy with their data strategy from the start make fundamentally different infrastructure decisions. They choose storage providers that support active data movement without penalizing it. They build governance structures that give the right people access without creating bottlenecks. And they treat data growth as a business opportunity, not a cost to manage.<\/p>\n\n\n\n<p>For most organizations, that shift in thinking starts with a simple question: who owns AI strategy? If the answer is &#8220;it&#8217;s fragmented across different teams,&#8221; then the second question is: what would it take to bring those conversations into one room?<\/p>\n\n\n\n<p>Everything that follows\u2014the data readiness, the governance, the infrastructure that actually works at scale\u2014flows from that first alignment.<\/p>\n\n\n\n<p>Read the Backblaze ebook, <a href=\"https:\/\/www.backblaze.com\/contact-sales\/navigating-multimodal-dataset-economics-ebook\" target=\"_blank\" rel=\"noreferrer noopener\">Navigating Multimodal Dataset Economics<\/a>, to make decisions about the AI datasets at your organization.&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Customer records live in the database. Payment activity is safely stored in your payment processor. Call recordings and transcripts live in Zoom, Teams, Webex, or another video conferencing application\u2013or are shared to Gong for customer insights. Telemetry resides in an observability tool like Grafana or DataDog. Your own day-to-day work is in Google Drive or&hellip; <a class=\"more-link\" href=\"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/\">Continue reading <span class=\"screen-reader-text\">Why AI Projects Stall: Data Silos<\/span><\/a><\/p>\n","protected":false},"author":224,"featured_media":113058,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":"","jetpack_post_was_ever_published":false},"categories":[7,434,1],"tags":[489,468,373],"class_list":["post-113057","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cloud-storage","category-featured-1","category-uncategorized","tag-ai-ml","tag-b2cloud","tag-developer","entry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.8 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Why AI Projects Stall: Data Silos<\/title>\n<meta name=\"description\" content=\"AI data silos are the #1 reason AI programs stall. Discover how to audit, govern, and align your data strategy before your next AI project.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Why AI Projects Stall: Data Silos\" \/>\n<meta property=\"og:description\" content=\"AI data silos are the #1 reason AI programs stall. Discover how to audit, govern, and align your data strategy before your next AI project.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/\" \/>\n<meta property=\"og:site_name\" content=\"Backblaze Blog | Cloud Storage &amp; Cloud Backup\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/backblaze\" \/>\n<meta property=\"article:published_time\" content=\"2026-06-15T19:16:52+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-06-15T19:16:54+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1440\" \/>\n\t<meta property=\"og:image:height\" content=\"820\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Maddie Presland\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@backblaze\" \/>\n<meta name=\"twitter:site\" content=\"@backblaze\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Maddie Presland\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Why AI Projects Stall: Data Silos","description":"AI data silos are the #1 reason AI programs stall. Discover how to audit, govern, and align your data strategy before your next AI project.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/","og_locale":"en_US","og_type":"article","og_title":"Why AI Projects Stall: Data Silos","og_description":"AI data silos are the #1 reason AI programs stall. Discover how to audit, govern, and align your data strategy before your next AI project.","og_url":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/","og_site_name":"Backblaze Blog | Cloud Storage &amp; Cloud Backup","article_publisher":"https:\/\/www.facebook.com\/backblaze","article_published_time":"2026-06-15T19:16:52+00:00","article_modified_time":"2026-06-15T19:16:54+00:00","og_image":[{"width":1440,"height":820,"url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos.png","type":"image\/png"}],"author":"Maddie Presland","twitter_card":"summary_large_image","twitter_creator":"@backblaze","twitter_site":"@backblaze","twitter_misc":{"Written by":"Maddie Presland","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/#article","isPartOf":{"@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/"},"author":{"name":"Maddie Presland","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/person\/5a95887c8e781ea9cf10472e47175ce0"},"headline":"Why AI Projects Stall: Data Silos","datePublished":"2026-06-15T19:16:52+00:00","dateModified":"2026-06-15T19:16:54+00:00","mainEntityOfPage":{"@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/"},"wordCount":871,"commentCount":0,"publisher":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/#primaryimage"},"thumbnailUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos.png","keywords":["AI\/ML","B2Cloud","Developer"],"articleSection":["Cloud Storage","Featured"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/","url":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/","name":"Why AI Projects Stall: Data Silos","isPartOf":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/#primaryimage"},"image":{"@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/#primaryimage"},"thumbnailUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos.png","datePublished":"2026-06-15T19:16:52+00:00","dateModified":"2026-06-15T19:16:54+00:00","description":"AI data silos are the #1 reason AI programs stall. Discover how to audit, govern, and align your data strategy before your next AI project.","breadcrumb":{"@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/#primaryimage","url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos.png","contentUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos.png","width":1440,"height":820,"caption":"Isometric illustration of isolated data cubes on a blue gradient background, representing fragmented enterprise data silos."},{"@type":"BreadcrumbList","@id":"https:\/\/www.backblaze.com\/blog\/why-ai-projects-stall-data-silos\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Why AI Projects Stall: Data Silos"}]},{"@type":"WebSite","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#website","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/","name":"Backblaze Cloud Solutions Blog","description":"Cloud Storage &amp; Cloud Backup","publisher":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization","name":"Backblaze","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/www.backblaze.com\/blog\/wp-content\/uploads\/2017\/12\/backblaze_icon_transparent.png?fit=512%2C512&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.backblaze.com\/blog\/wp-content\/uploads\/2017\/12\/backblaze_icon_transparent.png?fit=512%2C512&ssl=1","width":512,"height":512,"caption":"Backblaze"},"image":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/backblaze","https:\/\/x.com\/backblaze","https:\/\/www.youtube.com\/user\/Backblaze","https:\/\/en.wikipedia.org\/wiki\/Backblaze"]},{"@type":"Person","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/person\/5a95887c8e781ea9cf10472e47175ce0","name":"Maddie Presland","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2025\/10\/Backblaze_Author-Maddie-Presland_Square-150x150.jpg","url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2025\/10\/Backblaze_Author-Maddie-Presland_Square-150x150.jpg","contentUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2025\/10\/Backblaze_Author-Maddie-Presland_Square-150x150.jpg","caption":"Maddie Presland"},"description":"Maddie Presland is a Product Marketing Manager at Backblaze specializing in app storage use cases for multi-cloud architectures and AI. Maddie has more than five years of experience as a product marketer focusing on cloud infrastructure and developing technical marketing content for developers. With a background in journalism, she combines storytelling with her technical curiosity and ability to crash course just about anything. Connect with her on LinkedIn.","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/author\/maddiepresland\/"}]}},"jetpack_featured_media_url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2026\/06\/DataSilos.png","_links":{"self":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts\/113057","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/users\/224"}],"replies":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/comments?post=113057"}],"version-history":[{"count":0,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts\/113057\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/media\/113058"}],"wp:attachment":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/media?parent=113057"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/categories?post=113057"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/tags?post=113057"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}