{"id":111089,"date":"2024-04-11T09:36:57","date_gmt":"2024-04-11T16:36:57","guid":{"rendered":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/?p=111089"},"modified":"2024-08-14T10:41:25","modified_gmt":"2024-08-14T17:41:25","slug":"ai-101-what-is-model-serving","status":"publish","type":"post","link":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/","title":{"rendered":"AI 101: What Is Model Serving?"},"content":{"rendered":"<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1440\" height=\"820\" src=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1.png\" alt=\"A decorative image showing a computer, a cloud, and a building. \" class=\"wp-image-111090\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1.png 1440w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1-300x171.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1-1024x583.png 1024w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1-768x437.png 768w\" sizes=\"auto, (max-width: 1440px) 100vw, 1440px\" \/><\/figure>\n<\/div>\n\n\n<div style=\"height:15px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">If you read a blog article that starts with \u201cIn today&#8217;s fast-paced business landscape\u2026\u201d you can be 99% sure that content is AI generated. While large language models (LLMs) like ChatGPT, Gemini, and Claude may be the shiniest of AI applications from a consumer standpoint, they still have <a href=\"https:\/\/www.searchenginejournal.com\/humans-vs-machines-ad-copy-content-test-data-study\/509942\" target=\"_blank\" rel=\"noreferrer noopener\">a ways to go from a creativity standpoint<\/a>.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">That said, there are exciting possibilities for artificial intelligence and machine learning (AI\/ML) algorithms to improve and create products now and in the future, many of which focus on replicated operations, split second database predictions, <a href=\"https:\/\/www.brighttalk.com\/webcast\/14807\/611588\" target=\"_blank\" rel=\"noreferrer noopener\">natural language processing<\/a>, <a href=\"https:\/\/www.backblaze.com\/cloud-storage\/case-studies\/urlscan-io\" target=\"_blank\" rel=\"noreferrer noopener\">threat analysis<\/a>, and more. As you might imagine, deployment of those algorithms comes with its own set of complexities.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To solve for those complexities, specialized operations platforms have sprung up\u2014specifically, AI\/ML model serving platforms. Let\u2019s talk about AI\/ML model serving and how it fits into \u201ctoday\u2019s fast-paced business landscape.\u201d (Don\u2019t worry\u2014we wrote that one.)<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">What Is AI\/ML Model Serving?<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">AI\/ML model serving refers to the process of deploying machine learning models into production environments where they can be used to make predictions or perform tasks based on real-time or batch input data.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Trained machine learning models are made accessible via APIs or other interfaces, allowing external applications or systems to send real-world data to the models for <a href=\"https:\/\/www.backblaze.com\/blog\/ai-101-training-vs-inference\/\" target=\"_blank\" rel=\"noreferrer noopener\">inference<\/a>. The served models process the incoming data and return predictions, classifications, or other outputs based on the learned patterns encoded in the model parameters.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Practically, you can compare building an application that uses an AI\/ML algorithm to a car engine. The whole application (the engine) is built to solve a problem; in this case \u201ctransport me faster than walking.\u201d There are various subtasks to help you solve that problem well. Let\u2019s take the exhaust system as an example. The exhaust fundamentally does the same thing from car to car\u2014it moves hot air off the engine\u2014but once you upgrade your exhaust system (i.e. add an AI algorithm to your application), you can tell how your engine works differently by comparing your car\u2019s performance to a base-level model of the same one.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Now let\u2019s plug in our <a href=\"https:\/\/www.backblaze.com\/blog\/ai-101-how-cognitive-science-and-computer-processors-create-artificial-intelligence\/\">\u201csmart\u201d element<\/a>, and it\u2019s more like your exhaust has the ability to see that your car has terrible fuel efficiency, identifies that it\u2019s because you\u2019re not removing hot air off the engine well enough, and re-route the pathway it\u2019s using through your pipes, mufflers, and catalytic converters to improve itself. (Saving you money on gas\u2014wins all around.)&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Model serving, in this example, would be a shop that specializes in installing and maintaining exhausts. They\u2019re experts at plugging in your new exhaust and having it work well with the rest of the engine even if it\u2019s a newer type of tech (so, interoperability via API), and they have thought through and created frameworks for how to make sure the exhaust is functioning once you\u2019re driving around (i.e. metrics). They\u2019ve got a ton of ready-made parts and exhaust systems to recommend (that\u2019s your model registry). When they install your new system in your engine, they might have some tweaks that work specifically in your system, too (versioning over time to serve your specific product).&nbsp;&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Ok, back to the technical details. From an architecture standpoint, model serving also lets you separate your production model from the base AI\/ML model in addition to creating an accessible endpoint (read: an API or HTTPS access point, etc.). This separation has benefits\u2014making tracking model drift and versioning simpler, for instance.\u00a0<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;6a3f5667141ac&quot;}\" data-wp-interactive=\"core\/image\" data-wp-key=\"6a3f5667141ac\" class=\"aligncenter size-full wp-lightbox-container\"><img loading=\"lazy\" decoding=\"async\" width=\"936\" height=\"450\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on--click=\"actions.showLightbox\" data-wp-on--load=\"callbacks.setButtonStyles\" data-wp-on--pointerdown=\"actions.preloadImage\" data-wp-on--pointerenter=\"actions.preloadImageWithDelay\" data-wp-on--pointerleave=\"actions.cancelPreload\" data-wp-on-window--resize=\"callbacks.setButtonStyles\" src=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/AI_ML-Model-Serving_1_Model-Serving.png\" alt=\"A diagram of a typical model serving process. \" class=\"wp-image-111091\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/AI_ML-Model-Serving_1_Model-Serving.png 936w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/AI_ML-Model-Serving_1_Model-Serving-300x144.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/AI_ML-Model-Serving_1_Model-Serving-768x369.png 768w\" sizes=\"auto, (max-width: 936px) 100vw, 936px\" \/><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\tdata-wp-bind--aria-label=\"state.thisImage.triggerButtonAriaLabel\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.thisImage.buttonRight\"\n\t\t\tdata-wp-style--top=\"state.thisImage.buttonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/thenewstack.io\/model-server-the-critical-building-block-of-mlops\/\" target=\"_blank\" rel=\"noreferrer noopener\">Source.<\/a><\/figcaption><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">Like traditional software engineering, most AI\/ML model serving platforms also have code libraries of fully or partially trained models\u2014the model registry in the image above. For example, if you\u2019re running a <a href=\"https:\/\/www.backblaze.com\/blog\/shooting-for-the-clouds-how-one-photo-storage-service-moved-beyond-physical-devices\/\" target=\"_blank\" rel=\"noreferrer noopener\">photo management application<\/a>, you might grab an image recognition model and plug it into your larger application.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This is a tad more complex than other types of code deployment because you can\u2019t <em>really <\/em>tell if an AI\/ML model is functioning correctly until it\u2019s working on real-world data. Certainly, that\u2019s somewhat true of all code deployments\u2014you always find more bugs when you\u2019re live\u2014but because AI\/ML models are <a href=\"https:\/\/www.backblaze.com\/blog\/ai-101-gpu-vs-tpu-vs-npu\/\" target=\"_blank\" rel=\"noreferrer noopener\">performing complex tasks<\/a> like making predictions, natural language processing, etc., even a trained model has more room for \u201cerror\u201d that becomes evident when it\u2019s in a live environment. And, in many use cases\u2014like fraud detection or network intrusion detection\u2014models need to have <a href=\"https:\/\/www.backblaze.com\/blog\/navigating-cloud-storage-what-is-latency-and-why-does-it-matter\/\" target=\"_blank\" rel=\"noreferrer noopener\">very low latency<\/a> to perform properly.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Because of that, deciding what kind of <a href=\"https:\/\/www.qwak.com\/post\/shadow-deployment-vs-canary-release-of-machine-learning-models\" target=\"_blank\" rel=\"noreferrer noopener\">code deployment<\/a> to use can have a high impact on your end users. For example, lots of experts recommend leveraging shadow deployment techniques, where your AI\/ML model is ingesting live data, but running on a parallel environment invisible to end users, for phase one of your deployment.\u00a0<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Machine Learning Operations (MLOps) vs. AI\/ML Model Serving<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">In reading about model serving, you\u2019ll inevitably also come across folks talking about MLOps as well. (An Ops for every occasion, as they say. \u201cThey\u201d being me.) You can think of MLOps as being responsible for the entire, end-to-end process, whereas AI\/ML model serving focuses on one part of the process. Here\u2019s a handy diagram that outlines the whole MLOps lifecycle:<\/p>\n\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;6a3f56671b761&quot;}\" data-wp-interactive=\"core\/image\" data-wp-key=\"6a3f56671b761\" class=\"wp-block-image size-full wp-lightbox-container\"><img loading=\"lazy\" decoding=\"async\" width=\"780\" height=\"515\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on--click=\"actions.showLightbox\" data-wp-on--load=\"callbacks.setButtonStyles\" data-wp-on--pointerdown=\"actions.preloadImage\" data-wp-on--pointerenter=\"actions.preloadImageWithDelay\" data-wp-on--pointerleave=\"actions.cancelPreload\" data-wp-on-window--resize=\"callbacks.setButtonStyles\" src=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/AI_ML-Model-Serving_2_MLOps.png\" alt=\"A diagram showing a typical MLOps process. \" class=\"wp-image-111092\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/AI_ML-Model-Serving_2_MLOps.png 780w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/AI_ML-Model-Serving_2_MLOps-300x198.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/AI_ML-Model-Serving_2_MLOps-768x507.png 768w\" sizes=\"auto, (max-width: 780px) 100vw, 780px\" \/><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\tdata-wp-bind--aria-label=\"state.thisImage.triggerButtonAriaLabel\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.thisImage.buttonRight\"\n\t\t\tdata-wp-style--top=\"state.thisImage.buttonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/www.iguazio.com\/blog\/introduction-to-tf-serving\/\" target=\"_blank\" rel=\"noreferrer noopener\">Source.<\/a><\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">And, of course, you\u2019ll see one box on there that\u2019s called \u201cmodel serving\u201d.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">How to Choose a Model Serving Platform<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">AI model serving platforms typically provide features such as scalability to handle varying workloads, low latency for real-time predictions, monitoring capabilities to track model performance and health, versioning to manage multiple model versions, and integration with other software systems or frameworks.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/www.backblaze.com\/blog\/ai-101-do-the-dollars-make-sense\/\" target=\"_blank\" rel=\"noreferrer noopener\">Choosing the right one<\/a> is not a one-size-fits-all approach. Model serving platforms give you a whole host of benefits, operationally speaking\u2014they deliver better performance, scale easily with your business, integrate well with other applications, and give you valuable monitoring tools from both a performance and security perspective. But, there are a ton of other factors that can come into play that aren\u2019t immediately apparent, such as preferred code languages (Python is right up there), the processing\/hardware platform you\u2019re using, budget, what level of control and fine-tuning you want over APIs, how much management you want to do in-house vs. outsourcing, how much support\/engagement there is in the developer community, and so on.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Popular Model Serving Platforms<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Now that you know what model serving is, you might be wondering how you can use it yourself. We rounded up some of the more popular platforms so you can get a sense of the diversity in the marketplace:&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.tensorflow.org\/tfx\/guide\/serving\" target=\"_blank\" rel=\"noreferrer noopener\">TensorFlow Serving<\/a>: An open-source serving system for deploying machine learning models built with TensorFlow. It provides efficient and scalable serving of TensorFlow models for both online and batch predictions.\u00a0<\/li>\n\n\n\n<li><a href=\"https:\/\/aws.amazon.com\/sagemaker\/\" target=\"_blank\" rel=\"noreferrer noopener\">Amazon SageMake<\/a>r: A fully managed service provided by Amazon Web Services (AWS) for building, training, and deploying machine learning models at scale. SageMaker includes built-in model serving capabilities for deploying models to production.<\/li>\n\n\n\n<li><a href=\"https:\/\/cloud.google.com\/ai-platform\/docs\/technical-overview\" target=\"_blank\" rel=\"noreferrer noopener\">Google Cloud AI Platform<\/a>: A suite of cloud-based machine learning services provided by Google Cloud Platform (GCP). It offers tools for training, evaluation, and deployment of machine learning models, including model serving features for deploying models in production environments.<\/li>\n\n\n\n<li><a href=\"https:\/\/azure.microsoft.com\/en-us\/products\/machine-learning\" target=\"_blank\" rel=\"noreferrer noopener\">Microsoft Azure Machine Learning<\/a>: A cloud-based service offered by Microsoft Azure for building, training, and deploying machine learning models. Azure Machine Learning includes features for deploying models as web services for real-time scoring and batch inferencing.<\/li>\n\n\n\n<li><a href=\"https:\/\/www.backblaze.com\/cloud-storage\/solutions\/kubernetes-backup\" target=\"_blank\" rel=\"noreferrer noopener\">Kubernetes (K8s)<\/a>: While not a model serving platform in itself, Kubernetes is a popular open-source container orchestration platform that is often used for deploying and managing machine learning models at scale. Several tools and frameworks, such as Kubeflow and KFServing, provide extensions for serving models on Kubernetes clusters.<\/li>\n\n\n\n<li><a href=\"https:\/\/huggingface.co\/\" target=\"_blank\" rel=\"noreferrer noopener\">Hugging Face<\/a>: Known for its open-source libraries for natural language processing (NLP), Hugging Face also provides a model serving platform for deploying and managing natural language processing models in production environments.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">The Practical Approach<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">In short, AI\/ML model serving platforms make ML algorithms much more manageable and accessible for all kinds of applications. Choosing the right one (as always) comes down to your particular use case\u2014so, test thoroughly, and let us know what\u2019s working for you in the comments.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>When you&#8217;re deciding how to use artificial intelligence algorithm in your product, model serving can help you straighten out the operational details. Let&#8217;s talk about what it is. <\/p>\n","protected":false},"author":182,"featured_media":111090,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":"","jetpack_post_was_ever_published":false},"categories":[7,434,438],"tags":[489,468],"class_list":["post-111089","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cloud-storage","category-featured-1","category-featured-cloud-storage","tag-ai-ml","tag-b2cloud","entry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.8 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>AI 101: What Is Model Serving?<\/title>\n<meta name=\"description\" content=\"Learn how AI\/ML model serving streamlines the deployment of machine learning models, enhancing business efficiency and addressing complexities.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"AI 101: What Is Model Serving?\" \/>\n<meta property=\"og:description\" content=\"Learn how AI\/ML model serving streamlines the deployment of machine learning models, enhancing business efficiency and addressing complexities.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/\" \/>\n<meta property=\"og:site_name\" content=\"Backblaze Blog | Cloud Storage &amp; Cloud Backup\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/backblaze\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-11T16:36:57+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-08-14T17:41:25+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1440\" \/>\n\t<meta property=\"og:image:height\" content=\"820\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Stephanie Doyle\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@backblaze\" \/>\n<meta name=\"twitter:site\" content=\"@backblaze\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Stephanie Doyle\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"AI 101: What Is Model Serving?","description":"Learn how AI\/ML model serving streamlines the deployment of machine learning models, enhancing business efficiency and addressing complexities.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/","og_locale":"en_US","og_type":"article","og_title":"AI 101: What Is Model Serving?","og_description":"Learn how AI\/ML model serving streamlines the deployment of machine learning models, enhancing business efficiency and addressing complexities.","og_url":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/","og_site_name":"Backblaze Blog | Cloud Storage &amp; Cloud Backup","article_publisher":"https:\/\/www.facebook.com\/backblaze","article_published_time":"2024-04-11T16:36:57+00:00","article_modified_time":"2024-08-14T17:41:25+00:00","og_image":[{"width":1440,"height":820,"url":"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1.png","type":"image\/png"}],"author":"Stephanie Doyle","twitter_card":"summary_large_image","twitter_creator":"@backblaze","twitter_site":"@backblaze","twitter_misc":{"Written by":"Stephanie Doyle","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/#article","isPartOf":{"@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/"},"author":{"name":"Stephanie Doyle","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/person\/688f3962fd24d8155ef726bc94d75058"},"headline":"AI 101: What Is Model Serving?","datePublished":"2024-04-11T16:36:57+00:00","dateModified":"2024-08-14T17:41:25+00:00","mainEntityOfPage":{"@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/"},"wordCount":1390,"commentCount":0,"publisher":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/#primaryimage"},"thumbnailUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1.png","keywords":["AI\/ML","B2Cloud"],"articleSection":["Cloud Storage","Featured","Featured-Cloud Storage"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/","url":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/","name":"AI 101: What Is Model Serving?","isPartOf":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/#primaryimage"},"image":{"@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/#primaryimage"},"thumbnailUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1.png","datePublished":"2024-04-11T16:36:57+00:00","dateModified":"2024-08-14T17:41:25+00:00","description":"Learn how AI\/ML model serving streamlines the deployment of machine learning models, enhancing business efficiency and addressing complexities.","breadcrumb":{"@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/#primaryimage","url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1.png","contentUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1.png","width":1440,"height":820,"caption":"A decorative image showing a computer, a cloud, and a building."},{"@type":"BreadcrumbList","@id":"https:\/\/www.backblaze.com\/blog\/ai-101-what-is-model-serving\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/"},{"@type":"ListItem","position":2,"name":"AI 101: What Is Model Serving?"}]},{"@type":"WebSite","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#website","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/","name":"Backblaze Cloud Solutions Blog","description":"Cloud Storage &amp; Cloud Backup","publisher":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization","name":"Backblaze","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/www.backblaze.com\/blog\/wp-content\/uploads\/2017\/12\/backblaze_icon_transparent.png?fit=512%2C512&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.backblaze.com\/blog\/wp-content\/uploads\/2017\/12\/backblaze_icon_transparent.png?fit=512%2C512&ssl=1","width":512,"height":512,"caption":"Backblaze"},"image":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/backblaze","https:\/\/x.com\/backblaze","https:\/\/www.youtube.com\/user\/Backblaze","https:\/\/en.wikipedia.org\/wiki\/Backblaze"]},{"@type":"Person","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/person\/688f3962fd24d8155ef726bc94d75058","name":"Stephanie Doyle","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2022\/12\/headshot-4-1-e1670452405672-150x150.jpg","url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2022\/12\/headshot-4-1-e1670452405672-150x150.jpg","contentUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2022\/12\/headshot-4-1-e1670452405672-150x150.jpg","caption":"Stephanie Doyle"},"description":"Stephanie is the Technical Narrative Content Manager at Backblaze. She specializes in taking complex topics and writing relatable, engaging, and user-friendly content. You can most often find her reading in public places, and can connect with her on LinkedIn.","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/author\/stephanie\/"}]}},"jetpack_featured_media_url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2024\/04\/bb-bh-What-Is-Model-Serving_Design-A-1.png","_links":{"self":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts\/111089","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/users\/182"}],"replies":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/comments?post=111089"}],"version-history":[{"count":0,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts\/111089\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/media\/111090"}],"wp:attachment":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/media?parent=111089"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/categories?post=111089"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/tags?post=111089"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}