Why is DevOps for Machine Learning so Different?

Posted 2022-08-31 23:45:27

MLOps is becoming more popular. Many people who have been involved in traditional DevOps might be wondering why it isn't called DevOps. We'll show you how MLOps differs from traditional DevOps, and also discuss the challenges it presents to the industry.

Before Learning Machine Learning read this blog. The key difference between DevOps (and MLOps) is how machine learning uses data. The MLOps life cycle is affected by how data volume, transformation, and quality are handled.

Current State of DevOps and MLOps

DevOps refers to a set of established practices that ensure smooth build-deploy and monitor cycles. It is based on infrastructure and CI/CD. Tools include Jira and Git, Jenkins, Jira, Jira, Jira, docker and kubernetes.

MLOps have not reached the same maturity. 87% of machine-learning projects never go live

ML infrastructure can be complex. Workflows go beyond the production of artifacts and include data collection, validation and prep. There are many types of hardware resources that can be involved. Management is required for GPUs. Monitoring of the data and quality of predictions is possible. This can lead to a complex landscape.

Why is it so different?

Machine learning and its practice are the key to all these differences. Software responds to inputs, and both ML and mainstream programming have the same actions. However, the codification of actions is different.

Conventional software codes actions as explicit rules. The simplest examples of programming are 'hello, world' programs. These programs simply specify that a program should produce 'hello, world'. To make it more complicated, you can add control structures to allow for complex actions to be performed in response to inputs. We learn more about programming language as we add control structures. In contrast to older terminal systems, where inputs can only be made via keyboard and outputs can only be made from text, this rule-based input/output pattern is much easier to grasp. It is also true for most software that we use, even though the types and complexity of inputs and outputs can be quite diverse.

ML doesn't codify explicitly. Instead, rules are created indirectly by capturing patterns in data. This makes ML more useful for a narrower type of problem that can only be solved numerically. Predicting salary using data points/features like education, experience, and location. This is called a regression problem. The goal is to predict the salary value from previous data. Machine learning can also be used to solve classification problems. Instead of predicting the value of a variable, the model instead outputs a probability that a given data point will fall into a specific class. Here are some examples of classification problems:

Predict which number you are given based on handwritten examples.
Sort images according to their category, e.g. Types of flowers

It is not necessary to know all details about how ML works. It will be helpful to understand how ML models work. Let's take a look at the basics of a regression problem. For example, predicting salary based on data such as education, experience, and location.

At random, the coefficients/weights are set to their initial values (e.g. At random To make predictions, the equation can be applied to the training data. The first run of the prediction is likely to be poor. The error is the sum of all output variables (e.g., salary) and their distances. It shows how poor the predictions are. Salary) samples taken from the prediction line. To reduce error, we can update the weights and continue the process of updating and making new predictions. This is known as 'fitting', or 'training'. The end result is a set weights that can be used for predictions.

The basic picture is that ML training iterations are used to update weights and improve prediction accuracy. This shows how ML differs from traditional programming. These are the key points you should take away from this DevOps perspective:

Together, the training data with the code drive fitting.
A trained/weighted modeling is the closest thing to an executable. These can be categorized by ML toolkit (tensorflow or sc-kit teach, R, h2o etc.). Depending on the model type.
Sometimes, retraining is necessary. If your model makes predictions based on data that changes seasonally, like predictions about how many clothing items will be sold in a given month, this could be an example. Training on summer data may make good predictions, but not in winter.
Training can be difficult because of the large data volumes.
Data scientists work in an exploratory mode. Visualisations are an important part of that process.

Please log in to like, share and comment!