Multiple Regression

Learn about multiple regression.

Understanding multiple regression

Multiple regression is an extension of simple linear regression. It is defined as a linear regression with multiple features. It is the most widely used machine learning technique. This technique is useful across many domains as we cannot depend on one single variable to do effective modeling. Now, let’s formulate an equation for multiple linear regression. We have a feature set X, which consists of p features.

We can present the equation as

yi=w0+w1xi1+w2xi2+w3xi3+...+wpxip+Errory_i= w_0 +w_1x_{i1}+ w_2x_{i2}+ w_3x_{i3} + ... + w_px_{ip}+ Error

Instead of using the features directly, we can use transform features.

yi=w0+w1T(xi1)+w2T(xi2)+w3T(xi3)+...+wkT(xik)+Errory_i= w_0 +w_1T(x_{i1})+ w_2T(x_{i2})+ w_3T(x_{i3}) + ... + w_kT(x_{ik})+ Error

Here, T(xij)T(x_{ij}) is the ithi_{th} record and jthj_{th} transformation is applied. Transformation could be anything, including as higher order polynomial, logarithm, exponential, etc.

Consider these examples:

f(x)=25+35x12+12log(x2)f(x) = 25 + 35x_1^2 + 12log(x_2)

f(x)=3+4x1+12x24f(x) = 3 + 4x_1 + 12x_2^4

Level up your interview prep. Join Educative to access 70+ hands-on prep courses.