Let’s move on and practically do what we have learned so far. As always, we need to import some basic libraries.

Python 3.8

## Generate 2 random clusters, create dataframe
from sklearn.datasets import make_biclusters # to generate data
X, classes, cols= make_biclusters(shape=(50,2), # features (n_row,n_cols)
                                   n_clusters=2, # number of classes we want
                                   noise=50,# The standard deviation of the gaussian noise.
                                   random_state=101) # to re-generate same data everytime
# Creating dataframe
df = pd.DataFrame(X, columns=['feature_2','feature_1'])
df['target']= classes[0]
# Well, instead of True/False, lets replace with 1/0 targets -- a practice for map and lambda!
df['target'] = df['target'].map(lambda t: '1' if t==0 else '0')
print(df.tail(2)) # tail this time!

Python 3.8

# Figure 1 (left)
fig,(ax1,ax2)=plt.subplots(nrows=1,ncols=2,figsize=(16,8))
sns.scatterplot(x='feature_1',y='feature_2',data=df,hue='target',ax=ax1,s=150)
ax1.set_title("The data -- two classes")
ax1.set_xlabel('Feature 1')
ax1.set_ylabel('Feature 2')
ax1.legend().set_title('Target')
# Plot our new point
test_point=[[10,50]]
# Figure 2 (right)
sns.scatterplot(x='feature_1',y='feature_2',data=df,hue='target',ax=ax2,s=150)
ax2.scatter(x=test_point[0][0],y=test_point[0][1],color="red",marker="*",s=1000)
ax2.set_title('Red star is a test (unknown) point')
ax2.set_xlabel('Feature 1')
ax2.set_ylabel('Feature 2')
ax2.legend().set_title('Target')

1.Course Introduction

2.Linear Regression

3.Regularization

4.Bias-Variance Trade-off

5.Categorical Features

6.Logistic Regression

7.Logistic Regression: Titanic Data

Project

Sentiment Analysis Using Multinomial Logistic Regression

8.Multiclass Classification and Handling Imbalanced Classes

9.Project: Predicting Chronic Kidney Disease

10.K-Nearest Neighbors

11.Implementation of K-Nearest Neighbors

12.Logistic Regression vs. KNN

13.Decision Tree Learning

Project

Implement the Decision Tree Classifier from Scratch

14.Bootstrapping and Confidence Interval

15.Support Vector Machine

16.Practice and Comparisons

17.What's Next?

18.Appendix

Visualize the Working of K-Nearest Neighbors

The dataset

Visualize training and the test data