DBSCAN in R

Here, we’ll use the rnorm method to generate 100 normally distributed random numbers in 2D space by setting the number of columns in the dataset equal to two. (i.e ncols = 2).

Step 2: Run DBSCAN

Next, we move on to the major step of this Answer, which is to run the DBSCAN algorithm itself using the dbscan library. For this coding example, we'll use parameter values eps = 0.5 (radius for searching of neighboring points around a certain data point) and minPts = 5 (i.e., the minimum number of points to form a dense region).

Code explanation

The line-by-line explanation of the code above is given below:

Line 2: Here, we take the result of DBSCAN, which is a data frame object, and plot it with the x-axis set to the first column of the dataset and the y-axis set to the second column of the dataset (inside aes).
Line 3: We take the size of each point to 3 via geom_point(size=3) for greater legibility.
Lines 4-5: We set the title of the graph to DBSCAN Clustering in R as well as setting the axes labels to Variable 1 for the x-axis and Variable 2 for the y-axis.
Line 6: Here, the theme of the plot is set to a minimal theme, which typically removes background gridlines and other non-essential elements.
Line 7: This sets the color scale for the cluster variable, with the name of the color legend to Cluster.

Conclusion

Overall, we learned how the DBSCAN algorithm is performed on a random dataset and how the generated data is visualized with the help of a scatter plot. The parameters and styling for DBSCAN can be adjusted as needed for our specific dataset and preferences.

Free AI Mock Interviews

Coding Interview

Coding PatternsFree Interview

Gain insights and practical experience with coding patterns through targeted MCQs and coding problems, designed to match and challenge your expertise level.

System Design

YouTubeFree Interview

Learn to design a video streaming platform like YouTube by tackling functional and non-functional requirements, core components, and high-level to detailed design challenges.

Free Resources

DBSCAN in R

Step 1: Create a sample dataset

Step 2: Run DBSCAN

Step 3: Plot the generated clusters

Code explanation

Conclusion