The Essential k-NN Algorithm

Explore the essential k-Nearest Neighbors algorithm, including steps to classify data points and how to optimize performance by minimizing sorting overhead with Python's bisect and heapq modules. Understand computational costs and benchmark different approaches to enhance algorithm efficiency.

We'll cover the following...

Overview
k-NN using the bisect module
- k-NN using the heapq module
Conclusion

While clear, this does accumulate a large number of distance values in the distances list object, when only $k$ are actually needed. The sorted() function consumes the source generator and creates a (potentially large) list of intermediate values.

One of the high-cost parts of this specific $k$ -NN algorithm is sorting the entire set of training data after the distances have been computed. We summarize the complexity with the description as an $O(n \log n)$ operation. A way to avoid cost is to avoid sorting the entire set of distance computations.

Steps ...

1.Object-Oriented Design

2.Objects in Python

3.When Objects Are Alike

4.Expecting the Unexpected

5.When to Use Object-Oriented Programming

6.Abstract Base Classes and Operator Overloading

7.Python Data Structures

8.Object-Oriented and Functional Programming Intersection

9.Strings, Serialization, and File Paths

10.The Iterator Pattern

11.Common Design Patterns

12.Advanced Design Patterns

13.Testing Object-Oriented Programs

14.Concurrency

15.Conclusion

Project

The Essential k-NN Algorithm

Overview