A Microbenchmark Example

Discover the benchmarking results of linear and binary search algorithms.

We'll cover the following...

Benchmarking linear and binary search algorithms
Microbenchmark of linear_search()

We are searching for the value n, which we know isn't in the vector, so the algorithm will exhibit its worst-case performance using this test data. That's the good part of this test. Other than that, it is afflicted with many flaws that will make this benchmark useless:

Compiling this code using optimizations will most likely completely remove the code because the compiler can see that the results from the functions are not being used.
We don't want to measure the time it takes to create and fill the std::vector.
By only running the linear_search() function once, we will not achieve a statistically stable result.
It's cumbersome to test for different input sizes.

Let's see how these problems can be addressed by using a microbenchmarking support library. There are various tools/libraries for benchmarking, but we will use Google Benchmark.

Microbenchmark of `linear_search()`

Here is how a simple microbenchmark of linear_search() might look when using Google Benchmark:

C++

#include <benchmark/benchmark.h> // Non-standard header 
#include <vector>
auto linear_search(const std::vector<int>& vec, int v) -> bool {
  for (auto&& i : vec) {
    if (i == v) {
      return true;
    }
  }
  return false;
}
auto gen_vec(int s) {
  std::vector<int> v(s);
  for (int i = 0; i < s; ++i) {
    v[i] = i;
  }
  return v;
}
static void bm_linear_search(benchmark::State& state) {
  auto n = 1024;
  auto v = gen_vec(n);
  for (auto _ : state) {
    benchmark::DoNotOptimize(linear_search(v, n));
  }
}
BENCHMARK(bm_linear_search); // Register benchmarking function 
BENCHMARK_MAIN();

The number of iterations reported in the rightmost column reports the number of times the loop needed to execute before a statistically stable result was achieved. The state object passed to our benchmarking function determines when to stop. The average time per iteration is reported in two columns: Time is the wall-clock time, and CPU is the time spent on the CPU by the main thread. In this case, they were the same, but if linear_search() had been blocked waiting for I/O (for example), the CPU time would have been lower than the wall-clock time.

Another important thing to note is that the code that generates the vector is not included in the reported time. The only code that is being measured is the code inside this loop:

C++

#include <benchmark/benchmark.h>
#include <iostream>
#include <numeric>
#include <string>
auto linear_search(const std::vector<int>& vec, int v) -> bool {
  for (auto&& i : vec) {
    if (i == v) {
      return true;
    }
  }
  return false;
}
auto binary_search(const std::vector<int>& a, int key) {
  if (a.empty()) {
    return false;
  }
  auto low = size_t{0};
  auto high = a.size() - 1;
  while (low <= high) {
    const auto mid = std::midpoint(low, high);
    if (a[mid] < key) {
      low = mid + 1;
    } else if (a[mid] > key) {
      high = mid - 1;
    } else {
      return true;
    }
  }
  return false;
}
auto gen_vec(int s) {
  std::vector<int> v(s);
  for (int i = 0; i < s; ++i) {
    v[i] = i;
  }
  return v;
}
static void bm_linear_search(benchmark::State& state) {
  auto n = static_cast<int>(state.range(0));
  auto v = gen_vec(n);
  for (auto _ : state) {
    benchmark::DoNotOptimize(linear_search(v, n));
  }
  state.SetComplexityN(n);
}
static void bm_binary_search(benchmark::State& state) {
  auto n = static_cast<int>(state.range(0));
  auto v = gen_vec(n);
  for (auto _ : state) {
    benchmark::DoNotOptimize(binary_search(v, n));
  }
  state.SetComplexityN(n);
}
BENCHMARK(bm_linear_search)->RangeMultiplier(2)->Range(64, 4096)->Complexity();
BENCHMARK(bm_binary_search)->RangeMultiplier(2)->Range(64, 4096)->Complexity();
BENCHMARK_MAIN();

A Microbenchmark Example

Benchmarking linear and binary search algorithms

Microbenchmark of linear_search()

Microbenchmark of `linear_search()`