String Reconstruction With Overlap Graph: The Genome Vanishes
Learn to construct and interpret overlap graphs from collections of k-mers, a crucial step in genome assembly. This lesson guides you through understanding how reads overlap by suffix and prefix matching, enabling reconstruction of genome sequences even when direct paths seem to vanish.
We'll cover the following...
The genome can still be traced out in the graph in the
STOP and Think: Can any other strings be reconstructed by following a path visiting all the nodes in the above figure?
Finding genome path
The genome path may have disappeared to the naked eye, but it must still be there, since we’ve simply rearranged the nodes of the graph. Indeed, the figure below (top) highlights the genome path spelling out TAATGCCATGGGATGTT. However, if we had given you this graph to begin with, you would’ve ...