Feature #8: Similarity Measure Between DNA Samples

Explore how to measure similarity between two DNA samples by calculating the minimum number of edits needed to transform one sequence into another. Understand the dynamic programming approach including base cases and recursive solutions to effectively solve this computational biology problem.

We'll cover the following...

Description
Solution
Complexity measures

Time complexity
Space complexity

Description

The DNA of an alien species consists of a sequence of nucleotides, where each nucleotide is represented by a letter. We received two such DNA samples, and we need to measure the extent of similarity—also known as the edit distance—between them. The prevalent measure of similarity between these two samples is the minimum number of edits that are required to convert one DNA sample to the other.

Note: We are only permitted to insert, delete, or update a nucleotide in a DNA sample.

Given two DNA samples as strings, sample1 and sample2, we have to return the minimum number of operations that are required to convert sample1 to sample2.

The following examples may clarify this problem:

Solution

We’ll compare the sample1 string with the sample2 string, one character at a time. If the characters at the current position match, then no edit operation will be required.

On the other hand, if the characters at the current position in the two strings don’t match, we can perform one of the following three operations, whichever will result in the fewest edit operations:

Insert a character to sample1 at the current position.
Delete the character in sample1 at the current position.
Replace the character in sample1 at the current position with the character at the current position in sample2.

The choice shown above can’t be made on the basis of local ...

1.✨Getting Started

2.Netflix

3.Facebook

4.Search Engine

5.Google Calendar

6.Stock Scraper

7.UBER

8.Amazon

9.Zoom

10.Plagiarism Checker

11.Network

12.Cyber Security

13.Operating System

14.Language Compiler

15.Boggle

16.Scrabble 2.0

17.Game

18.Stocks

19.Computational Biology

20.Cellular Operator(AT&T)

21.Twitter

22.Trees

23.Miscellaneous

24.Conclusion

Feature #8: Similarity Measure Between DNA Samples

Description

Solution