Function Repository Resource:

AlignNearlyIdenticalSequences

Align sequences known to be nearly identical

Contributed by: John Cassel, Wolfram|Alpha Scientific Content

ResourceFunction["AlignNearlyIdenticalSequences"][seq₁,seq₂]

create an alignment between two strings or biomolecular sequences seq₁ and seq₂.

Details and Options

The "UniqueSubsequenceLength" option asserts how long a contiguous subsequence must be to be different than all other contiguous subsequences of the same length. The default value is 1000.

The result of this function is an alignment given as a list of successive matching and differing sequences. See SequenceAlignment for further documentation and examples.

The alignment generated by this method may not be optimal, though it should always be correct if the subsequence length parameter is set appropriately.

Intended for similar sequences as are typically found in organisms of the same species.

Examples

Basic Examples (1)

Find an alignment for nearly identical sequences:

In[1]:=

Out[1]=

Scope (1)

This function is suitable for aligning biomolecular sequences:

In[2]:=

Out[2]=

Properties and Relations (1)

This function can be used with the AlignmentToPositionDifferences resource function to produce manageable differences between quite large but nearly identical sequences:

In[3]:=