Assembling genomes: Transforming Read-Pairs to Long Virtual Reads
Let’s see the transformation of read-pairs to long virtual reads and the construction of de Bruijn graphs from those long virtual reads.
We'll cover the following
Constructing de Bruijn graph from long reads
Let Reads be the collection of all k-mer reads taken from N read-pairs. Note that a read-pair formed by k-mer reads Read and Read corresponds to two edges in the de Bruijn graph DeBruijn (Reads). Since these reads are separated by distance d in the genome, there must be a path of length k + d + 1 in DeBruijn (Reads) connecting the node at the beginning of the edge corresponding to Read with the node at the end of the edge corresponding to Read, as shown in the figure below. If there’s only one path of length k + d + 1 connecting these nodes, or if all such paths spell out the same string, then we can transform a read-pair formed by reads Read and Read into a virtual read of length 2 · k + d that starts as Read, spells out this path, and ends with Read.
Get hands-on with 1200+ tech skills courses.