Once we have a read sequence, we search the entire genome for matching sequences. Genomes are often billions of letters long, so we use specialized algorithms to quickly and efficiently search for matches. Once we know where in the genome a read aligns, then we can determine which gene it came from.

Basic alignment figure