Analysis of Sequence Composition
This class will focus on aspects of sequence (primarily genomic) composition and problems related to its analysis, which will serve as an introductory "warm-up" to algorithms. We will be discussing the problem of calculating the GC% of a given sequence, its comparison to that of another species and ways to detect outlier regions in terms of genomic in a given genome.
Algorithms that we will be discussing include:
a.calculating the GC% of a DNA sequence
b. the computation of k-mer frequencies from a DNA sequence
c. locating candidate regions of horizontal gene transfer based on k-mer frequencies
d. the definition of a sequence's origin of replication based on sequence composition.