UT Dallas 2015 Graduate Catalog

CS6365 - Data and Text Mining for Computational Biology

CS 6365 Data and Text Mining for Computational Biology (3 semester credit hours) The course introduces data and text mining as practiced currently in the bioinformatics field. Major topics include: sequence alignment for determining similarity between proteins and genes; properties of similarities and distances; genomic, proteomic, and text databases in the real world; finding patterns (motifs) in genes and proteins; differentiating between valid patterns and noise; classification; clustering and its application to phylogenetic trees; and selected topics from text mining. Prerequisite: CS 6325. (3-0) Y