671 - Data Mining: Methods and Applications
With the explosive growth of information generated from different sources, in a variety of formats, and with various qualities, information analysis has become challenging for researchers in many disciplines. Automatic, robust, and intelligent data mining techniques have become essential tools to handle heterogeneous, noisy, nontraditional, and large-scale data sets. This is a seminar course of advanced topics in data mining, the state-of-the-art methods to analyze different genres of information, and the applications to many real world problems. The course will highlight the practical applications of data mining instead of the theoretical foundations of machine learning and statistical computing. The course materials will focus on how the information in different real world problems can be represented as particular genres, or formats of data, and how the basic mining tasks of each genre of data can be accomplished using the state-of-the-art techniques. To this end, the course is suitable for those who are consumers of data mining techniques in their own disciplines, such as natural language processing, networks science, human computer interaction, economics, social computing, sociology, business intelligence, and biomedical informatics, etc.
- Intermediate Python Programming
- [SI 507 or waiver or 508] and [SI 544 or waiver or BIOSTAT 501 or 521 or 601] and [SI 618 or waiver]; (C- or better) or Graduate standing in Applied Statistics, Master's in Data Science, or Electrical Engineering & Computer Science