HDS 5300 Principles of Data Science
This course is an introduction to data science. Data science is the study of data to extract knowledge and meaningful insights from noisy data. It is an emerging interdisciplinary field that uses techniques and theories from mathematics, statistics, computer sciences, and domain knowledge to analyze large amounts of data. The objective of the course is to equip students with the necessary problem-solving skills and computational thinking required for data science. The course covers the entire data science pipeline, starting from data acquisition, data cleanup, data exploration and visualization, modeling and inference, and professional reporting. The course will teach fundamental concepts, methods, and tools in data science, with a focus on how to apply these methods in health science and biomedical research.