Course Description

The course introduces students to applied examples of data collection, processing, transformation, management, and analysis to provide students with hands-on introduction to data science experience. Students will explore key concepts related to data science, including applied statistics, information visualization, text mining and machine learning. “R”, the open source statistical analysis and visualization system, will be used throughout the course. R is reckoned by many to be the most popular choice among data analysts worldwide; having knowledge and skill with using it is considered a valuable and marketable job skill for most data scientists. 



Professor of Record

Jeffrey Saltz


Learning Objectives

After taking this course, students will be able to:

  1. Understand essential concepts and characteristics of data. 
  2. Understand scripting/code development for data management using R and R-Studio.
  3. Understand principles and practices in data screening, cleaning, and linking.
  4. Understand communication of results to decision makers. 
  5. Identify a problem and the data needed for addressing the problem. 
  6. Perform basic computational scripting using R and other optional tools.  
  7. Transform data through processing, linking, aggregation, summarization, and searching. 
  8. Organize and manage data at various stages of a project life-cycle.
  9. Determine appropriate techniques for analyzing data. 

Course Syllabus

IST 687 Fall 2020 Syllabus - Jeffrey Saltz

IST 687 Spring 2021 Syllabus - Stephen Wallace

Other iSchool Courses