School of Information Studies
Page tree

Course Description

The course introduces students to fundamentals about data and the standards, technologies, and methods for organizing, managing, curating, preserving, and using data. It discusses broader issues relating to data management, ethics, quality control and publication of data. Applied examples of data collection, processing, transformation, management, and analysis as well as a hands-on introduction to the emerging field of data science are provided. Students will explore key concepts related to data science, including applied statistics, information visualization, text mining and machine learning. "R", the open source statistical analysis and visualization system, will be used throughout the course. R is reckoned by many to be the most popular choice among data analysts worldwide; having knowledge and skill with using it is considered a valuable and marketable job skill for most data scientists.



Professor of Record

Zhasmina Tacheva


Undergraduate students.

Learning Objectives

After taking this course, students will be expected to understand:

  1. Essential concepts and characteristics of data
  2. The purpose of scripting for data management using R and R-Studio
  3. Principles and practices in data screening, cleaning, linking, and visualizations
  4. The importance of clear communication of results to decision-makers

After taking this course, students will be able to:

  1. Identify a problem and the data needed for addressing the problem.
  2. Perform basic computational scripting using R and other optional tools.
  3. Transform data through processing, linking, aggregation, summarization, and searching.
  4. Organize and manage data at various stages of a project life cycle.
  5. Determine appropriate techniques for analyzing data.

Course Syllabus

IST 387 Fall 2020 Syllabus- Tacheva, Zhasmina

Other iSchool Courses