Intro to Data Science

CSC 599.70 · Fall 2018 · City College of New York


Instructor Grant Long
Course Email itds.ccny@gmail.com
Lecture Monday 6:30pm-9:00PM, NAC 5-110 / NAC 7106: Linux Lab
Credits & Hours 3 credits, 3 hours
Office Hours Varies, by appointment
Syllabus Available here

Resources

The following resources will be crucial to the class. All are available to students at no cost.

About This Course

This course consists of a survey of analytical tools and concepts in data science, with goal of equipping students with an understanding of the best practices used by professional data scientists and analysts in top companies in technology, finance, and media. The course begins with an overview of fundamentals in data handling and exploratory data analysis, followed by an introduction to core concepts in statistical modeling and machine learning, and concludes with a brief introduction advanced concepts in data science.

Students will work with a wide variety of real world data sets throughout the course in order to gain hands on experience. Emphasis will be placed on frequent practice through writing and reviewing code each week. In addition, students will be assigned and expected to discuss short reading assignments ranging from academic reviews of popular topics in analytics as well as data science and engineering blog posts from companies such as Airbnb, Spotify, and Facebook. Tasks and readings will aim to demystify the work of data teams in the real world, and familiarize students with the concepts and resources needed to secure and succeed in analytical roles.

Grading

    Project (40%) + Homework & Quizzes (30%) + Midterm Exam (20%) + Participation (10%)

    • Project. The bulk of the course grade will be a group project that will be due in December (exact date TBD). Students will be expected to work on the project during the second half of the class and will be required to present their progress throughout the course of the semester. Grades will be assigned on the basis of overall project quality, demonstration of core principles taught in the class, and individual contributions to the group's effort. Details on the project are available here.
    • Assigments. This class includes short, frequent assignments to check comprehension. All assignments and quizzes will be graded on a 5-point scale. All quizzes will be announced in advance of class.
      • No late assignments accepted. Assignments not turned in by the set deadline will be scored as 0/5. Exceptions will be granted only as mandated by CUNY policy.
      • Worst two assignments dropped, includes missed assignments.
    • Exam. A short midterm exam will be held in October and will focus on broad concepts the course has surveyed thus far. The format will mimic the style of questions frequently asked in interviews for data-related roles.
    • Participation. Students are expected to attend class and be active participants in discussion. This includes, but is not limited to, discussing assigned readings and videos and sharing ideas during classroom exercises.
Important Dates
  • Project Teams Formed, October 1.
  • Project Proposals Due via Email, October 19.
  • Midterm Exam (45 min) Monday, October 22nd.
  • First Project Update, November 5.
  • Second Project Update, November 26.
  • Projects Due Finals period.
Recommended Texts and Materials
  • Required Text: Data Science from Scratch, Joel Grus. 2nd Edition, April 2015 (O'Reilly). Available online.
  • Additional required readings and videos will be made available to students in advance of each week's assignments. All will be availble online at no cost.
  • In addition to the required materials, students may find the following resources helpful in supplementing course materials:
    • Recommended Text: Python for Data Analysis, Wes McKinney. 2nd Edition, October 2017 (O'Reilly). Available online.
    • Recommended Text: Elements of Statistical Learning, Trevor Hastie, Robert Tibshirani and Jerome Friedman. 2nd Edition, 2009 (Springer). Available free online here.

The CUNY Policy on Academic Integrity governs behavior in this class. Academic dishonesty is prohibited in the City University of New York and is punishable by penalties, including failing grades, suspension, and expulsion.


Schedule