Course Intro: What is Data Science and Why Does It Matter?
September 5, 2019
What is Data Science?
Is that helpful?
Who are Data Scientists?
About Me
About Tech in Residence
About this Course
grantmlong.com/teaching
Official Course Objectives
Explain the key steps in a data science project.
Apply Python to load, clean, and process data sets.
Identify key elements of and patterns in a data set using computational analysis and statistical methods.
Explain and visualize empirical findings using with Python and other resources.
Explain fundamental principles of machine learning.
Apply predictive algorithms to a data set.
Work effectively in a team dedicated to analyzing data.
Why Take this Course?
Careers in data are abundant, lucrative, and rewarding.
The bulk of the course grade will be a group project that will be due in advance of the last class on December 9. Students will be expected to work on the project during the second half of the class and will be required to present their progress throughout the course of the semester. Grades will be assigned on the basis of overall project quality, demonstration of core principles taught in the class, and individual contributions to the group's effort. More details on the project will discussed in the second week of class.
Assignments and Exams
Assigments. This class includes frequent assignments to encourage mastery of basic concepts and check comprehension, predominantly through DataCamp. All assignments and quizzes will be graded on a 10-point scale. All quizzes will be announced in advance of class.
Assignments not turned in by the set deadline are eligible to be completed for half credit by the final class on December 9. Exceptions will be granted only as mandated by CUNY policy.
Exam. A short midterm exam will be held in November and will focus on broad concepts the course has surveyed thus far. The format will mimic the style of questions frequently asked in interviews for data-related roles.
Texts and Materials
Required Text: Data Science from Scratch, Joel Grus. 2nd Edition, May 2019 (O'Reilly). Available online.
Additional required readings and videos will be made available to students in advance of each week's assignments. All will be available online at no cost.
In addition to the required materials, students may find the following resources helpful in supplementing course materials:
Recommended Text: Foundations of Data Science, Avrim Blum, John Hopcroft, and Ravindran Kannan. January 2018. Available free online here.
Recommended Text: Elements of Statistical Learning, Trevor Hastie, Robert Tibshirani and Jerome Friedman. 2nd Edition, 2009 (Springer). Available free online here.
Recommended Text: Python for Data Analysis, Wes McKinney. 2nd Edition, October 2017 (O'Reilly). Available online.
Cheating
Academic dishonesty is prohibited in The City University of New York. Penalties for academic dishonesty include academic sanctions, such as failing or otherwise reduced grades, and/or disciplinary sanctions, including suspension or expulsion.