City College, Fall 2019
Intro to Data Science
Week 14: Life in Data
December 9, 2019
Today's Agenda
- Project Debrief
- What to Expect in an Interview
- What to Look for in a Data Job
- Ethical Considerations in Data
Projects: Results from Submission 2
Rank | Team Name | MSE | Points | Median Error | Share under 10% |
1 | GIN | 435,192 | +12 | $196 | 65.6% |
2 | Science Data | 488,104 | +12 | $215 | 60.7% |
3 | EDS | 545,513 | +8 | $249 | 55.2% |
4 | The Data Scientists | 548,583 | +8 | $223 | 59.9% |
5 | GodZillow | 582,126 | +4 | $251 | 54.9% |
6 | The Divers | 678,275 | +4 | $325 | 45.7% |
7 | Datalicious | 699,689 | +4 | $201 | 62.8% |
| Demo Model | 1,669,750 | | $366 | 42.20% |
Successful Strategies
- Simple models can work well, but adding data helps: MSE for demo model = 1,669,750
- Nonlinear models outperformed linear models
- Building details were helpful: ~35 percent of buildings in test had units in train
- Monolithic models rule
- More data would have been helpful...
What to Expect in an Interview
My Interviews
- Recruiter screen
- Semi-technical phone screen
- Coding exercise
- Onsite with the team
Tips
- Know the basics
- Sweat the details
- Be resourceful
- Show your passion
What to Look for in a Data Job
How good is their data? How much of your time will be spent cleaning it?
Are they looking for a data [scientist/engineer/analyst] or a wizard?
Who is you boss? Is it someone you can learn from?
What kind of technical resources do you have? (Computer, installation rights, cloud resources, visualization software)
Do they use the latest and greatest in open source software?
How seriously do they take recruiting?
You can do amazing things with data!
Should you?
Major Ethical Issues in Data
- Storage and Handling of Sensitive Information
- Bias in Algorithms
- Consent to Share Data
- Using Technology to Circumvent Law
- Bullsh!t
You can do amazing things with data!
You should.