Welcome to data science's dirty secret: real-world data is messy.
Data scientists must spend a good deal of time playing software developer, writing code to clean up data before they can actually do anything constructive with it.
This is a necessary evil, but we can still make the most of it.
Secret | Realworld data is messy |
---|