In this big data age, everyone is required to have some data sense. Statistics as a science of data, becomes more and more important nowadays. Harvard’s Statistics 101 is a good starting point.
Among the three, which one is the kernel?
As the new professional data scientist emerges, many statisticians begin to wonder what is the difference between a data scientist and a statistician. I came across the data science certificate at Havard Extension School at
It is clear that data scientist needs to know some statistical techniques as well as compute database knowledge. Statistics conbsists of descriptive and inferential statistics. The descriptive statistics includes visualization and summary statistics, while data science is mainly statistics via visualization, association study and making insightful discovery of the data. The main challenge is the high dimension and hence dimension reduction is the main issue.