It’s a Whole New Data Game for Business - WSJ
Feb. 9, 2015 | WSJ |

opportunistic data collection is leading to entirely new kinds of data that aren’t well suited to the existing statistical and data-mining methodologies. So point number one is that you need to think hard about the questions that you have and about the way that the data were collected and build new statistical tools to answer those questions. Don’t expect the existing software package that you have is going to give you the tools you need....Point number two is having to deal with distributed data....What do you do when the data that you want to analyze are actually in different places?

There’s lots of clever solutions for doing that. But at some point, the volume of data’s going to outstrip the ability to do that. You’re forced to think about how you might, for example, reduce those data sets, so that they’re easier to move.
data  data_collection  datasets  data_mining  massive_data_sets  distributed_data  haystacks  questions  tools  unstructured_data 
february 2015 by jerryking
