Big data, cows and cadastres
Jul 5, 2012 | KMWorld Magazine July/August 2012, [Vol 21, Issue 7]| by Stephen E. Arnold.

The hero of the story is a bull named Badger-Bluff Fanny Freddie. Dairy cattle sired by him yield more milk. Genetic information processed by sophisticated numerical recipes yield more efficiency. With Badger-Bluff Fanny Freddie, the dairy industry has an opportunity to convert big data into more milk per head. Therefore, the knowledge generated by big data analytics methods translates directly to money.

The article explained: "Dairy breeding is perfect for quantitative analysis. Pedigree records have been assiduously kept; relatively easy artificial insemination has helped centralize genetic information in a small number of key bulls since the 1960s; there are a relatively small and easily measurable number of traits—milk production, fat in the milk, protein in the milk, longevity, udder quality—that breeders want to optimize; each cow works for three or four years, which means that farmers invest thousands of dollars into each animal, so it's worth it to get the best semen money can buy. The economics push breeders to use the genetics."...The IBM approach is to understand the prospect or customer's problem, develop a plan of action and then assemble the solution from the components in IBM's toolbox....The only problem is that the user-friendly system assumes that the marketing manager understands sample size, the strengths and weaknesses of specific statistical methods and the output itself. Eye-catching graphics is not the same as statistically valid data.
The challenges

The problem in those two examples boils down to people. There is a shortage of staff with big data and analytics skills. The problem is not local; it is global. Data and the need to exploit it are rising faster than the talent pool required to use the sophisticated, increasingly user-friendly systems. Kolmogorov worked with a pen and paper. He could tap into today's powerful system because he had the mathematical expertise required to tame big data. Using a mouse is the trivial part of figuring out cow genetics.
