recentpopularlog in

mcherm : mapreduce   4

Bloom Filters - optimizing network bandwidth: Ruminations of a Programmer
Another great idea: transfer a Bloom filter over the network and use it to pre-filter the data that's going to be merged together (perhaps when doing a map-reduce for instance). Then you can send a much smaller amount of data over the network to do the actual merge.
bloomfilter  programming  mapreduce  datastructures  via:DebasishGhosh  DebasishGhosh  concurrency 
march 2009 by mcherm
A Scalable Language, and a Scalable Framework: Scala Blog
This is an excellent example of how a more powerful language makes something FAR easier to program and use. The example compares Scala and Java using the same Java library for doing map-reduce.
mapreduce  programming  scala  java  hadoop  languagedesign  languages 
september 2008 by mcherm
Good Math, Bad Math : Databases are hammers; MapReduce is a screwdriver.
Some thoughts on why the map-reduce tool is not the same as a relational database.
mapreduce  parallelprogramming  programming 
january 2008 by mcherm
Welcome to Hadoop!
An implementation of Google's map-reduce (and also their distributed file system) that is open source.
hardware  mapreduce  google  scalability  programming  library  filesystem 
january 2008 by mcherm

Copy this bookmark:

to read