Josh Wills offers pointers on setting up a case-control study with Hadoop and notes that a toolkit for constructing case-control studies is available on Cloudera’s github repository, released under the Apache License.