Setting up AWS Cluster to use snow in R

Setting up AWS Cluster

I wanted to setup an AWS cluster to take a shot at a Kaggle contest – DunnHumby Challenge

http://www.kaggle.com/c/dunnhumbychallenge

For this, I found StarCluster to be of great help. It allows you to set-up AWS nodes in a few lines of code and does much more (choosing AMIs and cluster configurations)

http://web.mit.edu/stardev/cluster/

Make sure you use the Bioconductor AMI which comes bundled with R and a host of installed packages.

http://www.bioconductor.org/help/bioconductor-cloud-ami/

I used the package “snowfall” for parallel processing.

Relevant SO questions I had asked

http://stackoverflow.com/questions/7241244/using-aws-for-parallel-processing-with-r

http://stackoverflow.com/questions/7333801/using-snow-and-snowfall-with-aws-for-parallel-processing-in-r

About these ads

About indiacrunchin
Any sufficiently advanced technology is indistinguishable from magic -- Arthur C. Clarke

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

%d bloggers like this: