My talk for the San Diego Data Science meetup:


  • Setup StarCluster to launch EC2 instances
  • Running IPython Notebook on Amazon EC2
  • Running single node Machine Learning jobs using multiple cores
  • Distributing jobs with IPython parallel to multiple EC2 instances

  • See HTML5 slides:

  • See the IPython notebook sources of the slides:

Finally the Github repository with additional material, under MIT license:

Any feedback is appreciated, google+, twitter or email.