News from Mar 14, 2012
The grant will enable us to use the Amazon Elastic Compute Cloud (Amazon EC2) infrastructure to:
The grant translates into approximately 25,000 machine/hours on EC2. One Web Data Commons extraction run consumes around 3,500 machine/hours, which means that we are now well prepared for the next versions of the Common Crawl corpus which are scheduled to be published at regular intervals during 2012.
Thank you, AWS!