Previously, we posted some information on Amazon’s foray into making huge public data sets available to users of their web services. Yesterday they announced the addition of some very sizable additions:
- US Bureau of Transportation Statistics
- DBpedia Knowledge Base (67 GB)
- Freebase Data Dump (66 GB)
- Genbank Genetic Sequence Database(250 GB)
If you use AWS, the announcement provides more info on these datasets as well as how to access them. If you don’t use AWS, you can still access much of this data directly from the websites linked above.