| Long-time user of boto[1] here. It has been the go to library to hook your python code into AWS and has a fairly active following on github[2]. One API point that I've found lacking in boto is a "sync" command for S3. Take a source directory and a target bucket and push up the differences ala rsync, that's the dream. Boto gives you a the ability to push/get S3 resources, but I've had to write my own sync logic. So, the first thing I went digging into is the S3 interface of the new CLI, and to my surprise, they've put a direct sync command on the interface[3], huzzah! Their implementation is a little wacky though. Instead of using some computed hashes, they are relying on a combination of file modtimes and filesize. Weird. Anyways, glad to see AWS is investing in a consistent interface to make managing their services easier. [1] http://boto.readthedocs.org/en/latest/index.html [2] https://github.com/boto/boto [3] http://docs.aws.amazon.com/cli/latest/reference/s3/sync.html... |
One disappointing issue is that the listing process on CF is a magnitude faster than S3.
Keep in mind that this is all being run from an EC2 box, so really, S3 should win hands down.