Building a better DynamoDB throughput scaling tool

I use DynamoDB, Amazon Web Services’ managed NoSQL data store. It’s a fantastic tool, where there’s basically no management needs placed on me. That’s with two exceptions: Backups Throughput scaling Backups is fine. I built a small tool that gets the latest changes and stores them elsewhere. Throughput scaling is a bit more involved so I… Continue reading Building a better DynamoDB throughput scaling tool

The economic scalability of DynamoDB

I’ve been using DynamoDB for a few months now after re-architecting a system which started becoming painful to scale on a traditional RDBMS system. The problem wasn’t necessarily read/write performance but rather the total storage space needed as a lot of “unstructured” blobs was stored in the DB. DynamoDB gives me a care free setup… Continue reading The economic scalability of DynamoDB

Space efficient Bloom filter index, 2x performance gain

To quote Wikipedia: A Bloom filter is a space-efficient probabilistic data structure that is used to test whether an element is a member of a set. This is often used by systems in order to avoid accessing slow media, like a disk. Take HBase or Cassandra for instance: Instead of reading their data files in… Continue reading Space efficient Bloom filter index, 2x performance gain