SPA Conference session: Distributed Databases | |||
| One-line description: | Exploring the main ideas behind distributed highly available databases. | ||
| Session format: | Presentation and practical. [read about the different session types] | ||
| Abstract: | We're going to explore the main ideas underpinning distributed, highly available databases - in particular the sorts of databases that are heavily inspired by the ideas in the Dynamo Paper (http://www.allthingsdistributed.com/files/amazon-dynamo-sosp2007.pdf) such as Riak and Cassandra. So we'll cover things like: * what availability and consistency are and how you can trade them off for each other * how to carry on reading and writing in the face of node failures * eventual consistency and repairs Then we'll try out these ideas by building our own simple distributed data store on top of a bunch of in-memory hash-maps, and see what happens when we wreak havoc by simulating node failures. | ||
| Audience background: | Developers who are interested in learning more about dynamo-like distributed databases. | ||
| Benefits of participating: | A familiarity with the main ideas underlying dynamo-like database distribution, that will come in handy when using or considering using these sorts of databases in the future. | ||
| Materials provided: | I'll provide links to the slides, and a github repo that has: a) an implementation of a hashmap wrapped in a simple HTTP API, which can be used as a single node in the distributed system b) a clear lists of tasks to build upon when creating the database yourself. (will include a hint sheet too - with code snippets for those who get very stuck) | ||
| Process: | First I'll present the ideas, and then I'll present how we're going to code our own version. This'll be outlined as a series of tasks that build upon each other. Then people can program on their own or in groups to get through the tasks, and I'll go around trying to help! | ||
| Detailed timetable: | Not sure how long a session should be. Maybe 45mins presentation, 15mins exercise explanation, 90-120mins for the exercise? | ||
| Outputs: | |||
| History: | None. I've presented on Cassandra before, including on the Dynamo paper. But this idea was generated just for SPA :-) | ||
| Presenters | |||
| 1. Emily Green SoundCloud |
2. | 3. | |