Resilient Distributed Datasets - A Fault-Tolerant Abstraction for In-Memory Cluster Computing
Druid - A Real-time Analytical Data Store
Mesos - A Platform for Fine-Grained Resource Sharing in the Data Center
Omega - flexible, scalable schedulers for large compute clusters
Large-scale cluster management at Google with Borg
Apache Hadoop YARN - Yet Another Resource Negotiator
Dominant Resource Fairness - Fair Allocation of Multiple Resource Types.pdf
Paxos Made Simple - Leslie Lamport (2001)
Impossibility of Distributed Consensus with One Faulty Process - Fischer, Lynch (1985)