layout	title	sched-activation
course	Week 5, Day 3 (Friday, February 7)	class="active"

In class---sharding and replication

Suppose our image processing application (Assignment 2) included its own image storage function rather than using Amazon S3. Some of the images on our site might become very popular (say, a striking photograph of a celebrity). When an image is popular, we want to retrieve it with low latency. We also want to ensure our images are retained even if there are disk crashes. (Recall that many data centers intentionally use disks with relatively low reliabilities.)

How would you use the approaches of sharding and replication (from Week 2 Wednesday) to decrease the latency of your image retrievals? For your sharding, how might you group images?

Implementation (from [You Are Building a Distributed System]({{ site.data.bibliography.cavage2013.url }}))

Build versus buy is not a simple choice

Testing is difficult

Transient bugs are common
Total network outage is rare---degraded performance is common
Drive the system at expected load
Induce failures (more on this in Week 8)

Operating the system requires careful monitoring (Week 9)

If it moves, measure it
Need both real-time and analytic data
You will produce lots of data---which might require its own highly scalable, reliable system to manage

Anonymous poll

On an unsigned sheet of paper, briefly complete:

I most want to see the class start doing …
I most want to see the class continue doing …
I most want to see the class stop doing …

No readings over Reading Break

Work on Assignment 3 and review the previous readings.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Week5-Fri.md

Week5-Fri.md

In class---sharding and replication

Implementation (from [You Are Building a Distributed System]({{ site.data.bibliography.cavage2013.url }}))

Anonymous poll

No readings over Reading Break

Files

Week5-Fri.md

Latest commit

History

Week5-Fri.md

File metadata and controls

In class---sharding and replication

Implementation (from [You Are Building a Distributed System]({{ site.data.bibliography.cavage2013.url }}))

Anonymous poll

No readings over Reading Break