Skip to content

Latest commit

 

History

History
130 lines (88 loc) · 5.45 KB

README.markdown

File metadata and controls

130 lines (88 loc) · 5.45 KB

Shuffle

Goal: Rearrange the contents of an array.

Imagine you're making a card game and you need to shuffle a deck of cards. You can represent the deck by an array of Card objects and shuffling the deck means to change the order of those objects in the array. (It's like the opposite of sorting.)

Here is a naive way to approach this in Swift:

extension Array {
  public mutating func shuffle() {
    var temp = [Element]()
    while !isEmpty {
      let i = random(count)
      let obj = removeAtIndex(i)
      temp.append(obj)
    }
    self = temp
  }
}

To try it out, copy the code into a playground and then do:

var list = [ "a", "b", "c", "d", "e", "f", "g" ]
list.shuffle()
list.shuffle()
list.shuffle()

You should see three different arrangements -- or permutations to use math-speak -- of the objects in the array.

This shuffle works in place, it modifies the contents of the original array. The algorithm works by creating a new array, temp, that is initially empty. Then we randomly choose an element from the original array and append it to temp, until the original array is empty. Finally, the temporary array is copied back into the original one.

Note: random() is a helper function that returns a random integer between 0 and the given maximum.

This code works just fine but it's not very efficient. Removing an element from an array is an O(n) operation and we perform this n times, making the total algorithm O(n^2). We can do better!

The Fisher-Yates / Knuth shuffle

Here is a much improved version of the shuffle algorithm:

extension Array {
  public mutating func shuffle() {
    for i in (count - 1).stride(through: 1, by: -1) {
      let j = random(i + 1)
      if i != j {
        swap(&self[i], &self[j])
      }
    }
  }
}

Again, this picks objects at random. In the naive version we placed those objects into a new temporary array so we could keep track of which objects were already shuffled and which still remained to be done. In this improved algorithm, however, we'll move the shuffled objects to the end of the original array.

Note: When you write random(x), the largest number it will return is x - 1. We want to have j <= i, not j < i, so the largest number from the random number generator needs to be i, not i - 1. That's why we do random(i + 1). If we didn't add that 1 to compensate, it would make some shuffle orders more likely to occur than others.

Let's walk through the example. We have the array:

[ "a", "b", "c", "d", "e", "f", "g" ]

The loop starts at the end of the array and works its way back to the beginning. The very first random number can be any element from the entire array. Let's say it returns 2, the index of "c". We swap "c" with "g" to move it to the end:

[ "a", "b", "g", "d", "e", "f" | "c" ]
             *                    *

The array now consists of two regions, indicated by the | bar. Everything to the right of the bar is shuffled already.

The next random number is chosen from the range 0...6, so only from the region [ "a", "b", "g", "d", "e", "f" ]. It will never choose "c" since that object is done and we'll no longer touch it.

Let's say the random number generator picks 0, the index of "a". Then we swap "a" with "f", which is the last element in the unshuffled portion, and the array looks like this:

[ "f", "b", "g", "d", "e" | "a", "c" ]
   *                         *

The next random number is somewhere in [ "f", "b", "g", "d", "e" ], so let's say it is 3. We swap "d" with "e":

[ "f", "b", "g", "e" | "d", "a", "c" ]
                  *     *

And so on... This continues until there is only one element remaining in the left portion. For example:

[ "b" | "e", "f", "g", "d", "a", "c" ]

There's nothing left to swap that "b" with, so we're done.

Because we only look at each array element once, this algorithm has a guaranteed running time of O(n). It's as fast as you could hope to get!

Creating a new array that is shuffled

There is a slight variation on this algorithm that is useful for when you want to create a new array instance that contains the values 0 to n-1 in random order.

Here is the code:

public func shuffledArray(n: Int) -> [Int] {
  var a = [Int](count: n, repeatedValue: 0)
  for i in 0..<n {
    let j = random(i + 1)
    if i != j {
      a[i] = a[j]
    }
    a[j] = i
  }
  return a
}

To use it:

let numbers = shuffledArray(10)

This returns something like [3, 0, 9, 1, 8, 5, 2, 6, 7, 4]. As you can see, every number between 0 and 10 is in that list, but shuffled around. Of course, when you try it for yourself the order of the numbers will be different.

The shuffledArray() function first creates a new array with n zeros. Then it loops n times and in each step adds the next number from the sequence to a random position in the array. The trick is to make sure that none of these numbers gets overwritten with the next one, so it moves the previous number out of the way first!

The algoritm is quite clever and I suggest you walk through an example yourself, either on paper or in the playground. (Hint: Again it splits the array into two regions.)

See also

These Swift implementations are based on pseudocode from the Wikipedia article.

Mike Bostock has a great visualization of the shuffle algorithm.

Written for Swift Algorithm Club by Matthijs Hollemans