I am trying to understand the Scala quicksort example from Wikipedia. How could the sample be disassembled step by step and what does all the syntactic sugar involved mean?
def qsort: List[Int] => List[Int] = {
case Nil => Nil
case pivot :: tail =>
val (smaller, rest) = tail.partition(_ < pivot)
qsort(smaller) ::: pivot :: qsort(rest)
}
As much as I can gather at this stage qsort is a function that takes no parameters and returns a new Function1[List[Int],List[Int]] that implements quicksort through usage of pattern matching, list manipulation and recursive calls. But I can’t quite figure out where the pivot comes from, and how exactly the pattern matching syntax works in this case.
UPDATE:
Thanks everyone for the great explanations!
I just wanted to share another example of quicksort implementation which I have discovered in the Scala by Example by Martin Odersky. Although based around arrays instead of lists and less of a show-off in terms of varios Scala features I personally find it much less convoluted than its Wikipedia counterpart, and just so much more clear and to the point expression of the underlying algorithm:
def sort(xs: Array[Int]): Array[Int] = {
if (xs.length <= 1) xs
else {
val pivot = xs(xs.length / 2)
Array.concat(
sort(xs filter (pivot >)),
xs filter (pivot ==),
sort(xs filter (pivot <)))
}
}
Let’s rewrite that. First, replace the function literal with an instance of
Function1:Next, I’m going to replace the pattern match with equivalent
if/elsestatements. Note that they are equivalent, not the same. The bytecode for pattern matches are more optimized. For instance, the secondifand the exception throwing below do not exist, because the compile knows the second match will always happen if the first fails.Actually,
val (smaller, rest)is pattern match as well, so Let’s decompose it as well:Obviously, this is highly unoptmized. Even worse, there are some function calls being done more than once, which doesn’t happen in the original. Unfortunately, to fix that would require some structural changes to the code.
There’s still some syntactic sugar here. There is an anonymous function being passed to partition, and there is the syntactic sugar for calling functions. Rewriting those yields the following:
For once, the extensive explanations about each syntactic sugar and how it works are being done by others. 🙂 I hope this complements their answers. Just as a final note, the following two lines are equivalent: