You will implement the merge and quick sort algorithms.
Learning Objectives:
- understand how merge sort and quicksort work
- practice recursion
- practice using property-based tests
Real-world application:
Given a dataset, the first step of processing is almost always to sort the data. This allows efficient lookups via binary search, but it even makes sequential search significantly faster due to an optimization in modern CPUs called "branch prediction". (See the most upvoted stackoverflow question of all time for details.) Technical interviews therefore often require you to explain how the merge and quicksort algorithms work.
ASIDE: Recall that Python uses the TimSort algorithm. TimSort was named after the inventor [Tim Peters](https://en.wikipedia.org/wiki/Tim_Peters_(software_engineer\)), who invented it specifically for python. Tim is one of the most influential early python programmers after Python's BDFL Guido van Rossum. Besides TimSort, he is most famous for developing the Zen of Python, which is a collection of sayings describing python's guiding philosophy. You can access the Zen of Python from within python by running the command
>>> import this
In the real world, you should never implement your own sorting algorithm. The algorithms built-in to standard libraries are highly optimized by highly skilled engineers, and therefore will be much faster than anything you are likely to write.
So why are you implementing them in this assignment? The divide-and-conquer strategy used in merge/quick sort turns out to be the most fundamental tool for building your own fast algorithms, and the main purposee of this assignment is to practice this divde-and-conquer programming technique. Next week, we will explore this strategy for massively parallel data analysis of multi-terabyte twitter data. If you take CS148 (graph algorithms), you will explore many more variations of the divide and conquer technique for developing fast algorithms. Data science tasks frequently rely on matrix multiplication and the fourier transform, and the fastest versions of these algorithms are implemented via divide and conquer.
Complete the following tasks:
- Fork the sorting repo and enable github actions
- Update the
README.md
file so that the test case badges point to your repo. - Implement the
_merged
,merge_sorted
, andquick_sorted
functions so that all test cases intests/test_main.py
pass. You must implementmerge_sorted
andquick_sorted
recursively. - (optionally)
You can receive 1 point of extra credit for implementing the
quick_sort
function and passing the tests intests/test_ec.py
.
Submit the link to your forked repository on sakai.