sorting

package module

v0.0.0-...-61f8482 Latest Latest Go to latest Published: Apr 28, 2024 License: MIT Imports: 1 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/ewh0/lets-go-by-using-it

Links

Open Source Insights

README ¶

Go Sort

This module contains a simple implementation of merge sort. The implementation is according to the pseudo-code depicted on the Wikipedia https://en.wikipedia.org/wiki/Merge_sort#Bottom-up_implementation.

// array A[] has the items to sort; array B[] is a work array
void BottomUpMergeSort(A[], B[], n)
{
    // Each 1-element run in A is already "sorted".
    // Make successively longer sorted runs of length 2, 4, 8, 16... until the whole array is sorted.
    for (width = 1; width < n; width = 2 * width)
    {
        // Array A is full of runs of length width.
        for (i = 0; i < n; i = i + 2 * width)
        {
            // Merge two runs: A[i:i+width-1] and A[i+width:i+2*width-1] to B[]
            // or copy A[i:n-1] to B[] ( if (i+width >= n) )
            BottomUpMerge(A, i, min(i+width, n), min(i+2*width, n), B);
        }
        // Now work array B is full of runs of length 2*width.
        // Copy array B to array A for the next iteration.
        // A more efficient implementation would swap the roles of A and B.
        CopyArray(B, A, n);
        // Now array A is full of runs of length 2*width.
    }
}

//  Left run is A[iLeft :iRight-1].
// Right run is A[iRight:iEnd-1  ].
void BottomUpMerge(A[], iLeft, iRight, iEnd, B[])
{
    i = iLeft, j = iRight;
    // While there are elements in the left or right runs...
    for (k = iLeft; k < iEnd; k++) {
        // If left run head exists and is <= existing right run head.
        if (i < iRight && (j >= iEnd || A[i] <= A[j])) {
            B[k] = A[i];
            i = i + 1;
        } else {
            B[k] = A[j];
            j = j + 1;    
        }
    } 
}

void CopyArray(B[], A[], n)
{
    for (i = 0; i < n; i++)
        A[i] = B[i];
}

In addition to this basic version, another version of merge sort is also implemented. In this version, both the sorting subroutines and merging subroutines are launched in parallel when appropriate. The concurrency model is based on goroutine and synchronization is achieved via channels. In addition, in this version, when the number of elements to be sorted drops below certain threshold (currently 8), the sorting will degrade to insertion sort to avoid unnecessary overhead of spawning new goroutines. See the implementation of ParallelMergesort for details

APIs

Generic version, serialized bottom-up merge sort, requires Go 1.21+

func MergesortGx[T cmp.Ordered](c []T)

Non-generic version via interface

type Comparable interface {
	Less(Comparable) bool
}

func Mergesort(c []Comparable)

Generic version, parallelized merge sort, requires Go 1.21+

func ParallelMergesort[T cmp.Ordered](input []T)

Notes

1. Why not using sort.Interface as the type for the input collection?

Without use reflection, I don't find out a way to easily make a copy of the underlying collection value as pointed by the sort.Interface. This might be resolved by use type assertion and type switch, but I don't want to continue in that direction. Instead, I create a custom interface to set the expectation for the element passed into the sort API

type Comparable interface {
	Less(Comparable) bool
}

and the merge sort expects the elements to be passed in as a slice of Comparable []Comparable

func Mergesort(c []Comparable)

To make our custom type comparable

type Character struct {
	first, last string
}

func (p Character) Less(c Comparable) bool {
	n := c.(Character)
	if p.first != n.first {
		return p.first < n.first
	}

	return p.last < n.last
}

Testing it

var input4 = []Comparable{Character{first: "Tom", last: "The Cat"}, Character{first: "Jerry", last: "The Mouse"}, Character{first: "Spike", last: "The Dog"}}
Mergesort(input4)
if !slices.IsSortedFunc(input4, func(a, b Comparable) int {
    p := a.(Character)
    q := b.(Character)
    if p.Less(q) {
        return -1
    } else if q.Less(p) {
        return 1
    } else {
        return 0
    }
}) {
    t.Errorf("failed to sort customized types")
}

There is just a little bit more work to make the builtin type to become comparable

type myInt int

func (m myInt) Less(c Comparable) bool {
	n := c.(myInt)
	return m < n
}

2. Utilize the cmp.Ordered type constraint

cmp.Ordered is a type constraint that is recently introduced in Go 1.21 and it is basically a union of all builtin types which supports < <= >= > operators. With it, we can easily implement a generic version of merge sort

func MergesortGx[T cmp.Ordered](c []T)

However, the convenience it offers is limited. Unless later Go decides to support operator overloading, right now we can only pass the builtin types

3. Wait, I've heard recursion is bad why you use it in the parallelized version?

Yes, I've heard the same thing too. That's why I implement the serialized merge sort in the bottom-up iterative manner. However, for the parallelized version, I can't (or too lazy) to come up a solution in the bottom-up iterative manner. Recursion makes the design and implementation easier.

In addition, pay attention to the pivot selection technique applied in the merge subroutine. By selecting a median value among the values to be merged, we manage to keep the number of elements to be merged in each part balanced. This is kind of analogous to the difference between balanced binary search tree and regular binary search tree where the worst case scenario is avoided. No merge subroutine will have too many or too few elements to be merged. This way, we ensure that the depth of recursion is minimized to the scale of logN.

In addition, the degrade to insertion sort when number of elements drop below certain threshold also help reduce down the height of the recursion tree.

Documentation ¶

Index ¶

func Mergesort(c []Comparable)
func MergesortGx[T cmp.Ordered](c []T)
func ParallelMergesort[T cmp.Ordered](input []T)
type Comparable

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func Mergesort ¶

func Mergesort(c []Comparable)

Mergesort performs an in-place sorting of the input collection in O(nlogn) time

func MergesortGx ¶

func MergesortGx[T cmp.Ordered](c []T)

func ParallelMergesort ¶

func ParallelMergesort[T cmp.Ordered](input []T)

Types ¶

type Comparable ¶

type Comparable interface {
	Less(Comparable) bool
}

Source Files ¶

View all Source files

sort.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL