⌘ '

raccourcis clavier

Questions on sortings and medians

Problem 1

Consider the following program

"\\begin{algorithm}\n\\caption{Sort$(L[0 \\ldots N])$}\n\\begin{algorithmic}\n\\REQUIRE $L$ is an array.\n \\WHILE{$L$ is not sorted}\n \\STATE $L \\gets$ a random permutation of $L$.\n \\ENDWHILE\n\\ENSURE $L$ is sorted.\n\\end{algorithmic}\n\\end{algorithm}"

Algorithm 21 Sort $(L[0 \ldots N])$

Require: $L$ is an array.

1:while $L$ is not sorted do

2: $L \gets$ a random permutation of $L$ .

3:end while

Ensure: $L$ is sorted.

Assume we can test that L is sorted $\Theta(|L|)$ time, that we can compute a random permutation of L in $\Theta(|L|)$ time.

P1.1

Does the $SORT$ program sort correctly? If yes, then provide an invariant for the while-loop and provide a bound function that can be used to prove the correctness of the program. If no, then argue why the program is not correct.

Solution

The program does sort correctly, but inefficiently, given enough time. This is known as Bogosort, as there are finite permutation in the list, one of them are the sorted array.

The invariant for the while-loop is as follow:

\text{Invariant: } 0 < i < S \mid L_i \text{ is a permutation of } L_0 \land S = n!

This invariant holds through the while-loop because the permutation of the list is a permutation of the original list, and the number of permutations is $n!$ .

The bound function for the while-loop is as follow:

The algorithm is probabilistic, and the expected number of iterations is $n!$ .
The probability for get the sorted list is $p=\frac{1}{n!}$ .

One can use Bernoulli’s trial to find success probability $p$ . The cumulative probability of having sorted the list after $k$ attempts is $1 - (1 - \frac{1}{n!})^k$

P1.2

Assume the program $SORT$ is correct. Is the program stable? Explain why.

Solution

The program is not stable since SORT is non-deterministic.

Each iteration generates a random permutation of the list. If the list contains duplicate elements, their relative order is then not reserved between permutations.
The algorithm does not consider the original order of elements when determine the list is sorted.

P1.3

What is the worst case runtime complexity of this program? What is the best case runtime complexity of this program? Is this program optimal? Explain your arguments.

Solution

Worst case scenario occurs when the algorithm goes through all $n!$ permutations before finding the sorted list. The worst case runtime complexity is $\Theta(n! \cdot n)$ .

Best case scenario occurs when the first generated permutation is the sorted list, which has the probability of $\frac{1}{n!}$ , which has the runtime of $\Theta(n)$ as it only needs to generate one random permutation and check for sorted.

No, the program is not optimal, since it is based on random permutation, and the expected number of iterations is $n!$ . There are no guarantee that the algorithm is reliable to sort the list. Definitely not as optimal as other sorting algorithm such as merge-sort or heap-sort.

P1.4

What is the expected case runtime complexity of this program? Explain your answer.

Solution

Similar to previous mentioned, one can use Bernoulli’s trial to find success probability $p$ .

This probability $p$ of success of each run is $\frac{1}{n!}$ , where $n$ is the number of elements in the list.

The expected case runtime complexity is $\Theta(n! \cdot n)$ , since the expected number of iterations is $n!$ . (since each permutation will take $\Theta(n)$ time to check if array is sorted.

Problem 2

The median of a list $L$ of distinct values is the middle value $\mathcal{v} \in L$ : an equal number of values in $L$ are smaller and larger than $\mathcal{v}$ . For example, in the list $L = [1,5,4,2,3]$ , the median is 3. Consider two sorted lists $\mathcal{A} \lbrack 0 \ldots N)$ and $\mathcal{B} \lbrack 0 \ldots M)$ with $N + M$ distinct values. You may assume that the total number of values in $\mathcal{A}$ and $\mathcal{B}$ is odd ( $N+M$ is odd). Hence, there is a value $\mathcal{v} \in ( \mathcal{A} \cup \mathcal{B}$ such that an equal amount $E = \lbrack \frac{N+M}{2} \rbrack$ of other values smaller and larger than $v$ .

P2.1

Provide an algorithm Median(A, B) that computes the median of the combined list $\mathcal{A} \cup \mathcal{B}$ in $\mathcal{O}(\log_2(N+M))$ time.

Solution

"\\begin{algorithm}\n\\caption{Median$(A[0 \\ldots N), B[0 \\ldots M])$}\n\\begin{algorithmic}\n\\REQUIRE N < M $|A| \\leq |B|$\n\\STATE $N \\gets |A|$\n\\STATE $M \\gets |B|$\n\\STATE $L \\gets A \\cup B$\n\\STATE $low \\coloneqq 0$\n\\STATE $high \\coloneqq N$\n\\WHILE{$low \\leq high$}\n \\STATE $i \\coloneqq \\lfloor \\frac{low + high}{2} \\rfloor \\gets \\text{index of A}$\n \\STATE $j \\coloneq \\lfloor \\frac{N+M+1}{2} \\rfloor - i \\gets \\text{index of B}$\n \\STATE $A_{\\text{left}} = i > 0 \\space ? \\space A[i-1] \\space : \\space -\\infty$\n \\STATE $A_{\\text{right}} = i < N \\space ? \\space A[i] \\space : \\space \\infty$\n \\STATE $B_{\\text{left}} = j > 0 \\space ? \\space B[j-1] \\space : \\space -\\infty$\n \\STATE $B_{\\text{right}} = j < M \\space ? \\space B[j] \\space : \\space \\infty$\n \\IF{$A_{\\text{left}} \\leq B_{\\text{right}} \\land B_{\\text{left}} \\leq A_{\\text{right}}$}\n \\IF{$(N+M) \\mod 2 == 1$}\n \\RETURN $\\max(A_{\\text{left}}, B_{\\text{left}})$\n \\ELSE\n \\RETURN $\\frac{\\max(A_{\\text{left}}, B_{\\text{left}}) + \\min(A_{\\text{right}}, B_{\\text{right}})}{2}$\n \\ENDIF\n \\ELSIF{$A_{\\text{left}} > B_{\\text{right}}$}\n \\STATE $high \\gets i - 1$\n \\ELSE\n \\STATE $low \\gets i + 1$\n \\ENDIF\n\\ENDWHILE\n\\end{algorithmic}\n\\end{algorithm}"

Algorithm 22 Median $(A[0 \ldots N), B[0 \ldots M])$

Require: N < M $|A| \leq |B|$

1: $N \gets |A|$

2: $M \gets |B|$

3: $L \gets A \cup B$

4: $low \coloneqq 0$

5: $high \coloneqq N$

6:while $low \leq high$ do

7: $i \coloneqq \lfloor \frac{low + high}{2} \rfloor \gets \text{index of A}$

8: $j \coloneq \lfloor \frac{N+M+1}{2} \rfloor - i \gets \text{index of B}$

9: $A_{\text{left}} = i > 0 \space ? \space A[i-1] \space : \space -\infty$

10: $A_{\text{right}} = i < N \space ? \space A[i] \space : \space \infty$

11: $B_{\text{left}} = j > 0 \space ? \space B[j-1] \space : \space -\infty$

12: $B_{\text{right}} = j < M \space ? \space B[j] \space : \space \infty$

13:if $A_{\text{left}} \leq B_{\text{right}} \land B_{\text{left}} \leq A_{\text{right}}$ then

14:if $(N+M) \mod 2 == 1$ then

15:return $\max(A_{\text{left}}, B_{\text{left}})$

16:else

17:return $\frac{\max(A_{\text{left}}, B_{\text{left}}) + \min(A_{\text{right}}, B_{\text{right}})}{2}$

18:end if

19:else if $A_{\text{left}} > B_{\text{right}}$ then

20: $high \gets i - 1$

21:else

22: $low \gets i + 1$

23:end if

24:end while

P2.2

Explain why your algorithm is correct and why the complexity is $\Theta(\log_2(N+M))$ .

Solution

The median of combined list $\mathcal{A} \cup \mathcal{B}$ is the value $\mathcal{v}$ such that it either the maximum value of left elements or minimum value of right elements (since $N+M$ is odd). Additionally, it partitions the array such that left side will always contains $\lfloor \frac{M+N}{2} \rfloor$ elements.

Since it employs binary search on two smaller arrays and adjusting the partition $A[i-1], A[i], B[j-1], B[j]$ , it halves the search space through each iteration to smaller array.

Thus, one can then achieve the complexity (of binary search) as $\Theta(\log_2(N+M))$ .

P2.3

Let $\mathcal{P}$ be an algorithm with complexity $\Theta(\log_2(N+M))$ that computes the middle value $A \cup B$ . Argue how we can use $P$ to break up the Merge-step necessary to merge two sorted lists with $N+M = 2E + 1$ values into two independent Merge-steps that each merge only $E$ values.

Solution

After using $\mathcal{P}$ to find median of $\mathcal{A} \cup \mathcal{B}$ , given that $N+M = 2E+1$ , median will split the list into two halves, each with $E$ elements.

Partition $\mathcal{A}$ and $\mathcal{B}$ into two subsets $\mathcal{A}_{\text{left}}, \mathcal{A}_{\text{right}}$ and $\mathcal{B}_{\text{left}}, \mathcal{B}_{\text{right}}$ such that left subsets contains items $\leq$ median, right subsets contains $gee$ median.

Proceed with two independent Merge-steps that each merge only $E$ values for both lower and higher sets. Finally concatenate these two arrays into one sorted lists.

Overall complexity for the merge ops is $O(2E)$ as each sub-problem involves merging $E$ elements.

Questions on sortings and medians

Étiquette

publié à

modifié à

durée

source

Problem 1

Problem 2

Vous pourriez aimer ce qui suit