Good day, everyone! I have here a program that sorts 50,000 words from a file using merge sort. I followed Thomas Cormen’s pseudocode in his Introduction to Algorithms and it seems right when I’m “debuuging” it by hand manually. However, when I run the program it says Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 2 . Yes, I think it is due to the large NO_OF_WORDS (ie, 50,000) but even though I decreased it to 10, still, it shows the same error.
import java.io.*;
import java.util.*;
public class SortingAnalysis {
public static void merge(String[] A, int p, int q, int r) {
int n1 = q-p+1;
int n2 = r-q;
String[] L = new String[n1+1];
String[] R = new String[n2+1];
for (int i=1; i<n1; i++) {
L[i] = A[p+i-1];
}
for (int j=1; j<n2; j++) {
R[j] = A[q+j];
}
L[n1+1] = "zzzzz"; //for infinity because if I use Math.floor, it will return a double
R[n2+1] = "zzzzz";
int i=1;
int j=1;
for (int k=p; k<=r; k++) {
int comparison = L[i].compareTo(R[j]);
if (comparison <= 0){
A[k] = L[i];
i++;
}
else {
A[k] = R[j];
j++;
}
}
}
public static void mergeSort (String[] A, int p, int r) {
if (p<r) {
int q = (p+r)/2;
mergeSort(A, p, q);
mergeSort(A, q+1, r);
merge(A, p, q, r);
}
}
public static void main(String[] args) {
final int NO_OF_WORDS = 50000;
try {
Scanner file = new Scanner(new File(args[0]));
String[] words = new String[NO_OF_WORDS];
int i = 0;
while(file.hasNext() && i < NO_OF_WORDS) {
words[i] = file.next();
i++;
}
long start = System.currentTimeMillis();
mergeSort(words, 0, words.length-1);
long end = System.currentTimeMillis();
System.out.println("Sorted Words: ");
for(int j = 0; j < words.length; j++) {
System.out.println(words[j]);
}
System.out.print("Running time: " + (end - start) + "ms");
}
catch(SecurityException securityException) {
System.err.println("Error");
System.exit(1);
}
catch(FileNotFoundException fileNotFoundException) {
System.err.println("Error");
System.exit(1);
}
}
}
I think it’s because of the declaration of String[] L and R. Or not. Please help me what’s the problem. Thank you very much!
EDIT
Cormen’s Pseudocode
MERGE(A, p, q, r )
n1 ← q − p + 1
n2 ←r − q
create arrays L[1 . . n1 + 1] and R[1 . . n2 + 1]
for i ← 1 to n1
do L[i ] ← A[p + i − 1]
for j ← 1 to n2
do R[ j ] ← A[q + j ]
L[n1 + 1]←∞
R[n2 + 1]←∞
i ← 1
j ← 1
for k ← p to r
do if L[i ] ≤ R[ j ]
then A[k] ← L[i ]
i ←i + 1
else A[k] ← R[ j ]
j ← j + 1
I don’t know what is your pseudocode but your implementation seems wrong. I’ve look at the wikipedia merge sort and it’s quite different.
So I will not give you the full working algorithm here. I’ll just give you the solution to resolve your problem of indexOutOfBounds but you still have to work more on your implementation.
In Java when you do that :
You declare an array of string that can contains
5strings within.The access to those strings is made this way :
L[anIndex].The first element is at index
0.So if you have an array of size
5then the last element is at index4(because we start from 0).In your code you do this :
then :
So here you always try to access a string at an index that doesn’t exist.
The last element in each array is respectively
n1andn2(because arrays size aren1+1andn2+1).I hope you’ll understand better how array works in Java with this explanation. Now you have to improve your implementation because it’s still not working. Maybe give us the pseudocode you use if you don’t understand it well.
EDIT :
Ok I made some correction.
Here is the working algorithm. I’ve had to change several index to fit Java “based-0 arrays”, take a look :
Note that I’ve change your Main, now I use an arrayList to avoid null value, if your text contains less words than the original array size. With your solution if you don’t fill the 50000 words you get null in the array and then nullPointerException in the merge algo.