The following is an interesting problem from HackerRank. I thought I might share my solution to this problem, as well as the explanation, I found it rather fascinating.
Click here to view the problem on the HackerRank website.
Click here to skip to the solution explanation.
Sansa has an array. She wants to find the value obtained by XOR-ing the contiguous subarrays, followed by XOR-ing the values thus obtained. Determine this value.
For example, if arr = [3, 4, 5]:
Subarray Operation Result 3 None 3 4 None 4 5 None 5 3,4 3 XOR 4 7 4,5 4 XOR 5 1 3,4,5 3 XOR 4 XOR 5 2
Now we take the resultant values and XOR them together:
3 ⊕ 4 ⊕ 5 ⊕ 7 ⊕ 1 ⊕ 2
Function Description
Complete the sansaXor function in the editor below. It should return an integer that represents the results of the calculations.
sansaXor has the following parameter(s):
- arr: an array of integers
Input Format
The first line contains an integer t, the number of the test cases.
Each of the next t pairs of lines is as follows:
- The first line of each test case contains an integer , the number of elements in arr.
- The second line of each test case contains space-separated integers arr[i].
Constraints
1 ≤ t ≤ 5
2 ≤ n ≤ 105
2 ≤ arr[i] ≤ 108
Output Format
Print the results of each test case on a separate line.
Sample Input 0
2 3 1 2 3 4 4 5 7 5
Sample Output 0
2 0
Explanation 0
Test case 0:
1 ⊕ 2 ⊕ 3 ⊕ (1 ⊕ 2) ⊕ (2 ⊕ 3) ⊕ (1 ⊕ 2 ⊕ 3) = 2
Test case 1:
4 ⊕ 5 ⊕ 7 ⊕ 5 ⊕ (4 ⊕ 5) ⊕ (5 ⊕ 7) ⊕ (7 ⊕ 5) ⊕ (4 ⊕ 5 ⊕ 7) ⊕ (5 ⊕ 7 ⊕ 5) ⊕ (4 ⊕ 5 ⊕ 7 ⊕ 5) = 0
Sample Input 1
2 3 98 74 12 3 50 13 2
Sample Output 1
110 48
Explanation 1
Test Case 0:
98 ⊕ 74 ⊕ 12 ⊕ (98 ⊕ 74) ⊕ (74 ⊕ 12) ⊕ (98 ⊕ 74 ⊕ 12) = 110
Test Case 1:
50 ⊕ 13 ⊕ 2 ⊕ (50 ⊕ 13) ⊕ (13 ⊕ 2) ⊕ (50 ⊕ 13 ⊕ 2) = 48
At first, this problem seems a little daunting. However, after a deeper analysation, and a little bit of trial & error, it comes out to be the total opposite.
Lets analyse a sample input. Lets take arr = [2, 3, 4, 5].
From the question, we can see that this can be represented as follows:
2 ⊕ 3 ⊕ 4 ⊕ 5 ⊕ (2 ⊕ 3) ⊕ (3 ⊕ 4) ⊕ (4 ⊕ 5) ⊕ (2 ⊕ 3 ⊕ 4) ⊕ (3 ⊕ 4 ⊕ 5) ⊕ (2 ⊕ 3 ⊕ 4 ⊕ 5)
From mathematical knowlege, it is understood that XOR is a commutitave and associative operator. Therefore, the brackets can be completely ignored, in fact removed, and we would be left with the following:
2 ⊕ 3 ⊕ 4 ⊕ 5 ⊕ 2 ⊕ 3 ⊕ 3 ⊕ 4 ⊕ 4 ⊕ 5 ⊕ 2 ⊕ 3 ⊕ 4 ⊕ 3 ⊕ 4 ⊕ 5 ⊕ 2 ⊕ 3 ⊕ 4 ⊕ 5
Lets manipulate the above expression, so that all the like terms are grouped. For convenience, lets also surround the like terms with brackets, like so:
(2 ⊕ 2 ⊕ 2 ⊕ 2) ⊕ (3 ⊕ 3 ⊕ 3 ⊕ 3 ⊕ 3 ⊕ 3) ⊕ (4 ⊕ 4 ⊕ 4 ⊕ 4 ⊕ 4 ⊕ 4) ⊕ (5 ⊕ 5 ⊕ 5 ⊕ 5)
Lets now attempt to reduce this expression bracket by bracket. Before that, however, lets have a quick look at what happens when you XOR a value, with itself. Lets XOR the binary representation of 2 with itself:
210 = 102
12 ⊕ 12 = 02
02 ⊕ 02 = 02
=> 102 ⊕ 102 = 002
∴ 2 ⊕ 2 = 0
This shows us that whenever we XOR a number x with itself, it always results to zero. If we now XOR the resulting zero with the same number x (i.e. 2), the following happens:
02 ⊕ 12 = 12
02 ⊕ 02 = 02
=> 002 ⊕ 102 = 102
∴ 0 ⊕ 2 = 2
This shows us that whenever we XOR zero with a number x, it always results back the number x. lets now go back and use this knowledge to simplify the expression. Looking at the first bracket, we have the following:
2 ⊕ 2 ⊕ 2 ⊕ 2
lets put brackets around groups of two terms and solve them each:
(2 ⊕ 2) ⊕ (2 ⊕ 2)
As discussed, the resulting values in both of these brackets will be zero:
(0) ⊕ (0)
= 0
From this, we can tell that if the following conditions are met:
- We have an expression in which any value x is being XOR'ed with itself any number of times
- We can divide the expression into subexpressions of two terms with no remaining terms (i.e. number of x's is even)
the resulting value will be zero.
We have established that the first subexpression of the original expression simplifies to 0. We can now simplify the next part.
3 ⊕ 3 ⊕ 3 ⊕ 3 ⊕ 3 ⊕ 3
We can straight away see that the number of 3's in the above subexpression is 6 (even), which means that it would simplify to zero. The third and fourth subexpressions also simplify to zero, since the number of 4's and 5's in each subexpression, respectively, is even.
0 ⊕ 0 ⊕ 0 ⊕ 0
= 0
With further inspection, it is observed that, in fact, for all input arrays that have an even length, the output is zero since all the values in an input array of even length occur exactly an even amount of times in the expression. From this, we can speculate that the problem must lie in the amount of times a number, x, occurs within the expression. If we can somehow find a pattern that reveals to us whether each element of the array occurs an even or odd number of times, we could use this to solve the problem.
Taking all of the above into account, what if the length of the input array is odd? Lets take arr = [2, 3, 4] to evaluate this scenario.
The expression would be as follows:
2 ⊕ 3 ⊕ 4 ⊕ (2 ⊕ 3) ⊕ (3 ⊕ 4) ⊕ (2 ⊕ 3 ⊕ 4)
Following the exact same steps as with the previous scenario, we end up with the following expression:
(2 ⊕ 2 ⊕ 2) ⊕ (3 ⊕ 3 ⊕ 3 ⊕ 3) ⊕ (4 ⊕ 4 ⊕ 4)
As before, lets take the first bracketed subexpression of the above expression and divide it into groups of two terms:
(2 ⊕ 2) ⊕ 2
We can notice that the number of 2's in the above subexpression is 3 (odd). We can simplify the grouped terms to zero, furthermore, we know that when we XOR any value with zero, we get the same value back. This implies the following:
0 ⊕ 2
= 2
Looking at the second part of the expression, we can see that there are an even number of 3's in the subexpression. Therefore, as before, this part of the expression would simplify to zero. Finally, the third part of the equation has an odd number of 4's, which we now know would simplify to 4. The following expression is obtained:
2 ⊕ 0 ⊕ 4
The above expression is the key to the solution. Lets take a minute to compare this expression with our input array:
[2, 3, 4]
2 ⊕ 0 ⊕ 4
We can observe that the first element of the array, arr[0] (even index), is equal to the first term of the expression. Similarly, the third element of the array, arr[2] (even index), is also equal to the third term of the expression. However, the second term of the expression is zero, which is not equal to the second element of the array, arr[1] (odd index). Now that we have noticed something, lets quickly simplify the expression to get the answer:
2 ⊕ 0 ⊕ 4
= 2 ⊕ 4
= 6
After observing other inputs for confirmation, we can verify a pattern. For any arr = [a, b, c, d, e, f, g, ...] where the length of arr is odd, the following would be the output:
a ⊕ 0 ⊕ c ⊕ 0 ⊕ e ⊕ 0 ⊕ g
= a ⊕ c ⊕ e ⊕ g
All the elements at even indeces of the input array, occur exactly an odd number of times in the expression, therefore being carried into the answer. All the elements at odd indeces of the input array, occur exactly an even number of times in the expression, therefore being reduced to zero.
With this information, coding the solution becomes much easier.