Solution: Partition Equal Subset Sum
Let's solve the Partition Equal Subset Sum problem using the Dynamic Programming pattern.
Statement
Given a non-empty array of positive integers, determine if the array can be divided into two subsets so that the sum of both subsets is equal.
Constraints:
-
nums.length
-
nums[i]
Solution
So far, you’ve probably brainstormed some approaches and have an idea of how to solve this problem. Let’s explore some of these approaches and figure out which one to follow based on considerations such as time complexity and any implementation constraints.
Naive approach
We can solve this problem with the following two steps:
- First, we calculate the sum of the array. If the sum of the array is odd, there can’t be two subsets with an equal sum, so we return FALSE.
- If the sum is even, we calculate and find a subset of the array with a sum equal to .
The naive approach is to solve the second step using recursion. In this approach, we calculate the result repeatedly every time. For example, consider an array, [1, 6, 20, 7, 8]
, which can be partitioned into [1, 20]
and [6, 7, 8]
with the sum 21
. While computing the sum, we encounter 21
twice: once in the first subset (6 + 7 = 13, 13 + 8 = 21
) and once in the second subset (1 + 20 = 21
). However, since we’re not storing these sums, it is computed twice.
The time complexity of this approach is . In the worst case, for each element in an array, this solution tests two possibilities, i.e., whether to include or exclude it. We can avoid the repeated work done in the naive approach by storing the result calculated at each step. We store all the values in a lookup table.
Optimized approach using dynamic programming
We use the bottom-up approach of dynamic programming, also known as the tabulation technique. In this approach, the smallest problem is solved, the result is saved, and larger subproblems are computed based on the evaluated results. The problem is divided into subproblems, which are dependent on each other. We start by initializing a lookup table and setting up the values of the base cases. For every subsequent, larger subproblem, we fetch the results of the required preceding smaller subproblems and use them to get the solution to the current subproblem.
Here is how we implement this algorithm:
-
First, we calculate the sum of the array,
nums
. If the sum of the array is odd, there can’t be two subsets with an equal sum, so we return FALSE. -
Create a lookup table,
dp
, of size , where is the sum, and is the size of the array. Thedp[0][0]
represents that the sum is , and none of the elements is included in the sum. Therefore, rows and columns are needed. Initialize all cells ofdp
with FALSE. -
Since each element in the array is a positive number, therefore the sum of elements can’t be . Hence, each element of the first row in
dp
is set to TRUE to represent the solution of the smallest sub-problem. -
The FALSE in the first column except location indicates that an empty array has no subset whose sum is greater than .
-
Fill the table in a bottom-up approach where
[i][j]
represents the current row and column entry.-
If the
j
element of the array is greater thani
, it will make the sum greater thani
, which means we cannot include this element in our subset. Therefore, we copy the previous column’s value, which isdp[i][j-1]
, intodp[i][j]
. -
If the
j
element of the array is less than or equal toi
, we have two choices: either include it in our subset or exclude it. Here, we want to find out if it is possible to form a subset with a sum ofi
using the firstj
elements of the array.-
In the first choice, we need to find a subset that adds up to
i - nums[j-1]
using the firstj-1
elements of thenums
array. That means we are looking at the value ofdp[i - nums[j - 1]][j - 1]
. -
In the second choice, we exclude the
j
element from our subset and find a subset that adds up toi
using the firstj-1
elements of the nums array. This means we are looking at the value ofdp[i][j - 1]
. -
Finally, we set
dp[i][j]
to the logical OR of these two choices:dp[i][j] = dp[i - nums[j - 1]][j - 1] OR dp[i][j - 1]
.
-
-
-
Return the value present at the last row and last column of the
dp
, which denotes whether the array can be partitioned or not.- If we get TRUE, then the array can be partitioned.
- If we get FALSE, then the array can not be partitioned.
Here’s the demonstration of the steps above:
Let’s look at the code for this solution below:
import java.util.*;class PartitionEqualSum {public static boolean canPartitionArray(int[] nums) {int arraySum = 0;for (int num : nums) {arraySum += num;}if (arraySum % 2 != 0) {return false;}int subsetSum = arraySum / 2;boolean[][] dp = new boolean[subsetSum + 1][nums.length + 1];for (int i = 0; i <= nums.length; i++) {dp[0][i] = true;}for (int i = 1; i <= subsetSum; i++) {for (int j = 1; j <= nums.length; j++) {if (nums[j - 1] > i) {dp[i][j] = dp[i][j - 1];} else {dp[i][j] = dp[i - nums[j - 1]][j - 1] || dp[i][j - 1];}}}return dp[subsetSum][nums.length];}// Driver Codepublic static void main(String[] args) {int[][] input = {{3, 1, 1, 2, 2, 1},{1, 3, 7, 3}, {1, 2, 3},{1, 2, 5}, {1, 3, 4, 8},{1, 2, 3, 2, 3, 5},{1, 5, 3, 2, 3, 19, 3},{1, 2, 3, 5, 3, 2, 1}};for (int i = 0; i < input.length; i++) {System.out.print(i + 1);System.out.println(".\tGiven array: " + Arrays.toString(input[i]));Boolean result = canPartitionArray(input[i]);System.out.print("\n\tCan we partition the array into equal sum arrays?: " + result + "\n");System.out.println(new String(new char[100]).replace('\0', '-'));}}}
Solution summary
To recap, the solution to this problem can be divided into the following parts:
-
Create a lookup table and initialize the first row with TRUE.
-
Fill the lookup table in the bottom-up approach by checking if the current number in the input array can be included in the subset sum.
-
In case it can be included in the subset sum, then the table entry is marked TRUE, otherwise, it is marked FALSE and is calculated based on the previous values in the lookup table.
-
After filling up, the value present at the last row and column of the lookup table denotes whether the array can be partitioned or not.
Time complexity
The time complexity of the solution above is , where is the size of the input array and is the sum of the array. This is the time required to fill the lookup table.
Space complexity
The space complexity of the above solution is . This is space taken by the lookup table.
Can we do better?
A space-optimized solution can be devised by using an array instead of a table, which reduces the space complexity from to . We initialize an array dp
of size with FALSE
in each slot in the array except the first slot, which is initialized with TRUE. This means that a subset with a sum of can always be formed by selecting no elements from the input array.
We can then iterate through the elements in the input array nums
. For each element val
in nums
, we can update dp
as follows:
- Iterate through
dp
in reverse order, starting from down toval
. - For each index
j
in thedp
, the algorithm setsdp[j]
to TRUE ifdp[j]
is already TRUE or ifdp[j - val]
is TRUE. The reason for this is that ifdp[j - val]
is TRUE, it means that a subset with the sumj - val
can be constructed using the previous elements of the input array. Therefore, by including the current elementval
, a subset with sumj
can be constructed.
In the end, the last element of dp
represents the output.
Level up your interview prep. Join Educative to access 70+ hands-on prep courses.