Solution: Minimum Window Substring
Let's solve the Minimum Window Substring problem using the Sliding Window pattern.
Statement
Given two strings, s
and t
, find the minimum window substring in s
, which has the following properties:
-
It is the shortest substring of
s
that includes all of the characters present int
. -
It must contain at least the same frequency of each character as in
t
. -
The order of the characters does not matter here.
Constraints:
-
Strings
s
andt
consist of uppercase and lowercase English characters. -
1
s.length
,t.length
Solution
So far, you’ve probably brainstormed some approaches and have an idea of how to solve this problem. Let’s explore some of these approaches and figure out which one to follow based on considerations such as time complexity and any implementation constraints.
Naive approach
The naive approach would be to find all possible substrings of s
and then identify the shortest substring that contains all characters of t
with corresponding frequencies equal to or greater than those in t
.
To find all possible substrings, we will iterate over s
one character at a time. For each character, we will form all possible substrings starting from that character.
We will keep track of the frequencies of the characters in the current substring. If the frequencies of the characters of t
in the substring are equal to or greater than their overall frequencies in t
, save the substring given that the length of this substring is less than the one already saved. After traversing s
, return the minimum window substring.
The time complexity of this approach will be , where is the length of s
. The space complexity of this approach will be , the space used in memory to track the frequencies of the characters of the current substring.
Optimized approach using sliding window
To eliminate the cost of iterating over each substring separately, we use the sliding window pattern. We are searching for the shortest substring of s
that contains all the characters of t
. Once we have found the initial window in s
that contains all the characters of t
, we can slide the window in order the find the shortest one. Let’s see how this approach can efficiently solve this problem.
The first step is to verify whether or not t
is a valid string. If it isn’t, we return an empty string as our output. Then, we initialize two pointers to apply the sliding window technique to our solution. Before we discuss how they’re being used in our solution, we need to take a look at the other components at work.
There are two separate hash maps that we initialize, reqCount
and window
. We populate the reqCount
hash map with the characters in t
and their corresponding frequencies. This is done by traversing each character of t
. If it doesn’t already exist in the hash map, we add it with count , but if it does, we increment its count. The window
hash map is initialized to contain the same characters present in the reqCount
hash map with the corresponding counts set to . The window
hash map will be used to keep track of the frequency of the characters of t
in the current_ window.
We also set up two more variables called current
and required
, which tell us whether we need to increase or decrease the size of our sliding window. The current
variable will initially hold the value but will be incremented by when we find a character whose frequency in the window
hash map matches its frequency in the reqCount
hash map. The required
variable will store the size of the reqCount
hash map. The moment these two become equal, we have found all the characters that we were looking for in the current window. So, we can start trying to reduce our window size to find the shortest possible substring.
Next, let’s look at how we create this window and adjust it. We initialize a variable called left
, which acts as the left pointer, but on the other side, we don’t need to initialize a right pointer explicitly. It can simply be the iterator of our loop, right
, which traverses the array from left to right. In each iteration of this loop, we perform the following steps:
-
If the new character is present in the
window
hash map, we increment its frequency by . -
If the new character occurs in
t
, we check if its frequency in thewindow
hash map is equal to its frequency in thereqCount
hash map. Here, we are actually checking if the current character has appeared the same number of times in the current window as it appears int
. If so, we incrementcurrent
by . -
Finally, if
current
andrequired
become equal this means that we have found a substring that meets our requirements. So, we start reducing the size of the current window to find a shorter substring that still meets these requirements. As long ascurrent
andrequired
remain equal, we move theleft
pointer forward, decrementing the frequency of the character being dropped out of the window. By doing this, we remove all the unnecessary characters present at the start of the substring. We keep comparing the size of the current window with the length of the shortest substring we have found so far. If the size of the current window is less than the length of the minimum substring, we update the minimum substring. -
Once
current
andrequired
become unequal, it means we have checked all the possible substrings that meet our requirement. Now, we slide the right edge of the window one step forward and continue iterating overs
.
When s
has been traversed, we return the minimum window substring.
The slides below illustrate how we would like the algorithm to run:
Note: and represent the
left
andright
pointers respectively.
Let’s have a look at the code for the algorithm we just discussed.
import java.util.HashMap;import java.util.Map;import java.util.*;public class Solution {public static String minWindow(String s, String t) {if (t.isEmpty()) {return "";}Map<Character, Integer> reqCount = new HashMap<>();Map<Character, Integer> window = new HashMap<>();for (char c : t.toCharArray()) {reqCount.put(c, reqCount.getOrDefault(c, 0) + 1);}int current = 0;int required = reqCount.size();int[] res = {-1, -1};int resLen = Integer.MAX_VALUE;int left = 0;for (int right = 0; right < s.length(); right++) {char c = s.charAt(right);if (reqCount.containsKey(c)) {window.put(c, window.getOrDefault(c, 0) + 1);if (window.get(c).equals(reqCount.get(c))) {current++;}}while (current == required) {if ((right - left + 1) < resLen) {res[0] = left;res[1] = right;resLen = (right - left + 1);}char leftChar = s.charAt(left);if (reqCount.containsKey(leftChar)) {window.put(leftChar, window.get(leftChar) - 1);if (window.get(leftChar) < reqCount.get(leftChar)) {current--;}}left++;}}return res[0] == -1 ? "" : s.substring(res[0], res[1] + 1);}public static void main(String[] args) {String[] s = {"PATTERN", "LIFE", "ABRACADABRA", "STRIKER", "DFFDFDFVD"};String[] t = {"TN", "I", "ABC", "RK", "VDD"};for (int i = 0; i < s.length; i++) {System.out.printf("%d.\ts: %s\n\tt: %s\n\tThe minimum substring containing %s is: %s\n",i + 1, s[i], t[i], t[i], minWindow(s[i], t[i]));System.out.println(new String(new char[100]).replace('\0', '-'));}}}
Solution summary
To recap, the solution can be divided into the following parts:
-
We validate the inputs. If
t
is an empty string, we return an empty string. -
Next, we initialize two hash maps:
reqCount
, to save the frequency of characters int
, andwindow
, to keep track of the frequency of characters oft
in the current window. We also initialize a variable,required
, to hold the number of unique characters int
. Lastly, we initializecurrent
which keeps track of the characters that occur int
whose frequency in the current window is equal to or greater than their corresponding frequency int
. -
Then, we iterate over
s
and in each iteration we perform the following steps:-
If the current character occurs in
t
, we update its frequency in thewindow
hash map. -
If the frequency of the new character is equal to its frequency in
reqCount
, we incrementcurrent
. -
If
current
is equal torequired
, we decrease the size of the window from the start. As long ascurrent
andrequired
are equal, we decrease the window size one character at a time, while also updating the minimum window substring. Oncecurrent
falls belowrequired
, we slide the right edge of the window forward and move on to the next iteration.
-
-
Finally, when
s
has been traversed completely, we return the minimum window substring.
Time complexity
The time complexity of the algorithm is , where is the length of the string s
and is the length of the string t
. This complexity arises because the algorithm processes each character of s
exactly once during the sliding window traversal, and operations on the hash maps for tracking character frequencies are on average. The preprocessing step to build the hash map reqCount
for string t
also takes . Together, these contribute to the overall linear runtime. In the worst case, hash map operations can degrade to due to collisions, resulting in a worst-case complexity of , but this is rare with well-distributed hash functions. For most practical purposes, the algorithm runs efficiently in .
Space complexity
Since the characters in t
are limited to uppercase and lowercase English letters, there is a maximum of possible characters. Therefore, the size of the reqCount
and window
hash maps will be at most , regardless of the length of t
. Therefore, the space complexity of this solution will be .
Level up your interview prep. Join Educative to access 70+ hands-on prep courses.