Longest Substring with at Most K Distinct Characters

Question

Given a string $S$ of length $n$ over a fixed alphabet (e.g., lowercase English letters) and a non-negative integer $K$, find the length of the longest contiguous substring of $S$ that contains at most $K$ distinct characters. Constraints: - $0 \leq K \leq 26$ - $0 \leq n \leq 10^5$ - $S$ consists…

Accepted Answer

How to Think About It: This is a classic sliding window problem. You maintain a window $[l, r]$ that represents the current candidate substring. As you expand the right boundary, you might add new distinct characters. When the number of distinct characters exceeds $K$, you shrink from the left until you are back to at most $K$ distinct characters. The key observation is that both $l$ and $r$ only move forward, so the total work is $O(n)$. Algorithm: 1. Maintain an array count[26] tracking the frequency of each character in the current window. 2. Maintain a variable distinct counting how many characters have non-zero frequency. 3. Use two pointers l = 0 and r = 0. Expand r one step at a time: - Add S[r] to the window: if count[S[r]] was 0, increment distinct; increment count[S[r]]. - While distinct > K: remove S[l] from the window (decrement count, if it hits 0 decrement distinct), then increment l. - Update best = max(best, r - l + 1). 4. Return best. Code: python def longest_substring_k_distinct(s: str, k: int) -> int: if k == 0 or not s: return 0 count = [0] * 26 distinct = 0 best…

Longest Substring with at Most K Distinct Characters

Hints

Worked Solution

Intuition