Skip to main content

Find Median from Data Stream (LeetCode 295) – Why Linked List TLEs & Two Heaps Work

📌 Problem Overview

LeetCode 295, Find Median from Data Stream, asks us to design a data structure that efficiently supports:

  • addNum(num) – add a number from the data stream
  • findMedian() – return the median of all elements so far

The challenge is not correctness, but performance. The operations may be called up to 10⁵ times, so naive approaches often lead to Time Limit Exceeded (TLE).


❌ Why a Sorted Linked List Causes TLE

A common first attempt is to maintain a sorted doubly linked list:

  • Traverse the list to find the correct position
  • Insert the number while keeping the list sorted
  • Walk from both ends to compute the median

⏱ Time Complexity

Operation Time Complexity
addNum() O(n)
findMedian() O(n)

With up to 100,000 operations, this approach degrades to O(n²) overall, which is why it results in TLE on LeetCode.


✅ Optimal Approach: Two Heaps

The correct and accepted solution uses two heaps instead of maintaining full order.

  • Max Heap (left) → stores the smaller half of numbers
  • Min Heap (right) → stores the larger half of numbers

The idea is to maintain balance, not full sorting.

📐 Invariant Rules

  • The size difference between heaps is at most 1
  • Max heap always has the extra element when count is odd

🧠 How Median Is Calculated

  • If total count is odd → median is top of max heap
  • If total count is even → median is average of both heap tops

🚀 Time Complexity (Optimized)

Operation Time Complexity
addNum() O(log n)
findMedian() O(1)

This guarantees excellent performance even for large input sizes.


🐍 Python Implementation (Two Heaps)

import heapq

class MedianFinder:
    def __init__(self):
        self.left = []   # max heap
        self.right = []  # min heap

    def addNum(self, num: int) -> None:
        heapq.heappush(self.left, -num)
        heapq.heappush(self.right, -heapq.heappop(self.left))
        if len(self.right) > len(self.left):
            heapq.heappush(self.left, -heapq.heappop(self.right))

    def findMedian(self) -> float:
        if len(self.left) > len(self.right):
            return -self.left[0]
        return (-self.left[0] + self.right[0]) / 2

📎 Full Solution on GitHub

You can find the complete and tested implementation here:

👉 LeetCode 295 – Find Median from Data Stream (Python)


🎯 Key Takeaways

  • Maintaining a fully sorted structure is expensive for data streams
  • Balancing data is more important than ordering it
  • Two heaps provide the perfect trade-off between speed and simplicity

This problem is a classic example of how choosing the right data structure matters more than clever logic.

Comments

Popular posts from this blog

Introducing CodeMad: Your Ultimate Universal IDE with Custom Shortcuts

Introducing CodeMad: Your Ultimate Multi-Language IDE with Custom Shortcuts Welcome to the world of CodeMad, your all-in-one Integrated Development Environment (IDE) that simplifies coding and boosts productivity. Developed in Python, CodeMad is designed to make your coding experience smoother and more efficient across a variety of programming languages, including C, C++, Java, Python, and HTML. Whether you're a beginner or an experienced programmer, CodeMad is your go-to tool. In this blog, we'll dive deep into the workings of CodeMad, highlighting its unique features and easy installation process. The Power of Shortcuts CodeMad's intuitive interface is built around a set of powerful keyboard shortcuts that make coding a breeze. Here are some of the key shortcuts you'll find in CodeMad: Copy (Ctrl+C) : Duplicate text with ease. Paste (Ctrl+V) : Quickly insert copied content into your code. Undo (Ctrl+Z) and Redo (Ctrl+Y) : Correct mistakes and s...

LeetCode 88 Explained: Four Approaches, Mistakes, Fixes & the Final Optimal Python Solution

Evolving My Solution to “Merge Sorted Array” A practical, beginner-friendly walkthrough showing four versions of my code (from a naive approach to the optimal in-place two-pointer solution). Includes explanations, complexity and ready-to-paste code. Problem Summary You are given two sorted arrays: nums1 with size m + n (first m are valid) nums2 with size n Goal: Merge nums2 into nums1 in sorted order in-place . Version 1 — Beginner Approach (Extra List) I merged into a new list then copied back. Works, but not in-place and uses extra memory. class Solution: def merge(self, nums1, m, nums2, n): result = [] p1 = 0 p2 = 0 for _ in range(m+n): if p1 >= m: result.extend(nums2[p2:n]) break elif p2 >= n: result.extend(nums1[p1:m]) break elif nu...

How do I run Python on Google Colab using android phone?

Regardless of whether you are an understudy keen on investigating Machine Learning yet battling to direct reproductions on huge datasets, or a specialist playing with ML frantic for extra computational force, Google Colab is the ideal answer for you. Google Colab or "the Colaboratory" is a free cloud administration facilitated by Google to support Machine Learning and Artificial Intelligence research, where frequently the obstruction to learning and achievement is the necessity of gigantic computational force. Table of content- What is google colab? how to use python in google colab? Program to add two strings given by the user. save the file in google colab? What is google colab? You will rapidly learn and utilize Google Colab on the off chance that you know and have utilized Jupyter notebook previously. Colab is fundamentally a free Jupyter notebook climate running completely in the cloud. In particular, Colab doesn't need an arrangement, in addition to the notebook tha...