Segment Trees

When 2 use ?

Need to be used in scenarios where we require to perform fast range-max(or min|sum) queries with point updates.

Can also check Square root decomposition technique

Datastructure design & operations

The segment tree, contains actual array data in the leaf nodes, and the internal nodes stores the value of specific operation performed on it’s children (segment of the array). This design allows efficient computation of range queries.

It performs two key operations

range query - log(n)
point update- log(n)

Example

Given an array a[N] = {2,4,5,1,6,4, ...}, we need to handle the update and queries:

update a[i] to x
query sum for range [l,r] = a[l]+a[l+1]+a[l+2]... +a[r] # could also be Range minimum query

In this case the internal nodes of the segment tree will contain the sum of it’s children leaf nodes.

Visual Representation

1. Original Input Array

The segment tree is built from an input array. Each element will eventually be stored in a leaf node of the tree.

2. Segment Tree Structure

The tree stores array elements at leaf nodes (with bold borders) and aggregated values (sums) at internal nodes. Each node tracks its range coverage [left, right] and the corresponding tree array index.

Key observations:

The root node tree[1] covers the entire array range [0-5] with sum = 22
Each parent’s value equals the sum of its two children
Leaf nodes (bold borders) store individual array elements
Node indices follow the pattern: left child = 2*i, right child = 2*i+1, parent = i/2

3. Memory Layout (Array Representation)

The tree is stored as a 1D array using 1-based indexing. This allows efficient parent-child navigation using simple arithmetic.

Navigation Formulas:

Left child of node i: 2 × i
Right child of node i: 2 × i + 1
Parent of node i: i ÷ 2 (integer division)
Space Complexity: O(4n) for array of size n

Why 1-based indexing?

Simplifies parent-child calculations (no need for 2*i+1 vs 2*i+2)
tree[0] remains unused but makes the math cleaner
Industry standard for segment tree implementations

Implementation

Range Sum Query

The range sum query recursively traverses the tree to find the sum of elements in the query range [queryL, queryR].

1
public int rangeSum(int[] tree, int node, int segL, int segR, int queryL, int queryR) {
2
    // Case 1: Complete overlap - current segment is completely inside query range
3
    if (queryL <= segL && segR <= queryR) {
4
        return tree[node];
5
    }
6

7
    // Case 2: No overlap - current segment is completely outside query range
8
    if (segR < queryL || queryR < segL) {
9
        return 0;  // Identity element for sum
10
    }
11

12
    // Case 3: Partial overlap - query range partially overlaps current segment
13
    int mid = (segL + segR) / 2;
14
    int leftSum = rangeSum(tree, 2*node, segL, mid, queryL, queryR);
15
    int rightSum = rangeSum(tree, 2*node+1, mid+1, segR, queryL, queryR);
16

17
    return leftSum + rightSum;
18
}
19

20
// Usage: rangeSum(tree, 1, 0, n-1, queryL, queryR)

Algorithm Explanation:

Complete Overlap (queryL <= segL && segR <= queryR):
- The current segment [segL, segR] is completely inside the query range [queryL, queryR]
- Return the precomputed value at this node
- Example: Query [1,4] with segment [1,1] → return tree[node]
No Overlap (segR < queryL || queryR < segL):
- The current segment is completely outside the query range
- Return identity element (0 for sum, infinity for min, -infinity for max)
- Example: Query [1,4] with segment [5,5] → return 0
Partial Overlap:
- The segment partially overlaps with the query range
- Recursively query both left and right children
- Combine results using the operation (sum, min, max, etc.)

Example Query: sum([1, 4])

For array a[] = {2,4,5,1,6,4}, querying sum of range [1,4]:

Query: [1,4]
├─ Node 1 [0-5]: Partial overlap → recurse
   ├─ Node 2 [0-2]: Partial overlap → recurse
   │  ├─ Node 4 [0-1]: Partial overlap → recurse
   │  │  ├─ Node 8 [0-0]: No overlap → return 0
   │  │  └─ Node 9 [1-1]: Complete overlap → return 4 ✓
   │  └─ Node 5 [2-2]: Complete overlap → return 5 ✓
   └─ Node 3 [3-5]: Partial overlap → recurse
      ├─ Node 6 [3-4]: Complete overlap → return 7 ✓
      └─ Node 7 [5-5]: No overlap → return 0

Result: 4 + 5 + 7 = 16

Time Complexity: O(log n) - At most 4 nodes per level are visited

Building the Segment Tree

1
public void buildTree(int[] arr, int[] tree, int node, int start, int end) {
2
    if (start == end) {
3
        // Leaf node - store array element
4
        tree[node] = arr[start];
5
    } else {
6
        int mid = (start + end) / 2;
7

8
        // Build left and right subtrees
9
        buildTree(arr, tree, 2*node, start, mid);
10
        buildTree(arr, tree, 2*node+1, mid+1, end);
11

12
        // Internal node - store sum of children
13
        tree[node] = tree[2*node] + tree[2*node+1];
14
    }
15
}
16

17
// Usage: buildTree(arr, tree, 1, 0, n-1)

Time Complexity: O(n) - Each array element is visited once

Point Update

1
public void updatePoint(int[] tree, int node, int segL, int segR, int idx, int value) {
2
    if (segL == segR) {
3
        // Leaf node - update value
4
        tree[node] = value;
5
    } else {
6
        int mid = (segL + segR) / 2;
7

8
        if (idx <= mid) {
9
            // Update in left subtree
10
            updatePoint(tree, 2*node, segL, mid, idx, value);
11
        } else {
12
            // Update in right subtree
13
            updatePoint(tree, 2*node+1, mid+1, segR, idx, value);
14
        }
15

16
        // Update current node
17
        tree[node] = tree[2*node] + tree[2*node+1];
18
    }
19
}
20

21
// Usage: updatePoint(tree, 1, 0, n-1, idx, value)

Time Complexity: O(log n) - Traverse from root to leaf and update path

Lazy Propagation

Problem with Range Updates

In the basic segment tree, updating a range [L, R] requires updating all elements individually, which takes O(n log n) time. This becomes inefficient when we need frequent range updates.

Example: Update all elements in range [2, 5] by adding 10.

Without lazy propagation: Update each of 4 elements → 4 × O(log n) = O(n log n)
With lazy propagation: Update only necessary nodes → O(log n)

Lazy Propagation Concept

Core Idea: Postpone updates to children until they are actually needed.

Maintain a separate lazy[] array to store pending updates
When updating a range, mark nodes with pending updates in lazy[]
Only propagate updates down when querying or updating that subtree
This allows range updates in O(log n) time

Key Principles:

Lazy values represent pending operations on a node’s entire range
Before processing a node, apply and push down any pending updates
Leaf nodes never have lazy values (updates are immediately applied)

Range Update with Lazy Propagation

1
public void updateRange(int[] tree, int[] lazy, int node, int segL, int segR,
2
                       int updateL, int updateR, int value) {
3
    // Step 1: Check if there's a pending update for this node
4
    if (lazy[node] != 0) {
5
        // Apply the pending update to current node
6
        tree[node] += (segR - segL + 1) * lazy[node];
7

8
        // Propagate to children if not a leaf node
9
        if (segL != segR) {
10
            lazy[2*node] += lazy[node];
11
            lazy[2*node+1] += lazy[node];
12
        }
13

14
        // Clear the lazy value
15
        lazy[node] = 0;
16
    }
17

18
    // Step 2: No overlap - current segment is outside update range
19
    if (segR < updateL || updateR < segL) {
20
        return;
21
    }
22

23
    // Step 3: Complete overlap - current segment is completely inside update range
24
    if (updateL <= segL && segR <= updateR) {
25
        // Update current node
26
        tree[node] += (segR - segL + 1) * value;
27

28
        // Mark children as lazy (postpone their updates)
29
        if (segL != segR) {
30
            lazy[2*node] += value;
31
            lazy[2*node+1] += value;
32
        }
33
        return;
34
    }
35

36
    // Step 4: Partial overlap - recursively update children
37
    int mid = (segL + segR) / 2;
38
    updateRange(tree, lazy, 2*node, segL, mid, updateL, updateR, value);
39
    updateRange(tree, lazy, 2*node+1, mid+1, segR, updateL, updateR, value);
40

41
    // Update current node from children
42
    tree[node] = tree[2*node] + tree[2*node+1];
43
}
44

45
// Usage: updateRange(tree, lazy, 1, 0, n-1, updateL, updateR, value)

Algorithm Steps:

Apply pending updates from lazy[node] before processing
Check overlap conditions (no overlap, complete, or partial)
For complete overlap: Update node and mark children as lazy
For partial overlap: Recursively update both children
Recompute parent value from updated children

Range Query with Lazy Propagation

1
public int queryRange(int[] tree, int[] lazy, int node, int segL, int segR,
2
                     int queryL, int queryR) {
3
    // Step 1: Apply pending update before querying
4
    if (lazy[node] != 0) {
5
        tree[node] += (segR - segL + 1) * lazy[node];
6

7
        if (segL != segR) {
8
            lazy[2*node] += lazy[node];
9
            lazy[2*node+1] += lazy[node];
10
        }
11

12
        lazy[node] = 0;
13
    }
14

15
    // Step 2: No overlap
16
    if (segR < queryL || queryR < segL) {
17
        return 0;
18
    }
19

20
    // Step 3: Complete overlap
21
    if (queryL <= segL && segR <= queryR) {
22
        return tree[node];
23
    }
24

25
    // Step 4: Partial overlap
26
    int mid = (segL + segR) / 2;
27
    int leftSum = queryRange(tree, lazy, 2*node, segL, mid, queryL, queryR);
28
    int rightSum = queryRange(tree, lazy, 2*node+1, mid+1, segR, queryL, queryR);
29

30
    return leftSum + rightSum;
31
}
32

33
// Usage: queryRange(tree, lazy, 1, 0, n-1, queryL, queryR)

Example: Range Update Walkthrough

Initial Array: a[] = {2, 4, 5, 1, 6, 4}
Operation: Add 10 to range [1, 3]

Step 1: Start at root [0-5]
├─ Partial overlap → recurse to children
   ├─ Node 2 [0-2]: Partial overlap
   │  ├─ Process lazy updates (if any)
   │  ├─ Recurse to children
   │  │  ├─ Node 4 [0-1]: Partial overlap
   │  │  │  ├─ Node 8 [0-0]: No overlap → return
   │  │  │  └─ Node 9 [1-1]: Complete overlap
   │  │  │     └─ Update: tree[9] += 1*10, mark lazy[] = 0
   │  │  └─ Node 5 [2-2]: Complete overlap
   │  │     └─ Update: tree[5] += 1*10
   │  └─ Update parent: tree[2] = tree[4] + tree[5]
   │
   └─ Node 3 [3-5]: Partial overlap
      ├─ Node 6 [3-4]: Partial overlap
      │  ├─ Node 10 [3-3]: Complete overlap
      │  │  └─ Update: tree[10] += 1*10
      │  └─ Node 11 [4-4]: No overlap → return
      └─ Node 7 [5-5]: No overlap → return

After update: a[] = {2, 14, 15, 11, 6, 4}
Lazy array: Only pending values for unvisited nodes remain

Key Advantage: Only O(log n) nodes updated, not O(n)

Complexity Analysis

Operation	Without Lazy Propagation	With Lazy Propagation
Point Update	O(log n)	O(log n)
Range Update	O(n log n)	O(log n)
Range Query	O(log n)	O(log n)
Space	O(4n)	O(8n)

Common Use Cases

Range Addition/Subtraction: Add/subtract value to all elements in range
Range Set: Set all elements in range to a specific value
Combination queries: Update ranges and query multiple times efficiently
Interval scheduling: Track overlapping intervals with updates

Implementation Tips

Initialize lazy array to 0 (identity element for the operation)
Always check lazy values before processing any node
Clear lazy values after propagating them down
For set operations, use a flag to distinguish between “add” and “set” modes
Test edge cases: single element ranges, full array updates, overlapping updates