Lemma
math, backwards

Compression

High-dimensional data with too much redundancy. Change basis until most coordinates are small. Drop the small ones. Reconstruct what's left. JPEG, TF-IDF, and Huffman codes are the *same procedure*, just applied to pixels, words, and probabilities.

the skeleton
  1. 1
    change basis
    Pick a basis where signal concentrates in few coordinates.
  2. 2
    drop small
    Zero out (or quantise) coordinates below a threshold.
  3. 3
    reconstruct
    Invert the basis change with the surviving coordinates.
instances · 3
leans on
walk the instances
How Compression Works →

One three-step procedure under JPEG, TF-IDF, and the raw-pixel entropy floor — across graphics and ML.