Beyond Indistinguishability: Measuring Extraction Risk in LLM APIs

Ruixuan Liu · David Evans · Li Xiong
IEEE Symposium on Security and Privacy (S&P) 2026
This interactive demo runs GPT-2 entirely in your browser using Transformers.js + WebGPU. No data leaves your machine. First visit downloads the model (~250 MB, cached afterwards).

Input Text

Loading model...

Summary

b (inextractability)
p* (max window prob)
η (greedy rate)
Extractable / Total
Extraction Cost Heatmap
Greedy Extractability

Each token is colored by the minimum b across all l-gram windows containing it. Red = low b (easy to extract), green = high b (hard to extract).

Vulnerable
Safe

Red = rank 1 (greedy-decodable), Green = rank > 1. Contiguous green spans of length ≥ l are greedy-extractable windows.