SLM Inference Race: Autoregressive vs. Discrete Neural Flow

Testing the stability and O(1) parallel decoding efficiency of Discrete Neural Flow against a standard GPT-2 baseline.

Baseline: GPT-2 (Autoregressive)

Waiting for run...

SOTA: Discrete Neural Flow (Linear Schedule)

Waiting for run...