SLM Inference Race: Autoregressive vs. Discrete Neural Flow
Testing the stability and O(1) parallel decoding efficiency of Discrete Neural Flow against a standard GPT-2 baseline.
Baseline: GPT-2 (Autoregressive)
Waiting for run...
SOTA: Discrete Neural Flow (Linear Schedule)
Waiting for run...