DFlash is a lightweight block diffusion model designed for speculative decoding. It enables efficient and high-quality parallel drafting. We will also open-source the training recipe soon, so you can ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results