
Mercury 2, the first diffusion-based reasoning large language model, introduces a new approach to token generation by refining multiple tokens in parallel rather than sequentially. This shift enables Mercury 2 to achieve speeds of up to 1,000 tokens per secon…
Mercury 2, the first diffusion-based reasoning large language model, introduces a new approach to token generation by refining multiple tokens in parallel rather than sequentially. This shift enables… [+7648 chars]






