-
1
ResDecode: Accelerating Large Language Models Inference via Residual Decoding Heads
Published 2025-06-01Subjects: “…speculative decoding…”
Get full text
Article -
2
Accelerating the inference of string generation-based chemical reaction models for industrial applications
Published 2025-03-01Subjects: Get full text
Article