Send the following on WhatsApp
Continue to ChatTriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding https://rmag.eu/triforce-lossless-acceleration-of-long-sequence-generation-with-hierarchical-speculative-decoding/