Skip to content

0.1.0

Latest
Compare
Choose a tag to compare
@feifeibear feifeibear released this 19 Sep 08:07
· 34 commits to main since this release
c3e97f5

Demonstrate Speculative Sampling using bloom 560m and 7b1 models.
Support KV Cache Optimization.
Only works for batch size as 1.