Skip to content

v0.2.1

Choose a tag to compare

@github-actions github-actions released this 16 Oct 20:01
· 10191 commits to main since this release
651c614

Major Changes

  • PagedAttention V2 kernel: Up to 20% end-to-end latency reduction
  • Support log probabilities for prompt tokens
  • AWQ support for Mistral 7B

What's Changed

New Contributors

Full Changelog: v0.2.0...v0.2.1