Skip to content

Commit 8f4d134

Browse files
authored
bump version 4.4.0 (#1774)
* bump version 4.4.0 * fix changelog * fix wrong commit
1 parent f89fa2b commit 8f4d134

File tree

2 files changed

+18
-1
lines changed

2 files changed

+18
-1
lines changed

CHANGELOG.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,23 @@
44

55
### Fixes and improvements
66

7+
## [v4.4.0](https://github.com/OpenNMT/CTranslate2/releases/tag/v4.4.0) (2024-09-09)
8+
**Removed**: Flash Attention support in the Python package due to significant package size increase with minimal performance gain.
9+
Note: Flash Attention remains supported in the C++ package with the `WITH_FLASH_ATTN` option.
10+
Flash Attention may be re-added in the future if substantial improvements are made.
11+
12+
### New features
13+
* Support Llama3 (#1751)
14+
* Support Gemma2 (1772)
15+
* Add log probs for all tokens in vocab (#1755)
16+
* Grouped conv1d (#1749 + #1758)
17+
18+
### Fixes and improvements
19+
* Fix pipeline (#1723 + #1747)
20+
* Some improvements in flash attention (#1732)
21+
* Fix crash when using return_alternative on CUDA (#1733)
22+
* Quantization AWQ GEMM + GEMV (#1727)
23+
724
## [v4.3.1](https://github.com/OpenNMT/CTranslate2/releases/tag/v4.3.1) (2024-06-10)
825
Note: Because of exceeding project's size on Pypi (> 20 GB), the release v4.3.0 was pushed unsuccessfully.
926

python/ctranslate2/version.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
11
"""Version information."""
22

3-
__version__ = "4.3.1"
3+
__version__ = "4.4.0"

0 commit comments

Comments
 (0)