This repository was archived by the owner on Jun 3, 2025. It is now read-only.
DeepSparse v0.11.0
·
2 commits
to release/0.11
since this release
New Features:
- High-performance sparse quantized convolutional neural networks supported on AVX2 systems.
- CCX detection added to the DeepSparse Engine for AMD systems.
deepsparse.serverintegration and CLIs added with Hugging Face transformers pipelines support.
Changes:
Performance improvements made for
- FP32 sparse BERT models
- batch size 1 networks
- quantized sparse BERT models
- Pooling operations
Resolved Issues:
- When hyperthreads are disabled in the BIOS, core/socket information on certain systems can now be detected.
- Hugging Face transformers validation flows for QQP now giving correct accuracy metrics.
- PyTorch downloaded for YOLO model stubs now supported.
Known Issues:
- When running NanoDet-Plus-m, the DeepSparse Engine will fail with an assertion (See #279). A hotfix is being pursued.