Skip to content

v0.8.0: performance improvement, cuda feature detection and kernel-memory integration

Choose a tag to compare

@SanftMonster SanftMonster released this 12 Nov 19:53
· 1452 commits to master since this release
597d329

What's Changed

  • fix: binary not copied on MAC platform. by @AsakusaRinne in #238
  • docs: add related repos. by @AsakusaRinne in #240
  • docs: add example models for v0.7.0. by @AsakusaRinne in #243
  • Adapts to SK Kernel Memory by @xbotter in #226
  • CodeQL Pointer Arithmetic by @martindevans in #246
  • build(deps): bump xunit from 2.5.0 to 2.6.1 by @dependabot in #233
  • build(deps): bump xunit.runner.visualstudio from 2.5.0 to 2.5.3 by @dependabot in #234
  • build(deps): bump Swashbuckle.AspNetCore from 6.2.3 to 6.5.0 by @dependabot in #235
  • build(deps): bump Microsoft.SemanticKernel from 1.0.0-beta1 to 1.0.0-beta4 by @dependabot in #231
  • feat(kernel-memory): avoid loading model twice. by @AsakusaRinne in #248
  • GitHub Action Pipeline Improvements by @martindevans in #245
  • Update README.md by @hswlab in #252
  • Removed some CI targets by @martindevans in #253
  • Removed Old Targets From CI matrix by @martindevans in #254
  • Align with llama.cpp b1488 by @SignalRT in #249
  • Enhance framework compatibility by @Uralstech in #259
  • Update LLama.Examples using Spectre.Console by @xbotter in #255
  • Context Size Autodetect by @martindevans in #263
  • Prevent duplication of user prompts / chat history in ChatSession. by @philippjbauer in #266
  • build: add package for kernel-memory integration. by @AsakusaRinne in #244
  • Exposed YaRN scaling parameters in IContextParams by @martindevans in #257
  • Update ToLLamaSharpChatHistory extension method to be public and support semantic-kernel author roles by @kidkych in #274
  • Runtime detection MacOS by @SignalRT in #258
  • feat: cuda feature detection. by @AsakusaRinne in #275

New Contributors

Full Changelog: v0.7.0...v0.8.0