Skip to content

Releases: aime-team/aime-api-server

aime-api-server-0.9.0

29 Nov 15:08

Choose a tag to compare

Changes

Version 0.9.0

aime-api-server-0.8.5

29 Aug 15:11

Choose a tag to compare

Changes

Version 0.8.5

  • Stable Diffusion 3 (sd3) example endpoint added
  • Tortois TTS example endpoint added
  • Improved and optimized ffmpeg image and audio conversion
  • Input parameters attribute 'keep_aspect_ratio' added as option how to rescale images
  • Llama3 and Mixtral endpoints: added max_gen_tokens input parameter

aime-api-server-0.8.1

27 Jun 09:13

Choose a tag to compare

Changes

Version 0.8.1

  • New input parameter type 'json', for submitting escaped raw JSON data to endpoint, added
  • Llama3 (llama3_chat) instruct chat example endpoint added
  • Mixtral 8x7B, 8x22B (mixtral_chat) instruct chat example endpoint added
  • Improved authentication mechanism
  • improved benchmark tool to benchmark instruct chat models throughput

aime-api-server-0.8.0

04 Apr 14:35

Choose a tag to compare

AIME API Server - The Scalable Model Inference API Server

With AIME API one deploys deep learning models (Pytorch, Tensorflow) through a job queue as scalable API endpoint capable of serving millions of model inference requests.

Turn a console Python script to a secure and robust web API acting as your interface to the mobile, browser and desktop world.

The central part is the AIME API Server, an efficient asynchronous HTTP/HTTPS web server which can be used stand-alone web server or integrated into Apache, NGINX or similar web servers. It takes the client requests, load balances the requests and distributes them to the API compute workers.

Features

  • Fast - asynchronous and multi process API server
  • Scalable & Robust- distributed cluster ready architecture
  • Secure - type safe interface and input validation
  • Aggregates API requests to GPU batch jobs for maximum throughput
  • Easy integratable into exisiting Python and Tensorflow projects
  • High performance image and audio input/ouput conversion for common web formats
  • Pythonic - easily extendable in your favourite programming language

Changes

Version 0.8.0

Initial Release