Image analysis with CLIP model

This is a project to pull together and end-to-end image analysis API using text-image embeddings. The intent is to build in several phases:

Build a basic API to upload and download images including metadata
Integrate pre-trained CLIP models to save embeddings for images at ingestion
Build out search functionality to allow image search in natural language and metadata filters
Test training of a custom CLIP-style model including image and text encoders and the overarching projection layers

At some point it may also require a basic front-end to demonstrate the API functionality.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
data		data
src/img_search		src/img_search
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback