ComicFinder is an AI-powered content-based recommendation system built using Python and OpenAI Embeddings. It helps users discover semantically similar manga, manhwa, manhua, and webtoons based on natural language descriptions, genres, or titles β ideal for fans seeking personalized recommendations beyond keyword search.
https://comicfinder.streamlit.app/
- π Recommends similar manga/manhwa/manhua/webtoon based on descriptions or titles
- π¦ Utilizes precomputed
clean_embeddings.npy
for fast results - π§ Embedding generation using OPENAI embeddings api
- β‘ Fast cosine similarity search for real-time recommendation
- π₯οΈ Clean Streamlit-based frontend
- π Organized data and scripts for easy retraining or extension
comic-recommender/
βββ app.py # Main application script
βββ data/
β βββ data.csv # Original manhwa dataset
β βββ clean_data.csv # Cleaned and preprocessed data
β βββ clean_embeddings.npy # (Ignored from Git, must be downloaded separately)
βββ scripts/
β βββ clean_dataset.py # Data cleaning script
β βββ generate_embeddings.py # Embedding generation
β βββ recommend.py # Similarity-based recommendations but CLI version
βββ .env # Store API keys
βββ requirements.txt # Python dependencies
βββ README.md # You're here!
git clone https://github.com/AdityaEXP/ComicFinder.git
cd ComicFinder
# Optional: Create virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
pip install -r requirements.txt
streamlit run .\app.py
- Find romance manhwa similar to What's Wrong with Secretary Kim?
- Get fantasy webtoon recommendations with strong male leads
- Discover hidden manga gems with character development arcs
- Replace genre filters with AI-powered natural language queries
Since clean_embeddings.npy is large, itβs not included in this repo. π¦ Download clean_embeddings.npy Or you can also generate the clean_embeddings.npy using your own openai api key it will cost around $0.02 per generation
Create a .env file for your OpenAI API Key
OPENAIKEY=sk-xxxxxx
MIT β free to use, modify, and distribute.
Aditya π οΈ AI + Python + Web3 Enthusiast
This project uses data inspired by or adapted from the following Kaggle dataset:
π Kaggle - Manhwa and Webtoon Dataset
Credit to Victor Soeiro for compiling and sharing this dataset.