Certificate AI Text Extraction

A Next.js application that extracts text from certificate images and PDFs using Google Gemini AI and traditional OCR.

Features

Upload certificate files (JPG, PNG, PDF)
Extract text using OCR (Optical Character Recognition)
Parse common certificate fields (name, roll number, marks, etc.)
Modern UI with TailwindCSS
Responsive design with TailwindCSS

Setup

Install dependencies:

npm install

Set up Google Gemini AI API key:
- Get your API key from Google AI Studio
- Create a .env.local file in the project root
- Add your API key: GEMINI_API_KEY=your_api_key_here
Run the development server:

npm run dev

Open http://localhost:3000 in your browser.

Usage

Upload a certificate file (JPG, PNG, or PDF)
Click "Extract Text" to process the document
View the extracted data and raw text

Technology Stack

Next.js 14
React 18
Google Gemini AI for advanced text extraction from images
Tesseract.js for OCR (fallback)
Formidable for file upload handling
pdf-parse for direct PDF text extraction
TailwindCSS for styling

API Endpoints

POST /api/gemini-ocr - Processes uploaded files using Gemini AI and returns extracted text
POST /api/ocr - Legacy OCR endpoint using Tesseract.js

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
public		public
src		src
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
README.md		README.md
SETUP_GEMINI.md		SETUP_GEMINI.md
components.json		components.json
eng.traineddata		eng.traineddata
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Certificate AI Text Extraction

Features

Setup

Usage

Technology Stack

API Endpoints

About

Uh oh!

Releases

Packages

Languages

lande26/SIH-25

Folders and files

Latest commit

History

Repository files navigation

Certificate AI Text Extraction

Features

Setup

Usage

Technology Stack

API Endpoints

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages