- User interface of initial page.
- User interface while in use.
- networkx
- nltk
- numpy
- opencv-python
- pytesseract
- python-dotenv
- quart
- Quart-CORS
- scipy
- Tesseract OCR engine
-
Python packages using pip
python -m pip install networkx nltk numpy opencv-python pytesseract python-dotenv quart Quart-CORS scipy -
Installing Tesseract OCR engine
-
Install to the default location (Recommended)
C:\\Program Files\\Tesseract-OCR\\tesseract.exe -
Or change the value of
TESSERACTin.envfile accordingly.
-
Enter the follwing command in
serverdirectory:python server.py -
Configured to run on
PORT 5000by default. -
PORTcan be changed via.env. Important: client.envmust be edited accordingly. -
Visit http://127.0.0.1:5000/api/ to test the server.
-
Visit http://127.0.0.1:5000/api/test/ocr and http://127.0.0.1:5000/api/test/summary to test both engines.
-
Use a simple web server to serve
distfolder inclientdirectory. -
For example execute
python -m http.serverinsidedistto serve the files of the directory. And visit the given link (i.e.http://localhost:8000/) to access the web client.
- Re-implement API in
JavaScriptandExpress.jswhile keeping OCR/Summarization functionality in Python.

