This project combines Optical Character Recognition (OCR) with a Vision-Language Model (VLM) to extract text from images and generate concise summaries.
git clone https://github.com/Utkarsh251106/Handwritten-Text-Recognition-and-Processing.git
conda create -n venv python=3.12.7 -y
conda activate venv
pip install -r requirements.txt
To run the code
# Start the Jupyter Notebook environment using the command
jupyter notebook