Skip to content

Utilizes PyTesseract for OCR and a pre-trained VLM (Hugging Face Transformers) to extract and summarize text from images.

Notifications You must be signed in to change notification settings

Utkarsh251106/Handwritten-Text-Recognition-and-Processing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Description

This project combines Optical Character Recognition (OCR) with a Vision-Language Model (VLM) to extract text from images and generate concise summaries.

How to run it?

Step 1: Clone the Repository:

git clone https://github.com/Utkarsh251106/Handwritten-Text-Recognition-and-Processing.git

Step 2: Create a conda environment:

conda create -n venv python=3.12.7 -y
conda activate venv

Step 3: Install the requirements:

pip install -r requirements.txt

Step 4: To run the code:

To run the code

# Start the Jupyter Notebook environment using the command
jupyter notebook

Run your Code.ipynb file

About

Utilizes PyTesseract for OCR and a pre-trained VLM (Hugging Face Transformers) to extract and summarize text from images.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published