Skip to content

Files

Latest commit

aef4cb6 · Jun 21, 2021

History

History
This branch is 7 commits behind HarshCasper/Rotten-Scripts:master.

PDF_Tables_Extractor

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Dec 11, 2020
Jun 21, 2021

Readme.md

PDF tables extractor.

  • This script will convert the tables in the PDF file into CSV files.
  • For each tables in the PDF new CSV file will be generated.
  • Technology used: Tabula-py

Setting up:

  • Install the requirements
  $  pip install tabula-py 

Running the script:

  $  # Specify the no. of pages to scan
  $ python3 pdf_to_csv.py <no. of pages>    
  $  # If you want all pages to scan
  $ python3 pdf_to_csv.py all