PDF tables extractor.

This script will convert the tables in the PDF file into CSV files.
For each tables in the PDF new CSV file will be generated.
Technology used: Tabula-py

Setting up:

Install the requirements

  $  pip install tabula-py

Running the script:

  $  # Specify the no. of pages to scan
  $ python3 pdf_to_csv.py <no. of pages>    
  $  # If you want all pages to scan
  $ python3 pdf_to_csv.py all

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PDF tables extractor.

Setting up:

Running the script:

FilesExpand file tree

Readme.md

Latest commit

History

Readme.md

File metadata and controls

PDF tables extractor.

Setting up:

Running the script: