All codes related to pdf parsing.

Harsh Parikh 2352bd3c3c added the intial codebase for parsing documents		vor 2 Jahren
complaints	2352bd3c3c added the intial codebase for parsing documents	vor 2 Jahren
docker	2352bd3c3c added the intial codebase for parsing documents	vor 2 Jahren
.gitignore	2352bd3c3c added the intial codebase for parsing documents	vor 2 Jahren
LICENSE	95a4f66c64 Initial commit	vor 2 Jahren
README.md	2352bd3c3c added the intial codebase for parsing documents	vor 2 Jahren

pdf_parser

All the codes related to pdf parsing

Launch the terminal.
Enter the following command to go to the base directory:
```
cd ~
```
1. Make a new directory Code by using the following command: bash mkdir Code
Pull the current repository by entering the following command:
```
git pull gogs@git.fafadiatech.com:harsh/pdf_parser.git
```
TODO LIST:
1. Implementing OCR on tika.
2. Dockerising the whole apache tika with ocr.
3. Testing the re on the scanned pdfs.