All codes related to pdf parsing.
Harsh Parikh deff3db35b Updated parser for acronyms | 2 years ago | |
---|---|---|
complaints | 2 years ago | |
docker | 2 years ago | |
.gitignore | 2 years ago | |
LICENSE | 2 years ago | |
README.md | 2 years ago |
virtual environment
.
Go to the home directory by typing the following command.
cd ~
Installs
using the following command:
bash
mkdir Installs
Create a virtual environment venv
python3 -mvenv venv
bash
source ~/Installs/venv/bin/activate
Setting up apache-tika
Enter the following command to go to the base directory:
cd ~
bash
wget https://www.apache.org/dyn/closer.lua/tika/1.28.4/tika-server-1.28.4.jar
Check if java is installed in your machine by running the following command:
java --version
If java is not installed in your local machine, please refer to this documentation.
Running the tika server
Create a new screen called apache-tika
screen -S apache-tika
bash
java -jar {your tika server}
Enter the following command to go to the base directory:
cd ~
Code
by using the following command:
bash
mkdir Code
Pull the current repository by entering the following command:
git pull gogs@git.fafadiatech.com:harsh/pdf_parser.git