All codes related to pdf parsing.
harsh 6d7c714390 Fixed formatting error on README.md | %!s(int64=2) %!d(string=hai) anos | |
---|---|---|
complaints | %!s(int64=2) %!d(string=hai) anos | |
docker | %!s(int64=2) %!d(string=hai) anos | |
.gitignore | %!s(int64=2) %!d(string=hai) anos | |
LICENSE | %!s(int64=2) %!d(string=hai) anos | |
README.md | %!s(int64=2) %!d(string=hai) anos |
virtual environment
.
Go to the home directory by typing the following command.
cd ~
Installs
using the following command:
bash
mkdir Installs
Create a virtual environment venv
python3 -mvenv venv
bash
source ~/Installs/venv/bin/activate
Setting up apache-tika
Enter the following command to go to the base directory:
cd ~
bash
wget https://www.apache.org/dyn/closer.lua/tika/1.28.4/tika-server-1.28.4.jar
Check if java is installed in your machine by running the following command:
java --version
If java is not installed in your local machine, please refer to this documentation.
Running the tika server
Create a new screen called apache-tika
screen -S apache-tika
bash
java -jar {your tika server}
Enter the following command to go to the base directory:
cd ~
Code
by using the following command:
bash
mkdir Code
Pull the current repository by entering the following command:
git pull gogs@git.fafadiatech.com:harsh/pdf_parser.git