|
@@ -76,7 +76,31 @@
|
|
|
git pull gogs@git.fafadiatech.com:harsh/pdf_parser.git
|
|
|
```
|
|
|
|
|
|
+### Running the docker file:
|
|
|
+1. Launch the terminal.
|
|
|
+2. Change the directory to the docker file by the following command.
|
|
|
+ ```bash
|
|
|
+ cd ~/Code/pdf_parser/docker/
|
|
|
+ ```
|
|
|
+3. Check if docker is installed in your machine using the command below. If it throws an error, refer to this [documentation](https://docs.docker.com/engine/install/)
|
|
|
+ ```bash
|
|
|
+ docker ps
|
|
|
+ ```
|
|
|
+4. Pull the required images using the following command.
|
|
|
+ ```bash
|
|
|
+ docker pull docker-compose.yml
|
|
|
+ ```
|
|
|
+5. Build the docker volume using the following command.
|
|
|
+ ```bash
|
|
|
+ docker-compose build
|
|
|
+ ```
|
|
|
+6. Activate a new screen and activate the containers using the following commands.
|
|
|
+ ```bash
|
|
|
+ screen -S docker
|
|
|
+ docker-compose up
|
|
|
+ ```
|
|
|
+
|
|
|
### TODO LIST:
|
|
|
-1. Implementing OCR on tika.
|
|
|
-1. Dockerising the whole apache tika with ocr.
|
|
|
-1. Testing the re on the scanned pdfs.
|
|
|
+- [ ] Implementing OCR on tika.
|
|
|
+- [x] Dockerising apache-tika.
|
|
|
+- [ ] Testing the re on the scanned pdfs
|