ChatDoc+

This repository contains an app that allows users to ask questions about a provided document or engage in a question-answering chatbot conversation. The app is built with Streamlit, Llama-Index, LangChain, Llama-cpp-python, and is powered by the LLM NeuralBeagle14-7B-GGUF by Maxime Labonne.

No OpenAI API key required.
Runs locally on Macs (Tested on Apple Silicon Mac with macOS Ventura 13.5.1).
You can turn off your internet connection.

Requirements

Python3.11 or above (should work with Python3.10)
pip (Python’s package installer)

Installation

Follow these steps to install and set up the app:

Clone the Repository and Navigate to the Directory:

git clone https://github.com/hbacard/chatdoc-plus.git && cd chatdoc-plus

Create a Python Virtual Environment:
```
python3 -m venv .venv
```
Activate the Environment:
- On Unix-based systems:
```
source .venv/bin/activate
```
Install Dependencies:
```
pip install -r requirements.txt
```
- Note: On Linux there might be an error with the installation of llama-cpp-python. You can try these steps:
  - sudo apt-get install build-essential
  - pip install -r requirements.txt
Download the gguf file for NeuralBeagle14-7B-GGUF:
```
python3 download_model.py
```

Enable GPU with llama-cpp-python:

On Apple Silicon (METAL):

CMAKE_ARGS="-DLLAMA_METAL=on" FORCE_CMAKE=1 pip install --force-reinstall llama-cpp-python==0.1.83 --no-cache-dir

On Linux:

CMAKE_ARGS="-DLLAMA_CUBLAS=on" FORCE_CMAKE=1 pip install --force-reinstall llama-cpp-python==0.1.83 --no-cache-dir

Run the App:
```
streamlit run app.py
```

Open your browser and navigate to http://localhost:8501 to interact with the app.

Note: If the app doesn't start you can try these steps:
- Deactivate the virtual environment with deactivate
- Reactivate it with source .venv/bin/activate
- Then run streamlit run app.py

Usage

The app provides two modes of operation:

Q&A Chat Bot: When no document is provided, engage in a question-answering conversation with the AI assistant.
Document Query Mode: Upload a document and ask questions about its content. The AI assistant will process queries using the document as context.

Contributing

We welcome contributions to improve this app. If you have suggestions or encounter issues, please open an issue or submit a pull request on GitHub.

License

This project is licensed under the MIT License.

Notes

you may need to istall poppler to use the library pdftotext. https://stackoverflow.com/questions/45912641/unable-to-install-pdftotext-on-python-3-6-missing-poppler

brew install poppler
brew install pkg-config poppler python

What is different in this fork respect to the forked repository by Hugo Bacard

In respect to the original repo by Hugo Bacard this fork has very little differences.

There is a new page to query and scrape ArXiv for getting a series of pdf articles, based on my previously opened source repo ScrapXiv.
There is a new page to scrape huggingface and get more models, other than neuralbeagle14-7b.Q5_K_M.gguf by Maxime Labonne.
The chat is now in a page that allow to select documents from the downloaded papers.
The python code is packaged and linted.

Disclaimers

This is an hobby project and the time for active maintenance will be very limited.
Please feel free to fork and re-use the code, citing the original repo and its LICENCE.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
chatdoc_arxiv		chatdoc_arxiv
images		images
models		models
papers		papers
.gitignore		.gitignore
LICENCE		LICENCE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chatdoc_arxiv

chatdoc_arxiv

images

images

models

models

papers

papers

.gitignore

.gitignore

LICENCE

LICENCE

README.md

README.md

pyproject.toml

pyproject.toml

requirements.txt

requirements.txt

Repository files navigation

ChatDoc+

Requirements

Installation

Usage

Contributing

License

Notes

What is different in this fork respect to the forked repository by Hugo Bacard

Disclaimers

About

Releases

Packages

Languages

License

SebastianoX/chatdoc-plus

Folders and files

Latest commit

History

Repository files navigation

ChatDoc+

Requirements

Installation

Usage

Contributing

License

Notes

What is different in this fork respect to the forked repository by Hugo Bacard

Disclaimers

About

Resources

License

Stars

Watchers

Forks

Languages