File size: 1,652 Bytes
25d41fc |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 |
# kagglex-final-project
A prototype written in Python to illustrate/demonstrate querying the Learning Path Index Dataset (see [Kaggle Dataset](https://www.kaggle.com/datasets/neomatrix369/learning-path-index-dataset) and [GitHub repo](https://github.com/neomatrix369/learning-path-index)), with the help of the OpenAI GPT technology (InstructHPT model and embeddings model), [Langchain](https://python.langchain.com/) and using [Facebook's FAISS library](https://faiss.ai/).

The end-to-end process can be learnt by going through the code base as well as by observing the console logs when using both the Streamlit and the CLI versions.
## Pre-requisites
- Python 3.8.x or above
- OpenAI API Key (see [How to get an OpenAI API Key](https://www.howtogeek.com/885918/how-to-get-an-openai-api-key/) -- note it's may not be FREE anymore)
- Install dependencies from `requirements.txt`
- Basic Command-line experience
- Basic git and GitHub experience
## Install and run
Copy the `.env_template` to `.env` in the current folder and then add your OpenAI API Key to `.env`.
**Please don't modify the `.env_template` file.**
```bash
pip install -r requirements.txt
```
### Interactive session via CLI app
```bash
python main.py
```

### Interactive session via Streamlit app
```bash
streamlit run main.py
```

---
|