This project uses LangChain and OpenAI to classify companies and retrieve related information.
-
Clone the repository:
git clone <repository-url> cd <repository-directory>
-
Create a virtual environment:
python -m venv venv source venv/bin/activate # On Windows use `venv\Scripts\activate`
-
Install dependencies:
pip install -r requirements.txt
-
Set up API keys:
- Create a
config.ini
file in the root directory with the following content:[API_KEYS] TAVILY_API_KEY = your_tavily_api_key OPENAI_API_KEY = your_openai_api_key
- Create a
-
Prepare data files:
- Ensure
taxanomy.csv
anddata.csv
are present in the root directory.
- Ensure
- Run the script:
python genAI_clasification.py
- The script includes error handling for missing files and missing columns in the data.
- If any errors occur during processing, they will be printed to the console.
- langchain-openai
- langchain-community
- langchain-core
- langgraph
- pandas
- configparser