Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduced a simple demo of mixed index and text2gremlin demonstration. #54

Merged
merged 38 commits into from
Jul 23, 2024
Merged

Introduced a simple demo of mixed index and text2gremlin demonstration. #54

merged 38 commits into from
Jul 23, 2024

Conversation

vichayturen
Copy link
Contributor

@vichayturen vichayturen commented Jul 2, 2024

Main changes:

  1. Modified the configuration method.
  2. Added vector index for RAG, supported mixed-retrieving and reranking.
  3. Introduced embedding configuration.
  4. Introduced index operation and document operation.
  5. Added task of transferring natural language to gremlin and its demonstration.
  6. Simplified the use and modify the usage in README.

TODOs (add them later) 🔢

  • connection check (GraphDB & LLM Server)
    • openai check
    • qianfan/wenxin check (use auth-api?)
    • local llm check
  • graph error handle
    • print request & response url + body in py-client
    • wrapper some exception (don't throw Error detail directly)
  • fix github security alert
  • radio (Selection)
  • configs improvement (remove duplicate items)
  • support uploading file in UI (text type)
  • allow users to modify prompt template (BASIC)
  • add a one-click graph clean button? (use clear api)
    ... (check the latest doc)

2. Graph RAG增加向量索引和混合索引加重排的两种方式的索引。
3. 增加自然语言转gremlin的任务及演示。
@dosubot dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Jul 2, 2024
@github-actions github-actions bot added the llm label Jul 2, 2024
@dosubot dosubot bot added documentation Improvements or additions to documentation enhancement New feature or request labels Jul 2, 2024
vichayturen and others added 5 commits July 6, 2024 17:08
2. added template input textbox
3. changed ernie llm to use qianfan sdk
4. added function of filtering id length
Also rename the same params (inner & outer)

TODO: Non-Schema mode will throw Error now (we need fix it)
…id querying

- Update BuildSemanticIndex class to match new file naming convention.
- Modify code_format_and_analysis.sh to use a line length of 120.
- Change logging format to use %s instead of {} for string formatting.
1. Introduce a clean-up function to remove index and content files, aiding in maintaining a clean file system after operations.
2. Introduce default demo of build kg.
3. Now it does not clean the kg before building kg by default.
4. Clean the stopwords files.
@imbajin
Copy link
Member

imbajin commented Jul 10, 2024

Now we could use log.xx(msg) to print the log: ↓
image

@ChenZiHong-Gavin
Copy link
Contributor

ChenZiHong-Gavin commented Jul 16, 2024

Now the .config file instead of json file can be generated with command like python -m hugegraph_llm.config.generate --dir_path=./


dirname = os.path.dirname
package_path = dirname(dirname(dirname(dirname(os.path.abspath(__file__)))))
env_path = os.path.join(package_path, ".env")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

在pip安装之后 运行代码会在python所在路径下生成.env

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are some aspects that can be optimized in the future.

  • Add some configuration-related comments in the .env file
  • Rename .env filename to something related to hugegraph-llm
  • Unify the variables in the .env file and the variables in the Config class to be all uppercase
  • Differentiate between commonly used parameters and parameters that are secret keys
  • Give user a choice where to place their .env file

@simon824 simon824 merged commit 3a3698b into apache:main Jul 23, 2024
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request llm python-client size:XXL This PR changes 1000+ lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants