machine comprehension

TigreGotico · Jul 22, 2024 · 65a15d5 · 65a15d5
1 parent 5d0d63e
commit 65a15d5
Show file tree

Hide file tree

Showing 4 changed files with 257 additions and 84 deletions.
diff --git a/README.md b/README.md
@@ -1,74 +1,128 @@
 # FlashRankMultipleChoiceSolver for OVOS
 
-The `FlashRankMultipleChoiceSolver` plugin is designed for the Open Voice OS (OVOS) platform to help select the best answer to a question from a list of options. This plugin utilizes the FlashRank library to evaluate and rank multiple-choice answers based on their relevance to the given query.
+The `FlashRankMultipleChoiceSolver` plugin is designed for the Open Voice OS (OVOS) platform to help select the best
+answer to a question from a list of options. This plugin utilizes the FlashRank library to evaluate and rank
+multiple-choice answers based on their relevance to the given query.
 
 ## Features
 
 - **Rerank Options**: Reranks a list of options based on their relevance to the query.
 - **Customizable Model**: Allows the use of different ranking models.
 - **Seamless Integration**: Designed to work with OVOS plugin manager.
 
-### Important Note on FlashRank and Llama-CPP Compatibility
+### ReRanking
 
-Installing FlashRank can lead to a downgrade of the `llama-cpp-python` version, which is critical for GPU support and performance, especially for large language models (LLMs). This issue is tracked in [FlashRank's GitHub repository](https://github.com/PrithivirajDamodaran/FlashRank/issues/29).
+ReRanking is a technique used to refine a list of potential answers by evaluating their relevance to a given query.
+This process is crucial in scenarios where multiple options or responses need to be assessed to determine the most
+appropriate one.
 
-**Workaround for GPU Support with `llama-cpp-python`:**
+In retrieval chatbots, ReRanking helps in selecting the best answer from a set of retrieved documents or options,
+enhancing the accuracy of the response provided to the user.
 
-If you need GPU support with `llama-cpp-python`, you might need to reinstall it after installing flashrank with specific CMake arguments:
-```bash
-CMAKE_ARGS="-DGGML_CUDA=on" FORCE_CMAKE=1 pip install llama-cpp-python --force-reinstall --no-cache-dir
-```
+`MultipleChoiceSolver` are integrated into the OVOS Common Query framework, where they are used to select the most
+relevant answer from a set of multiple skill responses.
 
-Be aware that installing FlashRank may undo these custom installations
+#### FlashRankMultipleChoiceSolver
 
-## Usage
+FlashRankMultipleChoiceSolver is designed to select the best answer to a question from a list of options.
 
-### Example Usage
+In the context of retrieval chatbots, FlashRankMultipleChoiceSolver is useful for scenarios where a user query results
+in a list of predefined answers or options.
+The solver ranks these options based on their relevance to the query and selects the most suitable one.
 
 ```python
-if __name__ == "__main__":
-    from flashrank_multiple_choice_solver import FlashRankMultipleChoiceSolver
-
-    p = FlashRankMultipleChoiceSolver()
-    a = p.rerank("what is the speed of light", [
-        "very fast", "10m/s", "the speed of light is C"
-    ])
-    print(a)
-    # Expected output:
-    # [(0.999819, 'the speed of light is C'),
-    #  (2.7686672e-05, 'very fast'),
-    #  (1.2555749e-05, '10m/s')]
-
-    a = p.select_answer("what is the speed of light", [
-        "very fast", "10m/s", "the speed of light is C"
-    ])
-    print(a) # Expected output: the speed of light is C
+from ovos_flashrank_solver import FlashRankMultipleChoiceSolver
+
+solver = FlashRankMultipleChoiceSolver()
+a = solver.rerank("what is the speed of light", [
+    "very fast", "10m/s", "the speed of light is C"
+])
+print(a)
+# 2024-07-22 15:03:10.295 - OVOS - __main__:load_corpus:61 - DEBUG - indexed 3 documents
+# 2024-07-22 15:03:10.297 - OVOS - __main__:retrieve_from_corpus:70 - DEBUG - Rank 1 (score: 0.7198746800422668): the speed of light is C
+# 2024-07-22 15:03:10.297 - OVOS - __main__:retrieve_from_corpus:70 - DEBUG - Rank 2 (score: 0.0): 10m/s
+# 2024-07-22 15:03:10.297 - OVOS - __main__:retrieve_from_corpus:70 - DEBUG - Rank 3 (score: 0.0): very fast
+# [(0.7198747, 'the speed of light is C'), (0.0, '10m/s'), (0.0, 'very fast')]
+
+# NOTE: select_answer is part of the MultipleChoiceSolver base class and uses rerank internally
+a = solver.select_answer("what is the speed of light", [
+    "very fast", "10m/s", "the speed of light is C"
+])
+print(a)  # the speed of light is C
 ```
 
-## Configuration
+#### FlashRankEvidenceSolverPlugin
+
+FlashRankEvidenceSolverPlugin is designed to extract the most relevant sentence from a text passage that answers a given
+question. This plugin uses the FlashRank algorithm to evaluate and rank sentences based on their relevance to the query.
 
-The `FlashRankMultipleChoiceSolver` can be configured to use different ranking models. By default, it uses the `ms-marco-TinyBERT-L-2-v2` model. You can specify a different model in the configuration if needed.
+In text extraction and machine comprehension tasks, FlashRankEvidenceSolverPlugin enables the identification of specific
+sentences within a larger body of text that directly address a user's query.
 
-Example configuration:
-```json
-{
-    "model": "desired-model-name"
+For example, in a scenario where a user queries about the number of rovers exploring Mars, FlashRankEvidenceSolverPlugin
+scans the provided text passage, ranks sentences based on their relevance, and extracts the most informative sentence.
+
+```python
+from ovos_flashrank_solver import FlashRankEvidenceSolverPlugin
+
+config = {
+    "lang": "en-us",
+    "min_conf": 0.4,
+    "n_answer": 1
 }
+solver = FlashRankEvidenceSolverPlugin(config)
+
+text = """Mars is the fourth planet from the Sun. It is a dusty, cold, desert world with a very thin atmosphere. 
+Mars is also a dynamic planet with seasons, polar ice caps, canyons, extinct volcanoes, and evidence that it was even more active in the past.
+Mars is one of the most explored bodies in our solar system, and it's the only planet where we've sent rovers to roam the alien landscape. 
+NASA currently has two rovers (Curiosity and Perseverance), one lander (InSight), and one helicopter (Ingenuity) exploring the surface of Mars.
+"""
+query = "how many rovers are currently exploring Mars"
+answer = solver.get_best_passage(evidence=text, question=query)
+print("Query:", query)
+print("Answer:", answer)
+# 2024-07-22 15:05:14.209 - OVOS - __main__:load_corpus:61 - DEBUG - indexed 5 documents
+# 2024-07-22 15:05:14.209 - OVOS - __main__:retrieve_from_corpus:70 - DEBUG - Rank 1 (score: 1.39238703250885): NASA currently has two rovers (Curiosity and Perseverance), one lander (InSight), and one helicopter (Ingenuity) exploring the surface of Mars.
+# 2024-07-22 15:05:14.210 - OVOS - __main__:retrieve_from_corpus:70 - DEBUG - Rank 2 (score: 0.38667747378349304): Mars is one of the most explored bodies in our solar system, and it's the only planet where we've sent rovers to roam the alien landscape.
+# 2024-07-22 15:05:14.210 - OVOS - __main__:retrieve_from_corpus:70 - DEBUG - Rank 3 (score: 0.15732118487358093): Mars is the fourth planet from the Sun.
+# 2024-07-22 15:05:14.210 - OVOS - __main__:retrieve_from_corpus:70 - DEBUG - Rank 4 (score: 0.10177625715732574): Mars is also a dynamic planet with seasons, polar ice caps, canyons, extinct volcanoes, and evidence that it was even more active in the past.
+# 2024-07-22 15:05:14.210 - OVOS - __main__:retrieve_from_corpus:70 - DEBUG - Rank 5 (score: 0.0): It is a dusty, cold, desert world with a very thin atmosphere.
+# Query: how many rovers are currently exploring Mars
+# Answer: NASA currently has two rovers (Curiosity and Perseverance), one lander (InSight), and one helicopter (Ingenuity) exploring the surface of Mars.
+
 ```
 
+In this example, `FlashRankEvidenceSolverPlugin` effectively identifies and retrieves the most relevant sentence from
+the provided text that answers the query about the number of rovers exploring Mars.
+This capability is essential for applications requiring information extraction from extensive textual content, such as
+automated research assistants or content summarizers.
+
 ## Available Models
 
-The following models are available for use with the `FlashRankMultipleChoiceSolver`:
+Below is the list of models supported as of now:
 
-## Available Models
+| Model Name                                       | Description                                                                                                                                                                                                                                                                                                                                     |
+|--------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| `ms-marco-TinyBERT-L-2-v2` (default)             | [Model card](https://huggingface.co/cross-encoder/ms-marco-TinyBERT-L-2)                                                                                                                                                                                                                                                                        |
+| `ms-marco-MiniLM-L-12-v2`                        | [Model card](https://huggingface.co/cross-encoder/ms-marco-MiniLM-L-12-v2)                                                                                                                                                                                                                                                                      |
+| `rank-T5-flan` (Best non cross-encoder reranker) | [Model card](https://huggingface.co/bergum/rank-T5-flan)                                                                                                                                                                                                                                                                                        |
+| `ms-marco-MultiBERT-L-12`                        | Multi-lingual, [supports 100+ languages](https://github.com/google-research/bert/blob/master/multilingual.md#list-of-languages)                                                                                                                                                                                                                 |
+| `ce-esci-MiniLM-L12-v2`                          | [FT on Amazon ESCI dataset](https://github.com/amazon-science/esci-data) (This is interesting because most models are FT on MSFT MARCO Bing queries) [Model card](https://huggingface.co/metarank/ce-esci-MiniLM-L12-v2)                                                                                                                        |
+| `rank_zephyr_7b_v1_full` (4-bit-quantised GGUF)  | [Model card](https://huggingface.co/castorini/rank_zephyr_7b_v1_full) (Offers very competitive performance, with large context window and relatively faster for a 4GB model). <br> **Important note:** Our current integration of `rank_zephyr` supports a max of 20 passages in one pass. The sliding window logic support is yet to be added. |
+
+## Important Note on FlashRank and Llama-CPP Compatibility
+
+Installing FlashRank can lead to a downgrade of the `llama-cpp-python` version, which is critical for GPU support and
+performance, especially for large language models (LLMs). This issue is tracked
+in [FlashRank's GitHub repository](https://github.com/PrithivirajDamodaran/FlashRank/issues/29).
+
+**Workaround for GPU Support with `llama-cpp-python`:**
 
-The following models are available for use with the `FlashRankMultipleChoiceSolver`:
+If you need GPU support with `llama-cpp-python`, you might need to reinstall it after installing flashrank with specific
+CMake arguments:
 
-| Model Name                   | Description                                                                                                   |
-|------------------------------|---------------------------------------------------------------------------------------------------------------|
-| ms-marco-TinyBERT-L-2-v2     | (default) [Model card](https://www.modelcards.com/ms-marco-TinyBERT-L-2-v2)                                     |
-| ms-marco-MiniLM-L-12-v2      | [Model card](https://www.modelcards.com/ms-marco-MiniLM-L-12-v2)                                               |
-| rank-T5-flan                 | Best non cross-encoder reranker [Model card](https://www.modelcards.com/rank-T5-flan)                          |
-| ms-marco-MultiBERT-L-12      | Multi-lingual, supports 100+ languages                                                                         |
-| ce-esci-MiniLM-L12-v2        | FT on Amazon ESCI dataset (This is interesting because most models are FT on MSFT MARCO Bing queries) [Model card](https://www.modelcards.com/ce-esci-MiniLM-L12-v2) |
-| rank_zephyr_7b_v1_full       | 4-bit-quantised GGUF [Model card](https://www.modelcards.com/rank_zephyr_7b_v1_full) (Offers very competitive performance, with large context window and relatively faster for a 4GB model) |
+```bash
+CMAKE_ARGS="-DGGML_CUDA=on" FORCE_CMAKE=1 pip install llama-cpp-python --force-reinstall --no-cache-dir
+```
+
+Be aware that installing FlashRank may undo these custom installations
diff --git a/ovos_flashrank_plugin/__init__.py b/ovos_flashrank_plugin/__init__.py