5. Passage Filter

🔎 Definition

Passage filtering is a node that filters out passages. It does not compress passages, but it deletes passages that are not relatable enough to the query.

What is difference between Passage Filter and Passage Reranker?

In passage reranker, you can set top_k parameter on your own. Which means, reranker modules return ‘top_k’ passages you set all the time.

On the other hand, passage filter does not guarantee the number of passages to be returned. It can be not filtered, or it can be filtered to 1 passage.

🤸 Benefits

The primary benefit of passage filtering is that you can filter out irrelevant passages. When the LLM gets irrelevant passages, it can be confused and return irrelevant answers. So it is important to filter out irrelevant passages.

Node Parameters

There are no node parameters for passage filter.

Example config.yaml file

node_lines:
  - node_line_name: retrieve_node_line  # Arbitrary node line name
    nodes:
      - node_type: passage_filter
        strategy:
          metrics: [ retrieval_f1, retrieval_recall, retrieval_precision ]
          speed_threshold: 5
        modules:
          - module_type: pass_passage_filter
          - module_type: similarity_threshold_cutoff
            threshold: 0.85

What is pass_passage_filter?

Its purpose is to test the performance that ‘not using’ any passage filter module. Because it can be the better option that not using passage filter node. So with this module, you can automatically test the performance without using any passage filter module.