Vectordb

The VectorDB module is a retrieval module that uses VectorDB as a backend. You can use Dense Retrieval with this class easily.

It first embeds the passage content using an embedding model, then stores the embedded vector in VectorDB. When retrieving, it embeds the query and searches for the most similar vectors in VectorDB. Lastly, it returns the passages that have the most similar vectors.

Backend Support

As of now, the VectorDB module exclusively supports ChromaDB as its backend database. We choose ChromaDB because it is a local VectorDB that needs no internet connection, server fee, or API key. Plus, it is open-source software.

Module Parameters

  • Parameter: vectordb

  • Usage: Defines the model used for embedding in the VectorDB module, impacting how data is represented and retrieved.

Tip

Information about the VectorDB can be found VectorDB Section. Plus, you can learn about how to add custom embedding model at here.

Example config.yaml

modules:
  - module_type: vectordb
    vectordb: [default, openai_chroma, openai_milvus]