Weaviate¶
The Weaviate
class is an open-source vector database designed to store, query, and manage vector embeddings efficiently.
Configuration¶
To use the Weaviate vector database, you need to configure it in your YAML configuration file.
1. Docker¶
You can see the full installation guide here
Set the client_type
to docker
.
You can run docker with the code below.
docker run -p 8080:8080 -p 50051:50051 cr.weaviate.io/semitechnologies/weaviate:1.27.3
It will automatically be set to host: localhost, port: 8080, and grpc_port: 50051.
If you’re already running weaviate on docker, you’ll need to set the host, port, and grpc_port you’re using.
Additionally, you’ll need to set text_key
, which is the key value of the property where weaviate will store the content.
If you already have content stored in weaviate, you’ll need to set that key value.
Example YAML file¶
- name: openai_weaviate
db_type: weaviate
embedding_model: openai_embed_3_large
collection_name: openai_embed_3_large
client_type: docker
host: localhost
port: 8080
grpc_port: 50051
embedding_batch: 50
similarity_metric: cosine
text_key: content
Here is a simple example of a YAML configuration file that uses the Weaviate vector database and the OpenAI:
vectordb:
- name: openai_weaviate
db_type: weaviate
embedding_model: openai_embed_3_large
collection_name: openai_embed_3_large
client_type: docker
host: localhost
port: 8080
grpc_port: 50051
embedding_batch: 50
similarity_metric: cosine
text_key: content
node_lines:
- node_line_name: retrieve_node_line # Arbitrary node line name
nodes:
- node_type: retrieval
strategy:
metrics: [retrieval_f1, retrieval_recall, retrieval_precision]
top_k: 3
modules:
- module_type: vectordb
vectordb: openai_weaviate
- node_line_name: post_retrieve_node_line # Arbitrary node line name
nodes:
- node_type: prompt_maker
strategy:
metrics: [bleu, meteor, rouge]
modules:
- module_type: fstring
prompt: "Read the passages and answer the given question. \n Question: {query} \n Passage: {retrieved_contents} \n Answer : "
- node_type: generator
strategy:
metrics: [bleu, rouge]
modules:
- module_type: llama_index_llm
llm: openai
model: [ gpt-4o-mini ]
2. Weaviate Cloud¶
You can see the full installation guide here
Example YAML file¶
- name: openai_embed_3_large
db_type: weaviate
embedding_model: openai_embed_3_large
collection_name: openai_embed_3_large
url: ${WEAVIATE_URL}
api_key: ${WEAVIATE_API_KEY}
grpc_port: 50051
embedding_batch: 50
similarity_metric: cosine
text_key: content
Here is a simple example of a YAML configuration file that uses the Weaviate vector database and the OpenAI:
vectordb:
- name: openai_weaviate
db_type: weaviate
embedding_model: openai_embed_3_large
collection_name: openai_embed_3_large
url: ${WEAVIATE_URL}
api_key: ${WEAVIATE_API_KEY}
grpc_port: 50051
embedding_batch: 50
similarity_metric: cosine
text_key: content
node_lines:
- node_line_name: retrieve_node_line # Arbitrary node line name
nodes:
- node_type: retrieval
strategy:
metrics: [retrieval_f1, retrieval_recall, retrieval_precision]
top_k: 3
modules:
- module_type: vectordb
vectordb: openai_weaviate
- node_line_name: post_retrieve_node_line # Arbitrary node line name
nodes:
- node_type: prompt_maker
strategy:
metrics: [bleu, meteor, rouge]
modules:
- module_type: fstring
prompt: "Read the passages and answer the given question. \n Question: {query} \n Passage: {retrieved_contents} \n Answer : "
- node_type: generator
strategy:
metrics: [bleu, rouge]
modules:
- module_type: llama_index_llm
llm: openai
model: [ gpt-4o-mini ]
Parameters¶
embedding_model: str
Purpose: Specifies the name or identifier of the embedding model to be used.
Example: “openai_embed_3_large”
Note: This should correspond to a valid embedding model that your system can use to generate vector embeddings. For more information see custom your embedding model documentation.
collection_name: str
Purpose: Sets the name of the Weaviate collection where the vectors will be stored.
Example: “my_vector_collection”
Note: If the collection doesn’t exist, it will be created. If it exists, it will be loaded.
embedding_batch: int = 100
Purpose: Determines the number of embeddings to process in a single batch.
Default: 100
Note: Adjust this based on your system’s memory and processing capabilities. Larger batches may be faster but require more memory.
similarity_metric: str = "cosine"
Purpose: Specifies the metric used to calculate similarity between vectors.
Default: “cosine”
Options: “cosine”, “l2” (Euclidean distance), “ip” (Inner Product)
Note: Choose the metric that best suits your use case and data characteristics.
Not support “hamming”, “manhattan”
client_type = "docker"
Purpose: Specifies the type of client you’re using to connect to Weaviate.
Default: “docker”
Options: “docker”, “cloud”
Note: Choose the appropriate client type based on your deployment.
host: str = "localhost"
Purpose: The host of the Weaviate server.
Default: “localhost”
Example: “weaviate-server.com”
Note: Use only
client_type: docker
.
port: int = 8080
Purpose: The port of the Weaviate Server.
Default: 8080
Note: Use only
client_type: docker
.
grpc_port: int = 50051
Note: Use only
client_type: docker
.
url: str = ""
Purpose: The URL of the Weaviate Cloud service.
Note: Use only
client_type: cloud
.
api_key: str = ""
Purpose: The API key for authentication with the Weaviate Cloud service.
Note: Use only
client_type: cloud
.
text_key: str = "content"
Purpose: Specifies the name of the property in Weaviate where the text data is stored.
Default: “content”
Note: This should correspond to the property name in your Weaviate schema where the text data is stored.
Usage¶
Here’s a brief overview of how to use the main functions of the Weaviate vector database:
Adding Vectors:
await weaviate_db.add(ids, texts)
This method adds new vectors to the database. It takes a list of IDs and corresponding texts, generates embeddings, and inserts them into the Weaviate collection.
Querying:
ids, scores = await weaviate_db.query(queries, top_k)
Performs a similarity search on the stored vectors. It returns the IDs and their scores. Below you can see how the score is determined.
Fetching Vectors:
vectors = await weaviate_db.fetch(ids)
Retrieves the vectors associated with the given IDs.
Checking Existence:
exists = await weaviate_db.is_exist(ids)
Checks if the given IDs exist in the database.
Deleting Vectors:
await weaviate_db.delete(ids)
Deletes the vectors associated with the given IDs from the database.
Deleting the Collection:
weaviate_db.delete_collection()
Deletes the collection from the Weaviate server.
how the score is determined?¶
Calculate the score based on the distance. Here is how to calculate distance in Weaviate.
Cosine Score = 2 - distance
Inner Product Score = -distance
L2 Score = -distance