autorag.deploy package

Submodules

autorag.deploy.api module

class autorag.deploy.api.ApiRunner(config: Dict, project_dir: str | None = None)[source]

Bases: BaseRunner

extract_retrieve_passage(df: DataFrame) List[RetrievedPassage][source]
run_api_server(host: str = '0.0.0.0', port: int = 8000, remote: bool = True, **kwargs)[source]

Run the pipeline as an api server. Here is api endpoint documentation => https://docs.auto-rag.com/deploy/api_endpoint.html

Parameters:
  • host – The host of the api server.

  • port – The port of the api server.

  • remote – Whether to expose the api server to the public internet using ngrok.

  • kwargs – Other arguments for Flask app.run.

class autorag.deploy.api.QueryRequest(*, query: str, result_column: str | None = 'generated_texts')[source]

Bases: BaseModel

model_computed_fields: ClassVar[Dict[str, ComputedFieldInfo]] = {}

A dictionary of computed field names and their corresponding ComputedFieldInfo objects.

model_config: ClassVar[ConfigDict] = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

model_fields: ClassVar[Dict[str, FieldInfo]] = {'query': FieldInfo(annotation=str, required=True), 'result_column': FieldInfo(annotation=Union[str, NoneType], required=False, default='generated_texts')}

Metadata about the fields defined on the model, mapping of field names to [FieldInfo][pydantic.fields.FieldInfo] objects.

This replaces Model.__fields__ from Pydantic V1.

query: str
result_column: str | None
class autorag.deploy.api.RetrievalResponse(*, passages: List[RetrievedPassage])[source]

Bases: BaseModel

model_computed_fields: ClassVar[Dict[str, ComputedFieldInfo]] = {}

A dictionary of computed field names and their corresponding ComputedFieldInfo objects.

model_config: ClassVar[ConfigDict] = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

model_fields: ClassVar[Dict[str, FieldInfo]] = {'passages': FieldInfo(annotation=List[RetrievedPassage], required=True)}

Metadata about the fields defined on the model, mapping of field names to [FieldInfo][pydantic.fields.FieldInfo] objects.

This replaces Model.__fields__ from Pydantic V1.

passages: List[RetrievedPassage]
class autorag.deploy.api.RetrievedPassage(*, content: str, doc_id: str, score: float, filepath: str | None = None, file_page: int | None = None, start_idx: int | None = None, end_idx: int | None = None)[source]

Bases: BaseModel

content: str
doc_id: str
end_idx: int | None
file_page: int | None
filepath: str | None
model_computed_fields: ClassVar[Dict[str, ComputedFieldInfo]] = {}

A dictionary of computed field names and their corresponding ComputedFieldInfo objects.

model_config: ClassVar[ConfigDict] = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

model_fields: ClassVar[Dict[str, FieldInfo]] = {'content': FieldInfo(annotation=str, required=True), 'doc_id': FieldInfo(annotation=str, required=True), 'end_idx': FieldInfo(annotation=Union[int, NoneType], required=False, default=None), 'file_page': FieldInfo(annotation=Union[int, NoneType], required=False, default=None), 'filepath': FieldInfo(annotation=Union[str, NoneType], required=False, default=None), 'score': FieldInfo(annotation=float, required=True), 'start_idx': FieldInfo(annotation=Union[int, NoneType], required=False, default=None)}

Metadata about the fields defined on the model, mapping of field names to [FieldInfo][pydantic.fields.FieldInfo] objects.

This replaces Model.__fields__ from Pydantic V1.

score: float
start_idx: int | None
class autorag.deploy.api.RunResponse(*, result: str | List[str], retrieved_passage: List[RetrievedPassage])[source]

Bases: BaseModel

model_computed_fields: ClassVar[Dict[str, ComputedFieldInfo]] = {}

A dictionary of computed field names and their corresponding ComputedFieldInfo objects.

model_config: ClassVar[ConfigDict] = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

model_fields: ClassVar[Dict[str, FieldInfo]] = {'result': FieldInfo(annotation=Union[str, List[str]], required=True), 'retrieved_passage': FieldInfo(annotation=List[RetrievedPassage], required=True)}

Metadata about the fields defined on the model, mapping of field names to [FieldInfo][pydantic.fields.FieldInfo] objects.

This replaces Model.__fields__ from Pydantic V1.

result: str | List[str]
retrieved_passage: List[RetrievedPassage]
class autorag.deploy.api.StreamResponse(*, type: Literal['generated_text', 'retrieved_passage'], generated_text: str | None, retrieved_passage: RetrievedPassage | None, passage_index: int | None)[source]

Bases: BaseModel

When the type is generated_text, only generated_text is returned. The other fields are None. When the type is retrieved_passage, only retrieved_passage and passage_index are returned. The other fields are None.

generated_text: str | None
model_computed_fields: ClassVar[Dict[str, ComputedFieldInfo]] = {}

A dictionary of computed field names and their corresponding ComputedFieldInfo objects.

model_config: ClassVar[ConfigDict] = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

model_fields: ClassVar[Dict[str, FieldInfo]] = {'generated_text': FieldInfo(annotation=Union[str, NoneType], required=True), 'passage_index': FieldInfo(annotation=Union[int, NoneType], required=True), 'retrieved_passage': FieldInfo(annotation=Union[RetrievedPassage, NoneType], required=True), 'type': FieldInfo(annotation=Literal['generated_text', 'retrieved_passage'], required=True)}

Metadata about the fields defined on the model, mapping of field names to [FieldInfo][pydantic.fields.FieldInfo] objects.

This replaces Model.__fields__ from Pydantic V1.

passage_index: int | None
retrieved_passage: RetrievedPassage | None
type: Literal['generated_text', 'retrieved_passage']
class autorag.deploy.api.VersionResponse(*, version: str)[source]

Bases: BaseModel

model_computed_fields: ClassVar[Dict[str, ComputedFieldInfo]] = {}

A dictionary of computed field names and their corresponding ComputedFieldInfo objects.

model_config: ClassVar[ConfigDict] = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

model_fields: ClassVar[Dict[str, FieldInfo]] = {'version': FieldInfo(annotation=str, required=True)}

Metadata about the fields defined on the model, mapping of field names to [FieldInfo][pydantic.fields.FieldInfo] objects.

This replaces Model.__fields__ from Pydantic V1.

version: str

autorag.deploy.base module

class autorag.deploy.base.BaseRunner(config: Dict, project_dir: str | None = None)[source]

Bases: object

classmethod from_trial_folder(trial_path: str)[source]

Load Runner from the evaluated trial folder. Must already be evaluated using Evaluator class. It sets the project_dir as the parent directory of the trial folder.

Parameters:

trial_path – The path of the trial folder.

Returns:

Initialized Runner.

classmethod from_yaml(yaml_path: str, project_dir: str | None = None)[source]

Load Runner from the YAML file. Must be extracted YAML file from the evaluated trial using the extract_best_config method.

Parameters:
  • yaml_path – The path of the YAML file.

  • project_dir – The path of the project directory. Default is the current directory.

Returns:

Initialized Runner.

class autorag.deploy.base.Runner(config: Dict, project_dir: str | None = None)[source]

Bases: BaseRunner

run(query: str, result_column: str = 'generated_texts')[source]

Run the pipeline with query. The loaded pipeline must start with a single query, so the first module of the pipeline must be query_expansion or retrieval module.

Parameters:
  • query – The query of the user.

  • result_column – The result column name for the answer. Default is generated_texts, which is the output of the generation module.

Returns:

The result of the pipeline.

autorag.deploy.base.extract_best_config(trial_path: str, output_path: str | None = None) Dict[source]

Extract the optimal pipeline from the evaluated trial.

Parameters:
  • trial_path – The path to the trial directory that you want to extract the pipeline from. Must already be evaluated.

  • output_path – Output path that pipeline yaml file will be saved. Must be .yaml or .yml file. If None, it does not save YAML file and just returns dict values. Default is None.

Returns:

The dictionary of the extracted pipeline.

autorag.deploy.base.extract_node_line_names(config_dict: Dict) List[str][source]

Extract node line names with the given config dictionary order.

Parameters:

config_dict – The YAML configuration dict for the pipeline. You can load this to access trail_folder/config.yaml.

Returns:

The list of node line names. It is the order of the node line names in the pipeline.

autorag.deploy.base.extract_node_strategy(config_dict: Dict) Dict[source]

Extract node strategies with the given config dictionary. The return value is a dictionary of the node type and its strategy.

Parameters:

config_dict – The YAML configuration dict for the pipeline. You can load this to access trail_folder/config.yaml.

Returns:

Key is node_type and value is strategy dict.

autorag.deploy.base.extract_vectordb_config(trial_path: str) List[Dict][source]
autorag.deploy.base.summary_df_to_yaml(summary_df: DataFrame, config_dict: Dict) Dict[source]

Convert trial summary dataframe to config yaml file.

Parameters:
  • summary_df – The trial summary dataframe of the evaluated trial.

  • config_dict – The yaml configuration dict for the pipeline. You can load this to access trail_folder/config.yaml.

Returns:

Dictionary of config yaml file. You can save this dictionary to yaml file.

autorag.deploy.gradio module

class autorag.deploy.gradio.GradioRunner(config: Dict, project_dir: str | None = None)[source]

Bases: BaseRunner

run(query: str, result_column: str = 'generated_texts')[source]

Run the pipeline with query. The loaded pipeline must start with a single query, so the first module of the pipeline must be query_expansion or retrieval module.

Parameters:
  • query – The query of the user.

  • result_column – The result column name for the answer. Default is generated_texts, which is the output of the generation module.

Returns:

The result of the pipeline.

run_web(server_name: str = '0.0.0.0', server_port: int = 7680, share: bool = False, **kwargs)[source]

Run web interface to interact pipeline. You can access the web interface at http://server_name:server_port in your browser

Parameters:
  • server_name – The host of the web. Default is 0.0.0.0.

  • server_port – The port of the web. Default is 7680.

  • share – Whether to create a publicly shareable link. Default is False.

  • kwargs – Other arguments for gr.ChatInterface.launch.

Module contents