autogen.agentchat.contrib.rag.ChromaDBQueryEngine

ChromaDBQueryEngine

ChromaDBQueryEngine(
    host: str | None = 'localhost',
    port: int | None = 8000,
    settings: ForwardRef('Settings') | None = None,
    tenant: str | None = None,
    database: str | None = None,
    embedding_function: Optional[EmbeddingFunction[Any]] = None,
    metadata: dict[str, Any] | None = None,
    llm: ForwardRef('LLM') | None = None,
    collection_name: str | None = None
)

This engine leverages Chromadb to persist document embeddings in a named collection and LlamaIndex’s VectorStoreIndex to efficiently index and retrieve documents, and generate an answer in response to natural language queries. Collection can be regarded as an abstraction of group of documents in the database.
It expects a Chromadb server to be running and accessible at the specified host and port.
Refer to this link for running Chromadb in a Docker container.
If the host and port are not provided, the engine will create an in-memory ChromaDB client.
Initializes the ChromaDBQueryEngine with db_path, metadata, and embedding function and llm.

Parameters:

Name	Description
`host`	Type: str \| None Default: ‘localhost’
`port`	Type: int \| None Default: 8000
`settings`	Type: ForwardRef(‘Settings’) \| None Default: None
`tenant`	Type: str \| None Default: None
`database`	Type: str \| None Default: None
`embedding_function`	Type: Optional[EmbeddingFunction[Any]] Default: None
`metadata`	Type: dict[str, typing.Any] \| None Default: None
`llm`	Type: ForwardRef(‘LLM’) \| None Default: None
`collection_name`	Type: str \| None Default: None

Instance Methods

add_docs

add_docs(
    self,
    new_doc_dir: Path | str | None = None,
    new_doc_paths_or_urls: Sequence[Path | str] | None = None,
    *args: Any,
    **kwargs: Any
) -> None

Add new documents to the underlying database and add to the index.

Parameters:

Name	Description
`new_doc_dir`	A dir of input documents that are used to create the records in database. Type: pathlib.Path \| str \| None Default: None
`new_doc_paths_or_urls`	A sequence of input documents that are used to create the records in database. A document can be a path to a file or a url. Type: Sequence[pathlib.Path \| str] \| None Default: None
`*args`	Any additional arguments Type: Any
`**kwargs`	Any additional keyword arguments Type: Any

connect_db

connect_db(
    self,
    *args: Any,
    **kwargs: Any
) -> bool

Connect to the database.
It does not overwrite the existing collection in the database.
It takes the following steps,

Set up ChromaDB and LlamaIndex storage.
2. Create the llamaIndex vector store index for querying or inserting docs later

Parameters:

Name	Description
`*args`	Any additional arguments Type: Any
`**kwargs`	Any additional keyword arguments Type: Any

Returns:

Type	Description
bool	bool: True if connection is successful

get_collection_name

get_collection_name(self) -> str

Get the name of the collection used by the query engine.
Returns:
The name of the collection.

Returns:

Type	Description
str	The name of the collection.

init_db

init_db(
    self,
    new_doc_dir: Path | str | None = None,
    new_doc_paths_or_urls: Sequence[Path | str] | None = None,
    *args: Any,
    **kwargs: Any
) -> bool

Initialize the database with the input documents or records.
It overwrites the existing collection in the database.
It takes the following steps,

Set up ChromaDB and LlamaIndex storage.
2. insert documents and build indexes upon them.

Parameters:

Name	Description
`new_doc_dir`	a dir of input documents that are used to create the records in database. Type: pathlib.Path \| str \| None Default: None
`new_doc_paths_or_urls`	a sequence of input documents that are used to create the records in database. a document can be a path to a file or a url. Type: Sequence[pathlib.Path \| str] \| None Default: None
`*args`	Any additional arguments Type: Any
`**kwargs`	Any additional keyword arguments Type: Any

Returns:

Type	Description
bool	bool: True if initialization is successful

query

query(self, question: str) -> str

Retrieve information from indexed documents by processing a query using the engine’s LLM.

Parameters:

Name	Description
`question`	A natural language query string used to search the indexed documents. Type: str

Returns:

Type	Description
str	A string containing the response generated by LLM.

Overview LlamaIndexQueryEngine

On this page

ChromaDBQueryEngine
Instance Methods
add_docs
connect_db
get_collection_name
init_db
query

autogen
- Overview
- Agent
- AgentNameConflictError
- AssistantAgent
- Cache
- ChatResult
- ContextExpression
- ConversableAgent
- GroupChat
- GroupChatManager
- InvalidCarryOverTypeError
- LLMConfig
- ModelClient
- NoEligibleSpeakerError
- OpenAIWrapper
- SenderRequiredError
- UndefinedNextAgentError
- UpdateSystemMessage
- UserProxyAgent
- a_initiate_swarm_chat
- a_run_swarm
- config_list_from_dotenv
- config_list_from_json
- config_list_from_models
- config_list_gpt4_gpt35
- config_list_openai_aoai
- filter_config
- gather_usage_summary
- get_config_list
- initiate_chats
- register_function
- run_swarm
- agentchat
  - Overview
  - a_initiate_chats
  - a_initiate_group_chat
  - a_run_group_chat
  - run_group_chat
  - chat
  - contrib
    - agent_eval
    - agent_optimizer
    - capabilities
    - captainagent
    - gpt_assistant_agent
    - graph_rag
    - img_utils
    - llamaindex_conversable_agent
    - llava_agent
    - math_user_proxy_agent
    - multimodal_conversable_agent
    - qdrant_retrieve_user_proxy_agent
    - rag
      - Overview
      - ChromaDBQueryEngine
      - LlamaIndexQueryEngine
      - MongoDBQueryEngine
      - RAGQueryEngine
    - retrieve_assistant_agent
    - retrieve_user_proxy_agent
    - society_of_mind_agent
    - swarm_agent
    - text_analyzer_agent
    - vectordb
    - web_surfer
  - group
  - realtime
  - utils
- agents
- browser_utils
- cache
- code_utils
- coding
- doc_utils
- events
- exception_utils
- fast_depends
- formatting_utils
- graph_utils
- import_utils
- interop
- io
- json_utils
- llm_config
- logger
- math_utils
- mcp
- messages
- oai
- retrieve_utils
- runtime_logging
- token_count_utils
- tools
- types

ChromaDBQueryEngine

ChromaDBQueryEngine(
    host: str | None = 'localhost',
    port: int | None = 8000,
    settings: ForwardRef('Settings') | None = None,
    tenant: str | None = None,
    database: str | None = None,
    embedding_function: Optional[EmbeddingFunction[Any]] = None,
    metadata: dict[str, Any] | None = None,
    llm: ForwardRef('LLM') | None = None,
    collection_name: str | None = None
)

Parameters:

Name	Description
`host`	Type: str \| None Default: ‘localhost’
`port`	Type: int \| None Default: 8000
`settings`	Type: ForwardRef(‘Settings’) \| None Default: None
`tenant`	Type: str \| None Default: None
`database`	Type: str \| None Default: None
`embedding_function`	Type: Optional[EmbeddingFunction[Any]] Default: None
`metadata`	Type: dict[str, typing.Any] \| None Default: None
`llm`	Type: ForwardRef(‘LLM’) \| None Default: None
`collection_name`	Type: str \| None Default: None

Instance Methods

add_docs

add_docs(
    self,
    new_doc_dir: Path | str | None = None,
    new_doc_paths_or_urls: Sequence[Path | str] | None = None,
    *args: Any,
    **kwargs: Any
) -> None

Add new documents to the underlying database and add to the index.

Parameters:

Name	Description
`new_doc_dir`	A dir of input documents that are used to create the records in database. Type: pathlib.Path \| str \| None Default: None
`new_doc_paths_or_urls`	A sequence of input documents that are used to create the records in database. A document can be a path to a file or a url. Type: Sequence[pathlib.Path \| str] \| None Default: None
`*args`	Any additional arguments Type: Any
`**kwargs`	Any additional keyword arguments Type: Any

connect_db

connect_db(
    self,
    *args: Any,
    **kwargs: Any
) -> bool

Connect to the database.
It does not overwrite the existing collection in the database.
It takes the following steps,

Set up ChromaDB and LlamaIndex storage.
2. Create the llamaIndex vector store index for querying or inserting docs later

Parameters:

Name	Description
`*args`	Any additional arguments Type: Any
`**kwargs`	Any additional keyword arguments Type: Any

Returns:

Type	Description
bool	bool: True if connection is successful

get_collection_name

get_collection_name(self) -> str

Get the name of the collection used by the query engine.
Returns:
The name of the collection.

Returns:

Type	Description
str	The name of the collection.

init_db

init_db(
    self,
    new_doc_dir: Path | str | None = None,
    new_doc_paths_or_urls: Sequence[Path | str] | None = None,
    *args: Any,
    **kwargs: Any
) -> bool

Initialize the database with the input documents or records.
It overwrites the existing collection in the database.
It takes the following steps,

Set up ChromaDB and LlamaIndex storage.
2. insert documents and build indexes upon them.

Parameters:

Name	Description
`new_doc_dir`	a dir of input documents that are used to create the records in database. Type: pathlib.Path \| str \| None Default: None
`new_doc_paths_or_urls`	a sequence of input documents that are used to create the records in database. a document can be a path to a file or a url. Type: Sequence[pathlib.Path \| str] \| None Default: None
`*args`	Any additional arguments Type: Any
`**kwargs`	Any additional keyword arguments Type: Any

Returns:

Type	Description
bool	bool: True if initialization is successful

query

query(self, question: str) -> str

Retrieve information from indexed documents by processing a query using the engine’s LLM.

Parameters:

Name	Description
`question`	A natural language query string used to search the indexed documents. Type: str

Returns:

Type	Description
str	A string containing the response generated by LLM.

Overview LlamaIndexQueryEngine

On this page

ChromaDBQueryEngine
Instance Methods
add_docs
connect_db
get_collection_name
init_db
query

​ChromaDBQueryEngine

​Instance Methods

​add_docs

​connect_db

​get_collection_name

​init_db

​query

API Reference

​ChromaDBQueryEngine

​Instance Methods

​add_docs

​connect_db

​get_collection_name

​init_db

​query

ChromaDBQueryEngine

Instance Methods

add_docs

connect_db

get_collection_name

init_db

query

ChromaDBQueryEngine

Instance Methods

add_docs

connect_db

get_collection_name

init_db

query