ml-colabs

BM25sRetriever.predict:v1
Name
Version
Last updated
11 months ago
Calls:
import weave
@weave.op()
def predict(self, query: str, top_k: int = 2):
    """
    Predicts the top-k most relevant chunks for a given query using the BM25 algorithm.
    This function is a wrapper around the `retrieve` method. It takes an input query string,
    tokenizes it using the BM25 tokenizer, and retrieves the top-k most relevant chunks from
    the BM25 index. The results are returned as a list of dictionaries, each containing a chunk
    and its corresponding relevance score.
    !!! example "Example Usage"
        ```python
        import weave
        from dotenv import load_dotenv
        from medrag_multi_modal.retrieval import BM25sRetriever
        load_dotenv()
        weave.init(project_name="ml-colabs/medrag-multi-modal")
        retriever = BM25sRetriever.from_wandb_artifact(
            index_artifact_address="ml-colabs/medrag-multi-modal/grays-anatomy-bm25s:latest"
        )
        retrieved_chunks = retriever.predict(query="What are Ribosomes?")