Utils

LLM models

`llama_utils.utils.models`

LLMs and embedding models.

`LLMModel`

Abstraction layer for different LLM providers: AzureOpenAI, Ollama, and HuggingFace.

Parameters:

Name	Type	Description	Default
`model_type`	`str`	Type of the model ('azure', 'ollama', 'huggingface').	required
`**kwargs`	`dict`	Additional parameters for the model initialization.	`{}`

Examples:

Initialize an Azure OpenAI model: >>> from llama_utils.utils.models import LLMModel >>> from dotenv import load_dotenv >>> load_dotenv() # doctest: +SKIP >>> model = LLMModel(model_type='azure', model_id='gpt-4o', engine='4o') # doctest: +SKIP >>> print(model.base_model.model) # doctest: +SKIP gpt-4o >>> response = model.generate_response("Hello, how are you?") # doctest: +SKIP >>> print(response) # doctest: +SKIP Hello! I'm just a computer program, so I don't have feelings, but I'm here and ready to help you. How can I assist you today?

Initialize an Ollama model: >>> model = LLMModel(model_type='ollama', model_id='llama3.1') >>> response = model.generate_response("Hello, how are you?") # doctest: +SKIP >>> print(response) # doctest: +SKIP I'm just a language model, I don't have emotions or feelings like humans do, so I don't have good or bad days. However, I'm functioning properly and ready to help with any questions or tasks you may have! How about you? How's your day going?

Initialize a HuggingFace model: >>> import os >>> cache_dir = os.getenv("CACHE_DIR") >>> model_kwargs = {} >>> model_kwargs["cache_dir"] = cache_dir >>> model_name = "distilgpt2" >>> model = LLMModel( ... model_type='huggingface', model_name=model_name, tokenizer_name=model_name, model_kwargs=model_kwargs ... ) >>> response = model.generate_response("Hello, how are you?") # doctest: +SKIP >>> print(response) # doctest: +SKIP

Source code in src/llama_utils/utils/models.py

class LLMModel:
    r"""Abstraction layer for different LLM providers: AzureOpenAI, Ollama, and HuggingFace.

    Parameters
    ----------
    model_type : str
        Type of the model ('azure', 'ollama', 'huggingface').
    **kwargs : dict
        Additional parameters for the model initialization.

    Examples
    --------
    Initialize an Azure OpenAI model:
        >>> from llama_utils.utils.models import LLMModel
        >>> from dotenv import load_dotenv
        >>> load_dotenv() # doctest: +SKIP
        >>> model = LLMModel(model_type='azure', model_id='gpt-4o', engine='4o') # doctest: +SKIP
        >>> print(model.base_model.model) # doctest: +SKIP
        gpt-4o
        >>> response = model.generate_response("Hello, how are you?") # doctest: +SKIP
        >>> print(response) # doctest: +SKIP
        Hello! I'm just a computer program, so I don't have feelings, but I'm here and ready to help you. How can I assist you today?

    Initialize an Ollama model:
        >>> model = LLMModel(model_type='ollama', model_id='llama3.1')
        >>> response = model.generate_response("Hello, how are you?") # doctest: +SKIP
        >>> print(response) # doctest: +SKIP
        I'm just a language model, I don't have emotions or feelings like humans do, so I don't have good or bad days. However, I'm functioning properly and ready to help with any questions or tasks you may have! How about you? How's your day going?

    Initialize a HuggingFace model:
        >>> import os
        >>> cache_dir = os.getenv("CACHE_DIR")
        >>> model_kwargs = {}
        >>> model_kwargs["cache_dir"] = cache_dir
        >>> model_name = "distilgpt2"
        >>> model = LLMModel(
        ...     model_type='huggingface', model_name=model_name, tokenizer_name=model_name, model_kwargs=model_kwargs
        ... )
        >>> response = model.generate_response("Hello, how are you?") # doctest: +SKIP
        >>> print(response) # doctest: +SKIP
    """

    def __init__(self, model_type: str, **kwargs):
        """Initialize the LLM model."""
        self._model_type = model_type.lower()
        self._base_model = self._initialize_model(**kwargs)

    @property
    def base_model(self):
        """Get the base model."""
        return self._base_model

    @property
    def model_type(self):
        """Get the model type."""
        return self._model_type

    def _initialize_model(self, **kwargs):
        if self.model_type == "azure":
            return azure_open_ai(
                model_id=kwargs.get("model_id", "gpt-4o"),
                engine=kwargs.get("engine", "4o"),
            )
        elif self.model_type == "ollama":
            return get_ollama_llm(**kwargs)
        elif self.model_type == "huggingface":
            return get_huggingface_llm(**kwargs)
        else:
            raise ValueError(f"Unsupported model type: {self.model_type}")

    def generate_response(self, prompt: str, **kwargs):
        """Generate a response from the model.

        Parameters
        ----------
        prompt : str
            The input prompt.
        **kwargs : dict
            Additional parameters for generation.

        Returns
        -------
        str
            Generated response.

        Examples
        --------
        Generate response using Azure OpenAI:

        >>> model = LLMModel(model_type='azure', model_id='gpt-4o')
        >>> response = model.generate_response("What is AI?") # doctest: +SKIP
        >>> print(response) # doctest: +SKIP

        Generate response using Ollama:

        >>> model = LLMModel(model_type='ollama', model_id='llama3')
        >>> response = model.generate_response("Explain quantum mechanics.") # doctest: +SKIP
        >>> print(response) # doctest: +SKIP

        Generate response using HuggingFace:

        >>> model = LLMModel(model_type='huggingface', model_name='distilgpt2') # doctest: +SKIP
        >>> response = model.generate_response("Write a poem about the sea.") # doctest: +SKIP
        >>> print(response) # doctest: +SKIP
        """
        if self.model_type in ["azure", "ollama", "huggingface"]:
            return self.base_model.complete(prompt, **kwargs)
        else:
            raise ValueError("Invalid model type")

`base_model` `property`

Get the base model.

`model_type` `property`

Get the model type.

`init(model_type: str, **kwargs)`

Initialize the LLM model.

Source code in src/llama_utils/utils/models.py

def __init__(self, model_type: str, **kwargs):
    """Initialize the LLM model."""
    self._model_type = model_type.lower()
    self._base_model = self._initialize_model(**kwargs)

`generate_response(prompt: str, **kwargs)`

Generate a response from the model.

Parameters:

Name	Type	Description	Default
`prompt`	`str`	The input prompt.	required
`**kwargs`	`dict`	Additional parameters for generation.	`{}`

Returns:

Type	Description
`str`	Generated response.

Examples:

Generate response using Azure OpenAI:

>>> model = LLMModel(model_type='azure', model_id='gpt-4o')
>>> response = model.generate_response("What is AI?")
>>> print(response)

Generate response using Ollama:

>>> model = LLMModel(model_type='ollama', model_id='llama3')
>>> response = model.generate_response("Explain quantum mechanics.")
>>> print(response)

Generate response using HuggingFace:

>>> model = LLMModel(model_type='huggingface', model_name='distilgpt2')
>>> response = model.generate_response("Write a poem about the sea.")
>>> print(response)

Source code in src/llama_utils/utils/models.py

def generate_response(self, prompt: str, **kwargs):
    """Generate a response from the model.

    Parameters
    ----------
    prompt : str
        The input prompt.
    **kwargs : dict
        Additional parameters for generation.

    Returns
    -------
    str
        Generated response.

    Examples
    --------
    Generate response using Azure OpenAI:

    >>> model = LLMModel(model_type='azure', model_id='gpt-4o')
    >>> response = model.generate_response("What is AI?") # doctest: +SKIP
    >>> print(response) # doctest: +SKIP

    Generate response using Ollama:

    >>> model = LLMModel(model_type='ollama', model_id='llama3')
    >>> response = model.generate_response("Explain quantum mechanics.") # doctest: +SKIP
    >>> print(response) # doctest: +SKIP

    Generate response using HuggingFace:

    >>> model = LLMModel(model_type='huggingface', model_name='distilgpt2') # doctest: +SKIP
    >>> response = model.generate_response("Write a poem about the sea.") # doctest: +SKIP
    >>> print(response) # doctest: +SKIP
    """
    if self.model_type in ["azure", "ollama", "huggingface"]:
        return self.base_model.complete(prompt, **kwargs)
    else:
        raise ValueError("Invalid model type")

`azure_open_ai(model_id: str = 'gpt-4o', engine: str = '4o')`

Get the Azure OpenAI model.

Parameters:

Name	Type	Description	Default
`model_id`	`str`	The model ID.	`'gpt-4o'`
`engine`	`str`	The engine.	`'4o'`

Returns:

Type	Description
`AzureOpenAI`	The Azure OpenAI model.

Raises:

Type	Description
`ImportError`	If the `llama-index-llms-azure-openai` package is not installed.

Examples:

>>> from llama_utils.utils.models import azure_open_ai
>>> from dotenv import load_dotenv
>>> load_dotenv()
>>> llm = azure_open_ai()
>>> print(llm.model)
gpt-4o
>>> print(llm.metadata)
context_window=128000 num_output=-1 is_chat_model=True is_function_calling_model=True model_name='gpt-4o' system_role=<MessageRole.SYSTEM: 'system'>

Source code in src/llama_utils/utils/models.py

def azure_open_ai(model_id: str = "gpt-4o", engine: str = "4o"):
    """Get the Azure OpenAI model.

    Parameters
    ----------
    model_id: str, optional, default is "gpt-4o"
        The model ID.
    engine: str, optional, default is "4o"
        The engine.

    Returns
    -------
    AzureOpenAI
        The Azure OpenAI model.

    Raises
    ------
    ImportError
        If the `llama-index-llms-azure-openai` package is not installed.

    Examples
    --------
    >>> from llama_utils.utils.models import azure_open_ai
    >>> from dotenv import load_dotenv
    >>> load_dotenv() # doctest: +SKIP
    >>> llm = azure_open_ai() # doctest: +SKIP
    >>> print(llm.model) # doctest: +SKIP
    gpt-4o
    >>> print(llm.metadata) # doctest: +SKIP
    context_window=128000 num_output=-1 is_chat_model=True is_function_calling_model=True model_name='gpt-4o' system_role=<MessageRole.SYSTEM: 'system'>
    """
    try:
        from llama_index.llms.azure_openai import AzureOpenAI
    except ImportError:
        raise ImportError(
            "Please install the `llama-index-llms-azure-openai` package to use the Azure OpenAI model."
        )
    endpoint = os.environ.get("AZURE_OPENAI_ENDPOINT")
    api_key = os.environ.get("AZURE_OPENAI_API_KEY")
    api_version = os.environ.get("AZURE_OPENAI_API_VERSION")

    if endpoint is None or api_key is None or api_version is None:
        warn("Azure OpenAI environment variables are not set.")

    llm = AzureOpenAI(
        engine="4o" if engine is None else engine,
        model="gpt-4o" if model_id is None else model_id,  # o1-preview
        temperature=0.0,
        azure_endpoint=endpoint,
        api_key=api_key,
        api_version=api_version,
    )

    return llm

`get_hugging_face_embedding(model_name: str = 'BAAI/bge-small-en-v1.5', cache_folder: str = None)`

Get the hugging face embedding model.

Parameters:

Name	Type	Description	Default
`model_name`	`str`	Name of the hugging face embedding model.	`'BAAI/bge-small-en-v1.5'`
`cache_folder`	`str`	Folder to cache the model. If not provided the function will search for - `LLAMA_INDEX_CACHE_DIR` in your environment variables. - `~/tmp/llama_index` if your OS is Linux. - `~/Library/Caches/llama_index` if your OS is MacOS. - `~/AppData/Local/llama_index` if your OS is Windows.	`None`

Returns:

Type	Description
`HuggingFaceEmbedding`	The hugging face embedding model.

Raises:

Type	Description
`ImportError`	If the `llama-index-embeddings-huggingface` package is not installed.

Examples:

>>> from llama_utils.utils.models import get_hugging_face_embedding
>>> embedding = get_hugging_face_embedding()
>>> print(embedding.model_name)
BAAI/bge-small-en-v1.5
>>> print(embedding.max_length)
512
>>> print(embedding.embed_batch_size)
10

Source code in src/llama_utils/utils/models.py

def get_hugging_face_embedding(
    model_name: str = "BAAI/bge-small-en-v1.5", cache_folder: str = None
):
    """Get the hugging face embedding model.

    Parameters
    ----------
    model_name: str, optional, default is "BAAI/bge-small-en-v1.5"
        Name of the hugging face embedding model.
    cache_folder: str, optional, default is None
        Folder to cache the model. If not provided the function will search for
        - `LLAMA_INDEX_CACHE_DIR` in your environment variables.
        - `~/tmp/llama_index` if your OS is Linux.
        - `~/Library/Caches/llama_index` if your OS is MacOS.
        - `~/AppData/Local/llama_index` if your OS is Windows.

    Returns
    -------
    HuggingFaceEmbedding
        The hugging face embedding model.

    Raises
    ------
    ImportError
        If the `llama-index-embeddings-huggingface` package is not installed.

    Examples
    --------
    >>> from llama_utils.utils.models import get_hugging_face_embedding
    >>> embedding = get_hugging_face_embedding()
    >>> print(embedding.model_name)
    BAAI/bge-small-en-v1.5
    >>> print(embedding.max_length)
    512
    >>> print(embedding.embed_batch_size)
    10
    """
    try:
        from llama_index.embeddings.huggingface import HuggingFaceEmbedding
    except ImportError:
        raise ImportError(
            "Please install the `llama-index-embeddings-huggingface` package to use the Hugging Face embedding model."
        )

    embedding = HuggingFaceEmbedding(model_name=model_name, cache_folder=cache_folder)
    return embedding

`get_huggingface_llm(**kwargs)`

Initializes and returns a HuggingFaceLLM instance with specified parameters.

Parameters:

Name	Type	Description	Default
`kwargs`		context_window : int, optional The maximum context window size, by default DEFAULT_CONTEXT_WINDOW. max_new_tokens : int, optional The maximum number of new tokens to generate, by default DEFAULT_NUM_OUTPUTS. generate_kwargs : dict, optional Additional arguments for text generation, by default {"temperature": 0.75, "do_sample": False}. query_wrapper_prompt : str, optional The wrapper prompt for query execution, by default "Answer the following question succinctly and informatively.". tokenizer_name : str, optional The tokenizer model name, by default DEFAULT_HUGGINGFACE_MODEL. model_name : str, optional The model name, by default DEFAULT_HUGGINGFACE_MODEL. device_map : str, optional The device mapping strategy, by default "auto". tokenizer_kwargs : dict, optional Additional tokenizer arguments, by default {"max_length": 2048}. model_kwargs : dict, optional Additional model arguments, by default {"torch_dtype": torch.float16}.	`{}`

Returns:

Type	Description
`HuggingFaceLLM`	An instance of the HuggingFaceLLM class.

Source code in src/llama_utils/utils/models.py

def get_huggingface_llm(**kwargs):
    """Initializes and returns a HuggingFaceLLM instance with specified parameters.

    Parameters
    ----------
    kwargs: dict
        context_window : int, optional
            The maximum context window size, by default DEFAULT_CONTEXT_WINDOW.
        max_new_tokens : int, optional
            The maximum number of new tokens to generate, by default DEFAULT_NUM_OUTPUTS.
        generate_kwargs : dict, optional
            Additional arguments for text generation, by default {"temperature": 0.75, "do_sample": False}.
        query_wrapper_prompt : str, optional
            The wrapper prompt for query execution, by default "Answer the following question succinctly and informatively.".
        tokenizer_name : str, optional
            The tokenizer model name, by default DEFAULT_HUGGINGFACE_MODEL.
        model_name : str, optional
            The model name, by default DEFAULT_HUGGINGFACE_MODEL.
        device_map : str, optional
            The device mapping strategy, by default "auto".
        tokenizer_kwargs : dict, optional
            Additional tokenizer arguments, by default {"max_length": 2048}.
        model_kwargs : dict, optional
            Additional model arguments, by default {"torch_dtype": torch.float16}.

    Returns
    -------
    HuggingFaceLLM
        An instance of the HuggingFaceLLM class.
    """
    try:
        import torch
        from llama_index.llms.huggingface import HuggingFaceLLM
    except ImportError:
        raise ImportError(
            "Please install the `llama-index-llms-huggingface` package to use the HuggingFaceLLM model."
        )

    return HuggingFaceLLM(
        context_window=kwargs.get("context_window", DEFAULT_CONTEXT_WINDOW),
        max_new_tokens=kwargs.get("max_new_tokens", DEFAULT_NUM_OUTPUTS),
        generate_kwargs=kwargs.get(
            "generate_kwargs", {"temperature": 0.75, "do_sample": False}
        ),
        query_wrapper_prompt=kwargs.get(
            "query_wrapper_prompt",
            "Answer the following question succinctly and informatively.",
        ),
        tokenizer_name=kwargs.get("tokenizer_name", DEFAULT_HUGGINGFACE_MODEL),
        model_name=kwargs.get("model_name", DEFAULT_HUGGINGFACE_MODEL),
        device_map=kwargs.get("device_map", "auto"),
        tokenizer_kwargs=kwargs.get("tokenizer_kwargs", {"max_length": 2048}),
        model_kwargs=kwargs.get(
            "model_kwargs",
            {"torch_dtype": torch.float16},
        ),
    )

`get_ollama_llm(model_id: str = 'llama3', base_url: str = 'http://localhost:11434', temperature: float = 0.75, context_window: int = DEFAULT_CONTEXT_WINDOW, request_timeout: float = 360.0, prompt_key: str = 'prompt', json_mode: bool = False, additional_kwargs: Dict[str, Any] = {}, is_function_calling_model: bool = True, keep_alive: Optional[Union[float, str]] = None)`

Get the Ollama LLM with flexible parameters.

Parameters:

Name	Type	Description	Default
`model_id`	`str`	The model ID to use.	`'llama3'`
`base_url`	`str`	The base URL of the Ollama API.	`'http://localhost:11434'`
`temperature`	`float`	The temperature setting for response randomness.	`0.75`
`context_window`	`int`	Maximum token window for context.	`DEFAULT_CONTEXT_WINDOW`
`request_timeout`	`float`	Timeout for requests.	`360.0`
`prompt_key`	`str`	Key for the prompt in requests.	`'prompt'`
`json_mode`	`bool`	Whether to return responses in JSON mode.	`False`
`additional_kwargs`	`dict`	Additional model-specific parameters.	`{}`
`is_function_calling_model`	`bool`	Whether the model supports function calling.	`True`
`keep_alive`	`Optional[Union[float, str]]`	Keep-alive duration.	`None`

Returns:

Type	Description
`Ollama`	An instance of the Ollama LLM.

Raises:

Type	Description
`ImportError`	If the `llama-index-llms-ollama` package is not installed.

Examples:

>>> from llama_utils.utils.models import get_ollama_llm
>>> llm = get_ollama_llm()
>>> print(llm.model)
llama3
>>> print(llm.base_url)
http://localhost:11434
>>> print(llm.metadata)
context_window=3900 num_output=256 is_chat_model=True is_function_calling_model=True model_name='llama3' system_role=<MessageRole.SYSTEM: 'system'>

Source code in src/llama_utils/utils/models.py

def get_ollama_llm(
    model_id: str = "llama3",
    base_url: str = "http://localhost:11434",
    temperature: float = 0.75,
    context_window: int = DEFAULT_CONTEXT_WINDOW,
    request_timeout: float = 360.0,
    prompt_key: str = "prompt",
    json_mode: bool = False,
    additional_kwargs: Dict[str, Any] = {},
    is_function_calling_model: bool = True,
    keep_alive: Optional[Union[float, str]] = None,
):
    """Get the Ollama LLM with flexible parameters.

    Parameters
    ----------
    model_id : str
        The model ID to use.
    base_url : str, optional
        The base URL of the Ollama API.
    temperature : float, optional
        The temperature setting for response randomness.
    context_window : int, optional
        Maximum token window for context.
    request_timeout : float, optional
        Timeout for requests.
    prompt_key : str, optional
        Key for the prompt in requests.
    json_mode : bool, optional
        Whether to return responses in JSON mode.
    additional_kwargs : dict, optional
        Additional model-specific parameters.
    is_function_calling_model : bool, optional
        Whether the model supports function calling.
    keep_alive : Optional[Union[float, str]], optional
        Keep-alive duration.

    Returns
    -------
    Ollama
        An instance of the Ollama LLM.

    Raises
    ------
    ImportError
        If the `llama-index-llms-ollama` package is not installed.

    Examples
    --------
    >>> from llama_utils.utils.models import get_ollama_llm
    >>> llm = get_ollama_llm()
    >>> print(llm.model)
    llama3
    >>> print(llm.base_url)
    http://localhost:11434
    >>> print(llm.metadata)
    context_window=3900 num_output=256 is_chat_model=True is_function_calling_model=True model_name='llama3' system_role=<MessageRole.SYSTEM: 'system'>
    """
    try:
        from llama_index.llms.ollama import Ollama
    except ImportError:
        raise ImportError(
            "Please install the `llama-index-llms-ollama` package to use the Ollama model."
        )

    return Ollama(
        model=model_id,
        base_url=base_url,
        temperature=temperature,
        context_window=context_window,
        request_timeout=request_timeout,
        prompt_key=prompt_key,
        json_mode=json_mode,
        additional_kwargs=additional_kwargs,
        is_function_calling_model=is_function_calling_model,
        keep_alive=keep_alive,
    )

config_loader

`llama_utils.utils.config_loader`

A class or function to load configuration.

`ConfigLoader`

A class or function to load configuration.

Source code in src/llama_utils/utils/config_loader.py

class ConfigLoader:
    """A class or function to load configuration."""

    def __init__(
        self,
        llm: Any = None,
        embedding: Any = None,
    ):
        """Initialize the ConfigLoader class.

        Parameters
        ----------
        llm: Any, optional, default is llama3
            llm model to use.
        embedding: Any, optional, default is BAAI/bge-small-en-v1.5
            Embedding model to use.

        Examples
        --------
        ```python
        >>> from llama_utils.utils.config_loader import ConfigLoader
        >>> config = ConfigLoader() # doctest: +SKIP
        >>> print(config.embedding) # doctest: +SKIP
        model_name='BAAI/bge-small-en-v1.5' embed_batch_size=10 callback_manager=<llama_index.core.callbacks.base.CallbackManager object at 0x000002919C16BD40> num_workers=None max_length=512 normalize=True query_instruction=None text_instruction=None cache_folder=None
        >>> print(config.llm.model) # doctest: +SKIP
        llama3

        ```
        """
        if llm is None:
            llm = get_ollama_llm()
        if embedding is None:
            embedding = get_hugging_face_embedding()

        Settings.embed_model = embedding
        Settings.llm = llm
        Settings.text_splitter = SentenceSplitter(chunk_size=1024, chunk_overlap=20)
        self._settings = Settings
        self._embedding = embedding
        self._llm = llm

    @property
    def settings(self):
        """Get the settings."""
        return self._settings

    @property
    def llm(self):
        """Get the llm model."""
        return self._llm

    @llm.setter
    def llm(self, value):
        self._llm = value
        Settings.llm = value

    @property
    def embedding(self):
        """Get the embedding model."""
        return self._embedding

    @embedding.setter
    def embedding(self, value):
        self._embedding = value
        Settings.embed_model = value

`embedding` `property` `writable`

Get the embedding model.

`llm` `property` `writable`

Get the llm model.

`settings` `property`

Get the settings.

`init(llm: Any = None, embedding: Any = None)`

Initialize the ConfigLoader class.

Parameters:

Name	Type	Description	Default
`llm`	`Any`	llm model to use.	`None`
`embedding`	`Any`	Embedding model to use.	`None`

Examples:

>>> from llama_utils.utils.config_loader import ConfigLoader
>>> config = ConfigLoader() # doctest: +SKIP
>>> print(config.embedding) # doctest: +SKIP
model_name='BAAI/bge-small-en-v1.5' embed_batch_size=10 callback_manager=<llama_index.core.callbacks.base.CallbackManager object at 0x000002919C16BD40> num_workers=None max_length=512 normalize=True query_instruction=None text_instruction=None cache_folder=None
>>> print(config.llm.model) # doctest: +SKIP
llama3

Source code in src/llama_utils/utils/config_loader.py

def __init__(
    self,
    llm: Any = None,
    embedding: Any = None,
):
    """Initialize the ConfigLoader class.

    Parameters
    ----------
    llm: Any, optional, default is llama3
        llm model to use.
    embedding: Any, optional, default is BAAI/bge-small-en-v1.5
        Embedding model to use.

    Examples
    --------
    ```python
    >>> from llama_utils.utils.config_loader import ConfigLoader
    >>> config = ConfigLoader() # doctest: +SKIP
    >>> print(config.embedding) # doctest: +SKIP
    model_name='BAAI/bge-small-en-v1.5' embed_batch_size=10 callback_manager=<llama_index.core.callbacks.base.CallbackManager object at 0x000002919C16BD40> num_workers=None max_length=512 normalize=True query_instruction=None text_instruction=None cache_folder=None
    >>> print(config.llm.model) # doctest: +SKIP
    llama3

    ```
    """
    if llm is None:
        llm = get_ollama_llm()
    if embedding is None:
        embedding = get_hugging_face_embedding()

    Settings.embed_model = embedding
    Settings.llm = llm
    Settings.text_splitter = SentenceSplitter(chunk_size=1024, chunk_overlap=20)
    self._settings = Settings
    self._embedding = embedding
    self._llm = llm

helper_functions

`llama_utils.utils.helper_functions`

A collection of helper functions used across different modules (e.g., text preprocessing, validation).

`HelperFunctions`

A collection of helper functions used across different modules (e.g., text preprocessing, validation).

Source code in src/llama_utils/utils/helper_functions.py

class HelperFunctions:
    """A collection of helper functions used across different modules (e.g., text preprocessing, validation)."""

    def __init__(self):
        """Initialize the helper functions."""
        pass

`init()`

Initialize the helper functions.

Source code in src/llama_utils/utils/helper_functions.py

def __init__(self):
    """Initialize the helper functions."""
    pass

`generate_content_hash(content: str)`

Generate a hash for the document content using SHA-256.

Parameters:

Name	Type	Description	Default
`content`	`str`	The content of the document.	required

Returns:

Type	Description
`str`	The SHA-256 hash of the content.

Source code in src/llama_utils/utils/helper_functions.py

def generate_content_hash(content: str):
    """Generate a hash for the document content using SHA-256.

    Parameters
    ----------
    content: str
        The content of the document.

    Returns
    -------
    str
        The SHA-256 hash of the content.
    """
    return hashlib.sha256(content.encode("utf-8")).hexdigest()

`is_sha256(string: str) -> bool`

Check if a string is a valid SHA-256 hash.

Parameters:

Name	Type	Description	Default
`string`	`str`	The string to check.	required

Returns:

Type	Description
`bool`	True if the string is a valid SHA-256 hash, False otherwise.

Source code in src/llama_utils/utils/helper_functions.py

def is_sha256(string: str) -> bool:
    """Check if a string is a valid SHA-256 hash.

    Parameters
    ----------
    string: str
        The string to check.

    Returns
    -------
    bool
        True if the string is a valid SHA-256 hash, False otherwise.
    """
    # SHA-256 hash must be 64 characters long and contain only hexadecimal characters
    return bool(re.fullmatch(r"[a-fA-F0-9]{64}", string))

loggers

`llama_utils.utils.logger`

Logger.

`Logger`

Logger class.

Source code in src/llama_utils/utils/logger.py

class Logger:
    """Logger class."""

    def __init__(self, name: str, level: int = logging.INFO, file_name: str = None):
        """Initialize the logger."""
        logging.basicConfig(
            level=level,
            format="%(asctime)s - %(name)s - %(levelname)s - %(message)s",
            filename=file_name if not None else "llama-utils.log",
        )
        self.logger = logging.getLogger(name)
        self.logger.addHandler(logging.StreamHandler(stream=sys.stdout))

`init(name: str, level: int = logging.INFO, file_name: str = None)`

Initialize the logger.

Source code in src/llama_utils/utils/logger.py

def __init__(self, name: str, level: int = logging.INFO, file_name: str = None):
    """Initialize the logger."""
    logging.basicConfig(
        level=level,
        format="%(asctime)s - %(name)s - %(levelname)s - %(message)s",
        filename=file_name if not None else "llama-utils.log",
    )
    self.logger = logging.getLogger(name)
    self.logger.addHandler(logging.StreamHandler(stream=sys.stdout))

errors

`llama_utils.utils.errors`

Errors module.

`StorageNotFoundError`

Bases: Exception

ReadOnlyError.

Source code in src/llama_utils/utils/errors.py

class StorageNotFoundError(Exception):
    """ReadOnlyError."""

    def __init__(self, error_message: str):
        """__init__."""

`init(error_message: str)`

init.

Source code in src/llama_utils/utils/errors.py

def __init__(self, error_message: str):
    """__init__."""

Utils

LLM models

llama_utils.utils.models

LLMModel

base_model property

model_type property

__init__(model_type: str, **kwargs)

generate_response(prompt: str, **kwargs)

azure_open_ai(model_id: str = 'gpt-4o', engine: str = '4o')

get_hugging_face_embedding(model_name: str = 'BAAI/bge-small-en-v1.5', cache_folder: str = None)

get_huggingface_llm(**kwargs)

config_loader

llama_utils.utils.config_loader

ConfigLoader

embedding property writable

llm property writable

settings property

__init__(llm: Any = None, embedding: Any = None)

helper_functions

llama_utils.utils.helper_functions

HelperFunctions

__init__()

generate_content_hash(content: str)

is_sha256(string: str) -> bool

loggers

llama_utils.utils.logger

Logger

__init__(name: str, level: int = logging.INFO, file_name: str = None)

errors

llama_utils.utils.errors

StorageNotFoundError

__init__(error_message: str)

`llama_utils.utils.models`

`LLMModel`

`base_model` `property`

`model_type` `property`

`init(model_type: str, **kwargs)`

`generate_response(prompt: str, **kwargs)`

`azure_open_ai(model_id: str = 'gpt-4o', engine: str = '4o')`

`get_hugging_face_embedding(model_name: str = 'BAAI/bge-small-en-v1.5', cache_folder: str = None)`

`get_huggingface_llm(**kwargs)`

`llama_utils.utils.config_loader`

`ConfigLoader`

`embedding` `property` `writable`

`llm` `property` `writable`

`settings` `property`

`init(llm: Any = None, embedding: Any = None)`

`llama_utils.utils.helper_functions`

`HelperFunctions`

`init()`

`generate_content_hash(content: str)`

`is_sha256(string: str) -> bool`

`llama_utils.utils.logger`

`Logger`

`init(name: str, level: int = logging.INFO, file_name: str = None)`

`llama_utils.utils.errors`

`StorageNotFoundError`

`init(error_message: str)`