Configuration through environment variables

The app configuration can be controlled using environment variables. You can specify those variables using a .env file, for example. If you are containerising the app then you must update the .env.docker before creating the Docker image.

The following is a list of configuration variables. All environment variables must be specified as quoted strings, such as ENV_KEY = "value". Type casting will be performed by the app as necessary. Some of the variables are required or conditionally required.

Gradio server settings

Notice that host and port variables listed below are not the same as the corresponding ones mentioned in Gradio documentation, which will be ignored.

GRADIO__SERVER_HOST: The host name to which the Gradio server should bind to. Default value: localhost. Set to 0.0.0.0 in .env.docker.
GRADIO__SERVER_PORT: The port number on the GRADIO__SERVER_HOST, which the Gradio server should bind on. Default value: 7860. Parsed as non-string type: int.

General large language model (LLM) settings

LLM__PROVIDER: The provider for the language language models. Default value: Ollama. Possible values are: Open AI, Cohere, Groq and Ollama.
LLM__TEMPERATURE: The temperature parameter for the LLM to be used. The value is usually a decimal number between 0 and 1 but the actual range depends on the model. For example, for Open AI models, the value can be between 0 and 2. Default value: 0.4. Parsed as non-string type: float.
LLM__TOP_P: The top_p paraeter for the LLM to be used, between 0 and 1. Default value: 0.4. Parsed as non-string type: float.
LLM__TOP_K: The top_k parameter for the LLM to be used. Default value: 40. Parsed as non-string type: int.
LLM__REPEAT_PENALTY: The repeat penalty parameter for the LLM to be used. Default value: 1.1. Parsed as non-string type: float.
LLM__SEED: The seed parameter for the LLM to be used. Default value: 1. Parsed as non-string type: int.

For details about these parameter settings, see the documentation of your chosen LLM provider, such as, the Ollama documentation for the valid parameters and their values for model files. Note that for language model providers other than Ollama, all model parameter settings apart from LLM__TEMPERATURE are ignored.

Language model specific settings

LLM__OPENAI_API_KEY: (required only if language model provider is Open AI) A valid Open AI API key.
LLM__OPENAI_MODEL: The Open AI model you want to use. Default value: gpt-4o-mini.
LLM__COHERE_API_KEY: (required only if language model provider is Cohere) A valid Cohere API key.
LLM__COHERE_MODEL: The Cohere model you want to use. Default value: command-r-plus.
LLM__GROQ_API_KEY: (required only if language model provider is Groq) A valid Groq API key.
LLM__GROQ_MODEL: The Groq model you want to use. Default value: llama3-groq-70b-8192-tool-use-preview.
LLM__OLLAMA_URL: URL for a running Ollama instance. Default value: http://localhost:11434. Set to http://host.docker.internal:11434 in .env.docker. (Learn about Docker networking and set it correctly as needed.)
LLM__OLLAMA_MODEL: The Ollama model you want to use. Default value: llama3.

{ch@tty cod:r} documentation

Configuration through environment variables

Gradio server settings

General large language model (LLM) settings

Language model specific settings