Blame: docs/content/reference/cli-reference.md - mudler/LocalAI

mudler / LocalAI UNCLAIMED

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference

44640 0 4 Go

Normal View History Raw

chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			`+++`
			`disableToc = false`
			`title = "CLI Reference"`
			`weight = 25`
			`url = '/reference/cli-reference'`
			`+++`

			`Complete reference for all LocalAI command-line interface (CLI) parameters and environment variables.`

feat: docs revamp (#7313) * docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Enhancements * Default to zen-dark Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-19 22:21:20 +01:00			`> Note: All CLI flags can also be set via environment variables. Environment variables take precedence over CLI flags. See [.env files]({{%relref "advanced/advanced-usage#env-files" %}}) for configuration file support.`
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00
			`## Global Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `-h, --help` \| \| Show context-sensitive help \| \|
			\| `--log-level` \| `info` \| Set the level of logs to output [error,warn,info,debug,trace] \| `$LOCALAI_LOG_LEVEL` \|
			\| `--debug` \| `false` \| DEPRECATED - Use `--log-level=debug` instead. Enable debug logging \| `$LOCALAI_DEBUG`, `$DEBUG` \|

			`## Storage Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--models-path` \| `BASEPATH/models` \| Path containing models used for inferencing \| `$LOCALAI_MODELS_PATH`, `$MODELS_PATH` \|
feat: Add --data-path CLI flag for persistent data separation (#8888) feat: add --data-path CLI flag for persistent data separation - Add LOCALAI_DATA_PATH environment variable and --data-path CLI flag - Default data path: /data (separate from configuration directory) - Automatic migration on startup: moves agent_tasks.json, agent_jobs.json, collections/, and assets/ from old config dir to new data path - Backward compatible: preserves old behavior if LOCALAI_DATA_PATH is not set - Agent state and job directories now use DataPath with proper fallback chain - Update documentation with new flag and docker-compose example This separates mutable persistent data (collectiondb, agents, assets, skills) from configuration files, enabling better volume mounting and data persistence in containerized deployments. Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> 2026-03-09 14:11:15 +01:00			\| `--data-path` \| `BASEPATH/data` \| Path for persistent data (collectiondb, agent state, tasks, jobs). Separates mutable data from configuration \| `$LOCALAI_DATA_PATH` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--generated-content-path` \| `/tmp/generated/content` \| Location for assets generated by backends (e.g. stablediffusion, images, audio, videos) \| `$LOCALAI_GENERATED_CONTENT_PATH`, `$GENERATED_CONTENT_PATH` \|
			\| `--upload-path` \| `/tmp/localai/upload` \| Path to store uploads from files API \| `$LOCALAI_UPLOAD_PATH`, `$UPLOAD_PATH` \|
feat(ui): runtime settings (#7320) * feat(ui): add watchdog settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not re-read env Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Some refactor, move other settings to runtime (p2p) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add API Keys handling Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to disable runtime settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Documentation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * show MCP toggle in index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop context default Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-20 22:37:20 +01:00			\| `--localai-config-dir` \| `BASEPATH/configuration` \| Directory for dynamic loading of certain configuration files (currently runtime_settings.json, api_keys.json, and external_backends.json). See [Runtime Settings]({{%relref "features/runtime-settings" %}}) for web-based configuration. \| `$LOCALAI_CONFIG_DIR` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--localai-config-dir-poll-interval` \| \| Time duration to poll the LocalAI Config Dir if your system has broken fsnotify events (example: `1m`) \| `$LOCALAI_CONFIG_DIR_POLL_INTERVAL` \|
			\| `--models-config-file` \| \| YAML file containing a list of model backend configs (alias: `--config-file`) \| `$LOCALAI_MODELS_CONFIG_FILE`, `$CONFIG_FILE` \|

			`## Backend Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--backends-path` \| `BASEPATH/backends` \| Path containing backends used for inferencing \| `$LOCALAI_BACKENDS_PATH`, `$BACKENDS_PATH` \|
chore: switch from /usr/share to /var/lib for data storage (#7361) * More appropriate place for data storing The /usr/share subtree in Linux is used for data that generally are not supposed to change. Conventional places for changeable data are usually located under /var, so /var/lib seems to be a reasonable default here. * Data paths consistency fix * Directory name consistency fix 2025-11-27 11:18:28 +03:00			\| `--backends-system-path` \| `/var/lib/local-ai/backends` \| Path containing system backends used for inferencing \| `$LOCALAI_BACKENDS_SYSTEM_PATH`, `$BACKEND_SYSTEM_PATH` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--external-backends` \| \| A list of external backends to load from gallery on boot \| `$LOCALAI_EXTERNAL_BACKENDS`, `$EXTERNAL_BACKENDS` \|
			\| `--external-grpc-backends` \| \| A list of external gRPC backends (format: `BACKEND_NAME:URI`) \| `$LOCALAI_EXTERNAL_GRPC_BACKENDS`, `$EXTERNAL_GRPC_BACKENDS` \|
			\| `--backend-galleries` \| \| JSON list of backend galleries \| `$LOCALAI_BACKEND_GALLERIES`, `$BACKEND_GALLERIES` \|
			\| `--autoload-backend-galleries` \| `true` \| Automatically load backend galleries on startup \| `$LOCALAI_AUTOLOAD_BACKEND_GALLERIES`, `$AUTOLOAD_BACKEND_GALLERIES` \|
			\| `--parallel-requests` \| `false` \| Enable backends to handle multiple requests in parallel if they support it (e.g.: llama.cpp or vllm) \| `$LOCALAI_PARALLEL_REQUESTS`, `$PARALLEL_REQUESTS` \|
feat(loader): enhance single active backend to support LRU eviction (#7535) * feat(loader): refactor single active backend support to LRU This changeset introduces LRU management of loaded backends. Users can set now a maximum number of models to be loaded concurrently, and, when setting LocalAI in single active backend mode we set LRU to 1 for backward compatibility. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-12-12 12:28:38 +01:00			\| `--max-active-backends` \| `0` \| Maximum number of active backends (loaded models). When exceeded, the least recently used model is evicted. Set to `0` for unlimited, `1` for single-backend mode \| `$LOCALAI_MAX_ACTIVE_BACKENDS`, `$MAX_ACTIVE_BACKENDS` \|
			\| `--single-active-backend` \| `false` \| DEPRECATED - Use `--max-active-backends=1` instead. Allow only one backend to be run at a time \| `$LOCALAI_SINGLE_ACTIVE_BACKEND`, `$SINGLE_ACTIVE_BACKEND` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--preload-backend-only` \| `false` \| Do not launch the API services, only the preloaded models/backends are started (useful for multi-node setups) \| `$LOCALAI_PRELOAD_BACKEND_ONLY`, `$PRELOAD_BACKEND_ONLY` \|
			\| `--enable-watchdog-idle` \| `false` \| Enable watchdog for stopping backends that are idle longer than the watchdog-idle-timeout \| `$LOCALAI_WATCHDOG_IDLE`, `$WATCHDOG_IDLE` \|
			\| `--watchdog-idle-timeout` \| `15m` \| Threshold beyond which an idle backend should be stopped \| `$LOCALAI_WATCHDOG_IDLE_TIMEOUT`, `$WATCHDOG_IDLE_TIMEOUT` \|
			\| `--enable-watchdog-busy` \| `false` \| Enable watchdog for stopping backends that are busy longer than the watchdog-busy-timeout \| `$LOCALAI_WATCHDOG_BUSY`, `$WATCHDOG_BUSY` \|
			\| `--watchdog-busy-timeout` \| `5m` \| Threshold beyond which a busy backend should be stopped \| `$LOCALAI_WATCHDOG_BUSY_TIMEOUT`, `$WATCHDOG_BUSY_TIMEOUT` \|
fix(cli): Fix watchdog running constantly and spamming logs (#8624) * Fix watchdog running constantly and spamming logs Signed-off-by: Andres Smith <andressmithdev@pm.me> * Update docs Signed-off-by: Andres Smith <andressmithdev@pm.me> --------- Signed-off-by: Andres Smith <andressmithdev@pm.me> 2026-02-23 11:57:28 +01:00			\| `--watchdog-interval` \| `500ms` \| Interval between watchdog checks (e.g., `500ms`, `5s`, `1m`) \| `$LOCALAI_WATCHDOG_INTERVAL`, `$WATCHDOG_INTERVAL` \|
feat: disable force eviction (#7725) * feat: allow to set forcing backends eviction while requests are in flight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: try to make the request sit and retry if eviction couldn't be done Otherwise calls that in order to pass would need to shutdown other backends would just fail. In this way instead we make the request sit and retry eviction until it succeeds. The thresholds can be configured by the user. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose settings to CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-12-25 14:26:18 +01:00			\| `--force-eviction-when-busy` \| `false` \| Force eviction even when models have active API calls (default: false for safety). Warning: Enabling this can interrupt active requests \| `$LOCALAI_FORCE_EVICTION_WHEN_BUSY`, `$FORCE_EVICTION_WHEN_BUSY` \|
			\| `--lru-eviction-max-retries` \| `30` \| Maximum number of retries when waiting for busy models to become idle before eviction \| `$LOCALAI_LRU_EVICTION_MAX_RETRIES`, `$LRU_EVICTION_MAX_RETRIES` \|
			\| `--lru-eviction-retry-interval` \| `1s` \| Interval between retries when waiting for busy models to become idle (e.g., `1s`, `2s`) \| `$LOCALAI_LRU_EVICTION_RETRY_INTERVAL`, `$LRU_EVICTION_RETRY_INTERVAL` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00
feat: docs revamp (#7313) * docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Enhancements * Default to zen-dark Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-19 22:21:20 +01:00			`For more information on VRAM management, see [VRAM and Memory Management]({{%relref "advanced/vram-management" %}}).`
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00
			`## Models Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--galleries` \| \| JSON list of galleries \| `$LOCALAI_GALLERIES`, `$GALLERIES` \|
			\| `--autoload-galleries` \| `true` \| Automatically load galleries on startup \| `$LOCALAI_AUTOLOAD_GALLERIES`, `$AUTOLOAD_GALLERIES` \|
			\| `--preload-models` \| \| A list of models to apply in JSON at start \| `$LOCALAI_PRELOAD_MODELS`, `$PRELOAD_MODELS` \|
			\| `--models` \| \| A list of model configuration URLs to load \| `$LOCALAI_MODELS`, `$MODELS` \|
			\| `--preload-models-config` \| \| A list of models to apply at startup. Path to a YAML config file \| `$LOCALAI_PRELOAD_MODELS_CONFIG`, `$PRELOAD_MODELS_CONFIG` \|
			\| `--load-to-memory` \| \| A list of models to load into memory at startup \| `$LOCALAI_LOAD_TO_MEMORY`, `$LOAD_TO_MEMORY` \|

			> Note: You can also pass model configuration URLs as positional arguments: `local-ai run MODEL_URL1 MODEL_URL2 ...`

			`## Performance Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--f16` \| `false` \| Enable GPU acceleration \| `$LOCALAI_F16`, `$F16` \|
			\| `-t, --threads` \| \| Number of threads used for parallel computation. Usage of the number of physical cores in the system is suggested \| `$LOCALAI_THREADS`, `$THREADS` \|
			\| `--context-size` \| \| Default context size for models \| `$LOCALAI_CONTEXT_SIZE`, `$CONTEXT_SIZE` \|

			`## API Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--address` \| `:8080` \| Bind address for the API server \| `$LOCALAI_ADDRESS`, `$ADDRESS` \|
			\| `--cors` \| `false` \| Enable CORS (Cross-Origin Resource Sharing) \| `$LOCALAI_CORS`, `$CORS` \|
			\| `--cors-allow-origins` \| \| Comma-separated list of allowed CORS origins \| `$LOCALAI_CORS_ALLOW_ORIGINS`, `$CORS_ALLOW_ORIGINS` \|
			\| `--csrf` \| `false` \| Enable Fiber CSRF middleware \| `$LOCALAI_CSRF` \|
			\| `--upload-limit` \| `15` \| Default upload-limit in MB \| `$LOCALAI_UPLOAD_LIMIT`, `$UPLOAD_LIMIT` \|
			\| `--api-keys` \| \| List of API Keys to enable API authentication. When this is set, all requests must be authenticated with one of these API keys \| `$LOCALAI_API_KEY`, `$API_KEY` \|
			\| `--disable-webui` \| `false` \| Disables the web user interface. When set to true, the server will only expose API endpoints without serving the web interface \| `$LOCALAI_DISABLE_WEBUI`, `$DISABLE_WEBUI` \|
feat(ui): runtime settings (#7320) * feat(ui): add watchdog settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not re-read env Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Some refactor, move other settings to runtime (p2p) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add API Keys handling Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to disable runtime settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Documentation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * show MCP toggle in index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop context default Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-20 22:37:20 +01:00			\| `--disable-runtime-settings` \| `false` \| Disables the runtime settings feature. When set to true, the server will not load runtime settings from the `runtime_settings.json` file and the settings web interface will be disabled \| `$LOCALAI_DISABLE_RUNTIME_SETTINGS`, `$DISABLE_RUNTIME_SETTINGS` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--disable-gallery-endpoint` \| `false` \| Disable the gallery endpoints \| `$LOCALAI_DISABLE_GALLERY_ENDPOINT`, `$DISABLE_GALLERY_ENDPOINT` \|
			\| `--disable-metrics-endpoint` \| `false` \| Disable the `/metrics` endpoint \| `$LOCALAI_DISABLE_METRICS_ENDPOINT`, `$DISABLE_METRICS_ENDPOINT` \|
			\| `--machine-tag` \| \| If not empty, add that string to Machine-Tag header in each response. Useful to track response from different machines using multiple P2P federated nodes \| `$LOCALAI_MACHINE_TAG`, `$MACHINE_TAG` \|

			`## Hardening Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--disable-predownload-scan` \| `false` \| If true, disables the best-effort security scanner before downloading any files \| `$LOCALAI_DISABLE_PREDOWNLOAD_SCAN` \|
			\| `--opaque-errors` \| `false` \| If true, all error responses are replaced with blank 500 errors. This is intended only for hardening against information leaks and is normally not recommended \| `$LOCALAI_OPAQUE_ERRORS` \|
			\| `--use-subtle-key-comparison` \| `false` \| If true, API Key validation comparisons will be performed using constant-time comparisons rather than simple equality. This trades off performance on each request for resilience against timing attacks \| `$LOCALAI_SUBTLE_KEY_COMPARISON` \|
			\| `--disable-api-key-requirement-for-http-get` \| `false` \| If true, a valid API key is not required to issue GET requests to portions of the web UI. This should only be enabled in secure testing environments \| `$LOCALAI_DISABLE_API_KEY_REQUIREMENT_FOR_HTTP_GET` \|
fix(ui): Move routes to /app to avoid conflict with API endpoints (#8978) Also test for regressions in HTTP GET API key exempted endpoints because this list can get out of sync with the UI routes. Also fix support for proxying on a different prefix both server and client side. Signed-off-by: Richard Palethorpe <io@richiejp.com> 2026-03-13 20:38:18 +00:00			\| `--http-get-exempted-endpoints` \| `^/$,^/app(/.)?$,^/browse(/.)?$,^/login/?$,^/explorer/?$,^/assets/.$,^/static/.$,^/swagger.*$` \| If `--disable-api-key-requirement-for-http-get` is overridden to true, this is the list of endpoints to exempt. Only adjust this in case of a security incident or as a result of a personal security posture review \| `$LOCALAI_HTTP_GET_EXEMPTED_ENDPOINTS` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00
feat: add users and authentication support (#9061) * feat(ui): add users and authentication support Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: allow the admin user to impersonificate users Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: ui improvements, disable 'Users' button in navbar when no auth is configured Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: add OIDC support Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: gate models Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: cache requests to optimize speed Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small UI enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ui): style improvements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: cover other paths by auth Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: separate local auth, refactor Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * security hardening, approval mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: fix tests and expectations Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: update localagi/localrecall Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2026-03-19 21:40:51 +01:00			`## Authentication Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--auth-enabled` \| `false` \| Enable user authentication and authorization \| `$LOCALAI_AUTH` \|
			\| `--auth-database-url` \| `{DataPath}/database.db` \| Database URL for auth — `postgres://...` for PostgreSQL, or a file path for SQLite \| `$LOCALAI_AUTH_DATABASE_URL`, `$DATABASE_URL` \|
			\| `--github-client-id` \| \| GitHub OAuth App Client ID (auto-enables auth when set) \| `$GITHUB_CLIENT_ID` \|
			\| `--github-client-secret` \| \| GitHub OAuth App Client Secret \| `$GITHUB_CLIENT_SECRET` \|
			\| `--oidc-issuer` \| \| OIDC issuer URL for auto-discovery \| `$LOCALAI_OIDC_ISSUER` \|
			\| `--oidc-client-id` \| \| OIDC Client ID (auto-enables auth when set) \| `$LOCALAI_OIDC_CLIENT_ID` \|
			\| `--oidc-client-secret` \| \| OIDC Client Secret \| `$LOCALAI_OIDC_CLIENT_SECRET` \|
			\| `--auth-base-url` \| \| Base URL for OAuth callbacks (e.g. `http://localhost:8080`) \| `$LOCALAI_BASE_URL` \|
			\| `--auth-admin-email` \| \| Email address to auto-promote to admin role on login \| `$LOCALAI_ADMIN_EMAIL` \|
			\| `--auth-registration-mode` \| `open` \| Registration mode: `open`, `approval`, or `invite` \| `$LOCALAI_REGISTRATION_MODE` \|
			\| `--disable-local-auth` \| `false` \| Disable local email/password registration and login (for OAuth/OIDC-only setups) \| `$LOCALAI_DISABLE_LOCAL_AUTH` \|

			`See [Authentication & Authorization]({{%relref "features/authentication" %}}) for full documentation.`

chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			`## P2P Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--p2p` \| `false` \| Enable P2P mode \| `$LOCALAI_P2P`, `$P2P` \|
			\| `--p2p-dht-interval` \| `360` \| Interval for DHT refresh (used during token generation) \| `$LOCALAI_P2P_DHT_INTERVAL`, `$P2P_DHT_INTERVAL` \|
			\| `--p2p-otp-interval` \| `9000` \| Interval for OTP refresh (used during token generation) \| `$LOCALAI_P2P_OTP_INTERVAL`, `$P2P_OTP_INTERVAL` \|
			\| `--p2ptoken` \| \| Token for P2P mode (optional) \| `$LOCALAI_P2P_TOKEN`, `$P2P_TOKEN`, `$TOKEN` \|
			\| `--p2p-network-id` \| \| Network ID for P2P mode, can be set arbitrarily by the user for grouping a set of instances \| `$LOCALAI_P2P_NETWORK_ID`, `$P2P_NETWORK_ID` \|
			\| `--federated` \| `false` \| Enable federated instance \| `$LOCALAI_FEDERATED`, `$FEDERATED` \|

			`## Other Commands`

			LocalAI supports several subcommands beyond `run`:

			- `local-ai models` - Manage LocalAI models and definitions
			- `local-ai backends` - Manage LocalAI backends and definitions
			- `local-ai tts` - Convert text to speech
			- `local-ai sound-generation` - Generate audio files from text or audio
			- `local-ai transcript` - Convert audio to text
			- `local-ai worker` - Run workers to distribute workload (llama.cpp-only)
			- `local-ai util` - Utility commands
			- `local-ai explorer` - Run P2P explorer
			- `local-ai federated` - Run LocalAI in federated mode

			Use `local-ai <command> --help` for more information on each command.

			`## Examples`

			`### Basic Usage`

			```bash
			`./local-ai run`

			`./local-ai run --models-path /path/to/models --address :9090`

			`./local-ai run --f16`
			```

			`### Environment Variables`

			```bash
			`export LOCALAI_MODELS_PATH=/path/to/models`
			`export LOCALAI_ADDRESS=:9090`
			`export LOCALAI_F16=true`
			`./local-ai run`
			```

			`### Advanced Configuration`

			```bash
			`./local-ai run \`
			`--models model1.yaml model2.yaml \`
			`--enable-watchdog-idle \`
			`--watchdog-idle-timeout=10m \`
			`--p2p \`
			`--federated`
			```

			`## Related Documentation`

feat: docs revamp (#7313) * docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Enhancements * Default to zen-dark Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-19 22:21:20 +01:00			`- See [Advanced Usage]({{%relref "advanced/advanced-usage" %}}) for configuration examples`
			`- See [VRAM and Memory Management]({{%relref "advanced/vram-management" %}}) for memory management options`
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00

chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			`+++`
			`disableToc = false`
			`title = "CLI Reference"`
			`weight = 25`
			`url = '/reference/cli-reference'`
			`+++`

			`Complete reference for all LocalAI command-line interface (CLI) parameters and environment variables.`

feat: docs revamp (#7313) * docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Enhancements * Default to zen-dark Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-19 22:21:20 +01:00			`> Note: All CLI flags can also be set via environment variables. Environment variables take precedence over CLI flags. See [.env files]({{%relref "advanced/advanced-usage#env-files" %}}) for configuration file support.`
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00
			`## Global Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `-h, --help` \| \| Show context-sensitive help \| \|
			\| `--log-level` \| `info` \| Set the level of logs to output [error,warn,info,debug,trace] \| `$LOCALAI_LOG_LEVEL` \|
			\| `--debug` \| `false` \| DEPRECATED - Use `--log-level=debug` instead. Enable debug logging \| `$LOCALAI_DEBUG`, `$DEBUG` \|

			`## Storage Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--models-path` \| `BASEPATH/models` \| Path containing models used for inferencing \| `$LOCALAI_MODELS_PATH`, `$MODELS_PATH` \|
feat: Add --data-path CLI flag for persistent data separation (#8888) feat: add --data-path CLI flag for persistent data separation - Add LOCALAI_DATA_PATH environment variable and --data-path CLI flag - Default data path: /data (separate from configuration directory) - Automatic migration on startup: moves agent_tasks.json, agent_jobs.json, collections/, and assets/ from old config dir to new data path - Backward compatible: preserves old behavior if LOCALAI_DATA_PATH is not set - Agent state and job directories now use DataPath with proper fallback chain - Update documentation with new flag and docker-compose example This separates mutable persistent data (collectiondb, agents, assets, skills) from configuration files, enabling better volume mounting and data persistence in containerized deployments. Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> 2026-03-09 14:11:15 +01:00			\| `--data-path` \| `BASEPATH/data` \| Path for persistent data (collectiondb, agent state, tasks, jobs). Separates mutable data from configuration \| `$LOCALAI_DATA_PATH` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--generated-content-path` \| `/tmp/generated/content` \| Location for assets generated by backends (e.g. stablediffusion, images, audio, videos) \| `$LOCALAI_GENERATED_CONTENT_PATH`, `$GENERATED_CONTENT_PATH` \|
			\| `--upload-path` \| `/tmp/localai/upload` \| Path to store uploads from files API \| `$LOCALAI_UPLOAD_PATH`, `$UPLOAD_PATH` \|
feat(ui): runtime settings (#7320) * feat(ui): add watchdog settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not re-read env Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Some refactor, move other settings to runtime (p2p) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add API Keys handling Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to disable runtime settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Documentation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * show MCP toggle in index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop context default Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-20 22:37:20 +01:00			\| `--localai-config-dir` \| `BASEPATH/configuration` \| Directory for dynamic loading of certain configuration files (currently runtime_settings.json, api_keys.json, and external_backends.json). See [Runtime Settings]({{%relref "features/runtime-settings" %}}) for web-based configuration. \| `$LOCALAI_CONFIG_DIR` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--localai-config-dir-poll-interval` \| \| Time duration to poll the LocalAI Config Dir if your system has broken fsnotify events (example: `1m`) \| `$LOCALAI_CONFIG_DIR_POLL_INTERVAL` \|
			\| `--models-config-file` \| \| YAML file containing a list of model backend configs (alias: `--config-file`) \| `$LOCALAI_MODELS_CONFIG_FILE`, `$CONFIG_FILE` \|

			`## Backend Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--backends-path` \| `BASEPATH/backends` \| Path containing backends used for inferencing \| `$LOCALAI_BACKENDS_PATH`, `$BACKENDS_PATH` \|
chore: switch from /usr/share to /var/lib for data storage (#7361) * More appropriate place for data storing The /usr/share subtree in Linux is used for data that generally are not supposed to change. Conventional places for changeable data are usually located under /var, so /var/lib seems to be a reasonable default here. * Data paths consistency fix * Directory name consistency fix 2025-11-27 11:18:28 +03:00			\| `--backends-system-path` \| `/var/lib/local-ai/backends` \| Path containing system backends used for inferencing \| `$LOCALAI_BACKENDS_SYSTEM_PATH`, `$BACKEND_SYSTEM_PATH` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--external-backends` \| \| A list of external backends to load from gallery on boot \| `$LOCALAI_EXTERNAL_BACKENDS`, `$EXTERNAL_BACKENDS` \|
			\| `--external-grpc-backends` \| \| A list of external gRPC backends (format: `BACKEND_NAME:URI`) \| `$LOCALAI_EXTERNAL_GRPC_BACKENDS`, `$EXTERNAL_GRPC_BACKENDS` \|
			\| `--backend-galleries` \| \| JSON list of backend galleries \| `$LOCALAI_BACKEND_GALLERIES`, `$BACKEND_GALLERIES` \|
			\| `--autoload-backend-galleries` \| `true` \| Automatically load backend galleries on startup \| `$LOCALAI_AUTOLOAD_BACKEND_GALLERIES`, `$AUTOLOAD_BACKEND_GALLERIES` \|
			\| `--parallel-requests` \| `false` \| Enable backends to handle multiple requests in parallel if they support it (e.g.: llama.cpp or vllm) \| `$LOCALAI_PARALLEL_REQUESTS`, `$PARALLEL_REQUESTS` \|
feat(loader): enhance single active backend to support LRU eviction (#7535) * feat(loader): refactor single active backend support to LRU This changeset introduces LRU management of loaded backends. Users can set now a maximum number of models to be loaded concurrently, and, when setting LocalAI in single active backend mode we set LRU to 1 for backward compatibility. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-12-12 12:28:38 +01:00			\| `--max-active-backends` \| `0` \| Maximum number of active backends (loaded models). When exceeded, the least recently used model is evicted. Set to `0` for unlimited, `1` for single-backend mode \| `$LOCALAI_MAX_ACTIVE_BACKENDS`, `$MAX_ACTIVE_BACKENDS` \|
			\| `--single-active-backend` \| `false` \| DEPRECATED - Use `--max-active-backends=1` instead. Allow only one backend to be run at a time \| `$LOCALAI_SINGLE_ACTIVE_BACKEND`, `$SINGLE_ACTIVE_BACKEND` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--preload-backend-only` \| `false` \| Do not launch the API services, only the preloaded models/backends are started (useful for multi-node setups) \| `$LOCALAI_PRELOAD_BACKEND_ONLY`, `$PRELOAD_BACKEND_ONLY` \|
			\| `--enable-watchdog-idle` \| `false` \| Enable watchdog for stopping backends that are idle longer than the watchdog-idle-timeout \| `$LOCALAI_WATCHDOG_IDLE`, `$WATCHDOG_IDLE` \|
			\| `--watchdog-idle-timeout` \| `15m` \| Threshold beyond which an idle backend should be stopped \| `$LOCALAI_WATCHDOG_IDLE_TIMEOUT`, `$WATCHDOG_IDLE_TIMEOUT` \|
			\| `--enable-watchdog-busy` \| `false` \| Enable watchdog for stopping backends that are busy longer than the watchdog-busy-timeout \| `$LOCALAI_WATCHDOG_BUSY`, `$WATCHDOG_BUSY` \|
			\| `--watchdog-busy-timeout` \| `5m` \| Threshold beyond which a busy backend should be stopped \| `$LOCALAI_WATCHDOG_BUSY_TIMEOUT`, `$WATCHDOG_BUSY_TIMEOUT` \|
fix(cli): Fix watchdog running constantly and spamming logs (#8624) * Fix watchdog running constantly and spamming logs Signed-off-by: Andres Smith <andressmithdev@pm.me> * Update docs Signed-off-by: Andres Smith <andressmithdev@pm.me> --------- Signed-off-by: Andres Smith <andressmithdev@pm.me> 2026-02-23 11:57:28 +01:00			\| `--watchdog-interval` \| `500ms` \| Interval between watchdog checks (e.g., `500ms`, `5s`, `1m`) \| `$LOCALAI_WATCHDOG_INTERVAL`, `$WATCHDOG_INTERVAL` \|
feat: disable force eviction (#7725) * feat: allow to set forcing backends eviction while requests are in flight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: try to make the request sit and retry if eviction couldn't be done Otherwise calls that in order to pass would need to shutdown other backends would just fail. In this way instead we make the request sit and retry eviction until it succeeds. The thresholds can be configured by the user. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose settings to CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-12-25 14:26:18 +01:00			\| `--force-eviction-when-busy` \| `false` \| Force eviction even when models have active API calls (default: false for safety). Warning: Enabling this can interrupt active requests \| `$LOCALAI_FORCE_EVICTION_WHEN_BUSY`, `$FORCE_EVICTION_WHEN_BUSY` \|
			\| `--lru-eviction-max-retries` \| `30` \| Maximum number of retries when waiting for busy models to become idle before eviction \| `$LOCALAI_LRU_EVICTION_MAX_RETRIES`, `$LRU_EVICTION_MAX_RETRIES` \|
			\| `--lru-eviction-retry-interval` \| `1s` \| Interval between retries when waiting for busy models to become idle (e.g., `1s`, `2s`) \| `$LOCALAI_LRU_EVICTION_RETRY_INTERVAL`, `$LRU_EVICTION_RETRY_INTERVAL` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00
feat: docs revamp (#7313) * docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Enhancements * Default to zen-dark Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-19 22:21:20 +01:00			`For more information on VRAM management, see [VRAM and Memory Management]({{%relref "advanced/vram-management" %}}).`
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00
			`## Models Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--galleries` \| \| JSON list of galleries \| `$LOCALAI_GALLERIES`, `$GALLERIES` \|
			\| `--autoload-galleries` \| `true` \| Automatically load galleries on startup \| `$LOCALAI_AUTOLOAD_GALLERIES`, `$AUTOLOAD_GALLERIES` \|
			\| `--preload-models` \| \| A list of models to apply in JSON at start \| `$LOCALAI_PRELOAD_MODELS`, `$PRELOAD_MODELS` \|
			\| `--models` \| \| A list of model configuration URLs to load \| `$LOCALAI_MODELS`, `$MODELS` \|
			\| `--preload-models-config` \| \| A list of models to apply at startup. Path to a YAML config file \| `$LOCALAI_PRELOAD_MODELS_CONFIG`, `$PRELOAD_MODELS_CONFIG` \|
			\| `--load-to-memory` \| \| A list of models to load into memory at startup \| `$LOCALAI_LOAD_TO_MEMORY`, `$LOAD_TO_MEMORY` \|

			> Note: You can also pass model configuration URLs as positional arguments: `local-ai run MODEL_URL1 MODEL_URL2 ...`

			`## Performance Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--f16` \| `false` \| Enable GPU acceleration \| `$LOCALAI_F16`, `$F16` \|
			\| `-t, --threads` \| \| Number of threads used for parallel computation. Usage of the number of physical cores in the system is suggested \| `$LOCALAI_THREADS`, `$THREADS` \|
			\| `--context-size` \| \| Default context size for models \| `$LOCALAI_CONTEXT_SIZE`, `$CONTEXT_SIZE` \|

			`## API Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--address` \| `:8080` \| Bind address for the API server \| `$LOCALAI_ADDRESS`, `$ADDRESS` \|
			\| `--cors` \| `false` \| Enable CORS (Cross-Origin Resource Sharing) \| `$LOCALAI_CORS`, `$CORS` \|
			\| `--cors-allow-origins` \| \| Comma-separated list of allowed CORS origins \| `$LOCALAI_CORS_ALLOW_ORIGINS`, `$CORS_ALLOW_ORIGINS` \|
			\| `--csrf` \| `false` \| Enable Fiber CSRF middleware \| `$LOCALAI_CSRF` \|
			\| `--upload-limit` \| `15` \| Default upload-limit in MB \| `$LOCALAI_UPLOAD_LIMIT`, `$UPLOAD_LIMIT` \|
			\| `--api-keys` \| \| List of API Keys to enable API authentication. When this is set, all requests must be authenticated with one of these API keys \| `$LOCALAI_API_KEY`, `$API_KEY` \|
			\| `--disable-webui` \| `false` \| Disables the web user interface. When set to true, the server will only expose API endpoints without serving the web interface \| `$LOCALAI_DISABLE_WEBUI`, `$DISABLE_WEBUI` \|
feat(ui): runtime settings (#7320) * feat(ui): add watchdog settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not re-read env Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Some refactor, move other settings to runtime (p2p) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add API Keys handling Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to disable runtime settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Documentation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * show MCP toggle in index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop context default Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-20 22:37:20 +01:00			\| `--disable-runtime-settings` \| `false` \| Disables the runtime settings feature. When set to true, the server will not load runtime settings from the `runtime_settings.json` file and the settings web interface will be disabled \| `$LOCALAI_DISABLE_RUNTIME_SETTINGS`, `$DISABLE_RUNTIME_SETTINGS` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			\| `--disable-gallery-endpoint` \| `false` \| Disable the gallery endpoints \| `$LOCALAI_DISABLE_GALLERY_ENDPOINT`, `$DISABLE_GALLERY_ENDPOINT` \|
			\| `--disable-metrics-endpoint` \| `false` \| Disable the `/metrics` endpoint \| `$LOCALAI_DISABLE_METRICS_ENDPOINT`, `$DISABLE_METRICS_ENDPOINT` \|
			\| `--machine-tag` \| \| If not empty, add that string to Machine-Tag header in each response. Useful to track response from different machines using multiple P2P federated nodes \| `$LOCALAI_MACHINE_TAG`, `$MACHINE_TAG` \|

			`## Hardening Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--disable-predownload-scan` \| `false` \| If true, disables the best-effort security scanner before downloading any files \| `$LOCALAI_DISABLE_PREDOWNLOAD_SCAN` \|
			\| `--opaque-errors` \| `false` \| If true, all error responses are replaced with blank 500 errors. This is intended only for hardening against information leaks and is normally not recommended \| `$LOCALAI_OPAQUE_ERRORS` \|
			\| `--use-subtle-key-comparison` \| `false` \| If true, API Key validation comparisons will be performed using constant-time comparisons rather than simple equality. This trades off performance on each request for resilience against timing attacks \| `$LOCALAI_SUBTLE_KEY_COMPARISON` \|
			\| `--disable-api-key-requirement-for-http-get` \| `false` \| If true, a valid API key is not required to issue GET requests to portions of the web UI. This should only be enabled in secure testing environments \| `$LOCALAI_DISABLE_API_KEY_REQUIREMENT_FOR_HTTP_GET` \|
fix(ui): Move routes to /app to avoid conflict with API endpoints (#8978) Also test for regressions in HTTP GET API key exempted endpoints because this list can get out of sync with the UI routes. Also fix support for proxying on a different prefix both server and client side. Signed-off-by: Richard Palethorpe <io@richiejp.com> 2026-03-13 20:38:18 +00:00			\| `--http-get-exempted-endpoints` \| `^/$,^/app(/.)?$,^/browse(/.)?$,^/login/?$,^/explorer/?$,^/assets/.$,^/static/.$,^/swagger.*$` \| If `--disable-api-key-requirement-for-http-get` is overridden to true, this is the list of endpoints to exempt. Only adjust this in case of a security incident or as a result of a personal security posture review \| `$LOCALAI_HTTP_GET_EXEMPTED_ENDPOINTS` \|
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00
feat: add users and authentication support (#9061) * feat(ui): add users and authentication support Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: allow the admin user to impersonificate users Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: ui improvements, disable 'Users' button in navbar when no auth is configured Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: add OIDC support Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: gate models Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: cache requests to optimize speed Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small UI enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ui): style improvements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: cover other paths by auth Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: separate local auth, refactor Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * security hardening, approval mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: fix tests and expectations Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: update localagi/localrecall Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2026-03-19 21:40:51 +01:00			`## Authentication Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--auth-enabled` \| `false` \| Enable user authentication and authorization \| `$LOCALAI_AUTH` \|
			\| `--auth-database-url` \| `{DataPath}/database.db` \| Database URL for auth — `postgres://...` for PostgreSQL, or a file path for SQLite \| `$LOCALAI_AUTH_DATABASE_URL`, `$DATABASE_URL` \|
			\| `--github-client-id` \| \| GitHub OAuth App Client ID (auto-enables auth when set) \| `$GITHUB_CLIENT_ID` \|
			\| `--github-client-secret` \| \| GitHub OAuth App Client Secret \| `$GITHUB_CLIENT_SECRET` \|
			\| `--oidc-issuer` \| \| OIDC issuer URL for auto-discovery \| `$LOCALAI_OIDC_ISSUER` \|
			\| `--oidc-client-id` \| \| OIDC Client ID (auto-enables auth when set) \| `$LOCALAI_OIDC_CLIENT_ID` \|
			\| `--oidc-client-secret` \| \| OIDC Client Secret \| `$LOCALAI_OIDC_CLIENT_SECRET` \|
			\| `--auth-base-url` \| \| Base URL for OAuth callbacks (e.g. `http://localhost:8080`) \| `$LOCALAI_BASE_URL` \|
			\| `--auth-admin-email` \| \| Email address to auto-promote to admin role on login \| `$LOCALAI_ADMIN_EMAIL` \|
			\| `--auth-registration-mode` \| `open` \| Registration mode: `open`, `approval`, or `invite` \| `$LOCALAI_REGISTRATION_MODE` \|
			\| `--disable-local-auth` \| `false` \| Disable local email/password registration and login (for OAuth/OIDC-only setups) \| `$LOCALAI_DISABLE_LOCAL_AUTH` \|

			`See [Authentication & Authorization]({{%relref "features/authentication" %}}) for full documentation.`

chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00			`## P2P Flags`

			`\| Parameter \| Default \| Description \| Environment Variable \|`
			`\|-----------\|---------\|-------------\|----------------------\|`
			\| `--p2p` \| `false` \| Enable P2P mode \| `$LOCALAI_P2P`, `$P2P` \|
			\| `--p2p-dht-interval` \| `360` \| Interval for DHT refresh (used during token generation) \| `$LOCALAI_P2P_DHT_INTERVAL`, `$P2P_DHT_INTERVAL` \|
			\| `--p2p-otp-interval` \| `9000` \| Interval for OTP refresh (used during token generation) \| `$LOCALAI_P2P_OTP_INTERVAL`, `$P2P_OTP_INTERVAL` \|
			\| `--p2ptoken` \| \| Token for P2P mode (optional) \| `$LOCALAI_P2P_TOKEN`, `$P2P_TOKEN`, `$TOKEN` \|
			\| `--p2p-network-id` \| \| Network ID for P2P mode, can be set arbitrarily by the user for grouping a set of instances \| `$LOCALAI_P2P_NETWORK_ID`, `$P2P_NETWORK_ID` \|
			\| `--federated` \| `false` \| Enable federated instance \| `$LOCALAI_FEDERATED`, `$FEDERATED` \|

			`## Other Commands`

			LocalAI supports several subcommands beyond `run`:

			- `local-ai models` - Manage LocalAI models and definitions
			- `local-ai backends` - Manage LocalAI backends and definitions
			- `local-ai tts` - Convert text to speech
			- `local-ai sound-generation` - Generate audio files from text or audio
			- `local-ai transcript` - Convert audio to text
			- `local-ai worker` - Run workers to distribute workload (llama.cpp-only)
			- `local-ai util` - Utility commands
			- `local-ai explorer` - Run P2P explorer
			- `local-ai federated` - Run LocalAI in federated mode

			Use `local-ai <command> --help` for more information on each command.

			`## Examples`

			`### Basic Usage`

			```bash
			`./local-ai run`

			`./local-ai run --models-path /path/to/models --address :9090`

			`./local-ai run --f16`
			```

			`### Environment Variables`

			```bash
			`export LOCALAI_MODELS_PATH=/path/to/models`
			`export LOCALAI_ADDRESS=:9090`
			`export LOCALAI_F16=true`
			`./local-ai run`
			```

			`### Advanced Configuration`

			```bash
			`./local-ai run \`
			`--models model1.yaml model2.yaml \`
			`--enable-watchdog-idle \`
			`--watchdog-idle-timeout=10m \`
			`--p2p \`
			`--federated`
			```

			`## Related Documentation`

feat: docs revamp (#7313) * docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Enhancements * Default to zen-dark Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-19 22:21:20 +01:00			`- See [Advanced Usage]({{%relref "advanced/advanced-usage" %}}) for configuration examples`
			`- See [VRAM and Memory Management]({{%relref "advanced/vram-management" %}}) for memory management options`
chore(docs): improve documentation and split into sections bigger topics (#7292) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-11-17 18:39:21 +01:00