Blame: Makefile - mudler/LocalAI

2026-01-06 14:26:42 +00:00

# Disable parallel execution for backend builds

feat(quantization): add quantization backend (#9096) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-22 00:56:34 +01:00

								.NOTPARALLEL: backends/diffusers backends/llama-cpp backends/outetts backends/piper backends/stablediffusion-ggml backends/whisper backends/faster-whisper backends/silero-vad backends/local-store backends/huggingface backends/rfdetr backends/kitten-tts backends/kokoro backends/chatterbox backends/llama-cpp-darwin backends/neutts build-darwin-python-backend build-darwin-go-backend backends/mlx backends/diffuser-darwin backends/mlx-vlm backends/mlx-audio backends/mlx-distributed backends/stablediffusion-ggml-darwin backends/vllm backends/vllm-omni backends/moonshine backends/pocket-tts backends/qwen-tts backends/faster-qwen3-tts backends/qwen-asr backends/nemo backends/voxcpm backends/whisperx backends/ace-step backends/acestep-cpp backends/fish-speech backends/voxtral backends/opus backends/trl backends/llama-cpp-quantization

							

2026-01-06 14:26:42 +00:00

feature: makefile & updates (#23) Co-authored-by: mudler <mudler@c3os.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-04-15 16:39:07 -07:00

GOCMD=go

Rename project to LocalAI (#35) Signed-off-by: mudler <mudler@c3os.io>

2023-04-19 18:43:10 +02:00

BINARY_NAME=local-ai

2025-08-26 14:22:04 +02:00

LAUNCHER_BINARY_NAME=local-ai-launcher

ci: add renovate suffix

2023-05-04 12:26:59 +02:00

feat(tts): add pocket-tts backend (#8018) * feat(pocket-tts): add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-13 23:35:19 +01:00

								UBUNTU_VERSION?=2404

							

2026-01-06 14:26:42 +00:00

UBUNTU_CODENAME?=noble

2025-12-02 14:24:35 +01:00

2025-07-22 16:31:04 +02:00

GORELEASER?=

2024-11-20 14:48:40 +01:00

fix gpt4all, add metal GPU support (#507)

2023-06-05 14:26:20 +02:00

								export BUILD_TYPE?=

							

chore(cuda): target 12.8 for 12 to increase compatibility (#8297) Some datacenter setups might be stuck with the 5.x kernel which doesn't play well with CUDA >=12.9. To incrase compatibility with the CUDA 12.x branch, downgrade to 12.8. For newer systems, it is still suggested to use CUDA 13.x wherever compatible. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-30 12:58:44 +01:00

								export CUDA_MAJOR_VERSION?=13

							

2025-07-20 22:52:45 +02:00

feat: add image generation with ncnn-stablediffusion (#272)

2023-05-16 19:32:53 +02:00

GO_TAGS?=

build: do not specify a BUILD_ID by default (#2284) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-10 16:01:55 +02:00

BUILD_ID?=

chore(Makefile): default to non-native builds for llama.cpp (#4173) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-11-18 13:59:06 +01:00

								NATIVE?=false

							

feat: Add '/version' endpoint and display it in the CLI (#679)

2023-06-26 15:12:43 +02:00

ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-10-19 13:50:40 +02:00

TEST_DIR=/tmp/test

refactor: move remaining api packages to core (#1731) * core 1 * api/openai/files fix * core 2 - core/config * move over core api.go and tests to the start of core/http * move over localai specific endpoints to core/http, begin the service/endpoint split there * refactor big chunk on the plane * refactor chunk 2 on plane, next step: port and modify changes to request.go * easy fixes for request.go, major changes not done yet * lintfix * json tag lintfix? * gitignore and .keep files * strange fix attempt: rename the config dir?

2024-03-01 10:19:53 -05:00

								TEST_FLAKES?=5

							

ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-10-19 13:50:40 +02:00

								RANDOM := $(shell bash -c 'echo $$RANDOM')

							

fix: copy git to correctly display version in /version

2023-07-02 11:14:09 +02:00

								VERSION?=$(shell git describe --always --tags || echo "dev" )

							

feat: Add '/version' endpoint and display it in the CLI (#679)

2023-06-26 15:12:43 +02:00

# go tool nm ./local-ai | grep Commit

ci(Makefile): reduce binary size by compressing (#2947) Makefile: try to reduce binary size Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-07-22 15:39:57 +02:00

LD_FLAGS?=-s -w

feat(p2p): Federation and AI swarms (#2723) * Wip p2p enhancements * get online state * Pass-by token to show in the dashboard Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Style * Minor fixups * parametrize SearchID * Refactoring * Allow to expose/bind more services Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add federation * Display federated mode in the WebUI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * make federated nodes visible from the WebUI * Fix version display * improve web page * live page update * visual enhancements * enhancements * visual enhancements --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-07-08 22:04:06 +02:00

								override LD_FLAGS += -X "github.com/mudler/LocalAI/internal.Version=$(VERSION)"

							

feat: Add '/version' endpoint and display it in the CLI (#679)

2023-06-26 15:12:43 +02:00

feat: add image generation with ncnn-stablediffusion (#272)

2023-05-16 19:32:53 +02:00

OPTIONAL_TARGETS?=

feat: add rwkv support (#158) Signed-off-by: mudler <mudler@mocaccino.org>

2023-05-03 11:45:22 +02:00

feat(build): only build llama.cpp relevant targets (#2659) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-26 14:58:38 +02:00

								export OS := $(shell uname -s)

							

ci: add binary releases pipelines (#358)

2023-05-23 17:12:48 +02:00

								ARCH := $(shell uname -m)

							

feature: makefile & updates (#23) Co-authored-by: mudler <mudler@c3os.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-04-15 16:39:07 -07:00

								GREEN  := $(shell tput -Txterm setaf 2)

							

ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-10-19 13:50:40 +02:00

# Default Docker bridge IP

fix: fix LDFLAGS for rwkv.cpp Previously the libs were added by other deps that made the linker add those as well (by chance). Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

								ifndef UNAME_S

							

2024-11-20 14:48:40 +01:00

fix: OSX Build Fix Part 1: Metal (#1365) * Make Metal the default on OSX, simplify osx-specific code, and fix the file copy error. * fix endif / comment

2023-11-30 13:50:50 -05:00

								ifeq ($(OS),Darwin)

							

feat: add static builds (#370)

2023-05-24 16:42:24 +02:00

								endif

							

2025-07-22 16:31:04 +02:00

# check if goreleaser exists

feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-02-08 20:12:51 +01:00

fix(tests): re-enable tests after code move (#1764) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-02-27 15:04:19 +01:00

TEST_PATHS?=./api/... ./pkg/... ./core/...

fix(Makefile): build all backends if none is specified Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-10-21 11:34:59 +02:00

Makefile: allow to build without GRPC_BACKENDS (#1607)

2024-01-19 15:38:43 +01:00

2025-07-20 22:52:45 +02:00

								.PHONY: all test build vendor

							

feature: makefile & updates (#23) Co-authored-by: mudler <mudler@c3os.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-04-15 16:39:07 -07:00

feat: add rwkv support (#158) Signed-off-by: mudler <mudler@mocaccino.org>

2023-05-03 11:45:22 +02:00

## GENERIC

invoke go mod clean before rebuilds

2023-07-05 18:24:55 +02:00

	$(GOCMD) clean -cache

feat: make images to build sources on start (#124) Signed-off-by: mudler <mudler@mocaccino.org>

2023-04-29 20:38:37 +02:00

	$(MAKE) build

feature: makefile & updates (#23) Co-authored-by: mudler <mudler@c3os.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-04-15 16:39:07 -07:00

								clean: ## Remove build related file

							

invoke go mod clean before rebuilds

2023-07-05 18:24:55 +02:00

	$(GOCMD) clean -cache

fix: update gitignore and make clean (#798) Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-07-25 17:02:46 -04:00

	rm -f prepare

Enhancements (#34) Signed-off-by: mudler <mudler@c3os.io>

2023-04-19 17:10:29 +02:00

	rm -rf $(BINARY_NAME)

ci: add binary releases pipelines (#358)

2023-05-23 17:12:48 +02:00

	rm -rf release/

2024-04-13 02:37:32 -05:00

	$(MAKE) protogen-clean

feature: makefile & updates (#23) Co-authored-by: mudler <mudler@c3os.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-04-15 16:39:07 -07:00

test/fix: OSX Test Repair (#1843) * test with gguf instead of ggml. Updates testPrompt to match? Adds debugging line to Dockerfile that I've found helpful recently. * fix testPrompt slightly * Sad Experiment: Test GH runner without metal? * break apart CGO_LDFLAGS * switch runner * upstream llama.cpp disables Metal on Github CI! * missed a dir from clean-tests * CGO_LDFLAGS * tmate failure + NO_ACCELERATE * whisper.cpp has a metal fix * do the exact opposite of the name of this branch, but keep it around for unrelated fixes? * add back newlines * add tmate to linux for testing * update fixtures * timeout for tmate

2024-03-18 14:19:43 -04:00

clean-tests:

2024-09-24 03:32:48 -04:00

fix: use rice when embedding large binaries (#5309) * fix(embed): use go-rice for large backend assets Golang embed FS has a hard limit that we might exceed when providing many binary alternatives. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify golang deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): switch to testcontainers and print logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): do not build a test binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-04 16:42:42 +02:00

## Install Go tools

feat(ui): move to React for frontend (#8772) * feat(ui): move to React Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add import model Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * syntax highlight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Minor fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-05 21:47:12 +01:00

## React UI:

feat: add rwkv support (#158) Signed-off-by: mudler <mudler@mocaccino.org>

2023-05-03 11:45:22 +02:00

## Build:

feat(realtime): WebRTC support (#8790) * feat(realtime): WebRTC support Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(tracing): Show full LLM opts and deltas Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>

2026-03-13 20:37:15 +00:00

feat: inferencing default, automatic tool parsing fallback and wire min_p (#9092) * feat: wire min_p Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: inferencing defaults Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(refactor): re-use iterative parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: generate automatically inference defaults from unsloth Instead of trying to re-invent the wheel and maintain here the inference defaults, prefer to consume unsloth ones, and contribute there as necessary. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: apply defaults also to models installed via gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: be consistent and apply fallback to all endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-22 00:57:15 +01:00

								build: protogen-go generate install-go-tools core/http/react-ui/dist ## Build the project

							

feat: add rwkv support (#158) Signed-off-by: mudler <mudler@mocaccino.org>

2023-05-03 11:45:22 +02:00

									$(info ${GREEN}I local-ai build info:${RESET})

							

feat: add image generation with ncnn-stablediffusion (#272)

2023-05-16 19:32:53 +02:00

									$(info ${GREEN}I GO_TAGS: ${YELLOW}$(GO_TAGS)${RESET})

							

feat: Add '/version' endpoint and display it in the CLI (#679)

2023-06-26 15:12:43 +02:00

									$(info ${GREEN}I LD_FLAGS: ${YELLOW}$(LD_FLAGS)${RESET})

							

ci(Makefile): reduce binary size by compressing (#2947) Makefile: try to reduce binary size Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-07-22 15:39:57 +02:00

									$(info ${GREEN}I UPX: ${YELLOW}$(UPX)${RESET})

							

fix: use rice when embedding large binaries (#5309) * fix(embed): use go-rice for large backend assets Golang embed FS has a hard limit that we might exceed when providing many binary alternatives. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify golang deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): switch to testcontainers and print logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): do not build a test binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-04 16:42:42 +02:00

									rm -rf $(BINARY_NAME) || true

							

chore(refactor): cli -> cmd, update docs (#6148) * chore(refactor): cli -> cmd Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update README Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-26 19:07:10 +02:00

									CGO_LDFLAGS="$(CGO_LDFLAGS)" $(GOCMD) build -ldflags "$(LD_FLAGS)" -tags "$(GO_TAGS)" -o $(BINARY_NAME) ./cmd/local-ai

							

2025-08-26 14:22:04 +02:00

chore(refactor): cli -> cmd, update docs (#6148) * chore(refactor): cli -> cmd Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update README Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-26 19:07:10 +02:00

									CGO_LDFLAGS="$(CGO_LDFLAGS)" $(GOCMD) build -ldflags "$(LD_FLAGS)" -tags "$(GO_TAGS)" -o $(LAUNCHER_BINARY_NAME) ./cmd/launcher

							

2025-08-26 14:22:04 +02:00

feat: add rwkv support (#158) Signed-off-by: mudler <mudler@mocaccino.org>

2023-05-03 11:45:22 +02:00

chore: use air to live reload in dev environment (#7186) * chore: use air to live reload in dev environment Signed-off-by: shohidulbari <shohidulbari18@gmail.com> * chore: update contribuing with live reload option Signed-off-by: shohidulbari <shohidulbari18@gmail.com> --------- Signed-off-by: shohidulbari <shohidulbari18@gmail.com>

2025-11-08 02:53:44 +06:00

								build-dev: ## Run LocalAI in dev mode with live reload

							

2025-07-22 16:31:04 +02:00

dev-dist:

feat(darwin): embed grpc libs (#2567) * debug * feat(makefile): allow to bundle libs into binary * ci: bundle protobuf into single-binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(assets): correctly reference extract folder Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * bundle also abseil Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * bundle more libs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2024-06-14 08:51:25 +02:00

feat: auto select llama-cpp cuda runtime (#2306) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * cuda Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * auto select cuda Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * update test Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * select CUDA backend only if present Signed-off-by: mudler <mudler@localai.io> * ci: keep cuda bin in path Signed-off-by: mudler <mudler@localai.io> * Makefile: make dist now builds also cuda Signed-off-by: mudler <mudler@localai.io> * Keep pushing fallback in case auto-flagset/nvidia fails There could be other reasons for which the default binary may fail. For example we might have detected an Nvidia GPU, however the user might not have the drivers/cuda libraries installed in the system, and so it would fail to start. We keep the fallback of llama.cpp at the end of the llama.cpp backends to try to fallback loading in case things go wrong Signed-off-by: mudler <mudler@localai.io> * Do not build cuda on MacOS Signed-off-by: mudler <mudler@localai.io> * cleanup Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: mudler <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: mudler <mudler@localai.io>

2024-05-14 10:40:18 -07:00

dist:

2025-07-22 16:31:04 +02:00

	$(GORELEASER) build --clean

ci: add binary releases pipelines (#358)

2023-05-23 17:12:48 +02:00

Feat: OSX Local Codesigning (#1319) * stage makefile * OSX local code signing and entitlements file to fix incoming connections prompt

2023-11-23 09:22:54 -05:00

								osx-signed: build

							

feat: add rwkv support (#158) Signed-off-by: mudler <mudler@mocaccino.org>

2023-05-03 11:45:22 +02:00

## Run

2025-07-20 22:52:45 +02:00

								run: ## run local-ai

							

feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

									CGO_LDFLAGS="$(CGO_LDFLAGS)" $(GOCMD) run ./

							

feature: makefile & updates (#23) Co-authored-by: mudler <mudler@c3os.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-04-15 16:39:07 -07:00

test/fix: OSX Test Repair (#1843) * test with gguf instead of ggml. Updates testPrompt to match? Adds debugging line to Dockerfile that I've found helpful recently. * fix testPrompt slightly * Sad Experiment: Test GH runner without metal? * break apart CGO_LDFLAGS * switch runner * upstream llama.cpp disables Metal on Github CI! * missed a dir from clean-tests * CGO_LDFLAGS * tmate failure + NO_ACCELERATE * whisper.cpp has a metal fix * do the exact opposite of the name of this branch, but keep it around for unrelated fixes? * add back newlines * add tmate to linux for testing * update fixtures * timeout for tmate

2024-03-18 14:19:43 -04:00

test-models/testmodel.ggml:

fix(llama-cpp): correctly calculate embeddings (#6259) * chore(tests): check embeddings differs in llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(llama.cpp): use the correct field for embedding Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(llama.cpp): use embedding type none Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): add test-cases in aio-e2e suite Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-09-13 23:11:54 +02:00

	mkdir -p test-models

chore(deps): bump llama.cpp to 'fef693dc6b959a8e8ba11558fbeaad0b264dd457' (#5467) Also try to use a smaller model for integration tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-26 17:19:46 +02:00

									wget -q https://huggingface.co/mradermacher/gpt2-alpaca-gpt4-GGUF/resolve/main/gpt2-alpaca-gpt4.Q4_K_M.gguf -O test-models/testmodel.ggml

							

2023-11-04 15:30:32 +01:00

									wget -q https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.en.bin -O test-models/whisper-en

							

feat: add /models/apply endpoint to prepare models (#286)

2023-05-18 15:59:03 +02:00

	cp tests/models_fixtures/* test-models

feat: add CI/tests (#58) Signed-off-by: mudler <mudler@mocaccino.org>

2023-04-22 00:44:52 +02:00

2025-07-22 16:31:04 +02:00

								prepare-test: protogen-go

							

feat: add /models/apply endpoint to prepare models (#286)

2023-05-18 15:59:03 +02:00

	cp tests/models_fixtures/* test-models

feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

2025-07-18 13:24:12 +02:00

########################################################

fix: use rice when embedding large binaries (#5309) * fix(embed): use go-rice for large backend assets Golang embed FS has a hard limit that we might exceed when providing many binary alternatives. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify golang deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): switch to testcontainers and print logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): do not build a test binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-04 16:42:42 +02:00

## Test targets

2025-07-22 16:31:04 +02:00

								test: test-models/testmodel.ggml protogen-go

							

feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

	@echo 'Running tests'

feat: split piper from main binary (#5858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-07-19 08:31:33 +02:00

									export GO_TAGS="debug"

							

feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

	$(MAKE) prepare-test

feat(realtime): WebRTC support (#8790) * feat(realtime): WebRTC support Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(tracing): Show full LLM opts and deltas Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>

2026-03-13 20:37:15 +00:00

									OPUS_SHIM_LIBRARY=$(abspath ./pkg/opus/shim/libopusshim.so) \

							

2025-07-18 13:24:12 +02:00

									HUGGINGFACE_GRPC=$(abspath ./)/backend/python/transformers/run.sh TEST_DIR=$(abspath ./)/test-dir/ FIXTURES=$(abspath ./)/tests/fixtures CONFIG_FILE=$(abspath ./)/test-models/config.yaml MODELS_PATH=$(abspath ./)/test-models BACKENDS_PATH=$(abspath ./)/backends \

							

chore(llama-ggml): drop deprecated backend (#4775) The GGML format is now dead, since in the next version of LocalAI we already bring many breaking compatibility changes, taking the occasion also to drop ggml support (pre-gguf). Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-06 18:36:23 +01:00

									$(GOCMD) run github.com/onsi/ginkgo/v2/ginkgo --label-filter="!llama-gguf"  --flake-attempts $(TEST_FLAKES) --fail-fast -v -r $(TEST_PATHS)

							

feat: bump llama.cpp, add gguf support (#943) **Description** This PR syncs up the `llama` backend to use `gguf` (https://github.com/go-skynet/go-llama.cpp/pull/180). It also adds `llama-stable` to the targets so we can still load ggml. It adapts the current tests to use the `llama-backend` for ggml and uses a `gguf` model to run tests on the new backend. In order to consume the new version of go-llama.cpp, it also bump go to 1.21 (images, pipelines, etc) --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-08-24 01:18:58 +02:00

	$(MAKE) test-llama-gguf

feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

	$(MAKE) test-tts

2025-07-18 13:24:12 +02:00

########################################################

chore: drop AIO images (#9004) AIO images are behind, and takes effort to maintain these. Wizard and installation of models have been semplified massively, so AIO images lost their purpose. This allows us to be more laser focused on main images and reliefes stress from CI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-14 17:49:36 +01:00

## E2E AIO tests (uses standard image with pre-configured models)

2025-07-18 13:24:12 +02:00

########################################################

chore: drop AIO images (#9004) AIO images are behind, and takes effort to maintain these. Wizard and installation of models have been semplified massively, so AIO images lost their purpose. This allows us to be more laser focused on main images and reliefes stress from CI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-14 17:49:36 +01:00

docker-build-e2e:

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

	docker build \

2026-01-06 14:26:42 +00:00

										--build-arg UBUNTU_CODENAME=$(UBUNTU_CODENAME) \

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

										--build-arg GO_TAGS="$(GO_TAGS)" \

							

2025-07-18 13:24:12 +02:00

chore: drop AIO images (#9004) AIO images are behind, and takes effort to maintain these. Wizard and installation of models have been semplified massively, so AIO images lost their purpose. This allows us to be more laser focused on main images and reliefes stress from CI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-14 17:49:36 +01:00

									LOCALAI_MODELS_DIR=$(abspath ./tests/e2e-aio/models) \

							

2025-07-18 13:24:12 +02:00

	$(MAKE) run-e2e-aio

ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-10-19 13:50:40 +02:00

prepare-e2e:

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

	docker build \

2026-01-06 14:26:42 +00:00

										--build-arg UBUNTU_CODENAME=$(UBUNTU_CODENAME) \

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

										--build-arg GO_TAGS="$(GO_TAGS)" \

							

ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-10-19 13:50:40 +02:00

chore: re-enable e2e tests, fixups anthropic API tools support (#8296) * chore(tests): add mock backend e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup anthropic tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * prepare e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop repetitive tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop specific CI workflow Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup anthropic issues, move all e2e tests to use mocked backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-30 12:41:50 +01:00

									docker run -p 5390:8080 -e MODELS_PATH=/models -e THREADS=1 -e DEBUG=true -d --rm -v $(TEST_DIR):/models --name e2e-tests-$(RANDOM) localai-tests

							

ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-10-19 13:50:40 +02:00

chore: re-enable e2e tests, fixups anthropic API tools support (#8296) * chore(tests): add mock backend e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup anthropic tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * prepare e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop repetitive tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop specific CI workflow Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup anthropic issues, move all e2e tests to use mocked backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-30 12:41:50 +01:00

								test-e2e: build-mock-backend prepare-e2e run-e2e-image

							

ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-10-19 13:50:40 +02:00

	@echo 'Running e2e tests'

chore: re-enable e2e tests, fixups anthropic API tools support (#8296) * chore(tests): add mock backend e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup anthropic tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * prepare e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop repetitive tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop specific CI workflow Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup anthropic issues, move all e2e tests to use mocked backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-30 12:41:50 +01:00

									LOCALAI_API=http://$(E2E_BRIDGE_IP):5390 \

							

chore(tests): improve rwkv tests and consume TEST_FLAKES (#3765) chores(tests): improve rwkv tests and consume TEST_FLAKES consistently use TEST_FLAKES and reduce flakiness of rwkv tests by being case insensitive Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-10-08 09:24:19 +02:00

									$(GOCMD) run github.com/onsi/ginkgo/v2/ginkgo --flake-attempts $(TEST_FLAKES) -v -r ./tests/e2e

							

chore: re-enable e2e tests, fixups anthropic API tools support (#8296) * chore(tests): add mock backend e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup anthropic tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * prepare e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop repetitive tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop specific CI workflow Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup anthropic issues, move all e2e tests to use mocked backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-30 12:41:50 +01:00

	$(MAKE) clean-mock-backend

ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-10-19 13:50:40 +02:00

2025-07-18 13:24:12 +02:00

########################################################

feat: bump llama.cpp, add gguf support (#943) **Description** This PR syncs up the `llama` backend to use `gguf` (https://github.com/go-skynet/go-llama.cpp/pull/180). It also adds `llama-stable` to the targets so we can still load ggml. It adapts the current tests to use the `llama-backend` for ggml and uses a `gguf` model to run tests on the new backend. In order to consume the new version of go-llama.cpp, it also bump go to 1.21 (images, pipelines, etc) --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-08-24 01:18:58 +02:00

								test-llama-gguf: prepare-test

							

2025-07-18 13:24:12 +02:00

									TEST_DIR=$(abspath ./)/test-dir/ FIXTURES=$(abspath ./)/tests/fixtures CONFIG_FILE=$(abspath ./)/test-models/config.yaml MODELS_PATH=$(abspath ./)/test-models BACKENDS_PATH=$(abspath ./)/backends \

							

chore(tests): improve rwkv tests and consume TEST_FLAKES (#3765) chores(tests): improve rwkv tests and consume TEST_FLAKES consistently use TEST_FLAKES and reduce flakiness of rwkv tests by being case insensitive Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-10-08 09:24:19 +02:00

									$(GOCMD) run github.com/onsi/ginkgo/v2/ginkgo --label-filter="llama-gguf" --flake-attempts $(TEST_FLAKES) -v -r $(TEST_PATHS)

							

feat: bump llama.cpp, add gguf support (#943) **Description** This PR syncs up the `llama` backend to use `gguf` (https://github.com/go-skynet/go-llama.cpp/pull/180). It also adds `llama-stable` to the targets so we can still load ggml. It adapts the current tests to use the `llama-backend` for ggml and uses a `gguf` model to run tests on the new backend. In order to consume the new version of go-llama.cpp, it also bump go to 1.21 (images, pipelines, etc) --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-08-24 01:18:58 +02:00

feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

								test-tts: prepare-test

							

2025-07-18 13:24:12 +02:00

									TEST_DIR=$(abspath ./)/test-dir/ FIXTURES=$(abspath ./)/tests/fixtures CONFIG_FILE=$(abspath ./)/test-models/config.yaml MODELS_PATH=$(abspath ./)/test-models BACKENDS_PATH=$(abspath ./)/backends \

							

chore(tests): improve rwkv tests and consume TEST_FLAKES (#3765) chores(tests): improve rwkv tests and consume TEST_FLAKES consistently use TEST_FLAKES and reduce flakiness of rwkv tests by being case insensitive Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-10-08 09:24:19 +02:00

									$(GOCMD) run github.com/onsi/ginkgo/v2/ginkgo --label-filter="tts" --flake-attempts $(TEST_FLAKES) -v -r $(TEST_PATHS)

							

feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

2025-07-18 13:24:12 +02:00

									TEST_DIR=$(abspath ./)/test-dir/ FIXTURES=$(abspath ./)/tests/fixtures CONFIG_FILE=$(abspath ./)/test-models/config.yaml MODELS_PATH=$(abspath ./)/test-models BACKENDS_PATH=$(abspath ./)/backends \

							

chore(tests): improve rwkv tests and consume TEST_FLAKES (#3765) chores(tests): improve rwkv tests and consume TEST_FLAKES consistently use TEST_FLAKES and reduce flakiness of rwkv tests by being case insensitive Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-10-08 09:24:19 +02:00

									$(GOCMD) run github.com/onsi/ginkgo/v2/ginkgo --label-filter="stablediffusion" --flake-attempts $(TEST_FLAKES) -v -r $(TEST_PATHS)

							

feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

2025-07-22 16:31:04 +02:00

test-stores:

chore(tests): improve rwkv tests and consume TEST_FLAKES (#3765) chores(tests): improve rwkv tests and consume TEST_FLAKES consistently use TEST_FLAKES and reduce flakiness of rwkv tests by being case insensitive Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-10-08 09:24:19 +02:00

									$(GOCMD) run github.com/onsi/ginkgo/v2/ginkgo --label-filter="stores" --flake-attempts $(TEST_FLAKES) -v -r tests/integration

							

feat(stores): Vector store backend (#1795) Add simple vector store backend Signed-off-by: Richard Palethorpe <io@richiejp.com>

2024-03-22 20:14:04 +00:00

feat(realtime): WebRTC support (#8790) * feat(realtime): WebRTC support Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(tracing): Show full LLM opts and deltas Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>

2026-03-13 20:37:15 +00:00

test-opus:

feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

test-container:

feat: add huggingface embeddings backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-20 22:10:42 +02:00

									docker run -ti --rm --entrypoint /bin/bash -ti -v $(abspath ./):/build local-ai-test-container

							

feature: makefile & updates (#23) Co-authored-by: mudler <mudler@c3os.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-04-15 16:39:07 -07:00

2025-07-18 13:24:12 +02:00

########################################################

feature: makefile & updates (#23) Co-authored-by: mudler <mudler@c3os.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-04-15 16:39:07 -07:00

## Help:

:arrow_up: Bump llama.cpp (#33) Signed-off-by: mudler <mudler@c3os.io>

2023-04-17 21:34:02 +02:00

										}' $(MAKEFILE_LIST)

							

feat: add falcon ggllm via grpc client Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

2025-07-18 13:24:12 +02:00

########################################################

2024-04-13 02:37:32 -05:00

								.PHONY: protogen

							

chore(Makefile): drop unused targets (#5893) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-07-24 14:49:50 +02:00

								protogen: protogen-go

							

2024-04-13 02:37:32 -05:00

2025-07-22 16:31:04 +02:00

protoc:

chore(deps): bump stable-diffusion.cpp to '8823dc48bcc1598eb9671da7b69e45338d0cc5a5' (#7524) * chore(deps): bump stable-diffusion.cpp to '8823dc48bcc1598eb9671da7b69e45338d0cc5a5' Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(Dockerfile.golang): Make curl noisy to see when download fails Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Richard Palethorpe <io@richiejp.com>

2025-12-11 20:32:25 +01:00

									curl -L $$URL -o protoc.zip && \

							

2025-07-22 16:31:04 +02:00

									unzip -j -d $(CURDIR) protoc.zip bin/protoc && rm protoc.zip

							

2024-04-13 02:37:32 -05:00

								.PHONY: protogen-go

							

2025-07-22 16:31:04 +02:00

								protogen-go: protoc install-go-tools

							

2024-04-13 02:37:32 -05:00

	mkdir -p pkg/grpc/proto

2025-07-22 16:31:04 +02:00

									./protoc --experimental_allow_proto3_optional -Ibackend/ --go_out=pkg/grpc/proto/ --go_opt=paths=source_relative --go-grpc_out=pkg/grpc/proto/ --go-grpc_opt=paths=source_relative \

							

refactor: move backends into the backends directory (#1279) * refactor: move backends into the backends directory Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor: move main close to implementation for every backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-11-13 22:40:16 +01:00

    backend/backend.proto

feat: add falcon ggllm via grpc client Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

feat: inferencing default, automatic tool parsing fallback and wire min_p (#9092) * feat: wire min_p Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: inferencing defaults Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(refactor): re-use iterative parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: generate automatically inference defaults from unsloth Instead of trying to re-invent the wheel and maintain here the inference defaults, prefer to consume unsloth ones, and contribute there as necessary. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: apply defaults also to models installed via gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: be consistent and apply fallback to all endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-22 00:57:15 +01:00

								core/config/inference_defaults.json: ## Fetch inference defaults from unsloth (only if missing)

							

2024-04-13 02:37:32 -05:00

								.PHONY: protogen-go-clean

							

feat: cuda transformers (#1401) * Use cuda in transformers if available tensorflow probably needs a different check. Signed-off-by: Erich Schubert <kno10@users.noreply.github.com> * feat: expose CUDA at top level Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * tests: add to tests and create workflow for py extra backends * doc: update note on how to use core images --------- Signed-off-by: Erich Schubert <kno10@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Erich Schubert <kno10@users.noreply.github.com>

2023-12-08 15:45:04 +01:00

	$(MAKE) -C backend/python/transformers

feat(metal): try to extend support to remaining backends (#8374) * feat(metal): try to extend support to remaining backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * neutts doesn't work Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split outetts out of transformers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Remove torch pin to whisperx Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-03 21:57:50 +01:00

	$(MAKE) -C backend/python/outetts

tests: add diffusers tests (#1419)

2023-12-11 02:20:34 -05:00

	$(MAKE) -C backend/python/diffusers

feat(chatterbox): add new backend (#5524) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-30 10:52:55 +02:00

	$(MAKE) -C backend/python/chatterbox

fix: vllm missing logprobs (#5279) * working to address missing items referencing #3436, #2930 - if i could test it, this might show that the output from the vllm backend is processed and returned to the user Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com> * adding in vllm tests to test-extras Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com> * adding in tests to pipeline for execution Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com> * removing todo block, test via pipeline Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com> --------- Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com>

2025-04-30 08:55:07 -04:00

	$(MAKE) -C backend/python/vllm

feat(vllm-omni): add new backend (#8188) * feat(vllm-omni: add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * default to py3.12 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-24 22:23:30 +01:00

	$(MAKE) -C backend/python/vllm-omni

feat(vibevoice): add new backend (#7494) * feat(vibevoice): add backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: add workflow and backend index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): add vibevoice Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use self-hosted for intel builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Pin python version for l4t Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-12-10 21:14:21 +01:00

	$(MAKE) -C backend/python/vibevoice

feat(backends): add moonshine backend for faster transcription (#7833) * feat(backends): add moonshine backend for faster transcription Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backend to CI, update AGENTS.md from this exercise Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-07 21:44:35 +01:00

	$(MAKE) -C backend/python/moonshine

feat(tts): add pocket-tts backend (#8018) * feat(pocket-tts): add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-13 23:35:19 +01:00

	$(MAKE) -C backend/python/pocket-tts

feat(qwen-tts): add Qwen-tts backend (#8163) * feat(qwen-tts): add Qwen-tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update intel deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop flash-attn for cuda13 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-23 15:18:41 +01:00

	$(MAKE) -C backend/python/qwen-tts

feat: add fish-speech backend (#8962) * feat: add fish-speech backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop portaudio Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-12 07:48:23 +01:00

	$(MAKE) -C backend/python/fish-speech

feat(backends): add faster-qwen3-tts (#8664) * feat(backends): add faster-qwen3-tts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: this backend is CUDA only Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: add requirements-install.txt with setuptools for build isolation The faster-qwen3-tts backend requires setuptools to build packages like sox that have setuptools as a build dependency. This ensures the build completes successfully in CI. Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-27 08:16:51 +01:00

	$(MAKE) -C backend/python/faster-qwen3-tts

feat(qwen-asr): add support to qwen-asr (#8281) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-29 21:50:35 +01:00

	$(MAKE) -C backend/python/qwen-asr

feat(nemo): add Nemo (only asr for now) backend (#8436) * feat(nemo): add Nemo (only asr for now) backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(nemo): add Nemo backend without Python version pins (#8438) * Initial plan * Remove Python version pins from nemo backend install.sh Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Pin pyarrow to 20.0.0 in nemo requirements Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>

2026-02-07 08:19:37 +01:00

	$(MAKE) -C backend/python/nemo

feat: add VoxCPM tts backend (#8109) * feat: add VoxCPM tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Disable voxcpm on arm64 cpu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-28 14:44:04 +01:00

	$(MAKE) -C backend/python/voxcpm

2026-02-02 10:33:12 -05:00

	$(MAKE) -C backend/python/whisperx

2026-02-05 12:04:53 +01:00

	$(MAKE) -C backend/python/ace-step

feat: add (experimental) fine-tuning support with TRL (#9088) * feat: add fine-tuning endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(experimental): add fine-tuning endpoint and TRL support This changeset defines new GRPC signatues for Fine tuning backends, and add TRL backend as initial fine-tuning engine. This implementation also supports exporting to GGUF and automatically importing it to LocalAI after fine-tuning. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * commit TRL backend, stop by killing process Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * move fine-tune to generic features Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add evals, reorder menu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-21 02:08:02 +01:00

	$(MAKE) -C backend/python/trl

feat: cuda transformers (#1401) * Use cuda in transformers if available tensorflow probably needs a different check. Signed-off-by: Erich Schubert <kno10@users.noreply.github.com> * feat: expose CUDA at top level Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * tests: add to tests and create workflow for py extra backends * doc: update note on how to use core images --------- Signed-off-by: Erich Schubert <kno10@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Erich Schubert <kno10@users.noreply.github.com>

2023-12-08 15:45:04 +01:00

feat(metal): try to extend support to remaining backends (#8374) * feat(metal): try to extend support to remaining backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * neutts doesn't work Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split outetts out of transformers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Remove torch pin to whisperx Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-03 21:57:50 +01:00

									$(MAKE) -C backend/python/outetts test

							

tests: add diffusers tests (#1419)

2023-12-11 02:20:34 -05:00

									$(MAKE) -C backend/python/diffusers test

							

feat(chatterbox): add new backend (#5524) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-30 10:52:55 +02:00

									$(MAKE) -C backend/python/chatterbox test

							

fix: vllm missing logprobs (#5279) * working to address missing items referencing #3436, #2930 - if i could test it, this might show that the output from the vllm backend is processed and returned to the user Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com> * adding in vllm tests to test-extras Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com> * adding in tests to pipeline for execution Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com> * removing todo block, test via pipeline Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com> --------- Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com>

2025-04-30 08:55:07 -04:00

									$(MAKE) -C backend/python/vllm test

							

feat(vllm-omni): add new backend (#8188) * feat(vllm-omni: add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * default to py3.12 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-24 22:23:30 +01:00

									$(MAKE) -C backend/python/vllm-omni test

							

feat(vibevoice): add new backend (#7494) * feat(vibevoice): add backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: add workflow and backend index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): add vibevoice Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use self-hosted for intel builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Pin python version for l4t Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-12-10 21:14:21 +01:00

									$(MAKE) -C backend/python/vibevoice test

							

feat(backends): add moonshine backend for faster transcription (#7833) * feat(backends): add moonshine backend for faster transcription Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backend to CI, update AGENTS.md from this exercise Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-07 21:44:35 +01:00

									$(MAKE) -C backend/python/moonshine test

							

feat(tts): add pocket-tts backend (#8018) * feat(pocket-tts): add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-13 23:35:19 +01:00

									$(MAKE) -C backend/python/pocket-tts test

							

feat(qwen-tts): add Qwen-tts backend (#8163) * feat(qwen-tts): add Qwen-tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update intel deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop flash-attn for cuda13 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-23 15:18:41 +01:00

									$(MAKE) -C backend/python/qwen-tts test

							

feat: add fish-speech backend (#8962) * feat: add fish-speech backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop portaudio Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-12 07:48:23 +01:00

									$(MAKE) -C backend/python/fish-speech test

							

feat(backends): add faster-qwen3-tts (#8664) * feat(backends): add faster-qwen3-tts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: this backend is CUDA only Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: add requirements-install.txt with setuptools for build isolation The faster-qwen3-tts backend requires setuptools to build packages like sox that have setuptools as a build dependency. This ensures the build completes successfully in CI. Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-27 08:16:51 +01:00

									$(MAKE) -C backend/python/faster-qwen3-tts test

							

feat(qwen-asr): add support to qwen-asr (#8281) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-29 21:50:35 +01:00

									$(MAKE) -C backend/python/qwen-asr test

							

feat(nemo): add Nemo (only asr for now) backend (#8436) * feat(nemo): add Nemo (only asr for now) backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(nemo): add Nemo backend without Python version pins (#8438) * Initial plan * Remove Python version pins from nemo backend install.sh Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Pin pyarrow to 20.0.0 in nemo requirements Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>

2026-02-07 08:19:37 +01:00

									$(MAKE) -C backend/python/nemo test

							

feat: add VoxCPM tts backend (#8109) * feat: add VoxCPM tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Disable voxcpm on arm64 cpu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-28 14:44:04 +01:00

									$(MAKE) -C backend/python/voxcpm test

							

2026-02-02 10:33:12 -05:00

									$(MAKE) -C backend/python/whisperx test

							

2026-02-05 12:04:53 +01:00

									$(MAKE) -C backend/python/ace-step test

							

feat: add (experimental) fine-tuning support with TRL (#9088) * feat: add fine-tuning endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(experimental): add fine-tuning endpoint and TRL support This changeset defines new GRPC signatues for Fine tuning backends, and add TRL backend as initial fine-tuning engine. This implementation also supports exporting to GGUF and automatically importing it to LocalAI after fine-tuning. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * commit TRL backend, stop by killing process Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * move fine-tune to generic features Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add evals, reorder menu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-21 02:08:02 +01:00

									$(MAKE) -C backend/python/trl test

							

2023-11-04 15:30:32 +01:00

feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-02-08 20:12:51 +01:00

DOCKER_IMAGE?=local-ai

2026-01-06 14:26:42 +00:00

BASE_IMAGE?=ubuntu:24.04

feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-02-08 20:12:51 +01:00

fix(build): better CI logging and correct some build failure modes in Makefile (#1899) * feat: group make output by target when running parallelized builds in CI * fix: quote GO_TAGS in makefile to fix handling of whitespace in value * fix: set CPATH to find opencv2 in it's commonly installed location * fix: add missing go mod dropreplace for go-llama.cpp * chore: remove opencv symlink from github workflows

2024-03-27 15:12:19 -05:00

										--build-arg GO_TAGS="$(GO_TAGS)" \

							

feat(build): adjust number of parallel make jobs (#1915) * feat(build): adjust number of parallel make jobs * fix: update make on MacOS from brew to support --output-sync argument * fix: cache grpc with version as part of key to improve validity of cache hits * fix: use gmake for tests-apple to use the updated GNU make version * fix: actually use the new make version for tests-apple * feat: parallelize tests-extra * feat: attempt to cache grpc build for docker images * fix: don't quote GRPC version * fix: don't cache go modules, we have limited cache space, better used elsewhere * fix: release with the same version of go that we test with * fix: don't fail on exporting cache layers * fix: remove deprecated BUILD_GRPC docker arg from Makefile

2024-03-29 16:32:40 -05:00

										--build-arg MAKEFLAGS="$(DOCKER_MAKEFLAGS)" \

							

feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-02-08 20:12:51 +01:00

										--build-arg BUILD_TYPE=$(BUILD_TYPE) \

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

										--build-arg CUDA_MAJOR_VERSION=$(CUDA_MAJOR_VERSION) \

							

2026-01-06 14:26:42 +00:00

										--build-arg UBUNTU_CODENAME=$(UBUNTU_CODENAME) \

							

feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-02-08 20:12:51 +01:00

		-t $(DOCKER_IMAGE) .

feat: auto select llama-cpp cpu variant (#2305) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-05-13 02:37:52 -07:00

2026-01-06 14:26:42 +00:00

docker-cuda12:

feat: Upgrade to CUDA 12.5 (#2601) Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com>

2024-06-19 17:50:49 +02:00

	docker build \

2026-01-06 14:26:42 +00:00

										--build-arg CUDA_MAJOR_VERSION=${CUDA_MAJOR_VERSION} \

							

feat: Upgrade to CUDA 12.5 (#2601) Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com>

2024-06-19 17:50:49 +02:00

										--build-arg BASE_IMAGE=$(BASE_IMAGE) \

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

										--build-arg UBUNTU_VERSION=$(UBUNTU_VERSION) \

							

2026-01-06 14:26:42 +00:00

										--build-arg UBUNTU_CODENAME=$(UBUNTU_CODENAME) \

							

feat: Upgrade to CUDA 12.5 (#2601) Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com>

2024-06-19 17:50:49 +02:00

feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-02-08 20:12:51 +01:00

docker-image-intel:

2026-01-06 14:26:42 +00:00

										--build-arg BASE_IMAGE=intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04 \

							

feat(intel): add diffusers/transformers support (#1746) * feat(intel): add diffusers support * try to consume upstream container image * Debug * Manually install deps * Map transformers/hf cache dir to modelpath if not specified * fix(compel): update initialization, pass by all gRPC options * fix: add dependencies, implement transformers for xpu * base it from the oneapi image * Add pillow * set threads if specified when launching the API * Skip conda install if intel * defaults to non-intel * ci: add to pipelines * prepare compel only if enabled * Skip conda install if intel * fix cleanup * Disable compel by default * Install torch 2.1.0 with Intel * Skip conda on some setups * Detect python * Quiet output * Do not override system python with conda * Prefer python3 * Fixups * exllama2: do not install without conda (overrides pytorch version) * exllama/exllama2: do not install if not using cuda * Add missing dataset dependency * Small fixups, symlink to python, add requirements * Add neural_speed to the deps * correctly handle model offloading * fix: device_map == xpu * go back at calling python, fixed at dockerfile level * Exllama2 restricted to only nvidia gpus * Tokenizer to xpu

2024-03-07 14:37:45 +01:00

										--build-arg IMAGE_TYPE=$(IMAGE_TYPE) \

							

feat: Realtime API support reboot (#5392) * feat(realtime): Initial Realtime API implementation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: go mod tidy Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat: Implement transcription only mode for realtime API Reduce the scope of the real time API for the initial realease and make transcription only mode functional. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Build backends on a separate layer to speed up core only changes Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-25 21:25:05 +01:00

										--build-arg GO_TAGS="$(GO_TAGS)" \

							

feat(build): adjust number of parallel make jobs (#1915) * feat(build): adjust number of parallel make jobs * fix: update make on MacOS from brew to support --output-sync argument * fix: cache grpc with version as part of key to improve validity of cache hits * fix: use gmake for tests-apple to use the updated GNU make version * fix: actually use the new make version for tests-apple * feat: parallelize tests-extra * feat: attempt to cache grpc build for docker images * fix: don't quote GRPC version * fix: don't cache go modules, we have limited cache space, better used elsewhere * fix: release with the same version of go that we test with * fix: don't fail on exporting cache layers * fix: remove deprecated BUILD_GRPC docker arg from Makefile

2024-03-29 16:32:40 -05:00

										--build-arg MAKEFLAGS="$(DOCKER_MAKEFLAGS)" \

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

										--build-arg BUILD_TYPE=intel \

							

2026-01-06 14:26:42 +00:00

										--build-arg UBUNTU_CODENAME=$(UBUNTU_CODENAME) \

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

		-t $(DOCKER_IMAGE) .

feat(swagger): Add swagger API doc (#1926) * makefile(build): add minimal and api build target * feat(swagger): Add swagger

2024-03-29 22:29:33 +01:00

2025-07-18 13:24:12 +02:00

########################################################

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

# Pattern rule for standard backends (docker-based)

2025-08-22 08:42:29 +02:00

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

								# Darwin-specific backends (keep as explicit targets since they have special build logic)

							

2025-08-22 08:42:29 +02:00

								backends/llama-cpp-darwin: build

							

chore(Makefile): small fixup for darwin MLX builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-22 08:52:29 +02:00

								build-darwin-python-backend: build

							

2025-08-22 08:42:29 +02:00

	bash ./scripts/build/python-darwin.sh

chore(ci): Build some Go based backends on Darwin (#6164) * chore(ci): Build Go based backends on Darwin Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(stablediffusion-ggml): Fixes for building on Darwin Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(whisper): Build on Darwin Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>

2025-09-01 21:18:30 +01:00

								build-darwin-go-backend: build

							

chore(Makefile): small fixup for darwin MLX builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-22 08:52:29 +02:00

backends/mlx:

Add mlx-vlm (#6119) * Add mlx-vlm Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to CI workflows Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add requirements-mps.txt Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-23 23:05:30 +02:00

									BACKEND=mlx $(MAKE) build-darwin-python-backend

							

2025-08-22 08:42:29 +02:00

									./local-ai backends install "ocifile://$(abspath ./backend-images/mlx.tar)"

							

feat(diffusers): add MPS version (#6121) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-22 23:14:54 +02:00

backends/diffuser-darwin:

Add mlx-vlm (#6119) * Add mlx-vlm Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to CI workflows Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add requirements-mps.txt Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-23 23:05:30 +02:00

									BACKEND=diffusers $(MAKE) build-darwin-python-backend

							

feat(diffusers): add MPS version (#6121) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-22 23:14:54 +02:00

									./local-ai backends install "ocifile://$(abspath ./backend-images/diffusers.tar)"

							

Add mlx-vlm (#6119) * Add mlx-vlm Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to CI workflows Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add requirements-mps.txt Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-23 23:05:30 +02:00

backends/mlx-vlm:

feat(mlx-audio): Add mlx-audio backend (#6138) * feat(mlx-audio): Add mlx-audio backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * improve loading Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * CI tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: set C_INCLUDE_PATH to point to python install Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-26 15:27:06 +02:00

backends/mlx-audio:

feat(mlx-distributed): add new MLX-distributed backend (#8801) * feat(mlx-distributed): add new MLX-distributed backend Add new MLX distributed backend with support for both TCP and RDMA for model sharding. This implementation ties in the discovery implementation already in place, and re-uses the same P2P mechanism for the TCP MLX-distributed inferencing. The Auto-parallel implementation is inspired by Exo's ones (who have been added to acknowledgement for the great work!) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose a CLI to facilitate backend starting Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: make manual rank0 configurable via model configs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing features from mlx backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2026-03-09 17:29:32 +01:00

backends/mlx-distributed:

chore(ci): Build some Go based backends on Darwin (#6164) * chore(ci): Build Go based backends on Darwin Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(stablediffusion-ggml): Fixes for building on Darwin Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(whisper): Build on Darwin Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>

2025-09-01 21:18:30 +01:00

backends/stablediffusion-ggml-darwin:

2025-07-18 13:24:12 +02:00

backend-images:

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

								# Backend metadata: BACKEND_NAME | DOCKERFILE_TYPE | BUILD_CONTEXT | PROGRESS_FLAG | NEEDS_BACKEND_ARG

							

feat(voxtral): add voxtral backend (#8451) * feat(voxtral): add voxtral backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-09 09:12:05 +01:00

								BACKEND_VOXTRAL = voxtral|golang|.|false|true

							

feat(backends): add ace-step.cpp (#8965) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-12 18:56:26 +01:00

								BACKEND_ACESTEP_CPP = acestep-cpp|golang|.|false|true

							

feat(realtime): WebRTC support (#8790) * feat(realtime): WebRTC support Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(tracing): Show full LLM opts and deltas Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>

2026-03-13 20:37:15 +00:00

								BACKEND_OPUS = opus|golang|.|false|true

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

feat(metal): try to extend support to remaining backends (#8374) * feat(metal): try to extend support to remaining backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * neutts doesn't work Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split outetts out of transformers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Remove torch pin to whisperx Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-03 21:57:50 +01:00

								BACKEND_OUTETTS = outetts|python|.|false|true

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

								BACKEND_FASTER_WHISPER = faster-whisper|python|.|false|true

							

feat(tts): add pocket-tts backend (#8018) * feat(pocket-tts): add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-13 23:35:19 +01:00

								BACKEND_RFDETR = rfdetr|python|.|false|true

							

feat(vllm-omni): add new backend (#8188) * feat(vllm-omni: add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * default to py3.12 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-24 22:23:30 +01:00

								BACKEND_VLLM_OMNI = vllm-omni|python|.|false|true

							

feat(tts): add pocket-tts backend (#8018) * feat(pocket-tts): add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-13 23:35:19 +01:00

								BACKEND_DIFFUSERS = diffusers|python|.|--progress=plain|true

							

feat(qwen-tts): add Qwen-tts backend (#8163) * feat(qwen-tts): add Qwen-tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update intel deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop flash-attn for cuda13 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-23 15:18:41 +01:00

								BACKEND_QWEN_TTS = qwen-tts|python|.|false|true

							

feat: add fish-speech backend (#8962) * feat: add fish-speech backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop portaudio Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-12 07:48:23 +01:00

								BACKEND_FISH_SPEECH = fish-speech|python|.|false|true

							

feat(backends): add faster-qwen3-tts (#8664) * feat(backends): add faster-qwen3-tts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: this backend is CUDA only Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: add requirements-install.txt with setuptools for build isolation The faster-qwen3-tts backend requires setuptools to build packages like sox that have setuptools as a build dependency. This ensures the build completes successfully in CI. Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-27 08:16:51 +01:00

								BACKEND_FASTER_QWEN3_TTS = faster-qwen3-tts|python|.|false|true

							

feat(qwen-asr): add support to qwen-asr (#8281) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-29 21:50:35 +01:00

								BACKEND_QWEN_ASR = qwen-asr|python|.|false|true

							

feat(nemo): add Nemo (only asr for now) backend (#8436) * feat(nemo): add Nemo (only asr for now) backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(nemo): add Nemo backend without Python version pins (#8438) * Initial plan * Remove Python version pins from nemo backend install.sh Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Pin pyarrow to 20.0.0 in nemo requirements Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>

2026-02-07 08:19:37 +01:00

								BACKEND_NEMO = nemo|python|.|false|true

							

feat: add VoxCPM tts backend (#8109) * feat: add VoxCPM tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Disable voxcpm on arm64 cpu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-28 14:44:04 +01:00

								BACKEND_VOXCPM = voxcpm|python|.|false|true

							

2026-02-02 10:33:12 -05:00

								BACKEND_WHISPERX = whisperx|python|.|false|true

							

2026-02-05 12:04:53 +01:00

								BACKEND_ACE_STEP = ace-step|python|.|false|true

							

feat(mlx-distributed): add new MLX-distributed backend (#8801) * feat(mlx-distributed): add new MLX-distributed backend Add new MLX distributed backend with support for both TCP and RDMA for model sharding. This implementation ties in the discovery implementation already in place, and re-uses the same P2P mechanism for the TCP MLX-distributed inferencing. The Auto-parallel implementation is inspired by Exo's ones (who have been added to acknowledgement for the great work!) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose a CLI to facilitate backend starting Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: make manual rank0 configurable via model configs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing features from mlx backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2026-03-09 17:29:32 +01:00

								BACKEND_MLX_DISTRIBUTED = mlx-distributed|python|./|false|true

							

feat: add (experimental) fine-tuning support with TRL (#9088) * feat: add fine-tuning endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(experimental): add fine-tuning endpoint and TRL support This changeset defines new GRPC signatues for Fine tuning backends, and add TRL backend as initial fine-tuning engine. This implementation also supports exporting to GGUF and automatically importing it to LocalAI after fine-tuning. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * commit TRL backend, stop by killing process Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * move fine-tune to generic features Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add evals, reorder menu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-21 02:08:02 +01:00

								BACKEND_TRL = trl|python|.|false|true

							

feat(quantization): add quantization backend (#9096) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-22 00:56:34 +01:00

								BACKEND_LLAMA_CPP_QUANTIZATION = llama-cpp-quantization|python|.|false|true

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

2026-01-06 14:26:42 +00:00

										--build-arg UBUNTU_CODENAME=$(UBUNTU_CODENAME) \

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

										$(if $(filter true,$(5)),--build-arg BACKEND=$(1)) \

							

feat(voxtral): add voxtral backend (#8451) * feat(voxtral): add voxtral backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-09 09:12:05 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_VOXTRAL)))

							

feat(realtime): WebRTC support (#8790) * feat(realtime): WebRTC support Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(tracing): Show full LLM opts and deltas Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>

2026-03-13 20:37:15 +00:00

								$(eval $(call generate-docker-build-target,$(BACKEND_OPUS)))

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_RERANKERS)))

							

feat(metal): try to extend support to remaining backends (#8374) * feat(metal): try to extend support to remaining backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * neutts doesn't work Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split outetts out of transformers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Remove torch pin to whisperx Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-03 21:57:50 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_OUTETTS)))

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_FASTER_WHISPER)))

							

feat(vllm-omni): add new backend (#8188) * feat(vllm-omni: add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * default to py3.12 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-24 22:23:30 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_VLLM_OMNI)))

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_DIFFUSERS)))

							

feat(backends): add moonshine backend for faster transcription (#7833) * feat(backends): add moonshine backend for faster transcription Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backend to CI, update AGENTS.md from this exercise Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-07 21:44:35 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_MOONSHINE)))

							

feat(tts): add pocket-tts backend (#8018) * feat(pocket-tts): add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-13 23:35:19 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_POCKET_TTS)))

							

feat(qwen-tts): add Qwen-tts backend (#8163) * feat(qwen-tts): add Qwen-tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update intel deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop flash-attn for cuda13 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-23 15:18:41 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_QWEN_TTS)))

							

feat: add fish-speech backend (#8962) * feat: add fish-speech backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop portaudio Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-12 07:48:23 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_FISH_SPEECH)))

							

feat(backends): add faster-qwen3-tts (#8664) * feat(backends): add faster-qwen3-tts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: this backend is CUDA only Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: add requirements-install.txt with setuptools for build isolation The faster-qwen3-tts backend requires setuptools to build packages like sox that have setuptools as a build dependency. This ensures the build completes successfully in CI. Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2026-02-27 08:16:51 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_FASTER_QWEN3_TTS)))

							

feat(qwen-asr): add support to qwen-asr (#8281) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-29 21:50:35 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_QWEN_ASR)))

							

feat(nemo): add Nemo (only asr for now) backend (#8436) * feat(nemo): add Nemo (only asr for now) backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(nemo): add Nemo backend without Python version pins (#8438) * Initial plan * Remove Python version pins from nemo backend install.sh Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Pin pyarrow to 20.0.0 in nemo requirements Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>

2026-02-07 08:19:37 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_NEMO)))

							

feat: add VoxCPM tts backend (#8109) * feat: add VoxCPM tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Disable voxcpm on arm64 cpu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-28 14:44:04 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_VOXCPM)))

							

2026-02-02 10:33:12 -05:00

								$(eval $(call generate-docker-build-target,$(BACKEND_WHISPERX)))

							

2026-02-05 12:04:53 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_ACE_STEP)))

							

feat(backends): add ace-step.cpp (#8965) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-12 18:56:26 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_ACESTEP_CPP)))

							

feat(mlx-distributed): add new MLX-distributed backend (#8801) * feat(mlx-distributed): add new MLX-distributed backend Add new MLX distributed backend with support for both TCP and RDMA for model sharding. This implementation ties in the discovery implementation already in place, and re-uses the same P2P mechanism for the TCP MLX-distributed inferencing. The Auto-parallel implementation is inspired by Exo's ones (who have been added to acknowledgement for the great work!) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose a CLI to facilitate backend starting Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: make manual rank0 configurable via model configs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing features from mlx backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2026-03-09 17:29:32 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_MLX_DISTRIBUTED)))

							

feat: add (experimental) fine-tuning support with TRL (#9088) * feat: add fine-tuning endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(experimental): add fine-tuning endpoint and TRL support This changeset defines new GRPC signatues for Fine tuning backends, and add TRL backend as initial fine-tuning engine. This implementation also supports exporting to GGUF and automatically importing it to LocalAI after fine-tuning. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * commit TRL backend, stop by killing process Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * move fine-tune to generic features Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add evals, reorder menu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-21 02:08:02 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_TRL)))

							

feat(quantization): add quantization backend (#9096) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-22 00:56:34 +01:00

								$(eval $(call generate-docker-build-target,$(BACKEND_LLAMA_CPP_QUANTIZATION)))

							

chore(Makefile): refactor common make targets (#7858) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-04 21:12:50 +01:00

2025-07-18 13:24:12 +02:00

feat(quantization): add quantization backend (#9096) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-22 00:56:34 +01:00

								docker-build-backends: docker-build-llama-cpp docker-build-rerankers docker-build-vllm docker-build-vllm-omni docker-build-transformers docker-build-outetts docker-build-diffusers docker-build-kokoro docker-build-faster-whisper docker-build-coqui docker-build-chatterbox docker-build-vibevoice docker-build-moonshine docker-build-pocket-tts docker-build-qwen-tts docker-build-fish-speech docker-build-faster-qwen3-tts docker-build-qwen-asr docker-build-nemo docker-build-voxcpm docker-build-whisperx docker-build-ace-step docker-build-acestep-cpp docker-build-voxtral docker-build-mlx-distributed docker-build-trl docker-build-llama-cpp-quantization

							

2025-07-18 13:24:12 +02:00

chore: re-enable e2e tests, fixups anthropic API tools support (#8296) * chore(tests): add mock backend e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup anthropic tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * prepare e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop repetitive tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop specific CI workflow Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup anthropic issues, move all e2e tests to use mocked backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-01-30 12:41:50 +01:00

########################################################

fix(ui): Add tracing inline settings back and create UI tests (#9027) Signed-off-by: Richard Palethorpe <io@richiejp.com>

2026-03-16 16:51:06 +00:00

########################################################

2025-07-18 13:24:12 +02:00

########################################################

feat(swagger): Add swagger API doc (#1926) * makefile(build): add minimal and api build target * feat(swagger): Add swagger

2024-03-29 22:29:33 +01:00

								.PHONY: swagger

							

refactor(routes): split routes registration (#2077) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-04-21 01:19:57 +02:00

	swag init -g core/http/app.go --output swagger

feat(webui): statically embed js/css assets (#2348) * feat(webui): statically embed js/css assets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update font assets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-19 18:24:27 +02:00

feat(ui): move to React for frontend (#8772) * feat(ui): move to React Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add import model Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * syntax highlight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Minor fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-03-05 21:47:12 +01:00

								# DEPRECATED: gen-assets is for the legacy Alpine.js UI. Remove when legacy UI is removed.

							

feat(webui): statically embed js/css assets (#2348) * feat(webui): statically embed js/css assets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update font assets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-19 18:24:27 +02:00

								.PHONY: gen-assets

							

chore: drop embedded models (#4715) Since the remote gallery was introduced this is now completely superseded by it. In order to keep the code clean and remove redudant parts let's simplify the usage. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-30 00:03:01 +01:00

									$(GOCMD) run core/dependencies_manager/manager.go webui_static.yaml core/http/static/assets

							

feat(gallery): show available models in website, allow `local-ai models install` to install from galleries (#2555) * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * gen a static page instead (we force DNS redirects to it) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(gallery): install models from CLI, unify install Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Uniform graphic of model page Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Makefile: update targets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Slightly enhance gallery view Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-13 00:47:16 +02:00

Allows to remove a backend from the list (#2721) * Allows to remove a backend from the list Signed-off-by: Mauro Morales <contact@mauromorales.com> * Update Makefile Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: Mauro Morales <contact@mauromorales.com> --------- Signed-off-by: Mauro Morales <contact@mauromorales.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2024-07-05 19:19:38 +02:00

docs/layouts/_default:

feat(gallery): show available models in website, allow `local-ai models install` to install from galleries (#2555) * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * gen a static page instead (we force DNS redirects to it) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(gallery): install models from CLI, unify install Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Uniform graphic of model page Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Makefile: update targets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Slightly enhance gallery view Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-13 00:47:16 +02:00

	mkdir -p docs/layouts/_default

Allows to remove a backend from the list (#2721) * Allows to remove a backend from the list Signed-off-by: Mauro Morales <contact@mauromorales.com> * Update Makefile Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: Mauro Morales <contact@mauromorales.com> --------- Signed-off-by: Mauro Morales <contact@mauromorales.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2024-07-05 19:19:38 +02:00

	cd docs && hugo serve

2025-08-26 14:22:04 +02:00

chore(ci): Build some Go based backends on Darwin (#6164) * chore(ci): Build Go based backends on Darwin Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(stablediffusion-ggml): Fixes for building on Darwin Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(whisper): Build on Darwin Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>

2025-09-01 21:18:30 +01:00

									cd cmd/launcher && go run fyne.io/tools/cmd/fyne@latest package -os linux -icon ../../core/http/static/logo.png --executable $(LAUNCHER_BINARY_NAME)-linux && mv launcher.tar.xz ../../$(LAUNCHER_BINARY_NAME)-linux.tar.xz