Blame: Dockerfile - mudler/LocalAI

mudler / LocalAI UNCLAIMED

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference

0 0 29 Go

Normal View History Raw

feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-02-08 20:12:51 +01:00			`ARG BASE_IMAGE=ubuntu:22.04`
feat: better control of GRPC docker cache (#2070) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-18 15:19:36 -05:00			`ARG GRPC_BASE_IMAGE=${BASE_IMAGE}`
ci: generate specific image for intel builds (#2374) ci: fix intel images until are fixed upstream Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-22 23:35:39 +02:00			`ARG INTEL_BASE_IMAGE=${BASE_IMAGE}`
fix: do not build from the same container (#434) Signed-off-by: mudler <mudler@mocaccino.org> 2023-05-30 15:53:37 +02:00
feat: Add backend gallery (#5607) * feat: Add backend gallery This PR add support to manage backends as similar to models. There is now available a backend gallery which can be used to install and remove extra backends. The backend gallery can be configured similarly as a model gallery, and API calls allows to install and remove new backends in runtime, and as well during the startup phase of LocalAI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backends docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip: Backend Dockerfile for python backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: drop extras images, build python backends separately Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup on all backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * test CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Tweaks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop old backends leftovers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Move dockerfile upper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix proto Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Feature dropped for consistency - we prefer model galleries Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing packages in the build image Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * exllama is ponly available on cublas Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * pin torch on chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups to index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug CI * Install accellerators deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add target arch * Add cuda minor version Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use self-hosted runners Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: use quay for test images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups for vllm and chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups on CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chatterbox is only available for nvidia Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify CI builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt test, use qwen3 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(model gallery): add jina-reranker-v1-tiny-en-gguf Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use reranker from llama.cpp in AIO images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Limit concurrent jobs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-15 14:56:52 +02:00			`FROM ${BASE_IMAGE} AS requirements`
feat: add CuBLAS support in Docker images (#403) Signed-off-by: Sébastien Prud'homme <sebastien.prudhomme@gmail.com> 2023-05-29 23:12:27 +02:00
feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-02-08 20:12:51 +01:00			`ENV DEBIAN_FRONTEND=noninteractive`
feat: add CuBLAS support in Docker images (#403) Signed-off-by: Sébastien Prud'homme <sebastien.prudhomme@gmail.com> 2023-05-29 23:12:27 +02:00
			`RUN apt-get update && \`
feat: cleanup Dockerfile and make final image a little smaller (#2146) * feat: cleanup Dockerfile and make final image a little smaller Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add build-essential to final stage Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more GRPC cache misses Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: correct for another cause of GRPC cache misses Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * feat: generate new GRPC cache automatically if needed Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use new GRPC_MAKEFLAGS build arg in GRPC cache generation Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-27 12:48:20 -05:00			`apt-get install -y --no-install-recommends \`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`ca-certificates curl wget espeak-ng libgomp1 \`
			`python3 python-is-python3 ffmpeg && \`
feat: cleanup Dockerfile and make final image a little smaller (#2146) * feat: cleanup Dockerfile and make final image a little smaller Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add build-essential to final stage Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more GRPC cache misses Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: correct for another cause of GRPC cache misses Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * feat: generate new GRPC cache automatically if needed Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use new GRPC_MAKEFLAGS build arg in GRPC cache generation Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-27 12:48:20 -05:00			`apt-get clean && \`
feat(images): do not install python deps in the core image (#2425) do not install python deps in the core image Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-27 22:07:48 +02:00			`rm -rf /var/lib/apt/lists/*`
feat: llama.cpp gRPC C++ backend (#1170) * wip: llama.cpp c++ gRPC server Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * make it work, attach it to the build process Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: add protobuf dep Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * try fix protobuf on cmake * cmake: workarounds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add packages * cmake: use fixed version of grpc Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cmake(grpc): install locally * install grpc Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * install required deps for grpc on debian bullseye Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * debug * debug * Fixups * no need to install cmake manually Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: fixup macOS * use brew whenever possible Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * macOS fixups * debug * fix container build Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * workaround * try mac https://stackoverflow.com/questions/23905661/on-mac-g-clang-fails-to-search-usr-local-include-and-usr-local-lib-by-def * Disable temp. arm64 docker image builds --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2023-10-16 21:46:29 +02:00
feat: organize Dockerfile into distinct sections (#2181) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-30 03:12:19 -05:00			`# The requirements-drivers target is for BUILD_TYPE specific items. If you need to install something specific to CUDA, or specific to ROCM, it goes here.`
feat: Add backend gallery (#5607) * feat: Add backend gallery This PR add support to manage backends as similar to models. There is now available a backend gallery which can be used to install and remove extra backends. The backend gallery can be configured similarly as a model gallery, and API calls allows to install and remove new backends in runtime, and as well during the startup phase of LocalAI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backends docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip: Backend Dockerfile for python backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: drop extras images, build python backends separately Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup on all backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * test CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Tweaks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop old backends leftovers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Move dockerfile upper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix proto Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Feature dropped for consistency - we prefer model galleries Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing packages in the build image Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * exllama is ponly available on cublas Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * pin torch on chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups to index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug CI * Install accellerators deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add target arch * Add cuda minor version Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use self-hosted runners Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: use quay for test images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups for vllm and chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups on CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chatterbox is only available for nvidia Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify CI builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt test, use qwen3 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(model gallery): add jina-reranker-v1-tiny-en-gguf Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use reranker from llama.cpp in AIO images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Limit concurrent jobs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-15 14:56:52 +02:00			`FROM requirements AS requirements-drivers`
feat: organize Dockerfile into distinct sections (#2181) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-30 03:12:19 -05:00
			`ARG BUILD_TYPE`
feat: Upgrade to CUDA 12.5 (#2601) Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> 2024-06-19 17:50:49 +02:00			`ARG CUDA_MAJOR_VERSION=12`
fix(cuda): downgrade to 12.0 to increase compatibility range (#2994) * fix(cuda): downgrade to 12.0 to increase compatibility range Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * improve messaging Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-07-23 23:35:31 +02:00			`ARG CUDA_MINOR_VERSION=0`
feat(Dockerfile): allow to skip driver installation (#4447) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-12-22 21:28:38 +01:00			`ARG SKIP_DRIVERS=false`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`ARG TARGETARCH`
			`ARG TARGETVARIANT`
feat: organize Dockerfile into distinct sections (#2181) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-30 03:12:19 -05:00			`ENV BUILD_TYPE=${BUILD_TYPE}`

feat(system): detect and allow to override capabilities (#5785) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-03 19:30:52 +02:00			`RUN mkdir -p /run/localai`
feat: do not bundle llama-cpp anymore (#5790) * Build llama.cpp separately Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Start to try to attach some tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add git and small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: correctly autoload external backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to run AIO tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Slightly update the Makefile helps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt auto-bumper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to run linux test Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add llama-cpp into build pipelines Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add default capability (for cpu) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop llama-cpp specific logic from the backend loader Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop grpc install in ci for tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Pass by backends path for tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Build protogen at start Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): set backends path consistently Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Correctly configure the backends path Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to build for darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Compile for metal on arm64/darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to run build off from cross-arch Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the backend index nvidia-l4t and cpu's llama-cpp backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Build also darwin-x86 for llama-cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Disable arm64 builds temporary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Test backend build on PR Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup build backend reusable workflow Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * pass by skip drivers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use crane Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip drivers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * x86 darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add packaging step for llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix leftover from bark-cpp extraction Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to fix hipblas build Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-18 13:24:12 +02:00			`RUN echo "default" > /run/localai/capability`
feat(system): detect and allow to override capabilities (#5785) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-03 19:30:52 +02:00
feat(vulkan): add vulkan support to the llama.cpp backend (#2648) feat(vulkan): add vulkan support to llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-06-24 20:04:58 +02:00			`# Vulkan requirements`
			`RUN <<EOT bash`
feat(Dockerfile): allow to skip driver installation (#4447) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-12-22 21:28:38 +01:00			`if [ "${BUILD_TYPE}" = "vulkan" ] && [ "${SKIP_DRIVERS}" = "false" ]; then`
feat(vulkan): add vulkan support to the llama.cpp backend (#2648) feat(vulkan): add vulkan support to llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-06-24 20:04:58 +02:00			`apt-get update && \`
			`apt-get install -y --no-install-recommends \`
fix: cleanup indentation and remove duplicate dockerfile stanza (#2889) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-07-15 20:51:15 -05:00			`software-properties-common pciutils wget gpg-agent && \`
feat(vulkan): add vulkan support to the llama.cpp backend (#2648) feat(vulkan): add vulkan support to llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-06-24 20:04:58 +02:00			`wget -qO - https://packages.lunarg.com/lunarg-signing-key-pub.asc \| apt-key add - && \`
			`wget -qO /etc/apt/sources.list.d/lunarg-vulkan-jammy.list https://packages.lunarg.com/vulkan/lunarg-vulkan-jammy.list && \`
			`apt-get update && \`
fix: cleanup indentation and remove duplicate dockerfile stanza (#2889) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-07-15 20:51:15 -05:00			`apt-get install -y \`
feat(vulkan): add vulkan support to the llama.cpp backend (#2648) feat(vulkan): add vulkan support to llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-06-24 20:04:58 +02:00			`vulkan-sdk && \`
			`apt-get clean && \`
feat(system): detect and allow to override capabilities (#5785) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-03 19:30:52 +02:00			`rm -rf /var/lib/apt/lists/* && \`
			`echo "vulkan" > /run/localai/capability`
feat(vulkan): add vulkan support to the llama.cpp backend (#2648) feat(vulkan): add vulkan support to llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-06-24 20:04:58 +02:00			`fi`
			`EOT`

feat: organize Dockerfile into distinct sections (#2181) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-30 03:12:19 -05:00			`# CuBLAS requirements`
feat(build): add arm64 core containers (#2421) ci: add arm64 container images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-28 10:34:59 +02:00			`RUN <<EOT bash`
feat(Dockerfile): allow to skip driver installation (#4447) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-12-22 21:28:38 +01:00			`if [ "${BUILD_TYPE}" = "cublas" ] && [ "${SKIP_DRIVERS}" = "false" ]; then`
feat(build): add arm64 core containers (#2421) ci: add arm64 container images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-28 10:34:59 +02:00			`apt-get update && \`
			`apt-get install -y --no-install-recommends \`
fix: cleanup indentation and remove duplicate dockerfile stanza (#2889) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-07-15 20:51:15 -05:00			`software-properties-common pciutils`
feat(build): add arm64 core containers (#2421) ci: add arm64 container images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-28 10:34:59 +02:00			`if [ "amd64" = "$TARGETARCH" ]; then`
			`curl -O https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb`
fix: cleanup indentation and remove duplicate dockerfile stanza (#2889) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-07-15 20:51:15 -05:00			`fi`
feat(build): add arm64 core containers (#2421) ci: add arm64 container images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-28 10:34:59 +02:00			`if [ "arm64" = "$TARGETARCH" ]; then`
			`curl -O https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/arm64/cuda-keyring_1.1-1_all.deb`
			`fi`
feat: organize Dockerfile into distinct sections (#2181) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-30 03:12:19 -05:00			`dpkg -i cuda-keyring_1.1-1_all.deb && \`
			`rm -f cuda-keyring_1.1-1_all.deb && \`
			`apt-get update && \`
			`apt-get install -y --no-install-recommends \`
			`cuda-nvcc-${CUDA_MAJOR_VERSION}-${CUDA_MINOR_VERSION} \`
deps(whisper): update, add libcufft-dev (#2501) * arrow_up: Update ggerganov/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * fix(build): add libcufft-dev Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> 2024-06-06 08:41:04 +02:00			`libcufft-dev-${CUDA_MAJOR_VERSION}-${CUDA_MINOR_VERSION} \`
feat: organize Dockerfile into distinct sections (#2181) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-30 03:12:19 -05:00			`libcurand-dev-${CUDA_MAJOR_VERSION}-${CUDA_MINOR_VERSION} \`
			`libcublas-dev-${CUDA_MAJOR_VERSION}-${CUDA_MINOR_VERSION} \`
			`libcusparse-dev-${CUDA_MAJOR_VERSION}-${CUDA_MINOR_VERSION} \`
			`libcusolver-dev-${CUDA_MAJOR_VERSION}-${CUDA_MINOR_VERSION} && \`
			`apt-get clean && \`
feat(system): detect and allow to override capabilities (#5785) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-03 19:30:52 +02:00			`rm -rf /var/lib/apt/lists/* && \`
			`echo "nvidia" > /run/localai/capability`
fix: cleanup indentation and remove duplicate dockerfile stanza (#2889) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-07-15 20:51:15 -05:00			`fi`
			`EOT`
feat: organize Dockerfile into distinct sections (#2181) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-30 03:12:19 -05:00
			`# If we are building with clblas support, we need the libraries for the builds`
feat(Dockerfile): allow to skip driver installation (#4447) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-12-22 21:28:38 +01:00			`RUN if [ "${BUILD_TYPE}" = "clblas" ] && [ "${SKIP_DRIVERS}" = "false" ]; then \`
feat: organize Dockerfile into distinct sections (#2181) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-30 03:12:19 -05:00			`apt-get update && \`
			`apt-get install -y --no-install-recommends \`
			`libclblast-dev && \`
			`apt-get clean && \`
			`rm -rf /var/lib/apt/lists/* \`
feat(intel): add diffusers/transformers support (#1746) * feat(intel): add diffusers support * try to consume upstream container image * Debug * Manually install deps * Map transformers/hf cache dir to modelpath if not specified * fix(compel): update initialization, pass by all gRPC options * fix: add dependencies, implement transformers for xpu * base it from the oneapi image * Add pillow * set threads if specified when launching the API * Skip conda install if intel * defaults to non-intel * ci: add to pipelines * prepare compel only if enabled * Skip conda install if intel * fix cleanup * Disable compel by default * Install torch 2.1.0 with Intel * Skip conda on some setups * Detect python * Quiet output * Do not override system python with conda * Prefer python3 * Fixups * exllama2: do not install without conda (overrides pytorch version) * exllama/exllama2: do not install if not using cuda * Add missing dataset dependency * Small fixups, symlink to python, add requirements * Add neural_speed to the deps * correctly handle model offloading * fix: device_map == xpu * go back at calling python, fixed at dockerfile level * Exllama2 restricted to only nvidia gpus * Tokenizer to xpu 2024-03-07 14:37:45 +01:00			`; fi`

feat(Dockerfile): allow to skip driver installation (#4447) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-12-22 21:28:38 +01:00			`RUN if [ "${BUILD_TYPE}" = "hipblas" ] && [ "${SKIP_DRIVERS}" = "false" ]; then \`
feat: update ROCM and use smaller image (#2196) * feat: update ROCM and use smaller image Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add call to ldconfig to fix AMDs broken library packages Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-05-03 11:46:49 -05:00			`apt-get update && \`
			`apt-get install -y --no-install-recommends \`
			`hipblas-dev \`
			`rocblas-dev && \`
			`apt-get clean && \`
			`rm -rf /var/lib/apt/lists/* && \`
feat(system): detect and allow to override capabilities (#5785) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-03 19:30:52 +02:00			`echo "amd" > /run/localai/capability && \`
feat: update ROCM and use smaller image (#2196) * feat: update ROCM and use smaller image Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add call to ldconfig to fix AMDs broken library packages Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-05-03 11:46:49 -05:00			`# I have no idea why, but the ROCM lib packages don't trigger ldconfig after they install, which results in local-ai and others not being able`
			`# to locate the libraries. We run ldconfig ourselves to work around this packaging deficiency`
			`ldconfig \`
			`; fi`

feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`# Cuda`
			`ENV PATH=/usr/local/cuda/bin:${PATH}`

			`# HipBLAS requirements`
			`ENV PATH=/opt/rocm/bin:${PATH}`

			`###################################`
			`###################################`

			`# The requirements-core target is common to all images. It should not be placed in requirements-core unless every single build will use it.`
			`FROM requirements-drivers AS build-requirements`

			`ARG GO_VERSION=1.22.6`
			`ARG CMAKE_VERSION=3.26.4`
			`ARG CMAKE_FROM_SOURCE=false`
			`ARG TARGETARCH`
			`ARG TARGETVARIANT`


			`RUN apt-get update && \`
			`apt-get install -y --no-install-recommends \`
			`build-essential \`
			`ccache \`
			`ca-certificates espeak-ng \`
			`curl libssl-dev \`
			`git \`
			`git-lfs \`
			`unzip upx-ucl python3 python-is-python3 && \`
			`apt-get clean && \`
			`rm -rf /var/lib/apt/lists/*`

			`# Install CMake (the version in 22.04 is too old)`
			`RUN <<EOT bash`
fix: dockerfile typo (#5823) fix dockerfile typo Signed-off-by: LeonSijiaLu <leonsijialu1@gmail.com> 2025-07-18 08:59:33 -04:00			`if [ "${CMAKE_FROM_SOURCE}" = "true" ]; then`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`curl -L -s https://github.com/Kitware/CMake/releases/download/v${CMAKE_VERSION}/cmake-${CMAKE_VERSION}.tar.gz -o cmake.tar.gz && tar xvf cmake.tar.gz && cd cmake-${CMAKE_VERSION} && ./configure && make && make install`
			`else`
			`apt-get update && \`
			`apt-get install -y \`
			`cmake && \`
			`apt-get clean && \`
			`rm -rf /var/lib/apt/lists/*`
			`fi`
			`EOT`

			`# Install Go`
			`RUN curl -L -s https://go.dev/dl/go${GO_VERSION}.linux-${TARGETARCH}.tar.gz \| tar -C /usr/local -xz`
			`ENV PATH=$PATH:/root/go/bin:/usr/local/go/bin`

feat: refactor build process, drop embedded backends (#5875) * feat: split remaining backends and drop embedded backends - Drop silero-vad, huggingface, and stores backend from embedded binaries - Refactor Makefile and Dockerfile to avoid building grpc backends - Drop golang code that was used to embed backends - Simplify building by using goreleaser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): be specific with llama-cpp backend templates Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(docs): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ci): minor fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: drop all ffmpeg references Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: run protogen-go Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Always enable p2p mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update gorelease file Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(stores): do not always load Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix linting issues Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Mac OS fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-22 16:31:04 +02:00			`# Install grpc compilers`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`RUN go install google.golang.org/protobuf/cmd/protoc-gen-go@v1.34.2 && \`
feat: refactor build process, drop embedded backends (#5875) * feat: split remaining backends and drop embedded backends - Drop silero-vad, huggingface, and stores backend from embedded binaries - Refactor Makefile and Dockerfile to avoid building grpc backends - Drop golang code that was used to embed backends - Simplify building by using goreleaser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): be specific with llama-cpp backend templates Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(docs): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ci): minor fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: drop all ffmpeg references Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: run protogen-go Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Always enable p2p mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update gorelease file Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(stores): do not always load Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix linting issues Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Mac OS fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-22 16:31:04 +02:00			`go install google.golang.org/grpc/cmd/protoc-gen-go-grpc@1958fcbe2ca8bd93af633f11e97d44e567e945af`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00
			`COPY --chmod=644 custom-ca-certs/* /usr/local/share/ca-certificates/`
			`RUN update-ca-certificates`


			`# OpenBLAS requirements and stable diffusion`
			`RUN apt-get update && \`
			`apt-get install -y --no-install-recommends \`
			`libopenblas-dev && \`
			`apt-get clean && \`
			`rm -rf /var/lib/apt/lists/*`

			`RUN test -n "$TARGETARCH" \`
			\|\| (echo 'warn: missing $TARGETARCH, either set this `ARG` manually, or run using `docker buildkit`')

			`# Use the variables in subsequent instructions`
			`RUN echo "Target Architecture: $TARGETARCH"`
			`RUN echo "Target Variant: $TARGETVARIANT"`




			`WORKDIR /build`


Docker preserve sources (#658) 2023-06-26 16:34:03 -04:00			`###################################`
			`###################################`

ci: generate specific image for intel builds (#2374) ci: fix intel images until are fixed upstream Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-22 23:35:39 +02:00			`# Temporary workaround for Intel's repository to work correctly`
			`# https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/APT-Repository-not-working-signatures-invalid/m-p/1599436/highlight/true#M36143`
			`# This is a temporary workaround until Intel fixes their repository`
			`FROM ${INTEL_BASE_IMAGE} AS intel`
			`RUN wget -qO - https://repositories.intel.com/gpu/intel-graphics.key \| \`
			`gpg --yes --dearmor --output /usr/share/keyrings/intel-graphics.gpg`
			`RUN echo "deb [arch=amd64 signed-by=/usr/share/keyrings/intel-graphics.gpg] https://repositories.intel.com/gpu/ubuntu jammy/lts/2350 unified" > /etc/apt/sources.list.d/intel-graphics.list`
feat(build): adjust number of parallel make jobs (#1915) * feat(build): adjust number of parallel make jobs * fix: update make on MacOS from brew to support --output-sync argument * fix: cache grpc with version as part of key to improve validity of cache hits * fix: use gmake for tests-apple to use the updated GNU make version * fix: actually use the new make version for tests-apple * feat: parallelize tests-extra * feat: attempt to cache grpc build for docker images * fix: don't quote GRPC version * fix: don't cache go modules, we have limited cache space, better used elsewhere * fix: release with the same version of go that we test with * fix: don't fail on exporting cache layers * fix: remove deprecated BUILD_GRPC docker arg from Makefile 2024-03-29 16:32:40 -05:00			`RUN apt-get update && \`
feat: cleanup Dockerfile and make final image a little smaller (#2146) * feat: cleanup Dockerfile and make final image a little smaller Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add build-essential to final stage Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more GRPC cache misses Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: correct for another cause of GRPC cache misses Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * feat: generate new GRPC cache automatically if needed Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use new GRPC_MAKEFLAGS build arg in GRPC cache generation Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-27 12:48:20 -05:00			`apt-get install -y --no-install-recommends \`
feat: split whisper from main binary (#5863) * feat: split whisper from main binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Cleanup makefile Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backend builds (missing only darwin) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Test CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add whisper backend to test runs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Make sure we have runtime libs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Less grpc on the main Dockerfile Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix hipblas build Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add whisper to index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Re-enable CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt auto-bumper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-20 22:52:45 +02:00			`intel-oneapi-runtime-libs && \`
feat(build): adjust number of parallel make jobs (#1915) * feat(build): adjust number of parallel make jobs * fix: update make on MacOS from brew to support --output-sync argument * fix: cache grpc with version as part of key to improve validity of cache hits * fix: use gmake for tests-apple to use the updated GNU make version * fix: actually use the new make version for tests-apple * feat: parallelize tests-extra * feat: attempt to cache grpc build for docker images * fix: don't quote GRPC version * fix: don't cache go modules, we have limited cache space, better used elsewhere * fix: release with the same version of go that we test with * fix: don't fail on exporting cache layers * fix: remove deprecated BUILD_GRPC docker arg from Makefile 2024-03-29 16:32:40 -05:00			`apt-get clean && \`
			`rm -rf /var/lib/apt/lists/*`

			`###################################`
			`###################################`

feat: Initial Version of vscode DevContainer (#3217) initial version of devcontainer --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-14 03:06:41 -04:00			`# The builder-base target has the arguments, variables, and copies shared between full builder images and the uncompiled devcontainer`

feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`FROM build-requirements AS builder-base`
fix: do not build from the same container (#434) Signed-off-by: mudler <mudler@mocaccino.org> 2023-05-30 15:53:37 +02:00
feat: refactor build process, drop embedded backends (#5875) * feat: split remaining backends and drop embedded backends - Drop silero-vad, huggingface, and stores backend from embedded binaries - Refactor Makefile and Dockerfile to avoid building grpc backends - Drop golang code that was used to embed backends - Simplify building by using goreleaser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): be specific with llama-cpp backend templates Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(docs): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ci): minor fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: drop all ffmpeg references Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: run protogen-go Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Always enable p2p mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update gorelease file Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(stores): do not always load Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix linting issues Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Mac OS fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-22 16:31:04 +02:00			`ARG GO_TAGS=""`
ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2023-10-19 13:50:40 +02:00			`ARG GRPC_BACKENDS`
fix(make): allow to parallelize jobs (#1845) * fix: clean up Makefile dependencies to allow for parallel builds * refactor: remove old unused backend from Makefile * fix: finish removing legacy backend, update piper * fix: I broke llama... I fixed llama * feat: give the tests and builds a few threads * fix: ensure libraries are replaced before build, add dropreplace target * Fix image build workflows 2024-03-17 09:39:20 -05:00			`ARG MAKEFLAGS`
feat: Initial Version of vscode DevContainer (#3217) initial version of devcontainer --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-14 03:06:41 -04:00			`ARG LD_FLAGS="-s -w"`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`ARG TARGETARCH`
			`ARG TARGETVARIANT`
ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2023-10-19 13:50:40 +02:00			`ENV GRPC_BACKENDS=${GRPC_BACKENDS}`
fix: do not build from the same container (#434) Signed-off-by: mudler <mudler@mocaccino.org> 2023-05-30 15:53:37 +02:00			`ENV GO_TAGS=${GO_TAGS}`
fix(make): allow to parallelize jobs (#1845) * fix: clean up Makefile dependencies to allow for parallel builds * refactor: remove old unused backend from Makefile * fix: finish removing legacy backend, update piper * fix: I broke llama... I fixed llama * feat: give the tests and builds a few threads * fix: ensure libraries are replaced before build, add dropreplace target * Fix image build workflows 2024-03-17 09:39:20 -05:00			`ENV MAKEFLAGS=${MAKEFLAGS}`
fix: do not build from the same container (#434) Signed-off-by: mudler <mudler@mocaccino.org> 2023-05-30 15:53:37 +02:00			`ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility`
			`ENV NVIDIA_REQUIRE_CUDA="cuda>=${CUDA_MAJOR_VERSION}.0"`
			`ENV NVIDIA_VISIBLE_DEVICES=all`
feat: Initial Version of vscode DevContainer (#3217) initial version of devcontainer --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-14 03:06:41 -04:00			`ENV LD_FLAGS=${LD_FLAGS}`
fix: do not build from the same container (#434) Signed-off-by: mudler <mudler@mocaccino.org> 2023-05-30 15:53:37 +02:00
feat: Initial Version of vscode DevContainer (#3217) initial version of devcontainer --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-14 03:06:41 -04:00			`RUN echo "GO_TAGS: $GO_TAGS" && echo "TARGETARCH: $TARGETARCH"`
Docker preserve sources (#658) 2023-06-26 16:34:03 -04:00
feat: Initial Version of vscode DevContainer (#3217) initial version of devcontainer --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-14 03:06:41 -04:00			`WORKDIR /build`
fix: dont commit generated files to git (#1993) * fix: initial work towards not committing generated files to the repository Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * feat: improve build docs Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove unused folder from .dockerignore and .gitignore Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: attempt to fix extra backend tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: attempt to fix other tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more test fixes Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: fix apple tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more extras tests fixes Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add GOBIN to PATH in docker build Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: extra tests and Dockerfile corrections Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove build dependency checks Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add golang protobuf compilers to tests-linux action Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: ensure protogen is run for extra backend installs Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use newer protobuf Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more missing protoc binaries Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: missing dependencies during docker build Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: don't install grpc compilers in the final stage if they aren't needed Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: python-grpc-tools in 22.04 repos is too old Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add a couple of extra build dependencies to Makefile Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: unbreak container rebuild functionality Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-13 02:37:32 -05:00
fix: copy git to correctly display version in /version 2023-07-02 11:14:09 +02:00
feat: split whisper from main binary (#5863) * feat: split whisper from main binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Cleanup makefile Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backend builds (missing only darwin) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Test CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add whisper backend to test runs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Make sure we have runtime libs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Less grpc on the main Dockerfile Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix hipblas build Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add whisper to index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Re-enable CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt auto-bumper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-20 22:52:45 +02:00			`# We need protoc installed, and the version in 22.04 is too old.`
feat(build): add arm64 core containers (#2421) ci: add arm64 container images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-28 10:34:59 +02:00			`RUN <<EOT bash`
			`if [ "amd64" = "$TARGETARCH" ]; then`
chore(deps): Update Dockerfile (#2532) Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> 2024-06-10 10:40:02 +02:00			`curl -L -s https://github.com/protocolbuffers/protobuf/releases/download/v27.1/protoc-27.1-linux-x86_64.zip -o protoc.zip && \`
feat(build): add arm64 core containers (#2421) ci: add arm64 container images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-28 10:34:59 +02:00			`unzip -j -d /usr/local/bin protoc.zip bin/protoc && \`
			`rm protoc.zip`
			`fi`
			`if [ "arm64" = "$TARGETARCH" ]; then`
chore(deps): Update Dockerfile (#2532) Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> 2024-06-10 10:40:02 +02:00			`curl -L -s https://github.com/protocolbuffers/protobuf/releases/download/v27.1/protoc-27.1-linux-aarch_64.zip -o protoc.zip && \`
feat(build): add arm64 core containers (#2421) ci: add arm64 container images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-05-28 10:34:59 +02:00			`unzip -j -d /usr/local/bin protoc.zip bin/protoc && \`
			`rm protoc.zip`
			`fi`
			`EOT`
feat: cleanup Dockerfile and make final image a little smaller (#2146) * feat: cleanup Dockerfile and make final image a little smaller Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add build-essential to final stage Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more GRPC cache misses Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: correct for another cause of GRPC cache misses Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * feat: generate new GRPC cache automatically if needed Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use new GRPC_MAKEFLAGS build arg in GRPC cache generation Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-27 12:48:20 -05:00
feat: devcontainer part 3 (#3318) * stash initial fixes, attempt to open branch inside container Signed-off-by: Dave Lee <dave@gray101.com> * add yq, from inside DC Signed-off-by: Dave Lee <dave@gray101.com> * stash progress, rebuild container Signed-off-by: Dave Lee <dave@gray101.com> * snap Signed-off-by: Dave Lee <dave@gray101.com> * split builder into builder-sd, will speed up devcontainer build times and potentially help caching in other situations. Signed-off-by: Dave Lee <dave@gray101.com> * fix yq Signed-off-by: Dave Lee <dave@gray101.com> * fix paths Signed-off-by: Dave Lee <dave@gray101.com> * fix paths - new folder to bypass the .dockerignore which _should_ exclude the other files Signed-off-by: Dave Lee <dave@gray101.com> * fix Signed-off-by: Dave Lee <dave@gray101.com> * fix ] Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-20 06:16:21 -04:00			`###################################`
			`###################################`

feat: Realtime API support reboot (#5392) * feat(realtime): Initial Realtime API implementation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: go mod tidy Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat: Implement transcription only mode for realtime API Reduce the scope of the real time API for the initial realease and make transcription only mode functional. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Build backends on a separate layer to speed up core only changes Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io> 2025-05-25 21:25:05 +01:00			`# Compile backends first in a separate stage`
			`FROM builder-base AS builder-backends`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`ARG TARGETARCH`
			`ARG TARGETVARIANT`
feat: devcontainer part 3 (#3318) * stash initial fixes, attempt to open branch inside container Signed-off-by: Dave Lee <dave@gray101.com> * add yq, from inside DC Signed-off-by: Dave Lee <dave@gray101.com> * stash progress, rebuild container Signed-off-by: Dave Lee <dave@gray101.com> * snap Signed-off-by: Dave Lee <dave@gray101.com> * split builder into builder-sd, will speed up devcontainer build times and potentially help caching in other situations. Signed-off-by: Dave Lee <dave@gray101.com> * fix yq Signed-off-by: Dave Lee <dave@gray101.com> * fix paths Signed-off-by: Dave Lee <dave@gray101.com> * fix paths - new folder to bypass the .dockerignore which _should_ exclude the other files Signed-off-by: Dave Lee <dave@gray101.com> * fix Signed-off-by: Dave Lee <dave@gray101.com> * fix ] Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-20 06:16:21 -04:00
Fix cleanup sonarqube findings (#2106) * fix: update dockerignore and gitignore to exclude sonarqube work dir Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove useless equality check Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use sonarqube Dockerfile recommendations Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-23 11:43:00 -05:00			`WORKDIR /build`
fix(initializer): do select backends that exist (#2694) we were not checking if the binary exists before picking these up from the asset dir. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-07-01 22:50:36 +02:00
feat: Realtime API support reboot (#5392) * feat(realtime): Initial Realtime API implementation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: go mod tidy Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat: Implement transcription only mode for realtime API Reduce the scope of the real time API for the initial realease and make transcription only mode functional. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Build backends on a separate layer to speed up core only changes Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io> 2025-05-25 21:25:05 +01:00			`COPY ./Makefile .`
			`COPY ./backend ./backend`
			`COPY ./go.mod .`
			`COPY ./go.sum .`
			`COPY ./.git ./.git`

			`# Some of the Go backends use libs from the main src, we could further optimize the caching by building the CPP backends before here`
			`COPY ./pkg/grpc ./pkg/grpc`
			`COPY ./pkg/utils ./pkg/utils`
			`COPY ./pkg/langchain ./pkg/langchain`
fix: speedup and improve cachability of docker build of `builder-sd` (#3430) fix: speedup and improve cachability of docker build of `builder-sd` (#3430) --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-09-10 02:57:16 -04:00
feat: Realtime API support reboot (#5392) * feat(realtime): Initial Realtime API implementation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: go mod tidy Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat: Implement transcription only mode for realtime API Reduce the scope of the real time API for the initial realease and make transcription only mode functional. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Build backends on a separate layer to speed up core only changes Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io> 2025-05-25 21:25:05 +01:00			`RUN ls -l ./`
feat: refactor build process, drop embedded backends (#5875) * feat: split remaining backends and drop embedded backends - Drop silero-vad, huggingface, and stores backend from embedded binaries - Refactor Makefile and Dockerfile to avoid building grpc backends - Drop golang code that was used to embed backends - Simplify building by using goreleaser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): be specific with llama-cpp backend templates Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(docs): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ci): minor fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: drop all ffmpeg references Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: run protogen-go Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Always enable p2p mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update gorelease file Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(stores): do not always load Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix linting issues Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Mac OS fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-22 16:31:04 +02:00			`RUN make protogen-go`
feat: Realtime API support reboot (#5392) * feat(realtime): Initial Realtime API implementation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: go mod tidy Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat: Implement transcription only mode for realtime API Reduce the scope of the real time API for the initial realease and make transcription only mode functional. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Build backends on a separate layer to speed up core only changes Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io> 2025-05-25 21:25:05 +01:00
			`# The builder target compiles LocalAI. This target is not the target that will be uploaded to the registry.`
			`# Adjustments to the build process should likely be made here.`
			`FROM builder-backends AS builder`

			`WORKDIR /build`

			`COPY . .`
fix: speedup and improve cachability of docker build of `builder-sd` (#3430) fix: speedup and improve cachability of docker build of `builder-sd` (#3430) --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-09-10 02:57:16 -04:00
fix(initializer): do select backends that exist (#2694) we were not checking if the binary exists before picking these up from the asset dir. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-07-01 22:50:36 +02:00			`## Build the binary`
fix(arm64): do not build instructions which are not available (#5318) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-05-05 17:30:00 +02:00			`## If we're on arm64 AND using cublas/hipblas, skip some of the llama-compat backends to save space`
			`## Otherwise just run the normal build`
feat: do not bundle llama-cpp anymore (#5790) * Build llama.cpp separately Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Start to try to attach some tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add git and small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: correctly autoload external backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to run AIO tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Slightly update the Makefile helps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt auto-bumper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to run linux test Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add llama-cpp into build pipelines Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add default capability (for cpu) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop llama-cpp specific logic from the backend loader Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop grpc install in ci for tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Pass by backends path for tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Build protogen at start Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): set backends path consistently Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Correctly configure the backends path Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to build for darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Compile for metal on arm64/darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to run build off from cross-arch Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the backend index nvidia-l4t and cpu's llama-cpp backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Build also darwin-x86 for llama-cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Disable arm64 builds temporary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Test backend build on PR Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup build backend reusable workflow Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * pass by skip drivers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use crane Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip drivers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * x86 darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add packaging step for llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix leftover from bark-cpp extraction Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to fix hipblas build Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2025-07-18 13:24:12 +02:00			`RUN make build`
fix: do not build from the same container (#434) Signed-off-by: mudler <mudler@mocaccino.org> 2023-05-30 15:53:37 +02:00
Docker preserve sources (#658) 2023-06-26 16:34:03 -04:00			`###################################`
			`###################################`

feat: Initial Version of vscode DevContainer (#3217) initial version of devcontainer --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-14 03:06:41 -04:00			`# The devcontainer target is not used on CI. It is a target for developers to use locally -`
			`# rather than copying files it mounts them locally and leaves building to the developer`

			`FROM builder-base AS devcontainer`

feat: devcontainer part 3 (#3318) * stash initial fixes, attempt to open branch inside container Signed-off-by: Dave Lee <dave@gray101.com> * add yq, from inside DC Signed-off-by: Dave Lee <dave@gray101.com> * stash progress, rebuild container Signed-off-by: Dave Lee <dave@gray101.com> * snap Signed-off-by: Dave Lee <dave@gray101.com> * split builder into builder-sd, will speed up devcontainer build times and potentially help caching in other situations. Signed-off-by: Dave Lee <dave@gray101.com> * fix yq Signed-off-by: Dave Lee <dave@gray101.com> * fix paths Signed-off-by: Dave Lee <dave@gray101.com> * fix paths - new folder to bypass the .dockerignore which _should_ exclude the other files Signed-off-by: Dave Lee <dave@gray101.com> * fix Signed-off-by: Dave Lee <dave@gray101.com> * fix ] Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-20 06:16:21 -04:00			`COPY .devcontainer-scripts /.devcontainer-scripts`
feat: Initial Version of vscode DevContainer (#3217) initial version of devcontainer --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-14 03:06:41 -04:00
feat: devcontainer part 3 (#3318) * stash initial fixes, attempt to open branch inside container Signed-off-by: Dave Lee <dave@gray101.com> * add yq, from inside DC Signed-off-by: Dave Lee <dave@gray101.com> * stash progress, rebuild container Signed-off-by: Dave Lee <dave@gray101.com> * snap Signed-off-by: Dave Lee <dave@gray101.com> * split builder into builder-sd, will speed up devcontainer build times and potentially help caching in other situations. Signed-off-by: Dave Lee <dave@gray101.com> * fix yq Signed-off-by: Dave Lee <dave@gray101.com> * fix paths Signed-off-by: Dave Lee <dave@gray101.com> * fix paths - new folder to bypass the .dockerignore which _should_ exclude the other files Signed-off-by: Dave Lee <dave@gray101.com> * fix Signed-off-by: Dave Lee <dave@gray101.com> * fix ] Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-20 06:16:21 -04:00			`RUN apt-get update && \`
			`apt-get install -y --no-install-recommends \`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`ssh less`
test: preliminary tests and merge fix for authv2 (#3584) * add api key to existing app tests, add preliminary auth test Signed-off-by: Dave Lee <dave@gray101.com> * small fix, run test Signed-off-by: Dave Lee <dave@gray101.com> * status on non-opaque Signed-off-by: Dave Lee <dave@gray101.com> * tweak auth error Signed-off-by: Dave Lee <dave@gray101.com> * exp Signed-off-by: Dave Lee <dave@gray101.com> * quick fix on real laptop Signed-off-by: Dave Lee <dave@gray101.com> * add downloader version that allows providing an auth header Signed-off-by: Dave Lee <dave@gray101.com> * stash some devcontainer fixes during testing Signed-off-by: Dave Lee <dave@gray101.com> * s2 Signed-off-by: Dave Lee <dave@gray101.com> * s Signed-off-by: Dave Lee <dave@gray101.com> * done with experiment Signed-off-by: Dave Lee <dave@gray101.com> * done with experiment Signed-off-by: Dave Lee <dave@gray101.com> * after merge fix Signed-off-by: Dave Lee <dave@gray101.com> * rename and fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2024-09-24 03:32:48 -04:00			`# For the devcontainer, leave apt functional in case additional devtools are needed at runtime.`
feat: devcontainer part 3 (#3318) * stash initial fixes, attempt to open branch inside container Signed-off-by: Dave Lee <dave@gray101.com> * add yq, from inside DC Signed-off-by: Dave Lee <dave@gray101.com> * stash progress, rebuild container Signed-off-by: Dave Lee <dave@gray101.com> * snap Signed-off-by: Dave Lee <dave@gray101.com> * split builder into builder-sd, will speed up devcontainer build times and potentially help caching in other situations. Signed-off-by: Dave Lee <dave@gray101.com> * fix yq Signed-off-by: Dave Lee <dave@gray101.com> * fix paths Signed-off-by: Dave Lee <dave@gray101.com> * fix paths - new folder to bypass the .dockerignore which _should_ exclude the other files Signed-off-by: Dave Lee <dave@gray101.com> * fix Signed-off-by: Dave Lee <dave@gray101.com> * fix ] Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-20 06:16:21 -04:00
feat: Initial Version of vscode DevContainer (#3217) initial version of devcontainer --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-14 03:06:41 -04:00			`RUN go install github.com/go-delve/delve/cmd/dlv@latest`

feat: devcontainer part 3 (#3318) * stash initial fixes, attempt to open branch inside container Signed-off-by: Dave Lee <dave@gray101.com> * add yq, from inside DC Signed-off-by: Dave Lee <dave@gray101.com> * stash progress, rebuild container Signed-off-by: Dave Lee <dave@gray101.com> * snap Signed-off-by: Dave Lee <dave@gray101.com> * split builder into builder-sd, will speed up devcontainer build times and potentially help caching in other situations. Signed-off-by: Dave Lee <dave@gray101.com> * fix yq Signed-off-by: Dave Lee <dave@gray101.com> * fix paths Signed-off-by: Dave Lee <dave@gray101.com> * fix paths - new folder to bypass the .dockerignore which _should_ exclude the other files Signed-off-by: Dave Lee <dave@gray101.com> * fix Signed-off-by: Dave Lee <dave@gray101.com> * fix ] Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-20 06:16:21 -04:00			`RUN go install github.com/mikefarah/yq/v4@latest`

feat: Initial Version of vscode DevContainer (#3217) initial version of devcontainer --------- Signed-off-by: Dave Lee <dave@gray101.com> 2024-08-14 03:06:41 -04:00			`###################################`
			`###################################`

feat: organize Dockerfile into distinct sections (#2181) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-30 03:12:19 -05:00			`# This is the final target. The result of this target will be the image uploaded to the registry.`
			`# If you cannot find a more suitable place for an addition, this layer is a suitable place for it.`
			`FROM requirements-drivers`
Dockerfile: unify duplicated requirements into single step (#580) 2023-06-13 01:39:38 -05:00
			`ENV HEALTHCHECK_ENDPOINT=http://localhost:8080/readyz`
feat: add ffmpeg images (#492) Signed-off-by: mudler <mudler@mocaccino.org> 2023-06-04 14:00:21 +02:00
feat: Upgrade to CUDA 12.5 (#2601) Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> 2024-06-19 17:50:49 +02:00			`ARG CUDA_MAJOR_VERSION=12`
ci: add GPU tests (#1095) * ci: test GPU Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: show logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * split extra/core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * consider runner host dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2023-10-19 13:50:40 +02:00			`ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility`
			`ENV NVIDIA_REQUIRE_CUDA="cuda>=${CUDA_MAJOR_VERSION}.0"`
			`ENV NVIDIA_VISIBLE_DEVICES=all`

feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`WORKDIR /`
feat: add ffmpeg images (#492) Signed-off-by: mudler <mudler@mocaccino.org> 2023-06-04 14:00:21 +02:00
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`COPY ./entrypoint.sh .`
fix: handle grpc and llama-cpp with REBUILD=true (#1328) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2023-11-25 08:48:24 +01:00
feat: llama.cpp gRPC C++ backend (#1170) * wip: llama.cpp c++ gRPC server Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * make it work, attach it to the build process Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: add protobuf dep Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * try fix protobuf on cmake * cmake: workarounds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add packages * cmake: use fixed version of grpc Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cmake(grpc): install locally * install grpc Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * install required deps for grpc on debian bullseye Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * debug * debug * Fixups * no need to install cmake manually Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: fixup macOS * use brew whenever possible Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * macOS fixups * debug * fix container build Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * workaround * try mac https://stackoverflow.com/questions/23905661/on-mac-g-clang-fails-to-search-usr-local-include-and-usr-local-lib-by-def * Disable temp. arm64 docker image builds --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2023-10-16 21:46:29 +02:00			`# Copy the binary`
fix: do not build from the same container (#434) Signed-off-by: mudler <mudler@mocaccino.org> 2023-05-30 15:53:37 +02:00			`COPY --from=builder /build/local-ai ./`
feat: add vall-e-x (#1007) Description This PR fixes #985 Notes for Reviewers [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [ ] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions: 1. Include descriptive PR titles with [<component-name>] prepended. 2. Build and test your changes before submitting a PR. 3. Sign your commits By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. --> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2023-09-04 19:25:23 +02:00
Update Dockerfile Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2024-01-09 08:55:43 +01:00			`# Make sure the models directory exists`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`RUN mkdir -p /models /backends`
Update Dockerfile Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2024-01-09 08:55:43 +01:00
image: add HEALTHCHECK (#388) Signed-off-by: mudler <mudler@mocaccino.org> 2023-05-26 18:34:02 +02:00			`# Define the health check command`
fix: correctly handle errors from App constructor (#430) Signed-off-by: mudler <mudler@mocaccino.org> 2023-05-30 12:00:30 +02:00			`HEALTHCHECK --interval=1m --timeout=10m --retries=10 \`
Fix cleanup sonarqube findings (#2106) * fix: update dockerignore and gitignore to exclude sonarqube work dir Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove useless equality check Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use sonarqube Dockerfile recommendations Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> 2024-04-23 11:43:00 -05:00			`CMD curl -f ${HEALTHCHECK_ENDPOINT} \|\| exit 1`
fix: gpu fetch device info (#2403) * fix: gpu fetch device info Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * use pciutils package Signed-off-by: Sertac Ozercan <sozercan@gmail.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com> 2024-05-26 00:56:06 -07:00
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`VOLUME /models /backends`
Add EXPOSE to Dockerfile (#107) 2023-04-27 18:45:24 +02:00			`EXPOSE 8080`
feat: :warning: reduce images size and stop bundling sources (#5721) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2025-06-26 18:41:38 +02:00			`ENTRYPOINT [ "/entrypoint.sh" ]`