Blame: go.mod - mudler/LocalAI

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

								module github.com/mudler/LocalAI

							

First import

2023-03-18 23:59:06 +01:00

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

								go 1.23.8

							

feat: Galleries UI (#2104) * WIP: add models to webui Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Register routes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: don't cache models Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: fixup multiple installs (strings.Clone) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-04-23 09:22:58 +02:00

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

								toolchain go1.24.5

							

First import

2023-03-18 23:59:06 +01:00

feat(p2p): add support for configuration of edgevpn listen_maddrs, dht_announce_maddrs and bootstrap_peers (#4200) * add support for edgevpn listen_maddrs, dht_announce_maddrs, dht_bootstrap_peers * upd docs for libp2p loglevel

2024-11-20 17:18:52 +04:00

									dario.cat/mergo v1.0.1

							

feat: auto select llama-cpp cpu variant (#2305) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-05-13 02:37:52 -07:00

									github.com/alecthomas/kong v0.9.0

							

build(deps): bump github.com/charmbracelet/glamour from 0.6.0 to 0.7.0 (#2004) Bumps [github.com/charmbracelet/glamour](https://github.com/charmbracelet/glamour) from 0.6.0 to 0.7.0. - [Release notes](https://github.com/charmbracelet/glamour/releases) - [Commits](https://github.com/charmbracelet/glamour/compare/v0.6.0...v0.7.0) --- updated-dependencies: - dependency-name: github.com/charmbracelet/glamour dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

2024-04-11 15:41:58 +00:00

									github.com/charmbracelet/glamour v0.7.0

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/chasefleming/elem-go v0.26.0

							

feat(p2p): add support for configuration of edgevpn listen_maddrs, dht_announce_maddrs and bootstrap_peers (#4200) * add support for edgevpn listen_maddrs, dht_announce_maddrs, dht_bootstrap_peers * upd docs for libp2p loglevel

2024-11-20 17:18:52 +04:00

									github.com/dave-gray101/v2keyauth v0.0.0-20240624150259-c45d584d25e2

							

refactor: move remaining api packages to core (#1731) * core 1 * api/openai/files fix * core 2 - core/config * move over core api.go and tests to the start of core/http * move over localai specific endpoints to core/http, begin the service/endpoint split there * refactor big chunk on the plane * refactor chunk 2 on plane, next step: port and modify changes to request.go * easy fixes for request.go, major changes not done yet * lintfix * json tag lintfix? * gitignore and .keep files * strange fix attempt: rename the config dir?

2024-03-01 10:19:53 -05:00

									github.com/fsnotify/fsnotify v1.7.0

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/ggerganov/whisper.cpp/bindings/go v0.0.0-20240626202019-c118733a29ad

							

feat: add transcript endpoint (#211)

2023-05-09 11:43:50 +02:00

									github.com/go-audio/wav v1.1.0

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/go-skynet/go-llama.cpp v0.0.0-20240314183750-6a8041ef6b46

							

2024-04-11 02:19:24 -05:00

									github.com/gofiber/swagger v1.0.0

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/gofiber/template/html/v2 v2.1.2

							

chore(go.mod): tidy (#4209) chore(go.mod): tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-11-20 18:13:42 +01:00

									github.com/gofiber/websocket/v2 v2.2.1

							

feat: external backend launching log improvements and relative path support (#3348) * specify workdir when launching external backend for safety / relative paths, bump version, logs Signed-off-by: Dave Lee <dave@gray101.com> * sneak in a devcontainer fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com>

2024-08-23 18:27:14 -04:00

									github.com/gofrs/flock v0.12.1

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/google/go-containerregistry v0.19.2

							

toil: bump grpc version (#2480) bump the grpc package version --------- Signed-off-by: Dave Lee <dave@gray101.com>

2024-06-04 02:39:19 -04:00

									github.com/google/uuid v1.6.0

							

feat(llama.cpp): estimate vram usage (#5299) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-02 17:40:26 +02:00

									github.com/gpustack/gguf-parser-go v0.17.0

							

feat: add falcon ggllm via grpc client Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

									github.com/hpcloud/tail v1.0.0

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/ipfs/go-log v1.0.5

							

feat: auto select llama-cpp cpu variant (#2305) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-05-13 02:37:52 -07:00

									github.com/jaypipes/ghw v0.12.0

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/klauspost/cpuid/v2 v2.2.10

							

Gallery repository (#663) Signed-off-by: mudler <mudler@localai.io>

2023-06-24 08:18:17 +02:00

									github.com/mholt/archiver/v3 v3.5.1

							

feat: auto select llama-cpp cpu variant (#2305) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-05-13 02:37:52 -07:00

									github.com/microcosm-cc/bluemonday v1.0.26

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/mudler/edgevpn v0.31.0

							

feat: external backend launching log improvements and relative path support (#3348) * specify workdir when launching external backend for safety / relative paths, bump version, logs Signed-off-by: Dave Lee <dave@gray101.com> * sneak in a devcontainer fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com>

2024-08-23 18:27:14 -04:00

									github.com/mudler/go-processmanager v0.0.0-20240820160718-8b802d3ecf82

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									github.com/nikolalohinski/gonja/v2 v2.3.2

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/onsi/ginkgo/v2 v2.23.3

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/onsi/gomega v1.36.2

							

feat(backends): install from local path (#5962) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-03 14:24:50 +02:00

									github.com/otiai10/copy v1.14.1

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/otiai10/openaigo v1.7.0

							

feat: add falcon ggllm via grpc client Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

									github.com/phayes/freeport v0.0.0-20220201140144-74d24b5ae9f5

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/prometheus/client_golang v1.22.0

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/rs/zerolog v1.33.0

							

2024-04-11 02:19:24 -05:00

									github.com/russross/blackfriday v1.6.0

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/sashabaranov/go-openai v1.26.2

							

chore(go.mod): tidy (#4209) chore(go.mod): tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-11-20 18:13:42 +01:00

									github.com/streamer45/silero-vad-go v0.2.1

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/stretchr/testify v1.10.0

							

2024-04-11 02:19:24 -05:00

									github.com/swaggo/swag v1.16.3

							

fix: use rice when embedding large binaries (#5309) * fix(embed): use go-rice for large backend assets Golang embed FS has a hard limit that we might exceed when providing many binary alternatives. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify golang deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): switch to testcontainers and print logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): do not build a test binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-04 16:42:42 +02:00

									github.com/testcontainers/testcontainers-go v0.35.0

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/tmc/langchaingo v0.1.12

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									go.opentelemetry.io/otel v1.35.0

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									go.opentelemetry.io/otel/exporters/prometheus v0.50.0

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									go.opentelemetry.io/otel/metric v1.35.0

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									go.opentelemetry.io/otel/sdk/metric v1.28.0

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									google.golang.org/grpc v1.67.1

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									google.golang.org/protobuf v1.36.6

							

fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 546600f (#276) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

2023-05-18 21:12:42 +02:00

									gopkg.in/yaml.v2 v2.4.0

							

fix(deps): update github.com/go-skynet/go-llama.cpp digest to 3d084e4 (#108) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

2023-04-28 19:24:49 +02:00

									gopkg.in/yaml.v3 v3.0.1

							

feat(oci): support OCI images and Ollama models (#2628) * Support specifying oci:// and ollama:// for model URLs Fixes: https://github.com/mudler/LocalAI/issues/2527 Fixes: https://github.com/mudler/LocalAI/issues/1028 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Lower watcher warnings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to install ollama models from CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not keep file ownership Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip test on darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-22 08:17:41 +02:00

									oras.land/oras-go/v2 v2.5.0

							

First import

2023-03-18 23:59:06 +01:00

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

require (

fix: use rice when embedding large binaries (#5309) * fix(embed): use go-rice for large backend assets Golang embed FS has a hard limit that we might exceed when providing many binary alternatives. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify golang deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): switch to testcontainers and print logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): do not build a test binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-04 16:42:42 +02:00

									github.com/containerd/platforms v0.2.1 // indirect

							

2024-12-08 13:50:33 +01:00

									github.com/dustin/go-humanize v1.0.1 // indirect

							

feat: Realtime API support reboot (#5392) * feat(realtime): Initial Realtime API implementation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: go mod tidy Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat: Implement transcription only mode for realtime API Reduce the scope of the real time API for the initial realease and make transcription only mode functional. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Build backends on a separate layer to speed up core only changes Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-25 21:25:05 +01:00

									github.com/fasthttp/websocket v1.5.8 // indirect

							

chore(go.mod): tidy (#4209) chore(go.mod): tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-11-20 18:13:42 +01:00

									github.com/felixge/httpsnoop v1.0.4 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/go-task/slim-sprig/v3 v3.0.0 // indirect

							

2024-12-08 13:50:33 +01:00

									github.com/json-iterator/go v1.1.12 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/libp2p/go-yamux/v5 v5.0.1 // indirect

							

fix: use rice when embedding large binaries (#5309) * fix(embed): use go-rice for large backend assets Golang embed FS has a hard limit that we might exceed when providing many binary alternatives. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify golang deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): switch to testcontainers and print logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): do not build a test binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-04 16:42:42 +02:00

									github.com/magiconair/properties v1.8.7 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/moby/docker-image-spec v1.3.1 // indirect

							

fix: use rice when embedding large binaries (#5309) * fix(embed): use go-rice for large backend assets Golang embed FS has a hard limit that we might exceed when providing many binary alternatives. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify golang deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): switch to testcontainers and print logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): do not build a test binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-04 16:42:42 +02:00

									github.com/moby/patternmatcher v0.6.0 // indirect

							

2024-12-08 13:50:33 +01:00

									github.com/modern-go/concurrent v0.0.0-20180306012644-bacd9c7ef1dd // indirect

							

fix: use rice when embedding large binaries (#5309) * fix(embed): use go-rice for large backend assets Golang embed FS has a hard limit that we might exceed when providing many binary alternatives. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify golang deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): switch to testcontainers and print logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): do not build a test binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-04 16:42:42 +02:00

									github.com/morikuni/aec v1.0.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/munnerz/goautoneg v0.0.0-20191010083416-a7dc8b61c822 // indirect

							

feat(backends): install from local path (#5962) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-03 14:24:50 +02:00

									github.com/otiai10/mint v1.6.3 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/pion/datachannel v1.5.10 // indirect

							

chore(deps): update edgevpn (#3340) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-08-20 19:17:35 +02:00

									github.com/pion/dtls/v2 v2.2.12 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/pion/dtls/v3 v3.0.6 // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									github.com/pion/logging v0.2.3 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/pion/randutil v0.1.0 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/pion/rtcp v1.2.15 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/pion/rtp v1.8.19 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/pion/stun v0.6.1 // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									github.com/pion/stun/v3 v3.0.0 // indirect

							

chore(deps): update edgevpn (#3340) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-08-20 19:17:35 +02:00

									github.com/pion/transport/v2 v2.2.10 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/pion/transport/v3 v3.0.7 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/pion/turn/v4 v4.0.2 // indirect

							

feat(llama.cpp): estimate vram usage (#5299) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-02 17:40:26 +02:00

									github.com/rs/dnscache v0.0.0-20230804202142-fc85eb664529 // indirect

							

feat: Realtime API support reboot (#5392) * feat(realtime): Initial Realtime API implementation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: go mod tidy Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat: Implement transcription only mode for realtime API Reduce the scope of the real time API for the initial realease and make transcription only mode functional. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Build backends on a separate layer to speed up core only changes Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-25 21:25:05 +01:00

									github.com/savsgio/gotils v0.0.0-20240303185622-093b76447511 // indirect

							

feat: external backend launching log improvements and relative path support (#3348) * specify workdir when launching external backend for safety / relative paths, bump version, logs Signed-off-by: Dave Lee <dave@gray101.com> * sneak in a devcontainer fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com>

2024-08-23 18:27:14 -04:00

									github.com/shirou/gopsutil/v4 v4.24.7 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/wlynxg/anet v0.0.5 // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									go.opentelemetry.io/auto/sdk v1.1.0 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									go.opentelemetry.io/contrib/instrumentation/net/http/otelhttp v0.56.0 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									go.uber.org/mock v0.5.2 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/KyleBanks/depth v1.2.1 // indirect

							

feat(p2p): add support for configuration of edgevpn listen_maddrs, dht_announce_maddrs and bootstrap_peers (#4200) * add support for edgevpn listen_maddrs, dht_announce_maddrs, dht_bootstrap_peers * upd docs for libp2p loglevel

2024-11-20 17:18:52 +04:00

									github.com/Masterminds/semver/v3 v3.3.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/Microsoft/go-winio v0.6.2 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/Microsoft/hcsshim v0.11.7 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/StackExchange/wmi v1.2.1 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/andybalholm/brotli v1.1.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/aymanbagabas/go-osc52/v2 v2.0.1 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/benbjohnson/clock v1.3.5 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/beorn7/perks v1.0.1 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/c-robinson/iplib v1.0.8 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/cenkalti/backoff/v4 v4.3.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/cespare/xxhash/v2 v2.3.0 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/containerd/cgroups v1.1.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/containerd/continuity v0.4.2 // indirect

							

feat(oci): support OCI images and Ollama models (#2628) * Support specifying oci:// and ollama:// for model URLs Fixes: https://github.com/mudler/LocalAI/issues/2527 Fixes: https://github.com/mudler/LocalAI/issues/1028 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Lower watcher warnings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to install ollama models from CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not keep file ownership Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip test on darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-22 08:17:41 +02:00

									github.com/containerd/errdefs v0.1.0 // indirect

							

feat(p2p): add support for configuration of edgevpn listen_maddrs, dht_announce_maddrs and bootstrap_peers (#4200) * add support for edgevpn listen_maddrs, dht_announce_maddrs, dht_bootstrap_peers * upd docs for libp2p loglevel

2024-11-20 17:18:52 +04:00

									github.com/creachadair/otp v0.5.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/davecgh/go-spew v1.1.1 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/davidlazar/go-crypto v0.0.0-20200604182044-b73af7476f6c // indirect

							

fix(p2p): adapt to backend changes, general improvements (#5889) The binary is now named "llama-cpp-rpc-server" for p2p workers. We also decrease the default token rotation interval, in this way peer discovery is much more responsive. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-07-23 12:40:32 +02:00

									github.com/decred/dcrd/dcrec/secp256k1/v4 v4.4.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/dlclark/regexp2 v1.10.0 // indirect

							

feat(oci): support OCI images and Ollama models (#2628) * Support specifying oci:// and ollama:// for model URLs Fixes: https://github.com/mudler/LocalAI/issues/2527 Fixes: https://github.com/mudler/LocalAI/issues/1028 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Lower watcher warnings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to install ollama models from CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not keep file ownership Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip test on darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-22 08:17:41 +02:00

									github.com/docker/distribution v2.8.2+incompatible // indirect

							

fix: use rice when embedding large binaries (#5309) * fix(embed): use go-rice for large backend assets Golang embed FS has a hard limit that we might exceed when providing many binary alternatives. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * simplify golang deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(tests): switch to testcontainers and print logs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(tests): do not build a test binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-04 16:42:42 +02:00

									github.com/docker/docker v27.1.1+incompatible

							

feat(oci): support OCI images and Ollama models (#2628) * Support specifying oci:// and ollama:// for model URLs Fixes: https://github.com/mudler/LocalAI/issues/2527 Fixes: https://github.com/mudler/LocalAI/issues/1028 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Lower watcher warnings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to install ollama models from CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not keep file ownership Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip test on darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-22 08:17:41 +02:00

									github.com/docker/docker-credential-helpers v0.7.0 // indirect

							

feat: Realtime API support reboot (#5392) * feat(realtime): Initial Realtime API implementation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: go mod tidy Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat: Implement transcription only mode for realtime API Reduce the scope of the real time API for the initial realease and make transcription only mode functional. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Build backends on a separate layer to speed up core only changes Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-25 21:25:05 +01:00

									github.com/docker/go-connections v0.5.0

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/docker/go-units v0.5.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/flynn/noise v1.1.0 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/francoispqt/gojay v1.2.13 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/ghodss/yaml v1.0.0 // indirect

							

chore(go.mod): tidy (#4209) chore(go.mod): tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-11-20 18:13:42 +01:00

									github.com/go-audio/audio v1.0.0

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/go-audio/riff v1.0.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/go-logr/logr v1.4.2 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/go-logr/stdr v1.2.2 // indirect

							

feat: external backend launching log improvements and relative path support (#3348) * specify workdir when launching external backend for safety / relative paths, bump version, logs Signed-off-by: Dave Lee <dave@gray101.com> * sneak in a devcontainer fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com>

2024-08-23 18:27:14 -04:00

									github.com/go-ole/go-ole v1.3.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/go-openapi/jsonpointer v0.21.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/gofiber/contrib/fiberzerolog v1.0.2

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/gofiber/template v1.8.3 // indirect

							

feat(oci): support OCI images and Ollama models (#2628) * Support specifying oci:// and ollama:// for model URLs Fixes: https://github.com/mudler/LocalAI/issues/2527 Fixes: https://github.com/mudler/LocalAI/issues/1028 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Lower watcher warnings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to install ollama models from CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not keep file ownership Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip test on darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-22 08:17:41 +02:00

									github.com/golang/groupcache v0.0.0-20210331224755-41bb18bfe9da // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/golang/snappy v0.0.4 // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									github.com/google/btree v1.1.3 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/google/go-cmp v0.7.0 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/google/gopacket v1.1.19 // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									github.com/google/pprof v0.0.0-20250208200701-d0013a598941 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/gorilla/css v1.0.1 // indirect

							

chore: update edgevpn dependency (#2855) deps: update edgevpn dependency Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-07-14 01:26:17 +02:00

									github.com/gorilla/websocket v1.5.3 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/hashicorp/golang-lru v1.0.2 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/hashicorp/golang-lru/v2 v2.0.7 // indirect

							

feat(llama.cpp): estimate vram usage (#5299) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-02 17:40:26 +02:00

									github.com/henvic/httpretty v0.1.4 // indirect

							

feat(p2p): add support for configuration of edgevpn listen_maddrs, dht_announce_maddrs and bootstrap_peers (#4200) * add support for edgevpn listen_maddrs, dht_announce_maddrs, dht_bootstrap_peers * upd docs for libp2p loglevel

2024-11-20 17:18:52 +04:00

									github.com/huandu/xstrings v1.5.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/huin/goupnp v1.3.0 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/ipfs/boxo v0.30.0 // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									github.com/ipfs/go-cid v0.5.0 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/ipfs/go-datastore v0.8.2 // indirect

							

chore(deps): update edgevpn (#3385) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-08-26 20:19:27 +02:00

									github.com/ipld/go-ipld-prime v0.21.0 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/jackpal/go-nat-pmp v1.0.2 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/jaypipes/pcidb v1.0.0 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/jbenet/go-temp-err-catcher v0.1.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/josharian/intern v1.0.0 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/klauspost/compress v1.18.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/klauspost/pgzip v1.2.5 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/koron/go-ssdp v0.0.6 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/libp2p/go-buffer-pool v0.1.0 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/libp2p/go-flow-metrics v0.2.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/libp2p/go-libp2p-asn-util v0.4.1 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/libp2p/go-libp2p-kad-dht v0.33.1 // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									github.com/libp2p/go-libp2p-record v0.3.1 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/libp2p/go-libp2p-routing-helpers v0.7.5 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/libp2p/go-msgio v0.3.0 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/libp2p/go-netroute v0.2.2 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/libp2p/go-reuseport v0.4.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/lucasb-eyer/go-colorful v1.2.0 // indirect

							

feat: external backend launching log improvements and relative path support (#3348) * specify workdir when launching external backend for safety / relative paths, bump version, logs Signed-off-by: Dave Lee <dave@gray101.com> * sneak in a devcontainer fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com>

2024-08-23 18:27:14 -04:00

									github.com/lufia/plan9stats v0.0.0-20240819163618-b1d8f4d146e7 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/mailru/easyjson v0.7.7 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/marten-seemann/tcp v0.0.0-20210406111302-dfbc87cc63fd // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									github.com/mattn/go-colorable v0.1.14 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/mattn/go-isatty v0.0.20 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/miekg/dns v1.1.66 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/mikioh/tcpinfo v0.0.0-20190314235526-30a79bb1804b // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/mitchellh/colorstring v0.0.0-20190213212951-d06e56a500db // indirect

							

feat(oci): support OCI images and Ollama models (#2628) * Support specifying oci:// and ollama:// for model URLs Fixes: https://github.com/mudler/LocalAI/issues/2527 Fixes: https://github.com/mudler/LocalAI/issues/1028 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Lower watcher warnings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to install ollama models from CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not keep file ownership Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip test on darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-22 08:17:41 +02:00

									github.com/moby/sys/sequential v0.5.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/moby/term v0.5.0 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/mr-tron/base58 v1.2.0 // indirect

							

feat: Centralized Request Processing middleware (#3847) * squash past, centralize request middleware PR Signed-off-by: Dave Lee <dave@gray101.com> * migrate bruno request files to examples repo Signed-off-by: Dave Lee <dave@gray101.com> * fix Signed-off-by: Dave Lee <dave@gray101.com> * Update tests/e2e-aio/e2e_test.go Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2025-02-10 06:06:16 -05:00

									github.com/mudler/go-piper v0.0.0-20241023091659-2494246fd9fc

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/mudler/water v0.0.0-20250808092830-dd90dcf09025 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/muesli/reflow v0.3.0 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/multiformats/go-base32 v0.1.0 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/multiformats/go-multiaddr v0.16.0

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/multiformats/go-multiaddr-dns v0.4.1 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/multiformats/go-multiaddr-fmt v0.1.0 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/multiformats/go-multicodec v0.9.1 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/multiformats/go-multihash v0.2.3 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/multiformats/go-multistream v0.6.1 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/multiformats/go-varint v0.0.7 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/nwaples/rardecode v1.1.0 // indirect

							

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343) * feat(llama.cpp): Enable decentralized, distributed inference As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to @rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now it is possible to distribute the workload to remote llama.cpp gRPC server. This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token. The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token). As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols, the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on. When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally. Then llama.cpp is configured to use the services. This feature is behind the "p2p" GO_FLAGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * go mod tidy Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * better message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-20 19:17:59 +02:00

									github.com/opentracing/opentracing-go v1.2.0 // indirect

							

feat: fiber CSRF (#2482) new config option - enables or disables the fiber csrf middleware Signed-off-by: Dave Lee <dave@gray101.com>

2024-06-04 15:43:46 -04:00

									github.com/philhofer/fwd v1.1.2 // indirect

							

Gallery repository (#663) Signed-off-by: mudler <mudler@localai.io>

2023-06-24 08:18:17 +02:00

									github.com/pierrec/lz4/v4 v4.1.2 // indirect

							

2025-07-22 16:31:04 +02:00

									github.com/pkg/errors v0.9.1

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/pkoukk/tiktoken-go v0.1.6 // indirect

							

Initial implementation of upload files api. (#1703) * Initial implementation of upload files api. * Move sanitize method to utils. * Save uploaded data to uploads folder. * Avoid loop if we do not have a purpose. * Minor cleanup of api and fix bug where deleting duplicate filename cause error. * Revert defer of saving config * Moved creation of directory to startup. * Make file names unique when storing on disk. * Add test for files api. * Update dependencies.

2024-02-18 02:12:02 -08:00

									github.com/pmezard/go-difflib v1.0.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/polydawn/refmt v0.89.0 // indirect

							

feat: external backend launching log improvements and relative path support (#3348) * specify workdir when launching external backend for safety / relative paths, bump version, logs Signed-off-by: Dave Lee <dave@gray101.com> * sneak in a devcontainer fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com>

2024-08-23 18:27:14 -04:00

									github.com/power-devops/perfstat v0.0.0-20240221224432-82ca36839d55 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/prometheus/client_model v0.6.2 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									github.com/quic-go/qpack v0.5.1 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									github.com/quic-go/quic-go v0.54.0 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/rivo/uniseg v0.4.7 // indirect

							

feat: auto select llama-cpp cpu variant (#2305) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-05-13 02:37:52 -07:00

									github.com/shoenig/go-m1cpu v0.1.6 // indirect

							

feat(p2p): add support for configuration of edgevpn listen_maddrs, dht_announce_maddrs and bootstrap_peers (#4200) * add support for edgevpn listen_maddrs, dht_announce_maddrs, dht_bootstrap_peers * upd docs for libp2p loglevel

2024-11-20 17:18:52 +04:00

									github.com/shopspring/decimal v1.4.0 // indirect

							

feat(oci): support OCI images and Ollama models (#2628) * Support specifying oci:// and ollama:// for model URLs Fixes: https://github.com/mudler/LocalAI/issues/2527 Fixes: https://github.com/mudler/LocalAI/issues/1028 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Lower watcher warnings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to install ollama models from CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not keep file ownership Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip test on darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-06-22 08:17:41 +02:00

									github.com/sirupsen/logrus v1.9.3 // indirect

							

feat(llama.cpp): estimate vram usage (#5299) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-05-02 17:40:26 +02:00

									github.com/smallnest/ringbuffer v0.0.0-20241116012123-461381446e3d // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/songgao/packets v0.0.0-20160404182456-549a10cd4091 // indirect

							

feat(p2p): add support for configuration of edgevpn listen_maddrs, dht_announce_maddrs and bootstrap_peers (#4200) * add support for edgevpn listen_maddrs, dht_announce_maddrs, dht_bootstrap_peers * upd docs for libp2p loglevel

2024-11-20 17:18:52 +04:00

									github.com/spf13/cast v1.7.0 // indirect

							

feat(swagger): Add swagger API doc (#1926) * makefile(build): add minimal and api build target * feat(swagger): Add swagger

2024-03-29 22:29:33 +01:00

									github.com/swaggo/files/v2 v2.0.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/tinylib/msgp v1.1.8 // indirect

							

feat: external backend launching log improvements and relative path support (#3348) * specify workdir when launching external backend for safety / relative paths, bump version, logs Signed-off-by: Dave Lee <dave@gray101.com> * sneak in a devcontainer fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com>

2024-08-23 18:27:14 -04:00

									github.com/tklauser/go-sysconf v0.3.14 // indirect

							

Gallery repository (#663) Signed-off-by: mudler <mudler@localai.io>

2023-06-24 08:18:17 +02:00

									github.com/ulikunitz/xz v0.5.9 // indirect

							

feat: auto select llama-cpp cpu variant (#2305) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-05-13 02:37:52 -07:00

									github.com/valyala/bytebufferpool v1.0.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/vbatts/tar-split v0.11.3 // indirect

							

chore(deps): update edgevpn to v0.28.2

2024-08-27 13:03:16 +02:00

									github.com/vishvananda/netlink v1.3.0 // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									github.com/vishvananda/netns v0.0.5 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									github.com/whyrusleeping/go-keyspace v0.0.0-20160322163242-5b898ac5add1 // indirect

							

Gallery repository (#663) Signed-off-by: mudler <mudler@localai.io>

2023-06-24 08:18:17 +02:00

									github.com/xi2/xz v0.0.0-20171230120015-48954b6210f8 // indirect

							

build(deps): bump github.com/charmbracelet/glamour from 0.6.0 to 0.7.0 (#2004) Bumps [github.com/charmbracelet/glamour](https://github.com/charmbracelet/glamour) from 0.6.0 to 0.7.0. - [Release notes](https://github.com/charmbracelet/glamour/releases) - [Commits](https://github.com/charmbracelet/glamour/compare/v0.6.0...v0.7.0) --- updated-dependencies: - dependency-name: github.com/charmbracelet/glamour dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

2024-04-11 15:41:58 +00:00

									github.com/yuin/goldmark v1.5.4 // indirect

							

chore(deps): Update Dependencies (#2538) * chore(deps): Update dependencies Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> * remove version identifiers for MeloTTS Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> --------- Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>

2024-07-12 21:54:08 +02:00

									github.com/yusufpapurcu/wmi v1.2.4 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									go.opencensus.io v0.24.0 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									go.opentelemetry.io/otel/sdk v1.31.0 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									go.opentelemetry.io/otel/trace v1.35.0 // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									go.uber.org/multierr v1.11.0 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									golang.org/x/crypto v0.39.0 // indirect

							

chore(deps): Bump edgevpn to v0.30.1 (#4840) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-02-17 16:51:22 +01:00

									golang.zx2c4.com/wintun v0.0.0-20230126152724-0fa3db229ce2 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									golang.zx2c4.com/wireguard v0.0.0-20250521234502-f333402bd9cb // indirect

							

chore: fix go.mod module (#2635) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>

2024-06-23 01:24:36 -07:00

									golang.zx2c4.com/wireguard/windows v0.5.3 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									gonum.org/v1/gonum v0.16.0 // indirect

							

chore(deps): bump edgevpn to v0.29.0 (#4564) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-01-09 09:33:22 +01:00

									google.golang.org/genproto/googleapis/rpc v0.0.0-20241007155032-5fefd90f89a9 // indirect

							

feat: add falcon ggllm via grpc client Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-07-15 01:19:43 +02:00

									gopkg.in/fsnotify.v1 v1.4.7 // indirect

							

feat(startup): show CPU/GPU information with --debug (#2241) Signed-off-by: mudler <mudler@localai.io>

2024-05-05 09:10:23 +02:00

									howett.net/plist v1.0.0 // indirect

							

chore(deps): bump edgevpn (#6001) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2025-08-08 16:23:18 +02:00

									lukechampine.com/blake3 v1.4.1 // indirect

							

Gallery repository (#663) Signed-off-by: mudler <mudler@localai.io>

2023-06-24 08:18:17 +02:00

)