118 Commits

Author SHA1 Message Date
Yannic Kilcher
46520c3688 deployment workflows for bye 2023-11-25 15:34:09 +01:00
Oliver Stanley
6042103608 Remove outdated LLaMA worker image (#3282) 2023-06-03 07:48:18 +02:00
someone13574
70f30a6d6f Message drafts (#3044)
closes #2931 (slightly changed goal based on advice from the discord,
generate full messages, not 'x' tokens. Full messages are more useful
data)

- [x] Create draft selection UI
- [x] Draft inference
- [x] Option to regenerate drafts and serve 3 new ones
- [x] Remember last viewed sibling message
- [x] Store selected draft training data for RLHF
- ~~[ ] Disable drafts when queue is too long / server is under load~~
(Suggested to leave to next PR in the discord)
- [x] Draft markdown rendering
- [x] 'Used plugin' UI for drafts
- [x] Resolve merge conflicts

---------

Co-authored-by: notmd <33456881+notmd@users.noreply.github.com>
Co-authored-by: notmd <notmd1811@gmail.com>
2023-05-31 20:51:31 +02:00
Andreas Köpf
750c59df4a Add missing build-backend-worker dependency to deploy-to-node (#3225)
The backend-worker container needs to be built before deploy-to-node
runs.
2023-05-24 18:01:59 +02:00
Graeme Harris
195614fe25 Added celery worker container to github actions (#3186)
Added worker and worker beat to ansible playbook
2023-05-24 15:24:40 +02:00
notmd
ded5acaae3 send flagged message notification to discord (#2641)
Send a notification to Discord so the mod team can quickly take action.
This is a temporal workaround for
https://github.com/LAION-AI/Open-Assistant/issues/968. Also we don't
need a UI for it.


![image](https://user-images.githubusercontent.com/33456881/232369181-19e3ec0d-5089-48f7-9d5c-88eb97eab8d3.png)
2023-05-05 21:26:37 +01:00
Quarks-1
abad9d1a96 Dependabot addition (#2684)
Added dependabot file to implement weekly dependency version checks for
Node JS

changes in this PR:

adds a new file under `.github/` called `depndabot.yml`.

---------

Co-authored-by: notmd <33456881+notmd@users.noreply.github.com>
2023-04-22 12:14:05 +02:00
Yannic Kilcher
d899c6c82a Added max messages and max message length settings for inference (#2774)
Returning 413 both when chat exceeds num messages allowed, and when the
individual message is too long
2023-04-22 10:56:02 +02:00
Yannic Kilcher
b4895f4f91 Added "current announcement" to web (minimal implementation) (#2571)
Co-authored-by: notmd <tinhmeo10@gmail.com>
2023-04-16 07:56:59 +07:00
Yannic Kilcher
0bb0beef84 Gunicorn worker image for inference (#2553) 2023-04-15 20:23:05 +00:00
Yannic Kilcher
3389b43568 Remove safety deployment dependency in release playbook (#2540)
the dependency is not needed as the safety server must be deployed
separately
2023-04-15 20:33:35 +02:00
Yannic Kilcher
bbabb9c4cb Added enable inference safety to playbook (#2528)
will not have any effect until refactor of safety to the central server
(right now it's in worker)
2023-04-15 18:40:19 +02:00
Yannic Kilcher
ed8bb2afe0 Added playbook variables for google auth (#2529) 2023-04-14 23:27:38 +00:00
Oliver Stanley
50cb3b0ef5 Add safety server to inference (#2449)
Close #2447.

---------

Co-authored-by: Shahules786 <Shahules786@gmail.com>
2023-04-13 19:41:47 +01:00
Andreas Köpf
d984b6eabc Add gh vars config for rewiew/ranking counts (#2479) 2023-04-12 21:03:27 +02:00
Yannic Kilcher
5a17d937fc Added docker image for standalone worker (#2300) 2023-04-09 22:05:29 +02:00
Andrew Maguire
f5066981bb update docusaurus to latest + add models yt video as blogpost (#2373)
- update docusaurus to latest
- add models yt video as a blog post
2023-04-09 15:07:35 +02:00
Yannic Kilcher
9ed9134da8 Updated nginx config for prod, including streaming headers (#2239) 2023-04-06 10:10:29 +02:00
hzj5790
3ecc705388 Disable Initial Prompt Task for en and es Locales (#1849)
To help clear the prompt backlog, as per
https://github.com/LAION-AI/Open-Assistant/issues/1659, and Andreas's
suggestions in https://github.com/LAION-AI/Open-Assistant/pull/1824,
this disables the initial-prompt task for en and es locales.

---------

Co-authored-by: Oliver Stanley <olivergestanley@gmail.com>
2023-04-04 22:48:16 +02:00
AbdBarho
7ee3ac077a Add missing variables to deployment job (#2297)
Fixes a regression introduced in
dcc4f25649
2023-04-02 19:51:04 +02:00
AbdBarho
dcc4f25649 Use trusted clients for inference auth (#2278)
Fixes #2285 

Changes in the PR:
* revert auth callbacks to refer back to the inference server (same as
it was before), the callback urls in the corresponding apps must be
updated.
* Added trusted clients which can communicate with API keys, these keys
need to be added to the github env / secrets, I don't have the access
rights to do that
* Drop all auth code from website

Changes here conflict with #2221, although not significantly, only the
urls, maybe this could be addressed as part of #2286
2023-04-02 19:01:08 +02:00
Yannic Kilcher
da02bc80c9 Fixed bugs in deployment notebook (Sorry 🙃) (#2219) 2023-03-25 22:26:55 +01:00
Yannic Kilcher
520f0fc004 Adjusted deployment notebooks for inference (#2213) 2023-03-25 21:56:54 +01:00
Andreas Köpf
bcd2fb3a7d Revert "fix: ghcr.io build for mulitplatform. includes Apple silicon" (#2199)
Broke builds, see
https://github.com/LAION-AI/Open-Assistant/actions/runs/4515995123

This reverts commit fa4c42d89f.
2023-03-25 00:20:49 +01:00
James Melvin Ebenezer
fa4c42d89f fix: ghcr.io build for mulitplatform. includes Apple silicon (#2151)
fixes #2081 

Test Evidence:
Since this is a bit tricky to fix and test on this repo
I tested by building postgres images on my private repo actions.

Co-authored-by: James Melvin <melvin@gameface.ai>
2023-03-24 23:56:46 +01:00
Yannic Kilcher
bcaa7a066d building separate llama image for inference (#2161) 2023-03-21 21:32:40 +01:00
Yannic Kilcher
dd0657ab7a added hf server 2023-03-19 15:39:31 +01:00
Yannic Kilcher
28d09fc15c enable chat via flag 2023-03-19 13:33:22 +01:00
Yannic Kilcher
75fac1a07c added hf token to deploy 2023-03-19 00:59:45 +01:00
Yannic Kilcher
0e803dd275 added inference server host to dev 2023-03-19 00:10:07 +01:00
Yannic Kilcher
1452861285 removed worker build from release 2023-03-18 12:06:25 +01:00
Yannic Kilcher
d834095859 Inference-deploy (#2109) 2023-03-18 11:58:59 +01:00
Yannic Kilcher
3a747f6c0d Revert "fix: added platform support for ghcr.io images to be run on Apple Sil…" (#2080)
Reverts LAION-AI/Open-Assistant#1763
2023-03-15 20:13:28 +01:00
James Melvin Ebenezer
3bc0b1a3fa fix: added platform support for ghcr.io images to be run on Apple Sil… (#1763)
Added support for multiple platforms for ghcr.io images. 

When running this image on an x86_64 / amd64 machine, the amd64 variant
is pulled and run.

Reference:

https://docs.docker.com/engine/reference/commandline/buildx_build/#platform
https://docs.docker.com/build/building/multi-platform/

Co-authored-by: James Melvin <melvin@gameface.ai>
2023-03-15 19:47:30 +01:00
Andreas Köpf
d485439991 Create oasst_data module (#2009)
To allow convenient imports/processing of OA data by 3rd party projects
the oasst-jsonl data file format schemas are moved into a separate new
python module called oasst_data. Code related to reading oasst-jsonl
files will be added later.
2023-03-08 14:44:29 +01:00
Andreas Köpf
097dd7153b Add TreeManagerConfiguration.max_prompt_lottery_waiting parameter (#1889)
- add lang column to message_tree_state, create multi-column index on
`state`, `lang`
- stop generating initial prompt tasks for languages with more than
`max_prompt_lottery_waiting ` prompts in `prompt_lottery_waiting` state
- add `MAX_PROMPT_LOTTERY_WAITING` workflow variable
2023-02-26 13:14:09 +01:00
Keith Stevens
22f477d21e Read the signin env variables from server props (#1614)
* Read the signin env variables from server props so that they always use the runtime variables

* Rename the env variables to be private

* Updating docker and ansible changes to reflect env variable name change

* Disabling captcha during CI due to failures and cleaning up a build step
2023-02-16 18:42:21 +07:00
Andreas Köpf
acb4052578 move env vars to outer scope 2023-02-15 21:07:58 +01:00
Andreas Köpf
1634d2fa86 show frontend build env vars 2023-02-15 20:35:43 +01:00
Andreas Köpf
6266734118 add WEB_NEXT_PUBLIC_* env vars to build-frontend gh action 2023-02-15 17:17:42 +01:00
Andreas Köpf
b0250b1abc add public backend & cors vars 2023-02-14 22:57:40 +01:00
AbdBarho
805e8bbeb2 Add our current architecture to the blog (#1483)
* Add our current architecture to the blog

* Update docs/blog/2023-02-11-architecture.md

Co-authored-by: Andrew Maguire <andrewm4894@gmail.com>

---------

Co-authored-by: Andrew Maguire <andrewm4894@gmail.com>
2023-02-14 01:19:40 +01:00
Gareth Davidson
a314ee92ba tripped up by dupe code :/ 2023-02-12 22:33:17 +00:00
AbdBarho
759eb563fe Use docker compose profiles for all services (#1418) 2023-02-12 20:57:35 +01:00
Yannic Kilcher
97f1bea066 added inference image builds 2023-02-10 22:59:45 +01:00
Andreas Köpf
cbc3eba37d add env for mods 2023-02-10 09:11:15 +01:00
Yannic Kilcher
c878d12069 added backend url 2023-02-05 01:50:30 +01:00
Yannic Kilcher
26f3fd4453 added maintenance mode variable 2023-02-05 01:34:34 +01:00
Saurav Maheshkar
926adcd710 feat(ci): enable pip caching in CI (#1129) 2023-02-05 00:50:51 +01:00
Andreas Köpf
100ecbe2d5 Limit initial prompts, ensure max_active_trees = growing trees 2023-02-04 23:23:16 +01:00