Commit Graph

  • e4f9db37db Fix context_params struct layout Andrei Betlen 2023-07-15 15:34:55 -04:00
  • bdf32df255 Add additional direnv directory to gitignore Andrei Betlen 2023-07-15 15:34:32 -04:00
  • d0572f4fca Merge branch 'custom_rope' into main Andrei Betlen 2023-07-15 15:11:43 -04:00
  • f0797a6054 Merge branch main into custom_rope Andrei Betlen 2023-07-15 15:11:01 -04:00
  • f72b6e9b73 Update llama.cpp Andrei Betlen 2023-07-15 15:01:08 -04:00
  • 15e0e0a937 Merge pull request #390 from SubhranshuSharma/main Andrei 2023-07-14 16:53:23 -04:00
  • 118b7f6d5c fix: tensor_split should be optional list Andrei Betlen 2023-07-14 16:52:48 -04:00
  • 25b3494e11 Minor fix to tensor_split parameter Andrei Betlen 2023-07-14 16:40:53 -04:00
  • e6c67c8f7d Update llama.cpp Andrei Betlen 2023-07-14 16:40:31 -04:00
  • 82b11c8c16 Merge pull request #460 from shouyiwang/tensor_split Andrei 2023-07-14 16:33:54 -04:00
  • 579f526246 Resolve merge conflicts Shouyi Wang 2023-07-14 14:37:01 +10:00
  • 6705f9b6c6 Bump version v0.1.71 Andrei Betlen 2023-07-13 23:32:06 -04:00
  • de4cc5a233 bugfix: pydantic v2 fields Andrei Betlen 2023-07-13 23:25:12 -04:00
  • 896ab7b88a Update llama.cpp Andrei Betlen 2023-07-13 23:24:55 -04:00
  • 7bb0024cd0 Fix uvicorn dependency Andrei Betlen 2023-07-12 19:31:43 -04:00
  • 3f8f276f9f Add bindings for custom_rope randoentity 2023-07-09 09:05:16 +02:00
  • f6c9d17f6b Merge branch 'main' of github.com:abetlen/llama_cpp_python into main v0.1.70 Andrei Betlen 2023-07-09 18:20:06 -04:00
  • 8e0f6253db Bump version Andrei Betlen 2023-07-09 18:20:04 -04:00
  • c988c2ac0b Bump version Andrei Betlen 2023-07-09 18:19:37 -04:00
  • df3d545938 Update changelog Andrei Betlen 2023-07-09 18:13:41 -04:00
  • a86bfdf0a5 bugfix: truncate completion max_tokens to fit context length by default Andrei Betlen 2023-07-09 18:13:29 -04:00
  • 6f70cc4b7d bugfix: pydantic settings missing / changed fields Andrei Betlen 2023-07-09 18:03:31 -04:00
  • 0f3c474a49 Bump version v0.1.69 Andrei Betlen 2023-07-09 11:44:29 -04:00
  • 9aa64163db Update llama.cpp Andrei Betlen 2023-07-09 11:40:59 -04:00
  • 9f21f548a5 Add tensor split Shouyi Wang 2023-07-07 19:22:10 +10:00
  • 99f064e681 docker: Add libopenblas to simple image Andrei Betlen 2023-07-09 01:36:39 -04:00
  • 00da643929 Update llama.cpp Andrei Betlen 2023-07-08 20:30:34 -04:00
  • 3c85c41573 docker: update path to dockerfile Andrei Betlen 2023-07-08 04:04:11 -04:00
  • 1f5e748a7e docker: fix docker build action args Andrei Betlen 2023-07-08 04:00:43 -04:00
  • 9e153fd11d docker: update context path Andrei Betlen 2023-07-08 03:44:51 -04:00
  • 5b7d76608d docker: add checkout action to dockerfile Andrei Betlen 2023-07-08 03:43:17 -04:00
  • 3a2635b9e1 Update docker workflow for new simple image Andrei Betlen 2023-07-08 03:37:28 -04:00
  • 670fe4b701 Update changelog Andrei Betlen 2023-07-08 03:37:12 -04:00
  • 24724202ee Merge pull request #64 from jm12138/add_unlimited_max_tokens Andrei 2023-07-08 02:38:06 -04:00
  • 5d756de314 Merge branch 'main' into add_unlimited_max_tokens Andrei 2023-07-08 02:37:38 -04:00
  • 236c4cf442 Merge pull request #456 from AgentJ-WR/patch-1 Andrei 2023-07-08 02:32:20 -04:00
  • 7952ca50c9 Merge pull request #452 from audreyfeldroy/update-macos-metal-gpu-step-4 Andrei 2023-07-08 02:32:09 -04:00
  • b8e0bed295 Merge pull request #453 from wu-qing-157/main Andrei 2023-07-08 02:31:52 -04:00
  • d6e6aad927 bugfix: fix compatibility bug with openai api on last token Andrei Betlen 2023-07-08 00:06:11 -04:00
  • 4f2b5d0b53 Format Andrei Betlen 2023-07-08 00:05:10 -04:00
  • ea4fbadab3 Show how to adjust context window in README.md AgentJ-WR 2023-07-07 23:24:57 -04:00
  • 34c505edf2 perf: convert pointer to byref Andrei Betlen 2023-07-07 22:54:07 -04:00
  • 52753b77f5 Upgrade fastapi to 0.100.0 and pydantic v2 Andrei Betlen 2023-07-07 21:38:46 -04:00
  • 11eae75211 perf: avoid allocating new buffers during sampling Andrei Betlen 2023-07-07 19:28:53 -04:00
  • 7887376bff Update llama.cpp Andrei Betlen 2023-07-07 19:06:54 -04:00
  • a14d8a9b3f perf: assign to candidates data structure instead Andrei Betlen 2023-07-07 18:58:43 -04:00
  • 9e61661518 fix indexing token_logprobs after sorting wu-qing-157 2023-07-07 10:18:49 +00:00
  • d270ec231a Update macOS Metal GPU step 4 Audrey Roy Greenfeld 2023-07-07 11:15:04 +01:00
  • ca11673061 Add universal docker image Andrei Betlen 2023-07-07 03:38:51 -04:00
  • 57d8ec3899 Add setting to control request interruption Andrei Betlen 2023-07-07 03:37:23 -04:00
  • cc542b4452 Update llama.cpp Andrei Betlen 2023-07-07 03:04:54 -04:00
  • 4c7cdcca00 Add interruptible streaming requests for llama-cpp-python server. Closes #183 Andrei Betlen 2023-07-07 03:04:17 -04:00
  • 98ae4e58a3 Update llama.cpp Andrei Betlen 2023-07-06 17:57:56 -04:00
  • a1b2d5c09b Bump version v0.1.68 Andrei Betlen 2023-07-05 01:06:46 -04:00
  • b994296c75 Update llama.cpp Andrei Betlen 2023-07-05 01:00:14 -04:00
  • 058b134ab6 Merge pull request #443 from abetlen/dependabot/pip/mkdocs-material-9.1.18 Andrei 2023-07-05 00:40:46 -04:00
  • 9261a52916 Bump mkdocs-material from 9.1.17 to 9.1.18 dependabot[bot] 2023-07-04 21:21:09 +00:00
  • 5e0a6b664d Merge pull request #442 from abetlen/dependabot/pip/typing-extensions-4.7.1 Andrei 2023-07-04 17:19:36 -04:00
  • f1b442337d Bump typing-extensions from 4.6.3 to 4.7.1 dependabot[bot] 2023-07-04 18:22:53 +00:00
  • 2379ed5809 Merge pull request #444 from abetlen/dependabot/pip/fastapi-0.99.1 Andrei 2023-07-04 14:21:44 -04:00
  • bf1dc6693b Merge pull request #436 from mikeyang01/main Andrei 2023-07-04 14:21:30 -04:00
  • fb02077e3f Bump fastapi from 0.98.0 to 0.99.1 dependabot[bot] 2023-07-03 20:55:32 +00:00
  • c8d0647caa Update README.md Mike 2023-06-30 16:42:13 +08:00
  • 28ec88cfb4 Merge pull request #435 from vladkens/patch-1 Andrei 2023-06-29 21:49:39 -04:00
  • 485eee7bef Update README.md vladkens 2023-06-30 00:48:21 +03:00
  • c67f786360 Update llama.cpp Andrei Betlen 2023-06-29 01:08:15 -04:00
  • e34f4414cf Hotfix: logits_all bug v0.1.67 Andrei Betlen 2023-06-29 00:57:27 -04:00
  • 4d1eb88b13 Bump version Andrei Betlen 2023-06-29 00:46:15 -04:00
  • a2ede37bd5 Load logits directly into scores buffer Andrei Betlen 2023-06-29 00:45:46 -04:00
  • b95b0ffbeb Use pre-allocated buffers to store input_ids and scores Andrei Betlen 2023-06-29 00:40:47 -04:00
  • a5e059c053 Free model when llama is unloaded. Closes #434 Andrei Betlen 2023-06-28 23:58:55 -04:00
  • 442213b070 Add stopping criteria and logits processor to docs Andrei Betlen 2023-06-28 21:07:58 -04:00
  • a3766591bb Update docs Andrei Betlen 2023-06-27 13:02:30 -04:00
  • 530599a467 Merge branch 'main' of github.com:abetlen/llama_cpp_python into main Andrei Betlen 2023-06-27 12:45:33 -04:00
  • dae983342a Update docs Andrei Betlen 2023-06-27 12:45:31 -04:00
  • e2a251352c Merge pull request #432 from abetlen/dependabot/pip/pytest-7.4.0 Andrei 2023-06-26 23:40:48 -04:00
  • 89f9e435ba Bump pytest from 7.3.2 to 7.4.0 dependabot[bot] 2023-06-26 23:26:11 +00:00
  • 2718dfbe3c Merge pull request #430 from abetlen/dependabot/pip/fastapi-0.98.0 Andrei 2023-06-26 19:25:09 -04:00
  • dcf1834294 Bump fastapi from 0.97.0 to 0.98.0 dependabot[bot] 2023-06-26 21:23:22 +00:00
  • 7d01749c8c Merge pull request #429 from abetlen/dependabot/pip/mkdocs-material-9.1.17 Andrei 2023-06-26 17:22:23 -04:00
  • e03c3806f8 Bump mkdocs-material from 9.1.16 to 9.1.17 dependabot[bot] 2023-06-26 21:16:53 +00:00
  • e18fe74bd7 Merge pull request #431 from abetlen/dependabot/pip/numpy-1.24.4 Andrei 2023-06-26 17:15:51 -04:00
  • c9a8b7eb43 Bump numpy from 1.24.3 to 1.24.4 dependabot[bot] 2023-06-26 21:03:34 +00:00
  • 452929404f Updated docs link Andrei Betlen 2023-06-26 16:35:38 -04:00
  • 66b8b979a5 Update readthedocs setup Andrei Betlen 2023-06-26 16:31:16 -04:00
  • 155dedf28f Add readthedocsc config Andrei Betlen 2023-06-26 16:25:17 -04:00
  • 5193af297b Bump version v0.1.66 Andrei Betlen 2023-06-26 08:53:54 -04:00
  • 3379dc40a1 Merge branch 'main' of github.com:abetlen/llama_cpp_python into main Andrei Betlen 2023-06-26 08:50:48 -04:00
  • 952228407e Update llama.cpp Andrei Betlen 2023-06-26 08:50:38 -04:00
  • b4a3db3e54 Update type signature Andrei Betlen 2023-06-26 08:50:30 -04:00
  • 628e3fb3df Merge pull request #370 from Okabintaro/fix-state-pickle Andrei 2023-06-26 08:46:59 -04:00
  • 5eb4ebb041 Merge branch 'main' into fix-state-pickle Andrei 2023-06-26 08:45:02 -04:00
  • 04d9218b92 Merge pull request #420 from samfundev/main Andrei 2023-06-26 08:10:43 -04:00
  • d788fb49bf Only concatenate after all batches are done samfundev 2023-06-24 15:51:46 -04:00
  • 877ca6d016 Merge branch 'main' into fix-state-pickle Andrei 2023-06-23 15:13:07 -04:00
  • b6f9388436 Merge pull request #402 from abetlen/dependabot/pip/mkdocs-material-9.1.16 Andrei 2023-06-23 10:10:32 -04:00
  • 0952d533fe Merge pull request #415 from lexin4ever/patch-1 Andrei 2023-06-23 10:09:38 -04:00
  • 282698b6d3 server: pass seed param from command line to llama Alexey 2023-06-23 00:19:24 +04:00
  • 3e7eae4796 Bump Version v0.1.65 Andrei Betlen 2023-06-20 11:25:44 -04:00
  • e37798777e Update llama.cpp Andrei Betlen 2023-06-20 11:25:10 -04:00