Commit Graph

  • bf57b13dda remove models that require huggingface auth from ci YeAnbang 2024-05-29 02:10:37 +00:00
  • 0bbac158ed fix datasets version YeAnbang 2024-05-29 02:03:51 +00:00
  • 62eb28b929 remove duplicated test YeAnbang 2024-05-29 01:45:58 +00:00
  • b8b5cacf38 fix transformers version YeAnbang 2024-05-29 00:57:32 +00:00
  • 1b880ce095 [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2024-05-28 08:02:42 +00:00
  • b1031f7244 fix ci YeAnbang 2024-05-28 08:06:36 +00:00
  • 7ae87b3159 fix training script YeAnbang 2024-05-28 08:00:18 +00:00
  • 0b4a33548c moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy YeAnbang 2024-05-28 07:58:08 +00:00
  • 7e65b71815 run pre-commit YeAnbang 2024-05-28 03:14:37 +00:00
  • 929e1e3da4 upgrade ppo dpo rm script YeAnbang 2024-05-28 03:04:39 +00:00
  • 7a7e86987d upgrade colossal-chat support tp_group>1, add sp for sft YeAnbang 2024-05-27 05:55:57 +00:00
  • 73e88a5553 [shardformer] fix import (#5788) Hongxin Liu 2024-06-06 19:09:50 +08:00
  • 5ead00ffc5 [misc] update requirements (#5787) Hongxin Liu 2024-06-06 15:55:34 +08:00
  • a1e39f4c0d [install]fix setup (#5786) flybird11111 2024-06-06 11:47:48 +08:00
  • 2d1a785b71 fix setup fix-setup flybird11111 2024-06-06 02:44:41 +00:00
  • b9d646fe9e [misc] fix dist logger (#5782) Hongxin Liu 2024-06-05 15:04:22 +08:00
  • c46e09715c Allow building cuda extension without a device. (#5535) Charles Coulombe 2024-06-05 02:26:30 -04:00
  • 3f7e3131d9 [gemini] optimize reduce scatter d2h copy (#5760) botbw 2024-06-05 14:23:13 +08:00
  • 10a19e22c6 [hotfix] fix testcase in test_fx/test_tracer (#5779) duanjunwen 2024-06-05 11:29:32 +08:00
  • 80c3c8789b [Test/CI] remove test cases to reduce CI duration (#5753) botbw 2024-06-05 11:29:04 +08:00
  • 79f7a7b211 [misc] Accelerate CI for zero and dist optim (#5758) Edenzzzz 2024-06-05 11:25:19 +08:00
  • 50b4c8e8cf [hotfix] fix llama flash attention forward (#5777) flybird11111 2024-06-05 10:56:47 +08:00
  • b45000f839 [Inference]Add Streaming LLM (#5745) yuehuayingxueluo 2024-06-05 10:51:19 +08:00
  • ee6fd38373 [devops] fix docker ci (#5780) Hongxin Liu 2024-06-04 17:47:39 +08:00
  • 32f4187806 [misc] update dockerfile (#5776) Hongxin Liu 2024-06-04 16:15:41 +08:00
  • e22b82755d [CI/tests] simplify some test case to reduce testing time (#5755) Haze188 2024-06-04 13:57:54 +08:00
  • 406443200f [Hotfix] Add missing init file in inference.executor (#5774) Yuanheng Zhao 2024-06-03 22:29:39 +08:00
  • 1b76564e16 [test] Fix/fix testcase (#5770) duanjunwen 2024-06-03 15:26:01 +08:00
  • 3f2be80530 fix (#5765) flybird11111 2024-06-03 11:25:18 +08:00
  • 68359ed1e1 [release] update version (#5752) v0.3.8 Hongxin Liu 2024-05-31 19:40:26 +08:00
  • 677cbfacf8 [Fix/Example] Fix Llama Inference Loading Data Type (#5763) Yuanheng Zhao 2024-05-30 13:48:46 +08:00
  • 023ea13cb5 Merge pull request #5749 from hpcaitech/prefetch botbw 2024-05-29 15:35:54 +08:00
  • 154720ba6e [chore] refactor profiler utils hxwang 2024-05-28 12:41:42 +00:00
  • 8547562884 [chore] remove unnecessary assert since compute list might not be recorded hxwang 2024-05-28 05:16:02 +00:00
  • e5e3320948 [bug] continue fix hxwang 2024-05-28 02:41:23 +00:00
  • 936dd96dbb [bug] workaround for idx fix hxwang 2024-05-28 02:33:12 +00:00
  • e0dde8fda5 Merge pull request #5754 from Hz188/prefetch botbw 2024-05-27 14:59:21 +08:00
  • 157b4cc357 Merge branch 'prefetch' into prefetch botbw 2024-05-27 14:58:57 +08:00
  • 87665d7922 correct argument help message genghaozhe 2024-05-27 06:03:53 +00:00
  • 4d097def96 [Gemini] add some code for reduce-scatter overlap, chunk prefetch in llama benchmark. (#5751) Haze188 2024-05-25 23:00:13 +08:00
  • b9269d962d add args.prefetch_num for benchmark genghaozhe 2024-05-25 14:55:50 +00:00
  • fba04e857b [bugs] fix args.profile=False DummyProfiler errro genghaozhe 2024-05-25 14:55:09 +00:00
  • b96c6390f4 [inference] Fix running time of test_continuous_batching (#5750) Yuanheng Zhao 2024-05-24 19:34:15 +08:00
  • 5f8c0a0ac3 [Feature] auto-cast optimizers to distributed version (#5746) Edenzzzz 2024-05-24 17:24:16 +08:00
  • ca674549e0 [chore] remove unnecessary test & changes hxwang 2024-05-24 06:09:36 +00:00
  • ff507b755e Merge branch 'main' of github.com:hpcaitech/ColossalAI into prefetch hxwang 2024-05-24 04:05:07 +00:00
  • 63c057cd8e [example] add profile util for llama hxwang 2024-05-24 03:59:36 +00:00
  • 2fc85abf43 [gemini] async grad chunk reduce (all-reduce&reduce-scatter) (#5713) botbw 2024-05-24 10:31:16 +08:00
  • 85946d4236 [Inference]Fix readme and example for API server (#5742) Jianghai 2024-05-24 10:03:05 +08:00
  • 15d21a077a Merge remote-tracking branch 'origin/main' into prefetch hxwang 2024-05-23 15:49:33 +00:00
  • 4647ec28c8 [inference] release (#5747) binmakeswell 2024-05-23 17:44:06 +08:00
  • df6747603f [Colossal-Inference] (v0.1.0) Merge pull request #5739 from hpcaitech/feature/colossal-infer Yuanheng Zhao 2024-05-22 14:31:09 +08:00
  • 498f42c45b [NFC] fix requirements (#5744) feature/colossal-infer Yuanheng Zhao 2024-05-22 12:08:49 +08:00
  • bd38fe6b91 [NFC] Fix code factors on inference triton kernels (#5743) Yuanheng Zhao 2024-05-21 22:12:15 +08:00
  • c2c8c9cf17 [ci] Temporary fix for build on pr (#5741) Yuanheng Zhao 2024-05-21 18:20:57 +08:00
  • 13c06d36a3 [bug] fix early return (#5740) botbw 2024-05-21 14:21:58 +08:00
  • c06208e72c Merge pull request #5737 from yuanheng-zhao/inference/sync/main Yuanheng Zhao 2024-05-21 11:26:37 +08:00
  • 22ce873c3f [Shardformer] Add parallel output for shardformer models(bloom, falcon) (#5702) Haze188 2024-05-21 11:07:13 +08:00
  • 83716e9feb Merge pull request #5738 from botbw/prefetch Haze188 2024-05-21 10:40:56 +08:00
  • b3c0e6d871 [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2024-05-21 02:09:14 +00:00
  • 137a7c341b [chore] fix init error hxwang 2024-05-21 02:07:21 +00:00
  • 8633c15da9 [sync] Sync feature/colossal-infer with main Yuanheng Zhao 2024-05-20 15:50:53 +00:00
  • d8b1ea4ac9 [doc] Update Inference Readme (#5736) Yuanheng Zhao 2024-05-20 22:50:04 +08:00
  • bdf9a001d6 [Fix/Inference] Add unsupported auto-policy error message (#5730) Yuanheng Zhao 2024-05-20 22:49:18 +08:00
  • f5b7de38a4 Merge pull request #5733 from Hz188/feature/prefetch botbw 2024-05-20 15:31:34 +08:00
  • 90d8d0183c remove personal comments genghaozhe 2024-05-20 07:28:20 +00:00
  • bfcb2d1ff8 refactor the code structure to solve the circular import genghaozhe 2024-05-20 07:25:24 +00:00
  • a280517dd9 remove unrelated file genghaozhe 2024-05-20 05:25:35 +00:00
  • 3b363d44cc Merge branch 'feature/prefetch' of https://github.com/Hz188/ColossalAI into feature/prefetch genghaozhe 2024-05-20 05:23:40 +00:00
  • 1ec92d29af remove perf log, unrelated file and so on genghaozhe 2024-05-20 05:21:26 +00:00
  • 5c6c5d6be3 remove comments genghaozhe 2024-05-20 05:15:51 +00:00
  • df63db7e63 remote comments genghaozhe 2024-05-20 05:15:51 +00:00
  • 7416e4943b fix conflicts to beautify the code genghaozhe 2024-05-20 04:09:51 +00:00
  • f5a5287f87 Merge pull request #5731 from botbw/prefetch botbw 2024-05-20 12:04:33 +08:00
  • d22bf30ca6 implement auto policy prefetch and modify a little origin code. genghaozhe 2024-05-20 04:01:53 +00:00
  • f1918e18a5 [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2024-05-20 03:00:06 +00:00
  • a55a9e298b [gemini] init auto policy prefetch hxwang 2024-05-20 02:21:17 +00:00
  • 283c407a19 [Inference] Fix Inference Generation Config and Sampling (#5710) Yuanheng Zhao 2024-05-19 15:08:42 +08:00
  • c5ddf17c76 Merge branch 'hpcaitech:feature/prefetch' into feature/prefetch Haze188 2024-05-17 18:58:53 +08:00
  • 06a3a100b3 remove unrelated code genghaozhe 2024-05-17 10:57:49 +00:00
  • 3d625ca836 add some todo Message genghaozhe 2024-05-17 10:55:28 +00:00
  • 9d83c6d715 [lazy] fix lazy cls init (#5720) flybird11111 2024-05-17 18:18:59 +08:00
  • 9690981601 Merge pull request #5722 from botbw/prefetch botbw 2024-05-17 13:46:18 +08:00
  • e57812c672 [chore] Update placement_policy.py botbw 2024-05-17 13:42:18 +08:00
  • 8bcfe360fd [example] Update Inference Example (#5725) Yuanheng Zhao 2024-05-17 11:28:53 +08:00
  • 013690a86b remove set(all_chunks) genghaozhe 2024-05-16 09:57:51 +00:00
  • 6efbadba25 [chore] remove debugging info hxwang 2024-05-16 16:46:39 +08:00
  • 20701d4533 [chore] remove print hxwang 2024-05-16 16:45:50 +08:00
  • f45f8a2aa7 [gemini] maxprefetch means maximum work to keep hxwang 2024-05-16 16:12:53 +08:00
  • fc2248cf99 Merge branch 'prefetch' of github.com:botbw/ColossalAI into feature/prefetch genghaozhe 2024-05-16 08:05:32 +00:00
  • 5470e5f94e a commit for fake push test genghaozhe 2024-05-16 08:03:40 +00:00
  • 6bbe956316 [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2024-05-16 07:26:19 +00:00
  • 82b25524ff Merge branch 'prefetch' of github.com:botbw/ColossalAI into prefetch hxwang 2024-05-16 07:25:22 +00:00
  • 1f6b57099c Merge branch 'prefetch' of github.com:botbw/ColossalAI into botbw-prefetch genghaozhe 2024-05-16 07:23:40 +00:00
  • 2e68eebdfe [chore] refactor & sync hxwang 2024-05-16 07:22:10 +00:00
  • 2011b1356a [misc] Update PyTorch version in docs (#5724) binmakeswell 2024-05-16 13:54:32 +08:00
  • 5bedea6e10 [pre-commit.ci] auto fixes from pre-commit.com hooks pre-commit-ci[bot] 2024-05-16 05:20:00 +00:00
  • 4148ceed9f [gemini] use compute_chunk to find next chunk hxwang 2024-05-16 13:17:26 +08:00
  • b2e9745888 [chore] sync hxwang 2024-05-16 04:45:06 +00:00
  • a8d459f99a 【Inference] Delete duplicated package (#5723) 傅剑寒 2024-05-16 10:49:03 +08:00