Commit Graph

  • acc295dc7b Clean up README.md by removing badge and whitespace main Mark Saroufim 2025-09-01 10:21:46 -07:00
  • 77f55b9121 silence destroy_proces_group() warning (#1387) Mark Saroufim 2025-08-25 13:18:39 -07:00
  • 6079ce615e silence destroy_proces_group() warning no_pg_destroy_warning Mark Saroufim 2025-08-25 13:16:59 -07:00
  • 746c0a2d7e Modernize distributed/rpc/pipeline (#1385) Mark Saroufim 2025-08-25 13:01:44 -07:00
  • 7fce8bb0da Unbreak torch/distributed/rpc/batch (#1386) Mark Saroufim 2025-08-25 13:01:33 -07:00
  • 3b1a2d16c6 Merge branch 'main' into update_batch_rpc update_batch_rpc Mark Saroufim 2025-08-25 12:55:52 -07:00
  • 47980ab8ee update Mark Saroufim 2025-08-25 12:55:03 -07:00
  • e5f34d1ca8 More RPC examples Mark Saroufim 2025-08-25 12:54:14 -07:00
  • 1a24d26a3e Fix More RPC examples Mark Saroufim 2025-08-25 12:53:50 -07:00
  • f9799096c2 update update_rpc Mark Saroufim 2025-08-25 12:24:51 -07:00
  • fce10a508c update Mark Saroufim 2025-08-25 12:22:28 -07:00
  • 89e1215636 Modernize RPC example Mark Saroufim 2025-08-25 12:10:45 -07:00
  • f1723eb8f3 Clean up commented paths in run_python_examples.sh (#1384) Mark Saroufim 2025-08-23 10:32:38 -07:00
  • ef95195574 Clean up commented paths in run_python_examples.sh msaroufim-patch-2 Mark Saroufim 2025-08-23 09:18:43 -07:00
  • 7de8d218df Comment out super resolution paths and run command Mark Saroufim 2025-08-23 00:03:28 -07:00
  • 4a3b6e34f6 Revert "Add Swin Transformer Example " (#1383) Mark Saroufim 2025-08-22 23:21:04 -07:00
  • 302283698e Revert "Add Swin Transformer Example (#1346)" revert-1346-main Mark Saroufim 2025-08-22 23:20:55 -07:00
  • 0e5d2ed8c2 Deploying to gh-pages from @ 89c7d7588d 🚀 gh-pages msaroufim 2025-08-23 06:12:12 +00:00
  • 89c7d7588d Upgrade to Gymnasium rather than Gym (#1381) Mark Towers 2025-08-23 07:11:42 +01:00
  • 32fdb4952f Fix argument help descriptions to match default values (#1281) Nariaki Tateiwa 2025-08-23 15:11:21 +09:00
  • 28d16ffaa5 Add AdamW optimizer support for World Language Model example (#1380) Sang Park 2025-08-23 15:08:25 +09:00
  • 993a98a67a Fix invalid characters in filename in neural_style.py (line 119 in train function) (#1294) Innoy 2025-08-23 14:06:24 +08:00
  • 892eca0a94 Add Swin Transformer Example (#1346) Sumantro Mukherjee 2025-08-23 11:30:14 +05:30
  • a630ec6965 Fix license in environment.py Mark Saroufim 2025-08-13 11:32:39 -07:00
  • 28e586404f Add accelerate API support for Word Language Model example (#1345) Francisco R Castro Garcia 2025-08-13 12:31:07 -06:00
  • 5a4ca9269f Adding torch accelerator and requirements file to FSDP2 example (#1375) Dianana 2025-08-05 19:37:50 -06:00
  • e9a4e7510c Add accelerator API to RPC distributed examples: ddp_rpc, parameter_server, rnn (#1371) Jaime Fraustro 2025-07-27 13:15:30 -06:00
  • 99f5c4e224 mark fsdp1 as deprecated and point to fsdp2 (#1374) Wei (Will) Feng 2025-07-20 23:48:25 -07:00
  • 890a2f0af3 Deploying to gh-pages from @ 8c1555d83c 🚀 soumith 2025-07-16 15:34:22 +00:00
  • 2174f26a9c fsdp_tp_example fsdp1-> fsdp2 Jiani Wang 2025-07-15 11:22:25 -07:00
  • 8c1555d83c remove manual n_heads change Jiani Wang 2025-07-15 13:17:11 -07:00
  • 58370dca54 Bump torch from 1.9.0 to 2.2.0 in /distributed/rpc/batch dependabot[bot] 2024-07-25 11:22:06 +00:00
  • bd1254b79e Fixed "--help" output in README.md Anthony Smith 2024-12-31 13:08:49 -06:00
  • 86ed9e411c Enhance README and examples for Tensor Parallelism - Added installation instructions and example running commands to README.md. - Update files to have a better organization jafraustro 2025-07-10 09:51:46 -07:00
  • 698a89e5fc Add torch.accelerator API to mingGPT example (#34) gera-aldama 2025-07-11 13:18:00 -06:00
  • f84bcb398d add min_gpu verification similart tu tensor_parallel_example.py example jafraustro 2025-07-11 08:17:29 -07:00
  • eab402d04d Update requirements.txt to specify torch version >=2.7 jafraustro 2025-07-10 09:01:04 -07:00
  • 330201265c Refactor DDP example to use Accelerator API jafraustro 2025-07-08 07:54:43 -07:00
  • d86d205660 Switch to torchrun for distributed launches jafraustro 2025-07-08 07:53:39 -07:00
  • d47f0f34f1 Add accelerator API to GCN example. jafraustro 2025-07-08 08:12:11 -07:00
  • c04a5a10ba Update .github/workflows/main_distributed.yaml _githubsgi 2025-07-09 10:20:44 -07:00
  • d3849110ac Update .github/workflows/main_distributed.yaml _githubsgi 2025-07-09 10:20:24 -07:00
  • 26409468df Update .github/workflows/main_distributed.yaml _githubsgi 2025-07-09 10:20:08 -07:00
  • 997c123c0e Updating Python to 3.10, fsdp_tp_example.py to accelerator Goswami, Subrata 2025-07-07 16:49:53 -07:00
  • 3fc7853882 Updating comments in multiple files. Goswami, Subrata 2025-07-01 12:10:19 -07:00
  • bac5d0f5e8 Removing pre's from requirement.txt, ... Goswami, Subrata 2025-07-01 11:17:21 -07:00
  • a5b6297a21 Bumping up Python from 3.8 to 3.9 in wf yaml. sg 2025-07-01 03:16:29 +00:00
  • fe39e70a63 Bumping up Pyhon version to 3.9 . sg 2025-06-30 19:08:29 +00:00
  • 56cc1a6cc3 Updating CI torch version. sg 2025-06-27 01:58:20 +00:00
  • 3055e3ea56 Moving CommDebugMode code to a seperate PR. sg 2025-06-25 00:55:51 +00:00
  • f8c79f153a cuda to accelerator subrata goswami 2025-06-11 23:31:19 +00:00
  • 6bfe651404 Cuda to accelerator, +CommDebugMode sg 2025-06-11 22:24:55 +00:00
  • d0b7e37094 Update main.py tutorial, set persistent_workers, make GPU execution much faster Max Buckley 2025-07-09 21:50:20 +02:00
  • 42068585f7 Fixed use_accel not defined issue (#1363) BL 2025-07-07 13:31:18 -05:00
  • 16554e58b3 imagenet: fix typo addressing args.gpu (#1361) Dmitry Rogozhkin 2025-06-30 11:52:25 -07:00
  • 8d408d2a50 Revert "Add Differentiable Physics: Mass-Spring System example" (#1360) Mark Saroufim 2025-06-25 13:20:01 -07:00
  • ac6b73ef9c Revert "Add Differentiable Physics: Mass-Spring System example (#1359)" revert-1359-add-mass-spring-example Mark Saroufim 2025-06-25 13:19:49 -07:00
  • 7c35995e4c Add Differentiable Physics: Mass-Spring System example (#1359) Abhishek Nandy 2025-06-25 21:16:09 +05:30
  • 6f616144df Add accelerate API support for Super Resolution example (#1358) Francisco R Castro Garcia 2025-06-23 19:17:56 -06:00
  • 2944a9dcc5 Change run_python_examples.sh to run word_language_model/generate.py after each call to word_language_model/main.py Francisco R Castro G 2025-05-20 00:04:45 +00:00
  • 6595d7b461 Update word_language_model/generate.py to remove duplicates, use abc order Francisco R Castro Garcia 2025-05-19 17:55:34 -06:00
  • a86bda00a1 Fix torch>= 2.6 generate.py compatibility in Word_language_example Francisco R Castro G 2025-05-19 22:14:49 +00:00
  • 37986af69a Removing unneeded print Edgar Romo Montiel 2025-06-13 17:53:32 -07:00
  • dcc2474838 Add Accelerator Api to Imagenet Example eromomon 2025-05-19 16:27:21 -07:00
  • 3ddcc89e48 fix typo which make tgt tensor data wrong (#1356) Wenzhe Zhou 2025-06-16 14:05:24 +08:00
  • fcce71c1f0 Restore default CI configuration for VAE and Siamese examples using accelerator API (#1342) eromomon 2025-06-15 17:00:35 -06:00
  • 5cc81aae65 save model in global rank 0 in multinode (#1357) IceCreamMilkyTea 2025-06-16 06:58:17 +08:00
  • abfa4f9cc4 Fix super_resolution example for torch>=2.6 (#1350) Francisco R Castro Garcia 2025-06-14 13:40:15 -06:00
  • ac7e960b35 Support torch>=2.6 in word_language_model example (#1347) Dmitry Rogozhkin 2025-05-17 20:20:56 -07:00
  • cc8e404cbc Use torch.accelerator in DCGAN example (#1344) Dianana 2025-05-13 13:14:47 -06:00
  • f2674984ab Deploying to gh-pages from @ 7028a2ecc9 🚀 msaroufim 2025-05-13 19:09:00 +00:00
  • 7028a2ecc9 This PR Improve docs build ci (#1336) oepeng 2025-05-13 21:08:26 +02:00
  • 70922969e7 FSDP2 example code for tutorial (#1343) Wei (Will) Feng 2025-05-09 10:12:08 -07:00
  • fbdea94a37 Update on "FSDP2 example" gh/weifengpy/1/head Wei Feng 2025-05-08 16:36:02 -07:00
  • 9789d443e3 FSDP2 example gh/weifengpy/1/orig Wei Feng 2025-05-08 16:36:02 -07:00
  • 4c78ad122c Update base for Update on "FSDP2 example" gh/weifengpy/1/base Wei Feng 2025-05-08 16:36:02 -07:00
  • f55919e5e2 Update on "FSDP2 example" Wei Feng 2025-05-07 21:03:00 -07:00
  • 7bd6449ffe Update base for Update on "FSDP2 example" Wei Feng 2025-05-07 21:03:00 -07:00
  • 7d43f13e5e Update on "FSDP2 example" Wei Feng 2025-05-07 20:59:53 -07:00
  • fe27fef5b4 Update base for Update on "FSDP2 example" Wei Feng 2025-05-07 20:59:53 -07:00
  • 098ac2c122 Update on "FSDP2 example" Wei Feng 2025-05-07 19:49:11 -07:00
  • ff5706f9c5 Update base for Update on "FSDP2 example" Wei Feng 2025-05-07 19:49:11 -07:00
  • 4d2b62e6ad Update on "FSDP2 example" Wei Feng 2025-05-05 19:53:31 -07:00
  • 259172f190 Update base for Update on "FSDP2 example" Wei Feng 2025-05-05 19:53:31 -07:00
  • 48eb64b99a args gh/weifengpy/2/orig Wei Feng 2025-05-05 19:51:11 -07:00
  • d14ce566f8 args gh/weifengpy/2/head Wei Feng 2025-05-05 19:51:09 -07:00
  • 9708f36d66 FSDP2 example Wei Feng 2025-05-05 19:45:18 -07:00
  • e5d4d63235 FSDP2 example gh/weifengpy/2/base Wei Feng 2025-05-05 19:45:16 -07:00
  • 54e132ece5 Use torch.accelerator API in VAE example (#1338) eromomon 2025-05-02 10:40:02 -06:00
  • 12dc18e872 Use torch.accelerator API in Siamese Network example (#1337) eromomon 2025-05-01 23:39:26 -06:00
  • b7aebb5ad2 Update tensor_parallel_example.py (#1324) nrothGIT 2025-04-30 20:03:08 -04:00
  • 2c435c7868 Use torch.accelerator API in GAT example (#1335) Jaime Fraustro 2025-04-30 16:28:55 -06:00
  • 65722fe3ce Use torch.accelerator API in mnist examples (#1334) Jaime Fraustro 2025-04-30 08:02:42 -06:00
  • d00125af85 Deploying to gh-pages from @ 6967ff5676 🚀 msaroufim 2025-04-29 18:48:30 +00:00
  • 6967ff5676 Update doc-build.yml and requirements for documentation build (#1333) Jaime Fraustro 2025-04-29 12:47:49 -06:00
  • 65afde6aab Use torch.accelerator API in Fast Neural Style example (#1327) Dmitry Rogozhkin 2025-04-28 12:46:10 -07:00
  • 00ef8a78e4 Respect each example requirements and use uv (#1330) Dmitry Rogozhkin 2025-04-25 21:28:00 -07:00
  • 8393ceb2ba Add support for Intel GPU to Fast Neural Style example (#1318) eromomon 2025-04-18 11:30:03 -06:00
  • 5dfeb46902 fix typo (#1272) Kosi Asuzu 2025-02-09 16:56:58 +00:00
  • 7e907f781f Update doc-build.yml svekars-patch-1 Svetlana Karslioglu 2025-01-29 14:29:53 -08:00