Blame: docs/Migrating.md - Unity-Technologies/ml-agents

Unity-Technologies / ml-agents UNCLAIMED

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

19279 0 0 C#

Normal View History Raw

Migration - add a note about updating both python and c# (#2900) * Migration - add a note about updating both python and c# * Update Migrating.md * trailing whitespace * Add where to find the API versions 2019-11-22 10:51:34 -08:00			`# Upgrading`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
Initial implementation using IHeuristicProvider. (#4849) - Actuators can now optionally implement IHeuristicProvider to generate heuristic actions for agents. Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2021-01-19 16:26:11 -08:00			`# Migrating`
Develop python api ga (#6) * Dropped support for python 3.6 * Pinning python 3.9.9 for tests due to typing issues with 3.9.10 * Testing new bokken image. * Testing new bokken image. * Updated yamato standalone build test. * Updated yamato standalone build test. * Updated standalone build test. * Updated yamato configs to use mla bokken vm. * Bug fixes for yamato yml files. * Fixed com.unity.ml-agents-test.yml * Bumped min python version to 3.7.2 * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated failing yamato jobs. * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated CHANGELOG. * Updated Migration guide. * Doc updates based on CR. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Fixed yamato import error. Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Miguel Alonso Jr <miguelalonsojr> Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com> Co-authored-by: Maryam Honari <honari.m94@gmail.com> Co-authored-by: Henry Peteet <henry.peteet@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> 2022-02-02 19:32:23 -05:00			`<!---`
			`TODO: update ml-agents-env package version before release`
			`--->`
Updated docs. (#5986) 2023-10-06 15:47:17 -04:00			`## Migrating to the ml-agents-envs 0.30.0 package`
Added no graphics monitor feature. (#6014) * Added no graphics monitor feature. * Fixed precommit issues. * Fixed installation docs for incorrect python version causing conflicts in Windows. 2023-11-22 08:25:45 -05:00			`- Python 3.10.12 is now the minimum version of python supported due to [python3.6 EOL](https://endoflife.date/python).`
Develop sentis upgrade (#5979) * Commiting changes. * Initial barracuda 4 upgrade. * Play mode tests passing. * Edit mode tests passing. * Training fixes. * Fixed performance issue with stacking sensor. * Fixed failing tests and issue with stacking sensor. * Updated examples for barracuda 4 upgrade. * Fixed issue with attention ONNX export w.r.t. dimensions. * Fixed issue with Buffer Sensor and Recurrent In/Out. * Retrained old policies and updated with ONNX policies. Deprecated old policy versions. * Saving work. * Saving work. * Updating to Sentis 1.1.1-exp.2 * Fixed more errors with Sentis upgrade. * Fixed tensor allocation issue in TensorUtils.ResizeTensor. Inference is working for 3DBall with Sentis. * Fixed broken Sentis model links for some example environments. * Fixed some broken edit mode tests. * Fixed some failing tests. * Fixing bugs with GPU inference on Sentis. * Updated packages lock and onnx meta files. * Refactoring all Barracuda related naming to Sentis. * Python max version bump. * Precommit fixes. * Pinned tensorboard version * Revert tensorboard version. * Fixed rpc tests. * Fixed failing python tests. * Fixed some more failing tests. Added six as an explicit dependency due to tensorboard requirements. * gha fix. * Updated environment registry for Sentis. * Fixed texture sensor test. * Develop python 3.10 (#5981) * Deprecated python 3.8.x and 3.9.x. * Updated colab gha test to 3.10.12 * Updated colabs for Sentis and python 3.10. * Test fix. * Minor update to colabs. * Develop torch 1.13.1 (#5982) * Bumped PyTorch version to 1.13.1 * Added potential fixes to model overrider TBD at a later date. * Updated changelog. * Updated protobufs. (#5983) * Updated training init tests to remove inference test temporarily. (#5984) 2023-10-05 18:28:39 -04:00			`Please update your python installation to 3.10.12 or higher.`
Develop python api ga (#6) * Dropped support for python 3.6 * Pinning python 3.9.9 for tests due to typing issues with 3.9.10 * Testing new bokken image. * Testing new bokken image. * Updated yamato standalone build test. * Updated yamato standalone build test. * Updated standalone build test. * Updated yamato configs to use mla bokken vm. * Bug fixes for yamato yml files. * Fixed com.unity.ml-agents-test.yml * Bumped min python version to 3.7.2 * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated failing yamato jobs. * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated CHANGELOG. * Updated Migration guide. * Doc updates based on CR. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Fixed yamato import error. Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Miguel Alonso Jr <miguelalonsojr> Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com> Co-authored-by: Maryam Honari <honari.m94@gmail.com> Co-authored-by: Henry Peteet <henry.peteet@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> 2022-02-02 19:32:23 -05:00			- The `gym-unity` package has been refactored into the `ml-agents-envs` package. Please update your imports accordingly.
			`- Example:`
			`- Before`
Fix sample code indentation in migrating.md (#5840) Fix markdown indentation 2024-03-14 10:19:33 +03:30			```python
			`from gym_unity.unity_gym_env import UnityToGymWrapper`
			```
Develop python api ga (#6) * Dropped support for python 3.6 * Pinning python 3.9.9 for tests due to typing issues with 3.9.10 * Testing new bokken image. * Testing new bokken image. * Updated yamato standalone build test. * Updated yamato standalone build test. * Updated standalone build test. * Updated yamato configs to use mla bokken vm. * Bug fixes for yamato yml files. * Fixed com.unity.ml-agents-test.yml * Bumped min python version to 3.7.2 * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated failing yamato jobs. * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated CHANGELOG. * Updated Migration guide. * Doc updates based on CR. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Fixed yamato import error. Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Miguel Alonso Jr <miguelalonsojr> Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com> Co-authored-by: Maryam Honari <honari.m94@gmail.com> Co-authored-by: Henry Peteet <henry.peteet@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> 2022-02-02 19:32:23 -05:00			`- After:`
Fix sample code indentation in migrating.md (#5840) Fix markdown indentation 2024-03-14 10:19:33 +03:30			```python
			`from mlagents_envs.envs.unity_gym_env import UnityToGymWrapper`
			```
Develop python api ga (#6) * Dropped support for python 3.6 * Pinning python 3.9.9 for tests due to typing issues with 3.9.10 * Testing new bokken image. * Testing new bokken image. * Updated yamato standalone build test. * Updated yamato standalone build test. * Updated standalone build test. * Updated yamato configs to use mla bokken vm. * Bug fixes for yamato yml files. * Fixed com.unity.ml-agents-test.yml * Bumped min python version to 3.7.2 * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated failing yamato jobs. * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated CHANGELOG. * Updated Migration guide. * Doc updates based on CR. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Fixed yamato import error. Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Miguel Alonso Jr <miguelalonsojr> Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com> Co-authored-by: Maryam Honari <honari.m94@gmail.com> Co-authored-by: Henry Peteet <henry.peteet@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> 2022-02-02 19:32:23 -05:00
Upgrade to Unity 2023.2 (#6071) This is the minimum version supported by the latest Sentis release 1.3.0: https://discussions.unity.com/t/about-sentis-beta/260899 Remove the TextMeshPro package, which is deprecated in 2023.2 (and causes duplicate symbol errors): https://forum.unity.com/threads/2023-2-latest-development-on-textmesh-pro.1434757/ Tested: ran models in the 3DBall scene on OS X 2024-03-09 13:47:15 +01:00			`## Migrating the package to version 3.x`
Update to Unity 6000.0 (#6207) * Upgrade projects to Unity 6000.0 * Upgrade obsolete Unity API * Use stable version of upm-ci-utils * Upgrade Wrench configuration to Unity 6000.0 * Use ubuntu-ci v1.0.0 for Yamato tests * Use b1.medium VM for the Pack test * Rely solely on IsEqualUsingDot quaternion comparison in the Pose inverse test due to float inaccuracy * Re-enable Unity trunk ml-agents tests * Use ubuntu-24.04 for pre-commit * Use ubuntu-22.04 for colab * Add missing PR references to the changelog 2025-06-13 22:16:40 +02:00			`- The official version of Unity ML-Agents supports is now 6000.0. If you run`
Fix continuous integration tests across incompatible editor versions (2022 and trunk) (#6079) * Fix com.unity.ml-agents test 2022.3 on win by upgrading Unity versions Run all CI tests against 2023.2 (required by Sentis), not 2022.3 Sample failing sub-jobs before: https://unity-ci.cds.internal.unity3d.com/job/35022030/dependency-graph Sample passing sub-jobs after: https://unity-ci.cds.internal.unity3d.com/job/35033024/dependency-graph Note that `trunk` jobs are still failing after this fix. Those will be investigated separately since they've been failing since March 8: https://unity-ci.cds.internal.unity3d.com/job/34919178/dependency-graph * Ignore yamato-parser output files * Disable `trunk` tests, which break with Unity 6 Clean run of "Run All Combinations of Editors/Platforms Tests" after this change: https://unity-ci.cds.internal.unity3d.com/job/35037130/dependency-graph * Print full diffs when a sensor mismatch occurs in tests * Refactor tests to use positions instead of hardcoded numbers * Disable `RigidBodySensorTests.TestBodiesWithJoint` which fails in 2023.2 * Fix editor version in test_versions.metafile (use 2023.2) 2024-03-13 18:23:15 +01:00			`into issues, please consider deleting your project's Library folder and reopening your`
Upgrade to Unity 2023.2 (#6071) This is the minimum version supported by the latest Sentis release 1.3.0: https://discussions.unity.com/t/about-sentis-beta/260899 Remove the TextMeshPro package, which is deprecated in 2023.2 (and causes duplicate symbol errors): https://forum.unity.com/threads/2023-2-latest-development-on-textmesh-pro.1434757/ Tested: ran models in the 3DBall scene on OS X 2024-03-09 13:47:15 +01:00			`project.`

Develop python api ga (#6) * Dropped support for python 3.6 * Pinning python 3.9.9 for tests due to typing issues with 3.9.10 * Testing new bokken image. * Testing new bokken image. * Updated yamato standalone build test. * Updated yamato standalone build test. * Updated standalone build test. * Updated yamato configs to use mla bokken vm. * Bug fixes for yamato yml files. * Fixed com.unity.ml-agents-test.yml * Bumped min python version to 3.7.2 * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated failing yamato jobs. * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated CHANGELOG. * Updated Migration guide. * Doc updates based on CR. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Fixed yamato import error. Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Miguel Alonso Jr <miguelalonsojr> Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com> Co-authored-by: Maryam Honari <honari.m94@gmail.com> Co-authored-by: Henry Peteet <henry.peteet@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> 2022-02-02 19:32:23 -05:00
Updated docs. (#5986) 2023-10-06 15:47:17 -04:00			`## Migrating the package to version 2.x`
Update to 2022.3 LTS and batched raycasts (#5950) * Updated to Unity 2022.3. * Added support for batched raycasting in ray perception sensor. Only for 3D (batched raycasting is only supported in 3D). * Added batched raycast tests and docs. * Updated meta file validation exclude list. * Added unity physics module dependency to core package. 2023-07-25 14:50:49 -04:00			`- The official version of Unity ML-Agents supports is now 2022.3 LTS. If you run`
Fix documentation typos and list rendering (#6066) * Fix list being rendered incorrectly in webdocs I assume this extra blank line will fix the list not being correctly formatted on https://unity-technologies.github.io/ml-agents/#releases-documentation * Fix typos in docs * Fix more mis-rendered lists Add a blank line before bulleted lists in markdown files to avoid them being rendered as in-paragraph sentences that all start with hyphens. * Fix typos in python comments used to generate docs 2024-03-06 14:58:57 +01:00			`into issues, please consider deleting your project's Library folder and reopening your`
update package version 2021-03-19 15:57:54 -07:00			`project.`
rearrange match3 docs (#5215) * rearrange match3 docs * changelog and migration from previous PR 2021-04-05 13:48:43 -07:00			`- If you used any of the APIs that were deprecated before version 2.0, you need to use their replacement. These`
			`deprecated APIs have been removed. See the migration steps bellow for specific API replacements.`
Migration guide - list removed APIs with replacements (#5307) * list removed APIs with replacements * fixes 2021-04-22 12:50:39 -07:00
			`### Deprecated methods removed`
			`\| Deprecated API \| Suggested Replacement \|`
			`\|:-------:\|:------:\|`
			\| `IActuator ActuatorComponent.CreateActuator()` \| `IActuator[] ActuatorComponent.CreateActuators()` \|
			\| `IActionReceiver.PackActions(in float[] destination)` \| none \|
			\| `Agent.CollectDiscreteActionMasks(DiscreteActionMasker actionMasker)` \| `Agent.WriteDiscreteActionMask(IDiscreteActionMask actionMask)` \|
			\| `Agent.Heuristic(float[] actionsOut)` \| `Agent.Heuristic(in ActionBuffers actionsOut)` \|
			\| `Agent.OnActionReceived(float[] vectorAction)` \| `Agent.OnActionReceived(ActionBuffers actions)` \|
			\| `Agent.GetAction()` \| `Agent.GetStoredActionBuffers()` \|
			\| `BrainParameters.SpaceType`, `VectorActionSize`, `VectorActionSpaceType`, and `NumActions` \| `BrainParameters.ActionSpec` \|
			\| `ObservationWriter.AddRange(IEnumerable<float> data, int writeOffset = 0)` \| `ObservationWriter. AddList(IList<float> data, int writeOffset = 0` \|
			\| `SensorComponent.IsVisual()` and `IsVector()` \| none \|
			\| `VectorSensor.AddObservation(IEnumerable<float> observation)` \| `VectorSensor.AddObservation(IList<float> observation)` \|
			\| `SideChannelsManager` \| `SideChannelManager` \|

non-IEnumerable interface for action masking (#5060) 2021-03-10 10:55:54 -08:00			`### IDiscreteActionMask changes`
			- The interface for disabling specific discrete actions has changed. `IDiscreteActionMask.WriteMask()` was removed,
			and replaced with `SetActionEnabled()`. Instead of returning an IEnumerable with indices to disable, you can
			now call `SetActionEnabled` for each index to disable (or enable). As an example, if you overrode
			`Agent.WriteDiscreteActionMask()` with something that looked like:

			```csharp
			`public override void WriteDiscreteActionMask(IDiscreteActionMask actionMask)`
			`{`
			`var branch = 2;`
			`var actionsToDisable = new[] {1, 3};`
			`actionMask.WriteMask(branch, actionsToDisable);`
			`}`
			```

			`the equivalent code would now be`

			```csharp
			`public override void WriteDiscreteActionMask(IDiscreteActionMask actionMask)`
			`{`
			`var branch = 2;`
			`actionMask.SetActionEnabled(branch, 1, false);`
			`actionMask.SetActionEnabled(branch, 3, false);`
			`}`
			```
[MLA-1634] Remove SensorComponent.GetObservationShape() (#5172) 2021-03-22 15:52:54 -07:00			`### IActuator changes`
Make IActuator implement IHeuristicProvider. (#5110) 2021-03-15 16:43:46 -07:00			- The `IActuator` interface now implements `IHeuristicProvider`. Please add the corresponding `Heuristic(in ActionBuffers)`
			`method to your custom Actuator classes.`
Removing Obsolete methods from the package (#5024) * Removing Obsolete methods from the package * Missing depecration and modified changelog * Readding the obsolete BrainParameter methods, will need a larger discussion on these * Removing Action Masker, readding the warining when using a non-implemented Heuristic, Removing NumAction from Brain Parameters * removing documentation and some calls to deprecated methods in the extensions package * Editing the Changelog to put the unreleased on top 2021-03-08 13:22:47 -08:00
[MLA-1634] Remove SensorComponent.GetObservationShape() (#5172) 2021-03-22 15:52:54 -07:00			`### ISensor and SensorComponent changes`
[MLA-1634] Compression spec (#5164) 2021-03-22 11:37:02 -07:00			- The `ISensor.GetObservationShape()` method and `ITypedSensor`
			and `IDimensionPropertiesSensor` interfaces were removed, and `GetObservationSpec()` was added. You can use
Make IActuator implement IHeuristicProvider. (#5110) 2021-03-15 16:43:46 -07:00			`ObservationSpec.Vector()` or `ObservationSpec.Visual()` to generate `ObservationSpec`s that are equivalent to
			`the previous shape. For example, if your old ISensor looked like:`

			```csharp
			`public override int[] GetObservationShape()`
			`{`
			`return new[] { m_Height, m_Width, m_NumChannels };`
			`}`
			```

			`the equivalent code would now be`

			```csharp
			`public override ObservationSpec GetObservationSpec()`
			`{`
			`return ObservationSpec.Visual(m_Height, m_Width, m_NumChannels);`
			`}`
			```

[MLA-1634] Compression spec (#5164) 2021-03-22 11:37:02 -07:00			- The `ISensor.GetCompressionType()` method and `ISparseChannelSensor` interface was removed,
			and `GetCompressionSpec()` was added. You can use `CompressionSpec.Default()` or
			`CompressionSpec.Compressed()` to generate `CompressionSpec`s that are equivalent to
			`the previous values. For example, if your old ISensor looked like:`
			```csharp
			`public virtual SensorCompressionType GetCompressionType()`
			`{`
			`return SensorCompressionType.None;`
			`}`
			```

			`the equivalent code would now be`

			```csharp
			`public CompressionSpec GetCompressionSpec()`
			`{`
			`return CompressionSpec.Default();`
			`}`
			```

[MLA-1634] Remove SensorComponent.GetObservationShape() (#5172) 2021-03-22 15:52:54 -07:00			- The abstract method `SensorComponent.GetObservationShape()` was removed.
[MLA-1824] make SensorComponent return ISensor[] (#5181) * Make SensorComponent return an array * split match3 sensors, partial retrain * docstrings, migration, changelog, cleanup 2021-03-23 13:14:46 -07:00			- The abstract method `SensorComponent.CreateSensor()` was replaced with `CreateSensors()`, which returns an `ISensor[]`.
[MLA-1634] Remove SensorComponent.GetObservationShape() (#5172) 2021-03-22 15:52:54 -07:00
rearrange match3 docs (#5215) * rearrange match3 docs * changelog and migration from previous PR 2021-04-05 13:48:43 -07:00			`### Match3 integration changes`
[MLA-1912] Move Match3 to com.unity.ml-agents (#5259) 2021-04-14 14:34:30 -07:00			The Match-3 integration utilities were moved from `com.unity.ml-agents.extensions` to `com.unity.ml-agents`.

rearrange match3 docs (#5215) * rearrange match3 docs * changelog and migration from previous PR 2021-04-05 13:48:43 -07:00			The `AbstractBoard` interface was changed:
			* `AbstractBoard` no longer contains `Rows`, `Columns`, `NumCellTypes`, and `NumSpecialTypes` fields.
			* `public abstract BoardSize GetMaxBoardSize()` was added as an abstract method. `BoardSize` is a new struct that
			contains `Rows`, `Columns`, `NumCellTypes`, and `NumSpecialTypes` fields, with the same meanings as the old
			`AbstractBoard` fields.
			* `public virtual BoardSize GetCurrentBoardSize()` is an optional method; by default it returns `GetMaxBoardSize()`. If
			you wish to use a single behavior to work with multiple board sizes, override `GetCurrentBoardSize()` to return the
			current `BoardSize`. The values returned by `GetCurrentBoardSize()` must be less than or equal to the corresponding
			values from `GetMaxBoardSize()`.

Move GridSensor into main package (#5256) * move OneHotGridSensor into main package * changelog and migration guide * remove old doc * check if physics module presents 2021-04-14 15:26:26 -07:00			`### GridSensor changes`
			`The sensor configuration has changed:`
Fix documentation typos and list rendering (#6066) * Fix list being rendered incorrectly in webdocs I assume this extra blank line will fix the list not being correctly formatted on https://unity-technologies.github.io/ml-agents/#releases-documentation * Fix typos in docs * Fix more mis-rendered lists Add a blank line before bulleted lists in markdown files to avoid them being rendered as in-paragraph sentences that all start with hyphens. * Fix typos in python comments used to generate docs 2024-03-06 14:58:57 +01:00			`* The sensor implementation has been refactored and existing GridSensor created from extension package`
Move GridSensor into main package (#5256) * move OneHotGridSensor into main package * changelog and migration guide * remove old doc * check if physics module presents 2021-04-14 15:26:26 -07:00			`will not work in newer version. Some errors might show up when loading the old sensor in the scene.`
			`You'll need to remove the old sensor and create a new GridSensor.`
			* These parameters names have changed but still refer to the same concept in the sensor: `GridNumSide` -> `GridSize`,
			`RotateToAgent` -> `RotateWithAgent`, `ObserveMask` -> `ColliderMask`, `DetectableObjects` -> `DetectableTags`
			* `DepthType` (`ChanelBase`/`ChannelHot`) option and `ChannelDepth` are removed. Now the default is
			`one-hot encoding for detected tag. If you were using original GridSensor without overriding any method,`
			`switching to new GridSensor will produce similar effect for training although the actual observations`
			`will be slightly different.`

			`For creating your GridSensor implementation with custom data:`
			* To create custom GridSensor, derive from `GridSensorBase` instead of `GridSensor`. Besides overriding
			`GetObjectData()`, you will also need to consider override `GetCellObservationSize()`, `IsDataNormalized()`
			and `GetProcessCollidersMethod()` according to the data you collect. Also you'll need to override
			`GridSensorComponent.GetGridSensors()` and return your custom GridSensor.
			* The input argument `tagIndex` in `GetObjectData()` has changed from 1-indexed to 0-indexed and the
			data type changed from `float` to `int`. The index of first detectable tag will be 0 instead of 1.
			`normalizedDistance` was removed from input.
			* The observation data should be written to the input `dataBuffer` instead of creating and returning a new array.
			* Removed the constraint of all data required to be normalized. You should specify it in `IsDataNormalized()`.
			`Sensors with non-normalized data cannot use PNG compression type.`
Fix documentation typos and list rendering (#6066) * Fix list being rendered incorrectly in webdocs I assume this extra blank line will fix the list not being correctly formatted on https://unity-technologies.github.io/ml-agents/#releases-documentation * Fix typos in docs * Fix more mis-rendered lists Add a blank line before bulleted lists in markdown files to avoid them being rendered as in-paragraph sentences that all start with hyphens. * Fix typos in python comments used to generate docs 2024-03-06 14:58:57 +01:00			* The sensor will not further encode the data received from `GetObjectData()` anymore. The values
			received from `GetObjectData()` will be the observation sent to the trainer.
Move GridSensor into main package (#5256) * move OneHotGridSensor into main package * changelog and migration guide * remove old doc * check if physics module presents 2021-04-14 15:26:26 -07:00
LSTM models from 1.x will be incompatible with MLA 2.x (#5254) * LSTM models from 1.x will be incompatible with MLA 2.x * Adding a test and a new v2 model * Make the Model Runner raise an error if using 1.0 model with LSTM * adding a new model for hallway trained with 2.0 * reword error messages * Only raise if error, not if warning * Addressing comments: The legacy Barrauda memory generator and applier were removed. All code that checked for (memories + v1.X) have been removed since these will no longer be supported * Modifying the changelog and the migrating guide with this change * Fixing the merge issues Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2021-04-15 13:34:15 -07:00			`### LSTM models from previous releases no longer supported`
Updated references to Unity Inference Engine to Sentis. (#5998) 2023-10-15 06:57:02 -04:00			`The way that Sentis processes LSTM (recurrent neural networks) has changed. As a result, models`
LSTM models from 1.x will be incompatible with MLA 2.x (#5254) * LSTM models from 1.x will be incompatible with MLA 2.x * Adding a test and a new v2 model * Make the Model Runner raise an error if using 1.0 model with LSTM * adding a new model for hallway trained with 2.0 * reword error messages * Only raise if error, not if warning * Addressing comments: The legacy Barrauda memory generator and applier were removed. All code that checked for (memories + v1.X) have been removed since these will no longer be supported * Modifying the changelog and the migrating guide with this change * Fixing the merge issues Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2021-04-15 13:34:15 -07:00			trained with previous versions of ML-Agents will not be usable at inference if they were trained with a `memory`
			setting in the `.yaml` config file.
			`If you want to use a model that has a recurrent neural network in this release of ML-Agents, you need to train`
			`the model using the python trainer from this release.`

Move GridSensor into main package (#5256) * move OneHotGridSensor into main package * changelog and migration guide * remove old doc * check if physics module presents 2021-04-14 15:26:26 -07:00
Initial implementation using IHeuristicProvider. (#4849) - Actuators can now optionally implement IHeuristicProvider to generate heuristic actions for agents. Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2021-01-19 16:26:11 -08:00			`## Migrating to Release 13`
			`### Implementing IHeuristic in your IActuator implementations`
			- If you have any custom actuators, you can now implement the `IHeuristicProvider` interface to have your actuator
[MLA-1724] Reduce use of IEnumerable during inference (#4887) * improve allocations in inference * Add IList overload for VectorSensor.AddObservation * [skip ci] changelog * [skip ci] migrating 2021-01-27 11:55:22 -08:00			`handle the generation of actions when an Agent is running in heuristic mode.`
			- `VectorSensor.AddObservation(IEnumerable<float>)` is deprecated. Use `VectorSensor.AddObservation(IList<float>)`
			`instead.`
			- `ObservationWriter.AddRange()` is deprecated. Use `ObservationWriter.AddList()` instead.
Fix documentation typos and list rendering (#6066) * Fix list being rendered incorrectly in webdocs I assume this extra blank line will fix the list not being correctly formatted on https://unity-technologies.github.io/ml-agents/#releases-documentation * Fix typos in docs * Fix more mis-rendered lists Add a blank line before bulleted lists in markdown files to avoid them being rendered as in-paragraph sentences that all start with hyphens. * Fix typos in python comments used to generate docs 2024-03-06 14:58:57 +01:00			- `ActuatorComponent.CreateActuator()` is deprecated. Please use override `ActuatorComponent.CreateActuators`
Add CreateActuators API, obsolete old method. (#4899) * Add CreateActuators method to the ActuatorComponent class which wraps the original method. The original method will be removed in the future. Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> 2021-02-02 09:02:09 -08:00			instead. Since `ActuatorComponent.CreateActuator()` is abstract, you will still need to override it in your
			class until it is removed. It is only ever called if you don't override `ActuatorComponent.CreateActuators`.
			`You can suppress the warnings by surrounding the method with the following pragma:`
			```c#
			`#pragma warning disable 672`
			`public IActuator CreateActuator() { ... }`
			`#pragma warning restore 672`
			```
Initial implementation using IHeuristicProvider. (#4849) - Actuators can now optionally implement IHeuristicProvider to generate heuristic actions for agents. Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2021-01-19 16:26:11 -08:00

Cleaning up documentation. 2018-08-25 17:51:28 -07:00			`# Migrating`
migration guide for actions and brainparams (#4776) 2020-12-17 18:56:47 -08:00			`## Migrating to Release 11`
			`### Agent virtual method deprecation`
			- `Agent.CollectDiscreteActionMasks()` was deprecated and should be replaced with `Agent.WriteDiscreteActionMask()`
			- `Agent.Heuristic(float[])` was deprecated and should be replaced with `Agent.Heuristic(ActionBuffers)`.
			- `Agent.OnActionReceived(float[])` was deprecated and should be replaced with `Agent.OnActionReceived(ActionBuffers)`.
			- `Agent.GetAction()` was deprecated and should be replaced with `Agent.GetStoredActionBuffers()`.

			`The default implementation of these will continue to call the deprecated versions where appropriate. However, the`
			`deprecated versions may not be compatible with continuous and discrete actions on the same Agent.`

			`### BrainParameters field and method deprecation`
			- `BrainParameters.VectorActionSize` was deprecated; you can now set `BrainParameters.ActionSpec.NumContinuousActions`
			or `BrainParameters.ActionSpec.BranchSizes` instead.
			- `BrainParameters.VectorActionSpaceType` was deprecated, since both continuous and discrete actions can now be used.
			- `BrainParameters.NumActions()` was deprecated. Use `BrainParameters.ActionSpec.NumContinuousActions` and
			`BrainParameters.ActionSpec.NumDiscreteActions` instead.
fix all the places that used v0.3 and should be fixed, also updated t… (#863) * fix all the places that used v0.3 and should be fixed, also updated the migration doc * resolved the comment 2018-06-15 13:55:19 -07:00
[life improvement] Moving Python files around (#4531) * Moved components to the tf folder and moved the TrainerFactory to the `trainer` folder * Addressing comments * Editing the migrating doc * fixing test 2020-10-05 11:35:40 -07:00			`## Migrating from Release 7 to latest`

			`### Important changes`
			- Some trainer files were moved. If you were using the `TrainerFactory` class, it was moved to
			the `trainers/trainer` folder.
			- The `components` folder containing `bc` and `reward_signals` code was moved to the `trainers/tf`
			`folder`

			`### Steps to Migrate`
			- Replace calls to `from mlagents.trainers.trainer_util import TrainerFactory` to `from mlagents.trainers.trainer import TrainerFactory`
			- Replace calls to `from mlagents.trainers.trainer_util import handle_existing_directories` to `from mlagents.trainers.directory_utils import validate_existing_directories`
			- Replace `mlagents.trainers.components` with `mlagents.trainers.tf.components` in your import statements.


			`## Migrating from Release 3 to Release 7`
Refactor of Curriculum and parameter sampling (#4160) * Introduced the Constant Parameter Sampler that will be useful later as samplers and floats can be used interchangeably * Refactored the settings.py to refect the new format of the config.yaml * First working version * Added the unit tests * Update to Upgrade for Updates * fixing the tests * Upgraded the config files * Fixes * Additional error catching * addressing some comments * Making the code nicer with cattr * Added and registered an unstructure hook for PrameterRandomization * Updating C# Walljump * Adding comments * Add test for settings export (#4164) * Add test for settings export * Update ml-agents/mlagents/trainers/tests/test_settings.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Including environment parameters for the test for settings export * First documentation update * Fixing a link * Updating changelog and migrating * adding some more tests for the conversion script * fixing bugs and using samplers in the walljump curriculum * Changing the format of the curriculum file as per discussion * Addressing comments * Update ml-agents/mlagents/trainers/settings.py Co-authored-by: Ervin T. <ervin@unity3d.com> * Update docs/Migrating.md Co-authored-by: Chris Elion <chris.elion@unity3d.com> * addressing comments Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-07-07 15:10:30 -07:00
			`### Important changes`
			`- The Parameter Randomization feature has been merged with the Curriculum feature. It is now possible to specify a sampler`
			`in the lesson of a Curriculum. Curriculum has been refactored and is now specified at the level of the parameter, not the`
			`behavior. More information`
Release 22 mm (#6157) * adding wrench * correct build path * release branch and 6.0 target * XmlDoc update * adressing xml docs * more docs * updating the release * test xmldoc fixes * more xml doc fixes * Uncompress the 3DBall sample * Fix API documentation * more xml doc fixes * Revert "Uncompress the 3DBall sample" This reverts commit d67dc941922c382046efe404446f477d41794f62. * reformat MaxStep xml * more xml doc fixes * fix more xml doc issues * fix summary tag * Updated changelog for missing PRs. * Removed tabs from .tests.json. * Updated changelog. * Removed tabs from CHANGELOG. * Fix failing ci post upgrade (#6141) (#6145) * Update PerformancProject and DevProject. * Removed mac perf tests. * Removing standalone tests dep from wrench packaging. * Fixed package works issues. Updated com.unity.ml-agents.md. * Updated com.unity.ml-agents.md. * Updated package version in Academy.cs * Adding back in package pack deps. * Updated package pack testing deps.. * Regenerated wrench ymls. * License update. * Extensions License update. * Another license tweak. * Another license tweak. * Upgraded to sentis 2.1.0. * Updated standalone yamato build test to using new ml-agents ubuntu ci bokken image. * Bumped python and extensions package versions. * Changed ci image for pytest gpu yamato test. * Changed default cuda dtype to torch.float32. * Updated version validation and extensions version. * Fixed failing GPU test. * Fixed failing GPU test. * Updated readme table and make_readme_table.py * Updated publish to pypi gha. --------- Co-authored-by: alexandre-ribard <alexandre.ribard@unity3d.com> Co-authored-by: Aurimas Petrovas <> 2024-10-05 13:53:04 -04:00			`[here](https://github.com/Unity-Technologies/ml-agents/blob/release_22_docs/docs/Training-ML-Agents.md).(#4160)`
Refactor of Curriculum and parameter sampling (#4160) * Introduced the Constant Parameter Sampler that will be useful later as samplers and floats can be used interchangeably * Refactored the settings.py to refect the new format of the config.yaml * First working version * Added the unit tests * Update to Upgrade for Updates * fixing the tests * Upgraded the config files * Fixes * Additional error catching * addressing some comments * Making the code nicer with cattr * Added and registered an unstructure hook for PrameterRandomization * Updating C# Walljump * Adding comments * Add test for settings export (#4164) * Add test for settings export * Update ml-agents/mlagents/trainers/tests/test_settings.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Including environment parameters for the test for settings export * First documentation update * Fixing a link * Updating changelog and migrating * adding some more tests for the conversion script * fixing bugs and using samplers in the walljump curriculum * Changing the format of the curriculum file as per discussion * Addressing comments * Update ml-agents/mlagents/trainers/settings.py Co-authored-by: Ervin T. <ervin@unity3d.com> * Update docs/Migrating.md Co-authored-by: Chris Elion <chris.elion@unity3d.com> * addressing comments Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-07-07 15:10:30 -07:00
			`### Steps to Migrate`
			`- The configuration format for curriculum and parameter randomization has changed. To upgrade your configuration files,`
			an upgrade script has been provided. Run `python -m mlagents.trainers.upgrade_config -h` to see the script usage. Note that you will have had to upgrade to/install the current version of ML-Agents before running the script. To update manually:
			- If your config file used a `parameter_randomization` section, rename that section to `environment_parameters`
			- If your config file used a `curriculum` section, you will need to rewrite your curriculum with this [format](Training-ML-Agents.md#curriculum).

			`## Migrating from Release 1 to Release 3`
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00
new changelog and migration sections (#3841) 2020-04-24 11:15:51 -07:00			`### Important changes`
Merge remote-tracking branch 'origin/master' into release_1_to_master 2020-04-30 16:40:47 -07:00			- Training artifacts (trained models, summaries) are now found under `results/`
			instead of `summaries/` and `models/`.
			`- Trainer configuration, curriculum configuration, and parameter randomization`
			`configuration have all been moved to a single YAML file. (#3791)`
[refactor] Structure configuration files into classes (#3936) 2020-05-26 15:48:31 -07:00			`- Trainer configuration format has changed, and using a "default" behavior name has`
			`been deprecated. (#3936)`
Renaming max_step to interrupted in TermialStep(s) (#3908) 2020-05-01 14:37:34 -07:00			- `max_step` in the `TerminalStep` and `TerminalSteps` objects was renamed `interrupted`.
Replaced get_behavior_names and get_behavior_spec with behavior_specs property (#3946) * Replaced get_behavior_names and get_behavior_spec with behavior_specs property * Fixing the test * [ci] * addressing some comments * use typing.Mapping (#3948) * Update ml-agents-envs/mlagents_envs/base_env.py Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Adding the documentation Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-05-12 15:24:40 -07:00			- On the UnityEnvironment API, `get_behavior_names()` and `get_behavior_specs()` methods were combined into the property `behavior_specs` that contains a mapping from behavior names to behavior spec.
External contribution : Allow visual and vector observations at the same time (#3998) * allow vector observations also when using visual observations * update changelog * Update CHANGELOG.md * Update __init__.py * remove trailing whitespace * Fix test case where visual and vector observations are used simultaneously * fix formatting * add test for visual and vector observations * Assert vector action shape * Fix test environment to return multiple visual observations * use_visual and allow_multiple_visual_obs are replaced by allow_multiple_obs which allows visual and vector observations to be used simultaneously. * fixing run_gym.py test * [ci] * Added some more tests and made the observation space a tuple when using multiple observations * Modifying the change log * Addding to the Migrating doc * Edits to Migrating.md * Simplification of the code to generate the observation spaces * Simplified warning messages * Adding contributor to changelog * Make sure render always retruns a np.array Co-authored-by: Simon <simon.hakenes@tu-dortmund.de> Co-authored-by: Simon Hakenes <simon.hakenes@ini.rub.de> 2020-05-20 15:57:15 -07:00			- `use_visual` and `allow_multiple_visual_obs` in the `UnityToGymWrapper` constructor
			were replaced by `allow_multiple_obs` which allows one or more visual observations and
			`vector observations to be used simultaneously.`
[refactor] Move checkpoint saving into trainer (#4034) 2020-06-03 17:17:30 -07:00			- `--save-freq` has been removed from the CLI and is now configurable in the trainer configuration
			`file.`
[refactor] Store and restore state along with checkpoints (#4025) 2020-06-02 18:11:48 -07:00			- `--lesson` has been removed from the CLI. Lessons will resume when using `--resume`.
			`To start at a different lesson, modify your Curriculum configuration.`
new changelog and migration sections (#3841) 2020-04-24 11:15:51 -07:00
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			`### Steps to Migrate`
[refactor] Improve config upgrade script and add test (#4056) 2020-06-03 17:17:06 -07:00			`- To upgrade your configuration files, an upgrade script has been provided. Run`
			`python -m mlagents.trainers.upgrade_config -h` to see the script usage. Note that you will have
			`had to upgrade to/install the current version of ML-Agents before running the script.`
[refactor] Structure configuration files into classes (#3936) 2020-05-26 15:48:31 -07:00
			To do it manually, copy your `<BehaviorName>` sections from `trainer_config.yaml` into a separate trainer configuration file, under a `behaviors` section.
			The `default` section is no longer needed. This new file should be specific to your environment, and not contain
			`configurations for multiple environments (unless they have the same Behavior Names).`
			`- You will need to reformat your trainer settings as per the [example](Training-ML-Agents.md).`
[refactor] Improve config upgrade script and add test (#4056) 2020-06-03 17:17:06 -07:00			- If your training uses [curriculum](Training-ML-Agents.md#curriculum-learning), move those configurations under a `curriculum` section.
fix links 2020-04-30 17:06:48 -07:00			`- If your training uses [parameter randomization](Training-ML-Agents.md#environment-parameter-randomization), move`
Merge remote-tracking branch 'origin/master' into release_1_to_master 2020-04-30 16:40:47 -07:00			the contents of the sampler config to `parameter_randomization` in the main trainer configuration.
Renaming max_step to interrupted in TermialStep(s) (#3908) 2020-05-01 14:37:34 -07:00			- If you are using `UnityEnvironment` directly, replace `max_step` with `interrupted`
External contribution : Allow visual and vector observations at the same time (#3998) * allow vector observations also when using visual observations * update changelog * Update CHANGELOG.md * Update __init__.py * remove trailing whitespace * Fix test case where visual and vector observations are used simultaneously * fix formatting * add test for visual and vector observations * Assert vector action shape * Fix test environment to return multiple visual observations * use_visual and allow_multiple_visual_obs are replaced by allow_multiple_obs which allows visual and vector observations to be used simultaneously. * fixing run_gym.py test * [ci] * Added some more tests and made the observation space a tuple when using multiple observations * Modifying the change log * Addding to the Migrating doc * Edits to Migrating.md * Simplification of the code to generate the observation spaces * Simplified warning messages * Adding contributor to changelog * Make sure render always retruns a np.array Co-authored-by: Simon <simon.hakenes@tu-dortmund.de> Co-authored-by: Simon Hakenes <simon.hakenes@ini.rub.de> 2020-05-20 15:57:15 -07:00			in the `TerminalStep` and `TerminalSteps` objects.
Replaced get_behavior_names and get_behavior_spec with behavior_specs property (#3946) * Replaced get_behavior_names and get_behavior_spec with behavior_specs property * Fixing the test * [ci] * addressing some comments * use typing.Mapping (#3948) * Update ml-agents-envs/mlagents_envs/base_env.py Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Adding the documentation Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-05-12 15:24:40 -07:00			- Replace usage of `get_behavior_names()` and `get_behavior_specs()` in UnityEnvironment with `behavior_specs`.
External contribution : Allow visual and vector observations at the same time (#3998) * allow vector observations also when using visual observations * update changelog * Update CHANGELOG.md * Update __init__.py * remove trailing whitespace * Fix test case where visual and vector observations are used simultaneously * fix formatting * add test for visual and vector observations * Assert vector action shape * Fix test environment to return multiple visual observations * use_visual and allow_multiple_visual_obs are replaced by allow_multiple_obs which allows visual and vector observations to be used simultaneously. * fixing run_gym.py test * [ci] * Added some more tests and made the observation space a tuple when using multiple observations * Modifying the change log * Addding to the Migrating doc * Edits to Migrating.md * Simplification of the code to generate the observation spaces * Simplified warning messages * Adding contributor to changelog * Make sure render always retruns a np.array Co-authored-by: Simon <simon.hakenes@tu-dortmund.de> Co-authored-by: Simon Hakenes <simon.hakenes@ini.rub.de> 2020-05-20 15:57:15 -07:00			- If you use the `UnityToGymWrapper`, remove `use_visual` and `allow_multiple_visual_obs`
			from the constructor and add `allow_multiple_obs = True` if the environment contains either
			`both visual and vector observations or multiple visual observations.`
[refactor] Move checkpoint saving into trainer (#4034) 2020-06-03 17:17:30 -07:00			- If you were setting `--save-freq` in the CLI, add a `checkpoint_interval` value in your
			trainer configuration, and set it equal to `save-freq * n_agents_in_scene`.
new changelog and migration sections (#3841) 2020-04-24 11:15:51 -07:00
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			`## Migrating from 0.15 to Release 1`
Updating the Changelog and Migrating (#3620) 2020-03-12 11:26:59 -07:00
			`### Important changes`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
[MLA-850] rename namespaces to Unity.MLAgents (#3843) * rename in protos * rename in C# * doc changes, migration, changelog * PR numbers * fix standalone test path 2020-04-24 13:04:22 -07:00			- The `MLAgents` C# namespace was renamed to `Unity.MLAgents`, and other nested
			`namespaces were similarly renamed (#3843).`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- The `--load` and `--train` command-line flags have been deprecated and
			replaced with `--resume` and `--inference`.
			- Running with the same `--run-id` twice will now throw an error.
			- The `play_against_current_self_ratio` self-play trainer hyperparameter has
			been renamed to `play_against_latest_model_ratio`
			`- Removed the multi-agent gym option from the gym wrapper. For multi-agent`
Develop python api ga (#6) * Dropped support for python 3.6 * Pinning python 3.9.9 for tests due to typing issues with 3.9.10 * Testing new bokken image. * Testing new bokken image. * Updated yamato standalone build test. * Updated yamato standalone build test. * Updated standalone build test. * Updated yamato configs to use mla bokken vm. * Bug fixes for yamato yml files. * Fixed com.unity.ml-agents-test.yml * Bumped min python version to 3.7.2 * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated failing yamato jobs. * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated CHANGELOG. * Updated Migration guide. * Doc updates based on CR. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Fixed yamato import error. Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Miguel Alonso Jr <miguelalonsojr> Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com> Co-authored-by: Maryam Honari <honari.m94@gmail.com> Co-authored-by: Henry Peteet <henry.peteet@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> 2022-02-02 19:32:23 -05:00			`scenarios, use the [Low Level Python API](Python-LLAPI.md).`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- The low level Python API has changed. You can look at the document`
Develop python api ga (#6) * Dropped support for python 3.6 * Pinning python 3.9.9 for tests due to typing issues with 3.9.10 * Testing new bokken image. * Testing new bokken image. * Updated yamato standalone build test. * Updated yamato standalone build test. * Updated standalone build test. * Updated yamato configs to use mla bokken vm. * Bug fixes for yamato yml files. * Fixed com.unity.ml-agents-test.yml * Bumped min python version to 3.7.2 * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated failing yamato jobs. * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated CHANGELOG. * Updated Migration guide. * Doc updates based on CR. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Fixed yamato import error. Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Miguel Alonso Jr <miguelalonsojr> Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com> Co-authored-by: Maryam Honari <honari.m94@gmail.com> Co-authored-by: Henry Peteet <henry.peteet@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> 2022-02-02 19:32:23 -05:00			`[Low Level Python API documentation](Python-LLAPI.md) for more information. If`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			you use `mlagents-learn` for training, this should be a transparent change.
			- The obsolete `Agent` methods `GiveModel`, `Done`, `InitializeAgent`,
			`AgentAction` and `AgentReset` have been removed.
			- The signature of `Agent.Heuristic()` was changed to take a `float[]` as a
			`parameter, instead of returning the array. This was done to prevent a common`
			`source of error where users would return arrays of the wrong size.`
[WIP] Side Channel Design Changes (#3807) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcademy / TestAcademyDispose) for catching these. * Removed debub log. * Back-up commit. * Revert "Back-up commit." This reverts commit f81e835cd314cb14cbc489feb75e430606e0419c. * key changes to wrapper classes made the wrapper classes non-singleton (but internal constructors) made EngineParameters internal * Re-enabled the option to add multiple side channels of the same type * Fixed example env * Add an enum flag to the EnvParamsChannel * Adding .cs.meta files * Update engine config side channel Removed unnecessary accessors Made capture frame rate a parameter * Rename SideChannelUtils —> SideChannelsManager * PR feedback * Minor PR feedback. * Python side changes to the SideChannel redesign (#3826) * Modified the EngineConfig to send one message per field * Created the Python Environment Parameters Channel and hooked it in * Make OnMessageReceived protected * addressing comments * [Side Channels] Edited the documenation and renamed a few things (#3833) * Edited the documetation and renamed a few things * addressing comments * Update docs/Python-API.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update com.unity.ml-agents/CHANGELOG.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Removing unecessary migrating line Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Addressing renaming comments * Removing the EngineParameters class Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-04-23 16:46:55 -07:00			`- The SideChannel API has changed (#3833, #3660) :`
			- Introduced the `SideChannelManager` to register, unregister and access side
Release 1 mm formatting (#3904) * Formatting lines. * Fix changelogs 2020-04-30 09:58:28 -07:00			`channels.`
			- `EnvironmentParameters` replaces the default `FloatProperties`. You can
			access the `EnvironmentParameters` with
			`Academy.Instance.EnvironmentParameters` on C#. If you were previously
			creating a `UnityEnvironment` in python and passing it a
			`FloatPropertiesChannel`, create an `EnvironmentParametersChannel` instead.
[WIP] Side Channel Design Changes (#3807) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcademy / TestAcademyDispose) for catching these. * Removed debub log. * Back-up commit. * Revert "Back-up commit." This reverts commit f81e835cd314cb14cbc489feb75e430606e0419c. * key changes to wrapper classes made the wrapper classes non-singleton (but internal constructors) made EngineParameters internal * Re-enabled the option to add multiple side channels of the same type * Fixed example env * Add an enum flag to the EnvParamsChannel * Adding .cs.meta files * Update engine config side channel Removed unnecessary accessors Made capture frame rate a parameter * Rename SideChannelUtils —> SideChannelsManager * PR feedback * Minor PR feedback. * Python side changes to the SideChannel redesign (#3826) * Modified the EngineConfig to send one message per field * Created the Python Environment Parameters Channel and hooked it in * Make OnMessageReceived protected * addressing comments * [Side Channels] Edited the documenation and renamed a few things (#3833) * Edited the documetation and renamed a few things * addressing comments * Update docs/Python-API.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update com.unity.ml-agents/CHANGELOG.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Removing unecessary migrating line Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Addressing renaming comments * Removing the EngineParameters class Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-04-23 16:46:55 -07:00			- `SideChannel.OnMessageReceived` is now a protected method (was public)
			`- SideChannel IncomingMessages methods now take an optional default argument,`
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			`which is used when trying to read more data than the message contains.`
[WIP] Side Channel Design Changes (#3807) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcademy / TestAcademyDispose) for catching these. * Removed debub log. * Back-up commit. * Revert "Back-up commit." This reverts commit f81e835cd314cb14cbc489feb75e430606e0419c. * key changes to wrapper classes made the wrapper classes non-singleton (but internal constructors) made EngineParameters internal * Re-enabled the option to add multiple side channels of the same type * Fixed example env * Add an enum flag to the EnvParamsChannel * Adding .cs.meta files * Update engine config side channel Removed unnecessary accessors Made capture frame rate a parameter * Rename SideChannelUtils —> SideChannelsManager * PR feedback * Minor PR feedback. * Python side changes to the SideChannel redesign (#3826) * Modified the EngineConfig to send one message per field * Created the Python Environment Parameters Channel and hooked it in * Make OnMessageReceived protected * addressing comments * [Side Channels] Edited the documenation and renamed a few things (#3833) * Edited the documetation and renamed a few things * addressing comments * Update docs/Python-API.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update com.unity.ml-agents/CHANGELOG.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Removing unecessary migrating line Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Addressing renaming comments * Removing the EngineParameters class Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-04-23 16:46:55 -07:00			`- Added a feature to allow sending stats from C# environments to TensorBoard`
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			`(and other python StatsWriters). To do this from your code, use`
			`Academy.Instance.StatsRecorder.Add(key, value)`(#3660)
			- `num_updates` and `train_interval` for SAC have been replaced with
			`steps_per_update`.
Making Gym a wrapper (#3812) * Making Gym a wrapper * Readding no graphics to the run gym test * typo * Modifying the changelog and the migrating doc * Applying pre-commit * [skip ci] Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Adding a note that the BaseEnv will close when the wrapper closes * FoRgOt To rUn PrE-ComMiT Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-04-21 12:50:42 -07:00			- The `UnityEnv` class from the `gym-unity` package was renamed
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			`UnityToGymWrapper` and no longer creates the `UnityEnvironment`. Instead, the
			`UnityEnvironment` must be passed as input to the constructor of
			`UnityToGymWrapper`
Code Style - apply PascalCase (#3828) * cleanup a few classes * cleanup raycast code * more capitalization * more renames * changelog and migration * fix MaxStep in docs * doc string 2020-04-23 09:58:41 -07:00			`- Public fields and properties on several classes were renamed to follow Unity's`
			`C# style conventions. All public fields and properties now use "PascalCase"`
			instead of "camelCase"; for example, `Agent.maxStep` was renamed to
			`Agent.MaxStep`. For a full list of changes, see the pull request. (#3828)
UI for Ray stacks, rename WriteAdapter to ObservationWriter (#3834) * UI for Ray stacks, rename WriteAdapter to ObservationWriter * move test * changelog and migration 2020-04-23 16:16:15 -07:00			- `WriteAdapter` was renamed to `ObservationWriter`. (#3834)
[refactor] CLI changes (#3705) 2020-03-30 15:58:50 -07:00
			`### Steps to Migrate`
Updating the Changelog and Migrating (#3620) 2020-03-12 11:26:59 -07:00
[MLA-850] rename namespaces to Unity.MLAgents (#3843) * rename in protos * rename in C# * doc changes, migration, changelog * PR numbers * fix standalone test path 2020-04-24 13:04:22 -07:00			- In C# code, replace `using MLAgents` with `using Unity.MLAgents`. Replace
			other nested namespaces such as `using MLAgents.Sensors` with
			`using Unity.MLAgents.Sensors`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- Replace the `--load` flag with `--resume` when calling `mlagents-learn`, and
			don't use the `--train` flag as training will happen by default. To run
			without training, use `--inference`.
			- To force-overwrite files from a pre-existing run, add the `--force`
			`command-line flag.`
			`- The Jupyter notebooks have been removed from the repository.`
			- If your Agent class overrides `Heuristic()`, change the signature to
			`public override void Heuristic(float[] actionsOut)` and assign values to
			`actionsOut` instead of returning an array.
[WIP] Side Channel Design Changes (#3807) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcademy / TestAcademyDispose) for catching these. * Removed debub log. * Back-up commit. * Revert "Back-up commit." This reverts commit f81e835cd314cb14cbc489feb75e430606e0419c. * key changes to wrapper classes made the wrapper classes non-singleton (but internal constructors) made EngineParameters internal * Re-enabled the option to add multiple side channels of the same type * Fixed example env * Add an enum flag to the EnvParamsChannel * Adding .cs.meta files * Update engine config side channel Removed unnecessary accessors Made capture frame rate a parameter * Rename SideChannelUtils —> SideChannelsManager * PR feedback * Minor PR feedback. * Python side changes to the SideChannel redesign (#3826) * Modified the EngineConfig to send one message per field * Created the Python Environment Parameters Channel and hooked it in * Make OnMessageReceived protected * addressing comments * [Side Channels] Edited the documenation and renamed a few things (#3833) * Edited the documetation and renamed a few things * addressing comments * Update docs/Python-API.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update com.unity.ml-agents/CHANGELOG.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Removing unecessary migrating line Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Addressing renaming comments * Removing the EngineParameters class Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-04-23 16:46:55 -07:00			- If you used `SideChannels` you must:
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			- Replace `Academy.FloatProperties` with
			`Academy.Instance.EnvironmentParameters`.
[WIP] Side Channel Design Changes (#3807) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcademy / TestAcademyDispose) for catching these. * Removed debub log. * Back-up commit. * Revert "Back-up commit." This reverts commit f81e835cd314cb14cbc489feb75e430606e0419c. * key changes to wrapper classes made the wrapper classes non-singleton (but internal constructors) made EngineParameters internal * Re-enabled the option to add multiple side channels of the same type * Fixed example env * Add an enum flag to the EnvParamsChannel * Adding .cs.meta files * Update engine config side channel Removed unnecessary accessors Made capture frame rate a parameter * Rename SideChannelUtils —> SideChannelsManager * PR feedback * Minor PR feedback. * Python side changes to the SideChannel redesign (#3826) * Modified the EngineConfig to send one message per field * Created the Python Environment Parameters Channel and hooked it in * Make OnMessageReceived protected * addressing comments * [Side Channels] Edited the documenation and renamed a few things (#3833) * Edited the documetation and renamed a few things * addressing comments * Update docs/Python-API.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update com.unity.ml-agents/CHANGELOG.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Removing unecessary migrating line Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Addressing renaming comments * Removing the EngineParameters class Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> 2020-04-23 16:46:55 -07:00			- `Academy.RegisterSideChannel` and `Academy.UnregisterSideChannel` were
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			removed. Use `SideChannelManager.RegisterSideChannel` and
			`SideChannelManager.UnregisterSideChannel` instead.
			- Set `steps_per_update` to be around equal to the number of agents in your
			environment, times `num_updates` and divided by `train_interval`.
			- Replace `UnityEnv` with `UnityToGymWrapper` in your code. The constructor no
			longer takes a file name as input but a fully constructed `UnityEnvironment`
			`instead.`
Code Style - apply PascalCase (#3828) * cleanup a few classes * cleanup raycast code * more capitalization * more renames * changelog and migration * fix MaxStep in docs * doc string 2020-04-23 09:58:41 -07:00			`- Update uses of "camelCase" fields and properties to "PascalCase".`
Merge 0.15.1 to master (#3755) * Bumping version on the release (#3615) * Update examples project to 2018.4.18f1 (#3618) From 2018.4.14f1. An internal package dependency was updated as a side effect. * Remove dead components from the examples scenes (#3619) (#3624) * Improve warnings and exception if using unsupported combo * add meta file * fix unit test * enforce onnx conversion (expect tf2 CI to fail) (#3600) * Update error message * Updated the release branch docs (#3621) * Updated the release branch docs * Edited the README * make sure top-level timer is closed before writing * Remove space from Product Name for examples In #2588 it was suggested that the space in the Product Name for our example environments causes confusion when using a default build because of the need to escape the space in the build filename. This change removes the space from the Product Name in the project's player settings. * [bug-fix] Increase 3dballhard and GAIL default steps (#3636) * Updating the NN models (#3632) * Updating the NN models * Update gridworld * [skip ci] Update BallHard * Update hallway * Hotfixes for Release 0.15.1 (#3698) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting for a response from the still-running environment. This led to long waits for the error response when communication version wasn't matching. This change forces a timeout duration of 0 when handling errors. * Bumping the versions * handle multiple dones in a single step (#3700) * handle multiple dones in a single step * [tests] Make end-to-end tests more stable (#3697) * [bug-fix] Fix entropy computation for GaussianDistribution (#3684) * Fix how we set logging levels (#3703) * cleanup logging * comments and cleanup * pylint, gym * [skip-ci] Update changelog for logging fix. (#3707) * [skip ci] Update README * [skip ci] Fixed a typo Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Adam Streck <adam.streck@gmail.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com> * fix changelog * keep master gridworld Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com> Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Adam Streck <adam.streck@gmail.com> 2020-04-08 11:19:24 -07:00
Updating the Changelog and Migrating (#3620) 2020-03-12 11:26:59 -07:00			`## Migrating from 0.14 to 0.15`
Remove monitor (#3372) * Removing the monitor * Editing the migrating docs * Readding the Monitor in the examples folder * Editing the Migrating doc 2020-02-06 14:18:29 -08:00
Split out 0.14 in the migration guide (#3374) 2020-02-06 12:49:00 -08:00			`### Important changes`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			- The `Agent.CollectObservations()` virtual method now takes as input a
			`VectorSensor` sensor as argument. The `Agent.AddVectorObs()` methods were
			`removed.`
			- The `SetMask` was renamed to `SetMask` method must now be called on the
			`DiscreteActionMasker` argument of the `CollectDiscreteActionMasks` virtual
			`method.`
			- We consolidated our API for `DiscreteActionMasker`. `SetMask` takes two
			`arguments : the branch index and the list of masked actions for that branch.`
			- The `Monitor` class has been moved to the Examples Project. (It was prone to
			`errors during testing)`
			- The `MLAgents.Sensors` namespace has been introduced. All sensors classes are
			part of the `MLAgents.Sensors` namespace.
			- The `MLAgents.SideChannels` namespace has been introduced. All side channel
			classes are part of the `MLAgents.SideChannels` namespace.
			- The interface for `RayPerceptionSensor.PerceiveStatic()` was changed to take
			`an input class and write to an output class, and the method was renamed to`
			`Perceive()`.
			- The `SetMask` method must now be called on the `DiscreteActionMasker` argument
			of the `CollectDiscreteActionMasks` method.
			- The method `GetStepCount()` on the Agent class has been replaced with the
			property getter `StepCount`
			- The `--multi-gpu` option has been removed temporarily.
			- `AgentInfo.actionMasks` has been renamed to `AgentInfo.discreteActionMasks`.
			- `BrainParameters` and `SpaceType` have been removed from the public API
			- `BehaviorParameters` have been removed from the public API.
			- `DecisionRequester` has been made internal (you can still use the
			DecisionRequesterComponent from the inspector). `RepeatAction` was renamed
			`TakeActionsBetweenDecisions` for clarity.
			- The following methods in the `Agent` class have been renamed. The original
			`method names will be removed in a later release:`
			- `InitializeAgent()` was renamed to `Initialize()`
			- `AgentAction()` was renamed to `OnActionReceived()`
Fix documentation typos and list rendering (#6066) * Fix list being rendered incorrectly in webdocs I assume this extra blank line will fix the list not being correctly formatted on https://unity-technologies.github.io/ml-agents/#releases-documentation * Fix typos in docs * Fix more mis-rendered lists Add a blank line before bulleted lists in markdown files to avoid them being rendered as in-paragraph sentences that all start with hyphens. * Fix typos in python comments used to generate docs 2024-03-06 14:58:57 +01:00			- `AgentReset()` was renamed to `OnEpisodeBegin()`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- `Done()` was renamed to `EndEpisode()`
			- `GiveModel()` was renamed to `SetModel()`
			- The `IFloatProperties` interface has been removed.
			`- The interface for SideChannels was changed:`
			- In C#, `OnMessageReceived` now takes a `IncomingMessage` argument, and
			`QueueMessageToSend` takes an `OutgoingMessage` argument.
			- In python, `on_message_received` now takes a `IncomingMessage` argument, and
			`queue_message_to_send` takes an `OutgoingMessage` argument.
			`- Automatic stepping for Academy is now controlled from the`
			`AutomaticSteppingEnabled property.`
Develop observation collector (#3352) * Add the VectorSensor to the CollectObservation call * Example of API change for BalanceBall * Modified the Examples * Changes to the migrating doc * Editing the docs * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * Removed the MLAgents.Sensor namespace * Removing the MLAgents.Sensor namespace from the tests * Editing the migrating docs Co-authored-by: Chris Elion <celion@gmail.com> 2020-02-06 18:17:18 -08:00
Split out 0.14 in the migration guide (#3374) 2020-02-06 12:49:00 -08:00			`### Steps to Migrate`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			- Add the `using MLAgents.Sensors;` in addition to `using MLAgents;` on top of
			`your Agent's script.`
			- Replace your Agent's implementation of `CollectObservations()` with
			`CollectObservations(VectorSensor sensor)`. In addition, replace all calls to
			`AddVectorObs()` with `sensor.AddObservation()` or
			`sensor.AddOneHotObservation()` on the `VectorSensor` passed as argument.
			- Replace your calls to `SetActionMask` on your Agent to
			`DiscreteActionMasker.SetActionMask` in `CollectDiscreteActionMasks`.
			- If you call `RayPerceptionSensor.PerceiveStatic()` manually, add your inputs
			to a `RayPerceptionInput`. To get the previous float array output, iterate
			through `RayPerceptionOutput.rayOutputs` and call
			`RayPerceptionOutput.RayOutput.ToFloatArray()`.
			- Replace all calls to `Agent.GetStepCount()` with `Agent.StepCount`
			`- We strongly recommend replacing the following methods with their new`
			`equivalent as they will be removed in a later release:`
			- `InitializeAgent()` to `Initialize()`
			- `AgentAction()` to `OnActionReceived()`
			- `AgentReset()` to `OnEpisodeBegin()`
			- `Done()` to `EndEpisode()`
			- `GiveModel()` to `SetModel()`
			- Replace `IFloatProperties` variables with `FloatPropertiesChannel` variables.
			- If you implemented custom `SideChannels`, update the signatures of your
			methods, and add your data to the `OutgoingMessage` or read it from the
			`IncomingMessage`.
			`- Replace calls to Academy.EnableAutomaticStepping()/DisableAutomaticStepping()`
			`with Academy.AutomaticSteppingEnabled = true/false.`
Merge branch 'master' into merge-release-0.13.0 2020-01-08 13:45:43 -08:00
migration guide (#3375) 2020-02-06 15:13:21 -08:00			`## Migrating from 0.13 to 0.14`
Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 2020-01-03 15:12:41 -08:00
			`### Important changes`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			- The `UnitySDK` folder has been split into a Unity Package
			(`com.unity.ml-agents`) and an examples project (`Project`). Please follow the
			`[Installation Guide](Installation.md) to get up and running with this new repo`
			`structure.`
			`- Several changes were made to how agents are reset and marked as done:`
			- Calling `Done()` on the Agent will now reset it immediately and call the
			`AgentReset` virtual method. (This is to simplify the previous logic in
			which the Agent had to wait for the next `EnvironmentStep` to reset)
			`- The "Reset on Done" setting in AgentParameters was removed; this is now`
			effectively always true. `AgentOnDone` virtual method on the Agent has been
			`removed.`
			- The `Decision Period` and `On Demand decision` checkbox have been removed from
			the Agent. On demand decision is now the default (calling `RequestDecision` on
			`the Agent manually.)`
			`- The Academy class was changed to a singleton, and its virtual methods were`
			`removed.`
			`- Trainer steps are now counted per-Agent, not per-environment as in previous`
			`versions. For instance, if you have 10 Agents in the scene, 20 environment`
			`steps now corresponds to 200 steps as printed in the terminal and in`
			`Tensorboard.`
			`- Curriculum config files are now YAML formatted and all curricula for a`
			`training run are combined into a single file.`
			- The `--num-runs` command-line option has been removed from `mlagents-learn`.
			`- Several fields on the Agent were removed or made private in order to simplify`
			`the interface.`
			- The `agentParameters` field of the Agent has been removed. (Contained only
			`maxStep` information)
			- `maxStep` is now a public field on the Agent. (Was moved from
			`agentParameters`)
			- The `Info` field of the Agent has been made private. (Was only used
			`internally and not meant to be modified outside of the Agent)`
			- The `GetReward()` method on the Agent has been removed. (It was being
			confused with `GetCumulativeReward()`)
			- The `AgentAction` struct no longer contains a `value` field. (Value
			`estimates were not set during inference)`
			- The `GetValueEstimate()` method on the Agent has been removed.
			- The `UpdateValueAction()` method on the Agent has been removed.
			- The deprecated `RayPerception3D` and `RayPerception2D` classes were removed,
			and the `legacyHitFractionBehavior` argument was removed from
			`RayPerceptionSensor.PerceiveStatic()`.
			`- RayPerceptionSensor was inconsistent in how it handle scale on the Agent's`
			`transform. It now scales the ray length and sphere size for casting as the`
			`transform's scale changes.`
Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 2020-01-03 15:12:41 -08:00
			`### Steps to Migrate`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			- Follow the instructions on how to install the `com.unity.ml-agents` package
			`into your project in the [Installation Guide](Installation.md).`
			- If your Agent implemented `AgentOnDone` and did not have the checkbox
			`Reset On Done` checked in the inspector, you must call the code that was in
			`AgentOnDone` manually.
			`- If you give your Agent a reward or penalty at the end of an episode (e.g. for`
			`reaching a goal or falling off of a platform), make sure you call`
			`AddReward()` or `SetReward()` _before_ calling `Done()`. Previously, the
			`order didn't matter.`
			- If you were not using `On Demand Decision` for your Agent, you must add a
			`DecisionRequester` component to your Agent GameObject and set its
			`Decision Period` field to the old `Decision Period` of the Agent.
			`- If you have a class that inherits from Academy:`
			`- If the class didn't override any of the virtual methods and didn't store any`
			`additional data, you can just remove the old script from the scene.`
			`- If the class had additional data, create a new MonoBehaviour and store the`
			`data in the new MonoBehaviour instead.`
			`- If the class overrode the virtual methods, create a new MonoBehaviour and`
			`move the logic to it:`
Documentation fix: MonoBehaviour.OnAwake -> Awake (#3927) `OnAwake` is not expected to exist and is thus never called 2020-05-06 19:44:10 +02:00			`- Move the InitializeAcademy code to MonoBehaviour.Awake`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- Move the AcademyStep code to MonoBehaviour.FixedUpdate`
			`- Move the OnDestroy code to MonoBehaviour.OnDestroy.`
			`- Move the AcademyReset code to a new method and add it to the`
			`Academy.OnEnvironmentReset action.`
			- Multiply `max_steps` and `summary_freq` in your `trainer_config.yaml` by the
			`number of Agents in the scene.`
			`- Combine curriculum configs into a single file. See`
[refactor] Move configuration files to single YAML file (#3791) 2020-04-29 16:19:05 -07:00			`[the WallJump curricula](https://github.com/Unity-Technologies/ml-agents/blob/0.14.1/config/curricula/wall_jump.yaml) for an example of`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`the new curriculum config format. A tool like https://www.json2yaml.com may be`
			`useful to help with the conversion.`
			`- If you have a model trained which uses RayPerceptionSensor and has non-1.0`
			`scale in the Agent's transform, it must be retrained.`

			`## Migrating from ML-Agents Toolkit v0.12.0 to v0.13.0`
Develop side channels: migrate reset parameters (#2990) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChannel() env = UnityEnvironment(base_port = 5004, side_channels = [channel0, channel1]) ``` * renamings * addressing comments * Logging a message when an unknown side channel number has been received by Unity * Addressing comments * renamings * renamings * Adding FloatProperties to the side channels (#2968) * renaming m_SideChannelsDict to m_SideChannel * renaming and some comments * renaming and adding a GetAndClearReceivedMessages() in the RawBytesSideChannel * micro-optimization * more errors and some nit * addressing comments * Using little-endian format in Python * adding some comments * Code comments * some changes and added the unit tests on both Python and C# * removing default default in get default * Update UnitySDK/Assets/ML-Agents/Scripts/SideChannel/SideChannel.cs Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/side_channel/raw_bytes_channel.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * fixing tests * removing the arguments to reset and the property reset_parameters on the UnityEnvironment * curriculum works but removed the check for reset parameters in the scene * processing side channels before the reset command * Removing engine configuration from C# * Engine configuration removed * fixing the tests * Update ml-agents-envs/mlagents/envs/subprocess_env_manager.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Removing register callbacks with empty calls in FloarProperties * Clamp instead of min max * removing the brain names from the environment.py print * renaming reset_parameters to get properties * made a default engine config * bug fix * Empty commit * Docs changes for the Side Channels feature (#3011) * Docs changes for the Side Channels feature * replace deprecated with removed on the CustomResetPratmeters` * Update docs/Python-API.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Training-Generalized-Reinforcement-Learning-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Removing the console outputs in the docs * Update docs/Training-ML-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * replace does not work with ignored * adding a note on side channels * adding some steps to migrate * addressing comments * adding more docs to the LL-API * added a blob on how to access the properties in C# * adding space between ResetParameters * fix typo * bug fixes * addressing comments 2019-12-03 16:40:01 -08:00
			`### Important changes`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			`- The low level Python API has changed. You can look at the document`
Develop python api ga (#6) * Dropped support for python 3.6 * Pinning python 3.9.9 for tests due to typing issues with 3.9.10 * Testing new bokken image. * Testing new bokken image. * Updated yamato standalone build test. * Updated yamato standalone build test. * Updated standalone build test. * Updated yamato configs to use mla bokken vm. * Bug fixes for yamato yml files. * Fixed com.unity.ml-agents-test.yml * Bumped min python version to 3.7.2 * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated failing yamato jobs. * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated CHANGELOG. * Updated Migration guide. * Doc updates based on CR. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Fixed yamato import error. Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Miguel Alonso Jr <miguelalonsojr> Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com> Co-authored-by: Maryam Honari <honari.m94@gmail.com> Co-authored-by: Henry Peteet <henry.peteet@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> 2022-02-02 19:32:23 -05:00			`[Low Level Python API documentation](Python-LLAPI.md) for more information. This`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`should only affect you if you're writing a custom trainer; if you use`
			`mlagents-learn` for training, this should be a transparent change.
			- `reset()` on the Low-Level Python API no longer takes a `train_mode`
			`argument. To modify the performance/speed of the engine, you must use an`
			`EngineConfigurationChannel`
			- `reset()` on the Low-Level Python API no longer takes a `config` argument.
			`UnityEnvironment` no longer has a `reset_parameters` field. To modify float
			properties in the environment, you must use a `FloatPropertiesChannel`. For
			`more information, refer to the`
Develop python api ga (#6) * Dropped support for python 3.6 * Pinning python 3.9.9 for tests due to typing issues with 3.9.10 * Testing new bokken image. * Testing new bokken image. * Updated yamato standalone build test. * Updated yamato standalone build test. * Updated standalone build test. * Updated yamato configs to use mla bokken vm. * Bug fixes for yamato yml files. * Fixed com.unity.ml-agents-test.yml * Bumped min python version to 3.7.2 * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated failing yamato jobs. * pettingzoo api prototype * add example * update file names * support multiple behavior names * fix multi behavior action index * add install in colab * add setup * update colab * fix __init__ * clone single branch * import tags only * import in init * catch import error * update colab * move colab and add readme * handle agent dying * add tests * update doc * add info * add action mask * fix action mask * update action masks in colab * change default env * set version * fix hybrid action * fix colab for hybrid actions * add note on auto reset * Updated colab name. * Update README.md * Following petting_zoo registry API (#5557) * init petting_zoo registry * cherrypick Custom trainer editor analytics (#5511) * cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)" * Update colab to match pettingZoo import api * ToRevert: pull exp-petting-registry branch * Add init file to tests * Install pettingzoo-unity requirements for pytest * update pytest command * Add docstrings and comments * update coverage to pettingzoo folder * unset log level * update env string * Two small bugfixes (#5589) 1. Add the missing `_cumulative_rewards` property 2. Update `agent_selection` to not error out when an agent finishes an episode. * Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing. * Some refactoring. * Finished inital implementation of parallel. Tests not passing. * Finished parallel API implementation and refactor. All PZ tests passing. * Cleanup. * Refactoring. * Pinning numpy version. * add metadata and behavior_specs initialization * addressing behaviour_spec issues * Bumped PZ version to 1.14.0. Fixed failing tests. * Refactored gym-unity and petting-zoo into ml-agents-envs * Added TODO to pydoc-config.yaml * Refactored gym and pz to be under a subpackage in mlagents_env package * Refactored ml-agents-envs docs. * Minor update to PZ API doc. * Updated mlagents_envs docs and colab. * Updated pytest gh workflow to remove ref to gym and pz. * Refactored to remove some test coupling between trainers and envs. * Updated installation doc. * Update ml-agents-envs/README.md Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> * Updated CHANGELOG. * Updated Migration guide. * Doc updates based on CR. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Updated github workflow for colab tests. * Fixed yamato import error. Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Miguel Alonso Jr <miguelalonsojr> Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com> Co-authored-by: Maryam Honari <honari.m94@gmail.com> Co-authored-by: Henry Peteet <henry.peteet@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> 2022-02-02 19:32:23 -05:00			`[Low Level Python API documentation](Python-LLAPI.md)`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- `CustomResetParameters` are now removed.
			- The Academy no longer has a `Training Configuration` nor
			`Inference Configuration` field in the inspector. To modify the configuration
			from the Low-Level Python API, use an `EngineConfigurationChannel`. To modify
			it during training, use the new command line arguments `--width`, `--height`,
			`--quality-level`, `--time-scale` and `--target-frame-rate` in
			`mlagents-learn`.
			- The Academy no longer has a `Default Reset Parameters` field in the inspector.
			The Academy class no longer has a `ResetParameters`. To access shared float
			properties with Python, use the new `FloatProperties` field on the Academy.
			`- Offline Behavioral Cloning has been removed. To learn from demonstrations, use`
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			`the GAIL and Behavioral Cloning features with either PPO or SAC.`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- `mlagents.envs` was renamed to `mlagents_envs`. The previous repo layout
			`depended on [PEP420](https://www.python.org/dev/peps/pep-0420/), which caused`
			`problems with some of our tooling such as mypy and pylint.`
Develop sentis upgrade (#5979) * Commiting changes. * Initial barracuda 4 upgrade. * Play mode tests passing. * Edit mode tests passing. * Training fixes. * Fixed performance issue with stacking sensor. * Fixed failing tests and issue with stacking sensor. * Updated examples for barracuda 4 upgrade. * Fixed issue with attention ONNX export w.r.t. dimensions. * Fixed issue with Buffer Sensor and Recurrent In/Out. * Retrained old policies and updated with ONNX policies. Deprecated old policy versions. * Saving work. * Saving work. * Updating to Sentis 1.1.1-exp.2 * Fixed more errors with Sentis upgrade. * Fixed tensor allocation issue in TensorUtils.ResizeTensor. Inference is working for 3DBall with Sentis. * Fixed broken Sentis model links for some example environments. * Fixed some broken edit mode tests. * Fixed some failing tests. * Fixing bugs with GPU inference on Sentis. * Updated packages lock and onnx meta files. * Refactoring all Barracuda related naming to Sentis. * Python max version bump. * Precommit fixes. * Pinned tensorboard version * Revert tensorboard version. * Fixed rpc tests. * Fixed failing python tests. * Fixed some more failing tests. Added six as an explicit dependency due to tensorboard requirements. * gha fix. * Updated environment registry for Sentis. * Fixed texture sensor test. * Develop python 3.10 (#5981) * Deprecated python 3.8.x and 3.9.x. * Updated colab gha test to 3.10.12 * Updated colabs for Sentis and python 3.10. * Test fix. * Minor update to colabs. * Develop torch 1.13.1 (#5982) * Bumped PyTorch version to 1.13.1 * Added potential fixes to model overrider TBD at a later date. * Updated changelog. * Updated protobufs. (#5983) * Updated training init tests to remove inference test temporarily. (#5984) 2023-10-05 18:28:39 -04:00			`- The official version of Unity ML-Agents supports is now 2022.3 LTS. If you run`
Fix documentation typos and list rendering (#6066) * Fix list being rendered incorrectly in webdocs I assume this extra blank line will fix the list not being correctly formatted on https://unity-technologies.github.io/ml-agents/#releases-documentation * Fix typos in docs * Fix more mis-rendered lists Add a blank line before bulleted lists in markdown files to avoid them being rendered as in-paragraph sentences that all start with hyphens. * Fix typos in python comments used to generate docs 2024-03-06 14:58:57 +01:00			`into issues, please consider deleting your library folder and reopening your`
Develop sentis upgrade (#5979) * Commiting changes. * Initial barracuda 4 upgrade. * Play mode tests passing. * Edit mode tests passing. * Training fixes. * Fixed performance issue with stacking sensor. * Fixed failing tests and issue with stacking sensor. * Updated examples for barracuda 4 upgrade. * Fixed issue with attention ONNX export w.r.t. dimensions. * Fixed issue with Buffer Sensor and Recurrent In/Out. * Retrained old policies and updated with ONNX policies. Deprecated old policy versions. * Saving work. * Saving work. * Updating to Sentis 1.1.1-exp.2 * Fixed more errors with Sentis upgrade. * Fixed tensor allocation issue in TensorUtils.ResizeTensor. Inference is working for 3DBall with Sentis. * Fixed broken Sentis model links for some example environments. * Fixed some broken edit mode tests. * Fixed some failing tests. * Fixing bugs with GPU inference on Sentis. * Updated packages lock and onnx meta files. * Refactoring all Barracuda related naming to Sentis. * Python max version bump. * Precommit fixes. * Pinned tensorboard version * Revert tensorboard version. * Fixed rpc tests. * Fixed failing python tests. * Fixed some more failing tests. Added six as an explicit dependency due to tensorboard requirements. * gha fix. * Updated environment registry for Sentis. * Fixed texture sensor test. * Develop python 3.10 (#5981) * Deprecated python 3.8.x and 3.9.x. * Updated colab gha test to 3.10.12 * Updated colabs for Sentis and python 3.10. * Test fix. * Minor update to colabs. * Develop torch 1.13.1 (#5982) * Bumped PyTorch version to 1.13.1 * Added potential fixes to model overrider TBD at a later date. * Updated changelog. * Updated protobufs. (#5983) * Updated training init tests to remove inference test temporarily. (#5984) 2023-10-05 18:28:39 -04:00			`projects. You will need to install the Sentis package into your project in`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`order to ML-Agents to compile correctly.`
Develop side channels: migrate reset parameters (#2990) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChannel() env = UnityEnvironment(base_port = 5004, side_channels = [channel0, channel1]) ``` * renamings * addressing comments * Logging a message when an unknown side channel number has been received by Unity * Addressing comments * renamings * renamings * Adding FloatProperties to the side channels (#2968) * renaming m_SideChannelsDict to m_SideChannel * renaming and some comments * renaming and adding a GetAndClearReceivedMessages() in the RawBytesSideChannel * micro-optimization * more errors and some nit * addressing comments * Using little-endian format in Python * adding some comments * Code comments * some changes and added the unit tests on both Python and C# * removing default default in get default * Update UnitySDK/Assets/ML-Agents/Scripts/SideChannel/SideChannel.cs Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/side_channel/raw_bytes_channel.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * fixing tests * removing the arguments to reset and the property reset_parameters on the UnityEnvironment * curriculum works but removed the check for reset parameters in the scene * processing side channels before the reset command * Removing engine configuration from C# * Engine configuration removed * fixing the tests * Update ml-agents-envs/mlagents/envs/subprocess_env_manager.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Removing register callbacks with empty calls in FloarProperties * Clamp instead of min max * removing the brain names from the environment.py print * renaming reset_parameters to get properties * made a default engine config * bug fix * Empty commit * Docs changes for the Side Channels feature (#3011) * Docs changes for the Side Channels feature * replace deprecated with removed on the CustomResetPratmeters` * Update docs/Python-API.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Training-Generalized-Reinforcement-Learning-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Removing the console outputs in the docs * Update docs/Training-ML-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * replace does not work with ignored * adding a note on side channels * adding some steps to migrate * addressing comments * adding more docs to the LL-API * added a blob on how to access the properties in C# * adding space between ResetParameters * fix typo * bug fixes * addressing comments 2019-12-03 16:40:01 -08:00
			`### Steps to Migrate`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- If you had a custom `Training Configuration` in the Academy inspector, you
			`will need to pass your custom configuration at every training run using the`
			new command line arguments `--width`, `--height`, `--quality-level`,
			`--time-scale` and `--target-frame-rate`.
			- If you were using `--slow` in `mlagents-learn`, you will need to pass your old
			`Inference Configuration` of the Academy inspector with the new command line
			arguments `--width`, `--height`, `--quality-level`, `--time-scale` and
			`--target-frame-rate` instead.
			- Any imports from `mlagents.envs` should be replaced with `mlagents_envs`.

			`## Migrating from ML-Agents Toolkit v0.11.0 to v0.12.0`
Add note on RayPerception change (#2837) * Add note on RayPerception change * Update Migrating.md 2019-11-04 10:06:26 -08:00
			`### Important Changes`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			`- Text actions and observations, and custom action and observation protos have`
			`been removed.`
			`- RayPerception3D and RayPerception2D are marked deprecated, and will be removed`
			`in a future release. They can be replaced by RayPerceptionSensorComponent3D`
			`and RayPerceptionSensorComponent2D.`
			- The `Use Heuristic` checkbox in Behavior Parameters has been replaced with a
			`Behavior Type` dropdown menu. This has the following options:
			- `Default` corresponds to the previous unchecked behavior, meaning that
			`Agents will train if they connect to a python trainer, otherwise they will`
			`perform inference.`
			- `Heuristic Only` means the Agent will always use the `Heuristic()` method.
			`This corresponds to having "Use Heuristic" selected in 0.11.0.`
			- `Inference Only` means the Agent will always perform inference.
Develop sentis upgrade (#5979) * Commiting changes. * Initial barracuda 4 upgrade. * Play mode tests passing. * Edit mode tests passing. * Training fixes. * Fixed performance issue with stacking sensor. * Fixed failing tests and issue with stacking sensor. * Updated examples for barracuda 4 upgrade. * Fixed issue with attention ONNX export w.r.t. dimensions. * Fixed issue with Buffer Sensor and Recurrent In/Out. * Retrained old policies and updated with ONNX policies. Deprecated old policy versions. * Saving work. * Saving work. * Updating to Sentis 1.1.1-exp.2 * Fixed more errors with Sentis upgrade. * Fixed tensor allocation issue in TensorUtils.ResizeTensor. Inference is working for 3DBall with Sentis. * Fixed broken Sentis model links for some example environments. * Fixed some broken edit mode tests. * Fixed some failing tests. * Fixing bugs with GPU inference on Sentis. * Updated packages lock and onnx meta files. * Refactoring all Barracuda related naming to Sentis. * Python max version bump. * Precommit fixes. * Pinned tensorboard version * Revert tensorboard version. * Fixed rpc tests. * Fixed failing python tests. * Fixed some more failing tests. Added six as an explicit dependency due to tensorboard requirements. * gha fix. * Updated environment registry for Sentis. * Fixed texture sensor test. * Develop python 3.10 (#5981) * Deprecated python 3.8.x and 3.9.x. * Updated colab gha test to 3.10.12 * Updated colabs for Sentis and python 3.10. * Test fix. * Minor update to colabs. * Develop torch 1.13.1 (#5982) * Bumped PyTorch version to 1.13.1 * Added potential fixes to model overrider TBD at a later date. * Updated changelog. * Updated protobufs. (#5983) * Updated training init tests to remove inference test temporarily. (#5984) 2023-10-05 18:28:39 -04:00			`- ML-Agents was upgraded to use Sentis 1.2.0-exp.2 and is installed via the package manager.`
Add note on RayPerception change (#2837) * Add note on RayPerception change * Update Migrating.md 2019-11-04 10:06:26 -08:00
			`### Steps to Migrate`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- We [fixed a bug](https://github.com/Unity-Technologies/ml-agents/pull/2823) in`
			`RayPerception3d.Perceive()` that was causing the `endOffset` to be used
			`incorrectly. However this may produce different behavior from previous`
			versions if you use a non-zero `startOffset`. To reproduce the old behavior,
add "the the" to precommit spell check (#4059) 2020-06-03 10:46:31 -07:00			you should increase the value of `endOffset` by `startOffset`. You can
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`verify your raycasts are performing as expected in scene view using the debug`
			`rays.`
			`- If you use RayPerception3D, replace it with RayPerceptionSensorComponent3D`
			`(and similarly for 2D). The settings, such as ray angles and detectable tags,`
			`are configured on the component now. RayPerception3D would contribute`
			`(# of rays) * (# of tags + 2)` to the State Size in Behavior Parameters, but
			`this is no longer necessary, so you should reduce the State Size by this`
			`amount. Making this change will require retraining your model, since the`
			`observations that RayPerceptionSensorComponent3D produces are different from`
			`the old behavior.`
			`- If you see messages such as`
Develop sentis upgrade (#5979) * Commiting changes. * Initial barracuda 4 upgrade. * Play mode tests passing. * Edit mode tests passing. * Training fixes. * Fixed performance issue with stacking sensor. * Fixed failing tests and issue with stacking sensor. * Updated examples for barracuda 4 upgrade. * Fixed issue with attention ONNX export w.r.t. dimensions. * Fixed issue with Buffer Sensor and Recurrent In/Out. * Retrained old policies and updated with ONNX policies. Deprecated old policy versions. * Saving work. * Saving work. * Updating to Sentis 1.1.1-exp.2 * Fixed more errors with Sentis upgrade. * Fixed tensor allocation issue in TensorUtils.ResizeTensor. Inference is working for 3DBall with Sentis. * Fixed broken Sentis model links for some example environments. * Fixed some broken edit mode tests. * Fixed some failing tests. * Fixing bugs with GPU inference on Sentis. * Updated packages lock and onnx meta files. * Refactoring all Barracuda related naming to Sentis. * Python max version bump. * Precommit fixes. * Pinned tensorboard version * Revert tensorboard version. * Fixed rpc tests. * Fixed failing python tests. * Fixed some more failing tests. Added six as an explicit dependency due to tensorboard requirements. * gha fix. * Updated environment registry for Sentis. * Fixed texture sensor test. * Develop python 3.10 (#5981) * Deprecated python 3.8.x and 3.9.x. * Updated colab gha test to 3.10.12 * Updated colabs for Sentis and python 3.10. * Test fix. * Minor update to colabs. * Develop torch 1.13.1 (#5982) * Bumped PyTorch version to 1.13.1 * Added potential fixes to model overrider TBD at a later date. * Updated changelog. * Updated protobufs. (#5983) * Updated training init tests to remove inference test temporarily. (#5984) 2023-10-05 18:28:39 -04:00			`The type or namespace 'Sentis' could not be found` or
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`The type or namespace 'Google' could not be found`, you will need to
Develop sentis upgrade (#5979) * Commiting changes. * Initial barracuda 4 upgrade. * Play mode tests passing. * Edit mode tests passing. * Training fixes. * Fixed performance issue with stacking sensor. * Fixed failing tests and issue with stacking sensor. * Updated examples for barracuda 4 upgrade. * Fixed issue with attention ONNX export w.r.t. dimensions. * Fixed issue with Buffer Sensor and Recurrent In/Out. * Retrained old policies and updated with ONNX policies. Deprecated old policy versions. * Saving work. * Saving work. * Updating to Sentis 1.1.1-exp.2 * Fixed more errors with Sentis upgrade. * Fixed tensor allocation issue in TensorUtils.ResizeTensor. Inference is working for 3DBall with Sentis. * Fixed broken Sentis model links for some example environments. * Fixed some broken edit mode tests. * Fixed some failing tests. * Fixing bugs with GPU inference on Sentis. * Updated packages lock and onnx meta files. * Refactoring all Barracuda related naming to Sentis. * Python max version bump. * Precommit fixes. * Pinned tensorboard version * Revert tensorboard version. * Fixed rpc tests. * Fixed failing python tests. * Fixed some more failing tests. Added six as an explicit dependency due to tensorboard requirements. * gha fix. * Updated environment registry for Sentis. * Fixed texture sensor test. * Develop python 3.10 (#5981) * Deprecated python 3.8.x and 3.9.x. * Updated colab gha test to 3.10.12 * Updated colabs for Sentis and python 3.10. * Test fix. * Minor update to colabs. * Develop torch 1.13.1 (#5982) * Bumped PyTorch version to 1.13.1 * Added potential fixes to model overrider TBD at a later date. * Updated changelog. * Updated protobufs. (#5983) * Updated training init tests to remove inference test temporarily. (#5984) 2023-10-05 18:28:39 -04:00			`[install the Sentis preview package](Installation.md#package-installation).`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			`## Migrating from ML-Agents Toolkit v0.10 to v0.11.0`
[migrating] Update migrating docs to call out gRPC service changes. (#2654) 2019-10-01 13:11:25 -07:00
			`### Important Changes`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			`- The definition of the gRPC service has changed.`
			`- The online BC training feature has been removed.`
			`- The BroadcastHub has been deprecated. If there is a training Python process,`
			`all LearningBrains in the scene will automatically be trained. If there is no`
			`Python process, inference will be used.`
			`- The Brain ScriptableObjects have been deprecated. The Brain Parameters are now`
			`on the Agent and are referred to as Behavior Parameters. Make sure the`
			`Behavior Parameters is attached to the Agent GameObject.`
			- To use a heuristic behavior, implement the `Heuristic()` method in the Agent
			class and check the `use heuristic` checkbox in the Behavior Parameters.
			`- Several changes were made to the setup for visual observations (i.e. using`
			`Cameras or RenderTextures):`
			`- Camera resolutions are no longer stored in the Brain Parameters.`
			`- AgentParameters no longer stores lists of Cameras and RenderTextures`
			`- To add visual observations to an Agent, you must now attach a`
			`CameraSensorComponent or RenderTextureComponent to the agent. The`
			`corresponding Camera or RenderTexture can be added to these in the editor,`
			`and the resolution and color/grayscale is configured on the component`
			`itself.`
[migrating] Update migrating docs to call out gRPC service changes. (#2654) 2019-10-01 13:11:25 -07:00
			`#### Steps to Migrate`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- In order to be able to train, make sure both your ML-Agents Python package and`
			`UnitySDK code come from the v0.11 release. Training will not work, for`
			`example, if you update the ML-Agents Python package, and only update the API`
			`Version in UnitySDK.`
			`- If your Agents used visual observations, you must add a CameraSensorComponent`
			`corresponding to each old Camera in the Agent's camera list (and similarly for`
			`RenderTextures).`
			`- Since Brain ScriptableObjects have been removed, you will need to delete all`
			the Brain ScriptableObjects from your `Assets` folder. Then, add a
			`Behavior Parameters` component to each `Agent` GameObject. You will then need
			to complete the fields on the new `Behavior Parameters` component with the
			`BrainParameters of the old Brain.`

			`## Migrating from ML-Agents Toolkit v0.9 to v0.10`
[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555) 2019-09-13 15:32:36 -07:00
			`### Important Changes`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- We have updated the C# code in our repository to be in line with Unity Coding`
			`Conventions. This has changed the name of some public facing classes and`
			`enums.`
			`- The example environments have been updated. If you were using these`
			`environments to benchmark your training, please note that the resulting`
			`rewards may be slightly different in v0.10.`
[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555) 2019-09-13 15:32:36 -07:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`#### Steps to Migrate`
[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555) 2019-09-13 15:32:36 -07:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- `UnitySDK/Assets/ML-Agents/Scripts/Communicator.cs` and its class
			`Communicator` have been renamed to
			`UnitySDK/Assets/ML-Agents/Scripts/ICommunicator.cs` and `ICommunicator`
			`respectively.`
			- The `SpaceType` Enums `discrete`, and `continuous` have been renamed to
			`Discrete` and `Continuous`.
			- We have removed the `Done` call as well as the capacity to set `Max Steps` on
			`the Academy. Therefore an AcademyReset will never be triggered from C# (only`
			`from Python). If you want to reset the simulation after a fixed number of`
			`steps, or when an event in the simulation occurs, we recommend looking at our`
			`multi-agent example environments (such as FoodCollector). In our examples,`
			`groups of Agents can be reset through an "Area" that can reset groups of`
			`Agents.`
			- The import for `mlagents.envs.UnityEnvironment` was removed. If you are using
			the Python API, change `from mlagents_envs import UnityEnvironment` to
			`from mlagents_envs.environment import UnityEnvironment`.

			`## Migrating from ML-Agents Toolkit v0.8 to v0.9`
Added Migrating docs for 0.9 (#2347) 2019-07-29 10:02:58 -07:00
			`### Important Changes`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			`- We have changed the way reward signals (including Curiosity) are defined in`
			the `trainer_config.yaml`.
			`- When using multiple environments, every "step" is recorded in TensorBoard.`
			`- The steps in the command line console corresponds to a single step of a single`
			`environment. Previously, each step corresponded to one step for all`
			environments (i.e., `num_envs` steps).
Added Migrating docs for 0.9 (#2347) 2019-07-29 10:02:58 -07:00
			`#### Steps to Migrate`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			`- If you were overriding any of these following parameters in your config file,`
			`remove them from the top-level config and follow the steps below:`
			- `gamma`: Define a new `extrinsic` reward signal and set it's `gamma` to your
			`new gamma.`
			- `use_curiosity`, `curiosity_strength`, `curiosity_enc_size`: Define a
			`curiosity` reward signal and set its `strength` to `curiosity_strength`,
			and `encoding_size` to `curiosity_enc_size`. Give it the same `gamma` as
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			your `extrinsic` signal to mimic previous behavior.
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- TensorBoards generated when running multiple environments in v0.8 are not`
			`comparable to those generated in v0.9 in terms of step count. Multiply your`
			v0.8 step count by `num_envs` for an approximate comparison. You may need to
			change `max_steps` in your config as appropriate as well.

			`## Migrating from ML-Agents Toolkit v0.7 to v0.8`
Release v0.8 docs (#1924) * update title caps * Rename Custom-Protos.md to Creating-Custom-Protobuf-Messages.md * Updated with custom protobuf messages * Cleanup against to our doc guidelines * Minor text revision * Create Training-Concurrent-Unity-Instances * Rename Training-Concurrent-Unity-Instances to Training-Concurrent-Unity-Instances.md * update to right format for --num-envs * added link to concurrent unity instances * Update and rename Training-Concurrent-Unity-Instances.md to Training-Using-Concurrent-Unity-Instances.md * Added considerations section * Update Training-Using-Concurrent-Unity-Instances.md * cleaned up language to match doc * minor updates * retroactive migration from 0.6 to 0.7 * Updated from 0.7 to 0.8 migration * Minor typo * minor fix * accidentally duplicated step * updated with new features list 2019-04-12 09:44:30 -07:00
			`### Important Changes`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			- We have split the Python packages into two separate packages `ml-agents` and
			`ml-agents-envs`.
			- `--worker-id` option of `learn.py` has been removed, use `--base-port` instead
			if you'd like to run multiple instances of `learn.py`.
Release v0.8 docs (#1924) * update title caps * Rename Custom-Protos.md to Creating-Custom-Protobuf-Messages.md * Updated with custom protobuf messages * Cleanup against to our doc guidelines * Minor text revision * Create Training-Concurrent-Unity-Instances * Rename Training-Concurrent-Unity-Instances to Training-Concurrent-Unity-Instances.md * update to right format for --num-envs * added link to concurrent unity instances * Update and rename Training-Concurrent-Unity-Instances.md to Training-Using-Concurrent-Unity-Instances.md * Added considerations section * Update Training-Using-Concurrent-Unity-Instances.md * cleaned up language to match doc * minor updates * retroactive migration from 0.6 to 0.7 * Updated from 0.7 to 0.8 migration * Minor typo * minor fix * accidentally duplicated step * updated with new features list 2019-04-12 09:44:30 -07:00
			`#### Steps to Migrate`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- If you are installing via PyPI, there is no change.`
			- If you intend to make modifications to `ml-agents` or `ml-agents-envs` please
			`check the Installing for Development in the`
			`[Installation documentation](Installation.md).`

			`## Migrating from ML-Agents Toolkit v0.6 to v0.7`
Release v0.8 docs (#1924) * update title caps * Rename Custom-Protos.md to Creating-Custom-Protobuf-Messages.md * Updated with custom protobuf messages * Cleanup against to our doc guidelines * Minor text revision * Create Training-Concurrent-Unity-Instances * Rename Training-Concurrent-Unity-Instances to Training-Concurrent-Unity-Instances.md * update to right format for --num-envs * added link to concurrent unity instances * Update and rename Training-Concurrent-Unity-Instances.md to Training-Using-Concurrent-Unity-Instances.md * Added considerations section * Update Training-Using-Concurrent-Unity-Instances.md * cleaned up language to match doc * minor updates * retroactive migration from 0.6 to 0.7 * Updated from 0.7 to 0.8 migration * Minor typo * minor fix * accidentally duplicated step * updated with new features list 2019-04-12 09:44:30 -07:00
			`### Important Changes`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00
			`- We no longer support TFS and are now using the`
Update to Inference Engine 2.2.1 (#6212) * Update to Inference Engine 2.2.1 * Update documentation 2025-06-16 19:47:22 +02:00			`[Sentis](Inference-Engine.md)`
Release v0.8 docs (#1924) * update title caps * Rename Custom-Protos.md to Creating-Custom-Protobuf-Messages.md * Updated with custom protobuf messages * Cleanup against to our doc guidelines * Minor text revision * Create Training-Concurrent-Unity-Instances * Rename Training-Concurrent-Unity-Instances to Training-Concurrent-Unity-Instances.md * update to right format for --num-envs * added link to concurrent unity instances * Update and rename Training-Concurrent-Unity-Instances.md to Training-Using-Concurrent-Unity-Instances.md * Added considerations section * Update Training-Using-Concurrent-Unity-Instances.md * cleaned up language to match doc * minor updates * retroactive migration from 0.6 to 0.7 * Updated from 0.7 to 0.8 migration * Minor typo * minor fix * accidentally duplicated step * updated with new features list 2019-04-12 09:44:30 -07:00
			`#### Steps to Migrate`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- Make sure to remove the `ENABLE_TENSORFLOW` flag in your Unity Project
			`settings`

			`## Migrating from ML-Agents Toolkit v0.5 to v0.6`
Documentation Update (#1339) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments 2018-10-18 17:48:45 -07:00
v0.6 docs minor updates (#1373) * Capitalization fix for ML background docs * Reference our Tensorboard docs from Background-Tensorflow * Separate v0.6 migration steps from change list * Migration doc feedback tweak * Updated Broadcast Hub doc 2018-11-15 14:15:46 -08:00			`### Important Changes`
Documentation Update (#1339) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments 2018-10-18 17:48:45 -07:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- Brains are now Scriptable Objects instead of MonoBehaviors.`
			`- You can no longer modify the type of a Brain. If you want to switch between`
			`PlayerBrain` and `LearningBrain` for multiple agents, you will need to assign
			`a new Brain to each agent separately. Note: You can pass the same Brain to`
			`multiple agents in a scene by leveraging Unity's prefab system or look for all`
			the agents in a scene using the search bar of the `Hierarchy` window with the
			word `Agent`.
v0.6 docs minor updates (#1373) * Capitalization fix for ML background docs * Reference our Tensorboard docs from Background-Tensorflow * Separate v0.6 migration steps from change list * Migration doc feedback tweak * Updated Broadcast Hub doc 2018-11-15 14:15:46 -08:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- We replaced the Internal and External Brain with Learning Brain.`
Minor typo fix 2018-12-13 13:04:45 -08:00			When you need to train a model, you need to drag it into the `Broadcast Hub`
Documentation Update (#1339) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments 2018-10-18 17:48:45 -07:00			inside the `Academy` and check the `Control` checkbox.
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- We removed the `Broadcast` checkbox of the Brain, to use the broadcast
Documentation Update (#1339) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments 2018-10-18 17:48:45 -07:00			functionality, you need to drag the Brain into the `Broadcast Hub`.
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- When training multiple Brains at the same time, each model is now stored into`
			`a separate model file rather than in the same file under different graph`
			`scopes.`
			`- The Learning Brain graph scope, placeholder names, output names and custom`
Documentation revert TF# 2018-11-29 13:47:59 -08:00			`placeholders can no longer be modified.`
Documentation Update (#1339) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments 2018-10-18 17:48:45 -07:00
v0.6 docs minor updates (#1373) * Capitalization fix for ML background docs * Reference our Tensorboard docs from Background-Tensorflow * Separate v0.6 migration steps from change list * Migration doc feedback tweak * Updated Broadcast Hub doc 2018-11-15 14:15:46 -08:00			`#### Steps to Migrate`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- To update a scene from v0.5 to v0.6, you must:`
			- Remove the `Brain` GameObjects in the scene. (Delete all of the Brain
			`GameObjects under Academy in the scene.)`
			- Create new `Brain` Scriptable Objects using `Assets -> Create -> ML-Agents`
			`for each type of the Brain you plan to use, and put the created files under`
			`a folder called Brains within your project.`
			- Edit their `Brain Parameters` to be the same as the parameters used in the
			`Brain` GameObjects.
			- Agents have a `Brain` field in the Inspector, you need to drag the
v0.6 docs minor updates (#1373) * Capitalization fix for ML background docs * Reference our Tensorboard docs from Background-Tensorflow * Separate v0.6 migration steps from change list * Migration doc feedback tweak * Updated Broadcast Hub doc 2018-11-15 14:15:46 -08:00			`appropriate Brain ScriptableObject in it.`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- The Academy has a `Broadcast Hub` field in the inspector, which is list of
			`brains used in the scene. To train or control your Brain from the`
			`mlagents-learn` Python script, you need to drag the relevant
			`LearningBrain` ScriptableObjects used in your scene into entries into this
			`list.`
Documentation Update (#1339) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments 2018-10-18 17:48:45 -07:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`## Migrating from ML-Agents Toolkit v0.4 to v0.5`
Updating the migrating doc (#1137) 2018-08-28 19:08:40 -07:00
			`### Important`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- The Unity project `unity-environment` has been renamed `UnitySDK`.
			- The `python` folder has been renamed to `ml-agents`. It now contains two
Updating the migrating doc (#1137) 2018-08-28 19:08:40 -07:00			packages, `mlagents.env` and `mlagents.trainers`. `mlagents.env` can be used
			to interact directly with a Unity environment, while `mlagents.trainers`
			`contains the classes for training agents.`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- The supported Unity version has changed from `2017.1 or later` to
			`2017.4 or later`. 2017.4 is an LTS (Long Term Support) version that helps us
Project Version 2017.4 (#1173) * Make project version 2017.4 * updated the documentation * added the upgrade notes for 2017.1 to 2017.4 * removed the .10f1 * fix the typo and make the language nicer * resolved the comments * Wrapping lines. * Wording. * resolved part of jeff's comment * resolved part of jeff's comment * fixed the link * Update FAQ.md Missing "an". * Missing "an". 2018-09-05 14:34:18 -07:00			`maintain good quality and support. Earlier versions of Unity might still work,`
			`but you may encounter an`
Fixing tables in documentation and other markdown errors. (#1199) 2018-09-07 13:54:32 -07:00			`[error](FAQ.md#instance-of-corebraininternal-couldnt-be-created) listed here.`
Updating the migrating doc (#1137) 2018-08-28 19:08:40 -07:00
			`### Unity API`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- Discrete Actions now use [branches](https://arxiv.org/abs/1711.08946). You can`
Documentation 0.5 Release Check List (Part 1) (#1154) 2018-09-04 21:16:20 -07:00			`now specify concurrent discrete actions. You will need to update the Brain`
			`Parameters in the Brain Inspector in all your environments that use discrete`
			`actions. Refer to the`
			`[discrete action documentation](Learning-Environment-Design-Agents.md#discrete-action-space)`
			`for more information.`
Updating the migrating doc (#1137) 2018-08-28 19:08:40 -07:00
			`### Python API`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- In order to run a training session, you can now use the command`
Updating the migrating doc (#1137) 2018-08-28 19:08:40 -07:00			`mlagents-learn` instead of `python3 learn.py` after installing the `mlagents`
Replacing training screenshots with updated markdown. (#1152) * Replacing training screenshots with updated markdown. * Changing bash prompt to be simpler. 2018-08-29 13:36:20 -07:00			`packages. This change is documented`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`[here](Training-ML-Agents.md#training-with-mlagents-learn). For example, if we`
			`previously ran`
Adding specific mlagents-learn vs. learn.py example. (#1157) * Adding specific mlagents-learn vs. learn.py example. * Specifying where to run mlagents-learn in Migrating.md. 2018-08-30 16:45:16 -07:00
			```sh
			`python3 learn.py 3DBall --train`
			```

pull/1294 from has-taiar (#1310) * pull/1294 from has-taiar * removed the left bracket * moved the windows link position * update the windows doc * resolved the comments, changed the pip install . to pip install -e . , added the package explanation to the Windows installation doc * Resolved the comments * add the 'the' 2018-10-10 18:32:49 -07:00			from the `python` subdirectory (which is changed to `ml-agents` subdirectory
			`in v0.5), we now run`
Adding specific mlagents-learn vs. learn.py example. (#1157) * Adding specific mlagents-learn vs. learn.py example. * Specifying where to run mlagents-learn in Migrating.md. 2018-08-30 16:45:16 -07:00
			```sh
			`mlagents-learn config/trainer_config.yaml --env=3DBall --train`
			```

pull/1294 from has-taiar (#1310) * pull/1294 from has-taiar * removed the left bracket * moved the windows link position * update the windows doc * resolved the comments, changed the pip install . to pip install -e . , added the package explanation to the Windows installation doc * Resolved the comments * add the 'the' 2018-10-10 18:32:49 -07:00			`from the root directory where we installed the ML-Agents Toolkit.`
Adding specific mlagents-learn vs. learn.py example. (#1157) * Adding specific mlagents-learn vs. learn.py example. * Specifying where to run mlagents-learn in Migrating.md. 2018-08-30 16:45:16 -07:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- It is now required to specify the path to the yaml trainer configuration file`
Adding specific mlagents-learn vs. learn.py example. (#1157) * Adding specific mlagents-learn vs. learn.py example. * Specifying where to run mlagents-learn in Migrating.md. 2018-08-30 16:45:16 -07:00			when running `mlagents-learn`. For an example trainer configuration file, see
[refactor] Move configuration files to single YAML file (#3791) 2020-04-29 16:19:05 -07:00			`[trainer_config.yaml](https://github.com/Unity-Technologies/ml-agents/blob/0.5.0a/config/trainer_config.yaml). An example of passing a`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			trainer configuration to `mlagents-learn` is shown above.
			- The environment name is now passed through the `--env` option.
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			`- Curriculum learning has been changed. In summary:`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- Curriculum files for the same environment must now be placed into a folder.`
Documentation Update (#1339) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments 2018-10-18 17:48:45 -07:00			`Each curriculum file should be named after the Brain whose curriculum it`
Curriculum learning reward thresholding bug fix (#1141) 2018-09-04 17:00:11 -07:00			`specifies.`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- `min_lesson_length` now specifies the minimum number of episodes in a lesson
Curriculum learning reward thresholding bug fix (#1141) 2018-09-04 17:00:11 -07:00			`and affects reward thresholding.`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- It is no longer necessary to specify the `Max Steps` of the Academy to use
Curriculum learning reward thresholding bug fix (#1141) 2018-09-04 17:00:11 -07:00			`curriculum learning.`
Updating the migrating doc (#1137) 2018-08-28 19:08:40 -07:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`## Migrating from ML-Agents Toolkit v0.3 to v0.4`
fix all the places that used v0.3 and should be fixed, also updated t… (#863) * fix all the places that used v0.3 and should be fixed, also updated the migration doc * resolved the comment 2018-06-15 13:55:19 -07:00
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			`### Unity API`
fixed the migration doc 2018-06-22 11:09:47 -07:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- `using MLAgents;` needs to be added in all of the C# scripts that use
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			`ML-Agents.`
[Documentation] Added how to move from 0.2 to 0.3 (#436) * [Documentation] Added migrating to v0.3 document. 2018-03-13 11:45:16 -07:00
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			`### Python API`
[Documentation] Added how to move from 0.2 to 0.3 (#436) * [Documentation] Added migrating to v0.3 document. 2018-03-13 11:45:16 -07:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- We've changed some of the Python packages dependencies in requirement.txt`
pull/1294 from has-taiar (#1310) * pull/1294 from has-taiar * removed the left bracket * moved the windows link position * update the windows doc * resolved the comments, changed the pip install . to pip install -e . , added the package explanation to the Windows installation doc * Resolved the comments * add the 'the' 2018-10-10 18:32:49 -07:00			file. Make sure to run `pip3 install -e .` within your `ml-agents/python`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`folder to update your Python packages.`
[Documentation] Added how to move from 0.2 to 0.3 (#436) * [Documentation] Added migrating to v0.3 document. 2018-03-13 11:45:16 -07:00
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`## Migrating from ML-Agents Toolkit v0.2 to v0.3`
[Documentation] Added how to move from 0.2 to 0.3 (#436) * [Documentation] Added migrating to v0.3 document. 2018-03-13 11:45:16 -07:00
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			`There are a large number of new features and improvements in the ML-Agents`
			`toolkit v0.3 which change both the training process and Unity API in ways which`
			`will cause incompatibilities with environments made using older versions. This`
			`page is designed to highlight those changes for users familiar with v0.1 or v0.2`
			`in order to ensure a smooth transition.`
[Documentation] Added how to move from 0.2 to 0.3 (#436) * [Documentation] Added migrating to v0.3 document. 2018-03-13 11:45:16 -07:00
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			`### Important`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- The ML-Agents Toolkit is no longer compatible with Python 2.`
Cleaning up documentation. 2018-08-25 17:51:28 -07:00
			`### Python Training`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- The training script `ppo.py` and `PPO.ipynb` Python notebook have been
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			replaced with a single `learn.py` script as the launching point for training
			with ML-Agents. For more information on using `learn.py`, see
Adding correct links for migration guide. 2018-08-26 00:05:31 -07:00			`[here](Training-ML-Agents.md#training-with-mlagents-learn).`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- Hyperparameters for training Brains are now stored in the`
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			`trainer_config.yaml` file. For more information on using this file, see
Release mm GitHub docs (#3864) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed content from Training-SAC and Training-PPO and Learning-Environment-Design. * Added descriptions for reward signals and BC Added relevant sections to ML-Agents-Overview and Training-ML-Agents to cover reward signals and BC. Removed the corresponding text from Training-PPO, Training-SAC, Reward-Signals and Training-Imitation-Learning. * Add memory to overview and training pages. * Removing now redundant text from Training-SAC and Training-PPO Should have been part of the previous commit. * Added ranges for RNN Should have been part of previous 2 commits. * Adding self-play to Overview and Training pages Including a description of teams to the Agent overview page. * Adding Self-Play * Add Environment Parameter Randomization * Adding Concurrent instances * Move configs description to separate file * Added Model Types and Additional Features section * Added Environment Parameters and Recording Stats * Moving demo recording to Designing agents page * Removing mentions of the Monitor class. * Remove reference to Imitation Learning file * Deleting 5 pages and their references Concurrent Training Env Parameter Rand. Curric. learning Memory Reward Signals * Added threaded param to training config Deleted unnecessary bloat from Training-SAC, Training-PPO and Training-SelfPlay. * Small fix to Using Tensorboard * Removing links to Training-PPO / Training-SAC and Training-SelfPlay In preparation for those three files being deleted. * fix toolkit * fix bad link * New PR that changes the glossary for Experience (#3889) * Removed Training-X.md, updated Using-Tensorboard.md (#3888) * removed Training-X.md, updated Usining-TensorBoard.md * remove blank line * added all reward signals * Add table of contents to the 3 main pages. * Prettier formatting. * Prettier fixes. * :arrow_forward: —> Play to confuse Prettier less often * Fixing broken links in unity package * Adding a table of contents to the Agents doc * Minor prettier improvements Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> 2020-04-28 20:39:10 -07:00			`[here](Training-ML-Agents.md#training-configurations).`
Cleaning up documentation. 2018-08-25 17:51:28 -07:00
			`### Unity API`

Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`- Modifications to an Agent's rewards must now be done using either`
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			`AddReward()` or `SetReward()`.
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- Setting an Agent to done now requires the use of the `Done()` method.
			- `CollectStates()` has been replaced by `CollectObservations()`, which now no
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			`longer returns a list of floats.`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- To collect observations, call `AddVectorObs()` within `CollectObservations()`.
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			Note that you can call `AddVectorObs()` with floats, integers, lists and
			`arrays of floats, Vector3 and Quaternions.`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			- `AgentStep()` has been replaced by `AgentAction()`.
			- `WaitTime()` has been removed.
			- The `Frame Skip` field of the Academy is replaced by the Agent's
			`Decision Frequency` field, enabling the Agent to make decisions at different
			`frequencies.`
			`- The names of the inputs in the Internal Brain have been changed. You must`
Cleaning up documentation. 2018-08-25 17:51:28 -07:00			replace `state` with `vector_observation` and `observation` with
			`visual_observation`. In addition, you must remove the `epsilon` placeholder.

			`### Semantics`

			`In order to more closely align with the terminology used in the Reinforcement`
			`Learning field, and to be more descriptive, we have changed the names of some of`
			`the concepts used in ML-Agents. The changes are highlighted in the table below.`
[Documentation] Added how to move from 0.2 to 0.3 (#436) * [Documentation] Added migrating to v0.3 document. 2018-03-13 11:45:16 -07:00
			`\| Old - v0.2 and earlier \| New - v0.3 and later \|`
Develop mm docs formatting (#3796) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate 2020-04-17 11:38:55 -07:00			`\| ---------------------- \| -------------------- \|`
			`\| State \| Vector Observation \|`
			`\| Observation \| Visual Observation \|`
			`\| Action \| Vector Action \|`
			`\| N/A \| Text Observation \|`
			`\| N/A \| Text Action \|`