Blame: ci/docker/runtime_functions.sh - apache/mxnet

[MXNET-286] Removed OpenMP from armv6 builds (#10469) * Removed OpenMP from armv6 builds * Added comment about OpenMP switching off for armv6 build * Removed github api call for version for OpenBLAS in the docker build for armv6

2018-07-19 17:18:04 +02:00

[MXNET-73] Add Armv6 ci build (#10172) * Initial add for ARMv6 * Fixed Armv6 build * Introduced workaround for fortran lib linking * Added comments to armv6 ci build

2018-03-22 17:04:15 +01:00

								build_armv6() {

							

2018-04-11 14:31:18 +02:00

    # We do not need OpenMP, since most armv6 systems have only 1 core

[MXNET-73] Add Armv6 ci build (#10172) * Initial add for ARMv6 * Fixed Armv6 build * Introduced workaround for fortran lib linking * Added comments to armv6 ci build

2018-03-22 17:04:15 +01:00

    cmake \

[ARM] improvements to ARMv7 based builds. (#11245) Fix build with OpenCV 2. Native RPi build. Openblas compilation fixes and version pinning. (@lebeg) Disabled bundled OpenMP for cross compilation. (@lebeg) Build logic refinements.

2018-06-15 02:32:01 +02:00

								        -DCMAKE_TOOLCHAIN_FILE=${CMAKE_TOOLCHAIN_FILE} \

							

[MXNET-73] Add Armv6 ci build (#10172) * Initial add for ARMv6 * Fixed Armv6 build * Introduced workaround for fortran lib linking * Added comments to armv6 ci build

2018-03-22 17:04:15 +01:00

        -DUSE_CUDA=OFF \

[MXNET-286] Removed OpenMP from armv6 builds (#10469) * Removed OpenMP from armv6 builds * Added comment about OpenMP switching off for armv6 build * Removed github api call for version for OpenBLAS in the docker build for armv6

2018-04-11 14:31:18 +02:00

        -DUSE_OPENMP=OFF \

[MXNET-287] ARMv6 build with 8-10 times bigger file size (#10439)

2018-04-06 14:57:03 +02:00

        -DCMAKE_BUILD_TYPE=Release \

[MXNET-73] Add Armv6 ci build (#10172) * Initial add for ARMv6 * Fixed Armv6 build * Introduced workaround for fortran lib linking * Added comments to armv6 ci build

2018-03-22 17:04:15 +01:00

        -DUSE_LAPACK=OFF \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

[MXNET-287] ARMv6 build with 8-10 times bigger file size (#10439)

2018-04-06 14:57:03 +02:00

        -DBUILD_CPP_EXAMPLES=OFF \

[MXNET-73] Add Armv6 ci build (#10172) * Initial add for ARMv6 * Fixed Armv6 build * Introduced workaround for fortran lib linking * Added comments to armv6 ci build

2018-03-22 17:04:15 +01:00

        -G Ninja /work/mxnet

[ARM] improvements to ARMv7 based builds. (#11245) Fix build with OpenCV 2. Native RPi build. Openblas compilation fixes and version pinning. (@lebeg) Disabled bundled OpenMP for cross compilation. (@lebeg) Build logic refinements.

2018-06-15 02:32:01 +02:00

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

[ARM] improvements to ARMv7 based builds. (#11245) Fix build with OpenCV 2. Native RPi build. Openblas compilation fixes and version pinning. (@lebeg) Disabled bundled OpenMP for cross compilation. (@lebeg) Build logic refinements.

2018-06-15 02:32:01 +02:00

    build_wheel

[MXNET-73] Add Armv6 ci build (#10172) * Initial add for ARMv6 * Fixed Armv6 build * Introduced workaround for fortran lib linking * Added comments to armv6 ci build

2018-03-22 17:04:15 +01:00

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								build_armv7() {

							

[ARM] improvements to ARMv7 based builds. (#11245) Fix build with OpenCV 2. Native RPi build. Openblas compilation fixes and version pinning. (@lebeg) Disabled bundled OpenMP for cross compilation. (@lebeg) Build logic refinements.

2018-06-15 02:32:01 +02:00

[MXNET-472] ccache for docker builds (#11151) * [MXNET-472] Add ccache support to docker builds * Added ccache stages to all containers * Refactored ccache installs in docker images * Reformatted build.py * Added ccache install to android docker builds * Improved setting ccache directory and max cache size * Added ccache to cmake based docker builds * Removed unnessesary yum install from centos7 ccache build * Added default compilers for ccache for docker builds * Added README comment about ccache mapping into docker builds * Reverted not working ccache configurations * Added comments about ccache installation * Move install scripts * Update ubuntu_r.sh

2018-06-08 03:12:30 +02:00

    cmake \

[ARM] improvements to ARMv7 based builds. (#11245) Fix build with OpenCV 2. Native RPi build. Openblas compilation fixes and version pinning. (@lebeg) Disabled bundled OpenMP for cross compilation. (@lebeg) Build logic refinements.

2018-06-15 02:32:01 +02:00

								        -DCMAKE_TOOLCHAIN_FILE=${CMAKE_TOOLCHAIN_FILE} \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

[ARM] improvements to ARMv7 based builds. (#11245) Fix build with OpenCV 2. Native RPi build. Openblas compilation fixes and version pinning. (@lebeg) Disabled bundled OpenMP for cross compilation. (@lebeg) Build logic refinements.

2018-06-15 02:32:01 +02:00

        -DBUILD_CPP_EXAMPLES=OFF \

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

        -G Ninja /work/mxnet

[ARM] improvements to ARMv7 based builds. (#11245) Fix build with OpenCV 2. Native RPi build. Openblas compilation fixes and version pinning. (@lebeg) Disabled bundled OpenMP for cross compilation. (@lebeg) Build logic refinements.

2018-06-15 02:32:01 +02:00

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

[ARM] improvements to ARMv7 based builds. (#11245) Fix build with OpenCV 2. Native RPi build. Openblas compilation fixes and version pinning. (@lebeg) Disabled bundled OpenMP for cross compilation. (@lebeg) Build logic refinements.

2018-06-15 02:32:01 +02:00

    build_wheel

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2018-07-19 17:18:04 +02:00

								build_armv8() {

							

2020-02-28 11:59:22 -08:00

    cd /work/build

[MXNET-472] ccache for docker builds (#11151) * [MXNET-472] Add ccache support to docker builds * Added ccache stages to all containers * Refactored ccache installs in docker images * Reformatted build.py * Added ccache install to android docker builds * Improved setting ccache directory and max cache size * Added ccache to cmake based docker builds * Removed unnessesary yum install from centos7 ccache build * Added default compilers for ccache for docker builds * Added README comment about ccache mapping into docker builds * Reverted not working ccache configurations * Added comments about ccache installation * Move install scripts * Update ubuntu_r.sh

2018-06-08 03:12:30 +02:00

    cmake \

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

								        -DCMAKE_TOOLCHAIN_FILE=${CMAKE_TOOLCHAIN_FILE} \

							

Enable OpenMP for armv8 builds (#12273)

2018-08-22 15:24:19 +02:00

        -DUSE_OPENMP=ON \

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

        -DUSE_LAPACK=OFF \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

        -DCMAKE_BUILD_TYPE=Release \

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

        -G Ninja /work/mxnet

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

[MXNET-57] Android ARMv7 support (#11382) Increase API Level to 27 and update NDK to 17b

2018-07-02 19:35:22 +02:00

    build_wheel

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

[MXNET-57] Add android64 build (#11188) (#11055) * Fixes sources for android64 Add initial android64 build logic. Remove pthread when linking MXNet in Android. Bionic provides built-in support for threads. * Simplify and fix android64 build

2018-07-19 17:18:04 +02:00

2018-06-15 19:33:33 +02:00

								build_android_armv7() {

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    set -ex

cmake: x86 options only on x86 and remove manual specification on CI (#18588) Use CMAKE_SYSTEM_PROCESSOR to detect target architecture and make x86 related options available only when compiling for x86. Remove the code turning these options manually off on CI. Remove ANDROID cmake option which was used to decide if -lpthread needs to be specified explicitly (on most Linux systems) or not (on Android). Instead auto-detect the behavior.

2020-06-19 14:46:27 -07:00

    # ANDROID_ABI and ANDROID_STL are options of the CMAKE_TOOLCHAIN_FILE

[MXNET-472] ccache for docker builds (#11151) * [MXNET-472] Add ccache support to docker builds * Added ccache stages to all containers * Refactored ccache installs in docker images * Reformatted build.py * Added ccache install to android docker builds * Improved setting ccache directory and max cache size * Added ccache to cmake based docker builds * Removed unnessesary yum install from centos7 ccache build * Added default compilers for ccache for docker builds * Added README comment about ccache mapping into docker builds * Reverted not working ccache configurations * Added comments about ccache installation * Move install scripts * Update ubuntu_r.sh

2018-06-08 03:12:30 +02:00

    cmake \

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

								        -DCMAKE_TOOLCHAIN_FILE=${CMAKE_TOOLCHAIN_FILE} \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

        -DUSE_OPENCV=OFF \

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

        -G Ninja /work/mxnet

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

[MXNET-57] Add android64 build (#11188) (#11055) * Fixes sources for android64 Add initial android64 build logic. Remove pthread when linking MXNet in Android. Bionic provides built-in support for threads. * Simplify and fix android64 build

2018-07-19 17:18:04 +02:00

								build_android_armv8() {

							

2018-06-15 19:33:33 +02:00

    set -ex

cmake: x86 options only on x86 and remove manual specification on CI (#18588) Use CMAKE_SYSTEM_PROCESSOR to detect target architecture and make x86 related options available only when compiling for x86. Remove the code turning these options manually off on CI. Remove ANDROID cmake option which was used to decide if -lpthread needs to be specified explicitly (on most Linux systems) or not (on Android). Instead auto-detect the behavior.

2020-06-19 14:46:27 -07:00

    # ANDROID_ABI and ANDROID_STL are options of the CMAKE_TOOLCHAIN_FILE

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

    cmake \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

        -DUSE_OPENCV=OFF \

[MXNET-57] Add android64 build (#11188) (#11055) * Fixes sources for android64 Add initial android64 build logic. Remove pthread when linking MXNet in Android. Bionic provides built-in support for threads. * Simplify and fix android64 build

2018-06-15 19:33:33 +02:00

        -G Ninja /work/mxnet

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

[MXNET-57] Add android64 build (#11188) (#11055) * Fixes sources for android64 Add initial android64 build logic. Remove pthread when linking MXNet in Android. Bionic provides built-in support for threads. * Simplify and fix android64 build

2018-06-15 19:33:33 +02:00

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								build_centos7_cpu() {

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    set -ex

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

    source /opt/rh/devtoolset-7/enable

Opt in to newer GCC C++ ABI on RedHat Developer Toolset (#19182) Target version 11, which first appeared in G++7 and which is supported by default by any G++ version up until 10 (the most recent version at time of writing) due to their respective default value of -fabi-compat-version=11. Generate aliases for ABI version 7, which first appeared in G++ 4.8 and is the G++ version shipped by default on a non-EOL system (RHEL7 based systems such as Amazon Linux 1).

2020-09-19 18:42:34 -07:00

    # Opt in to newer GCC C++ ABI. devtoolset defaults to ABI Version 2.

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cmake \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DUSE_DIST_KVSTORE=ON \

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

Enable Large Tensor Support by default (#18625) Test with Large Tensor Support disabled on CentOS builds

2020-11-17 12:09:42 -08:00

        -DUSE_INT64_TENSOR_SIZE=OFF \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -G Ninja /work/mxnet

Change *_mkldnn* test and build scenarios names to *_onednn* (#20034)

2021-03-24 15:15:32 +01:00

								build_centos7_onednn() {

							

[MXNET-138]Create CentOS gcc4.8 & MKLDNN ci-test (#10218) * create centos gcc4.8 & mkldnn ci-test * make test * add missing slash * check gcc version * remove make test

2018-03-26 19:05:47 +08:00

    set -ex

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cd /work/build

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

    source /opt/rh/devtoolset-7/enable

Opt in to newer GCC C++ ABI on RedHat Developer Toolset (#19182) Target version 11, which first appeared in G++7 and which is supported by default by any G++ version up until 10 (the most recent version at time of writing) due to their respective default value of -fabi-compat-version=11. Generate aliases for ABI version 7, which first appeared in G++ 4.8 and is the G++ version shipped by default on a non-EOL system (RHEL7 based systems such as Amazon Linux 1).

2020-09-19 18:42:34 -07:00

    # Opt in to newer GCC C++ ABI. devtoolset defaults to ABI Version 2.

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

    cmake -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=ON \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DUSE_CUDA=OFF \

Enable Large Tensor Support by default (#18625) Test with Large Tensor Support disabled on CentOS builds

2020-11-17 12:09:42 -08:00

        -DUSE_INT64_TENSOR_SIZE=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -G Ninja /work/mxnet

[MXNET-138]Create CentOS gcc4.8 & MKLDNN ci-test (#10218) * create centos gcc4.8 & mkldnn ci-test * make test * add missing slash * check gcc version * remove make test

2018-03-26 19:05:47 +08:00

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								build_centos7_gpu() {

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cd /work/build

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

    source /opt/rh/devtoolset-7/enable

Opt in to newer GCC C++ ABI on RedHat Developer Toolset (#19182) Target version 11, which first appeared in G++7 and which is supported by default by any G++ version up until 10 (the most recent version at time of writing) due to their respective default value of -fabi-compat-version=11. Generate aliases for ABI version 7, which first appeared in G++ 4.8 and is the G++ version shipped by default on a non-EOL system (RHEL7 based systems such as Amazon Linux 1).

2020-09-19 18:42:34 -07:00

    # Opt in to newer GCC C++ ABI. devtoolset defaults to ABI Version 2.

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cmake \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=ON \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DUSE_CUDA=ON \

[FEATURE] Add g5 instance to CI (#20876) * Add g5 instance to jenkinsfiles where both p3 and g4 are mentioned * Remove reference to non-existent restricted-mxnetlinux-gpu-g5 * Enable unittest job on g5 * Fix Jenkinsfile_unix_gpu syntax * Include A10G arch 86 in build for g5 * Update is_TF32_enabled() for SM arch > 80 * Remove gpu arch 86 from centos builds on cuda 10 * Fix test_convolution_{grouping,dilated_impulse_response}, test_np_linalg_qr * Fix test_convolution_grouping on A100 * Fix test_rnn_unroll_variant_length * Fix test_convolution_dilated_impulse_response * Skip test_np_standard_binary_funcs test of 0-dim array broadcast * Temporarily add '-s' to pytest cpu tests * Revert "Temporarily add '-s' to pytest cpu tests" This reverts commit 4a9056a26f8c210497e3b5ed2318e30c8c2dbc5e. * Improve test_rnn_layers_fp{16,32} invocation * Pin MarkupSafe==2.0.1 to avoid soft_unicode import failure * Run test_rnn_layers_fp32 only when cuDNN is present * Fix potential out-of-bounds write in count_sketch.cu * Revert "Pin MarkupSafe==2.0.1 to avoid soft_unicode import failure" This reverts commit ae17b1f2af787427740c66a05ee1fb733ea56dd3.

2022-03-08 15:00:20 -08:00

								        -DMXNET_CUDA_ARCH="$CI_CMAKE_CUDA10_ARCH" \

							

Enable Large Tensor Support by default (#18625) Test with Large Tensor Support disabled on CentOS builds

2020-11-17 12:09:42 -08:00

        -DUSE_DIST_KVSTORE=ON \

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

Enable Large Tensor Support by default (#18625) Test with Large Tensor Support disabled on CentOS builds

2020-11-17 12:09:42 -08:00

        -DUSE_INT64_TENSOR_SIZE=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -G Ninja /work/mxnet

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

[MXNET-472] ccache for docker builds (#11151) * [MXNET-472] Add ccache support to docker builds * Added ccache stages to all containers * Refactored ccache installs in docker images * Reformatted build.py * Added ccache install to android docker builds * Improved setting ccache directory and max cache size * Added ccache to cmake based docker builds * Removed unnessesary yum install from centos7 ccache build * Added default compilers for ccache for docker builds * Added README comment about ccache mapping into docker builds * Reverted not working ccache configurations * Added comments about ccache installation * Move install scripts * Update ubuntu_r.sh

2018-06-08 03:12:30 +02:00

								build_ubuntu_cpu() {

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								build_ubuntu_cpu_openblas() {

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    set -ex

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CXXFLAGS="-Wno-error=strict-overflow" CC=gcc-7 CXX=g++-7 cmake \

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

								        -DCMAKE_BUILD_TYPE="RelWithDebInfo" \

							

CI: Re-enable code coverage for CPU builds (#17889)

2020-04-04 01:51:56 +00:00

        -DENABLE_TESTCOVERAGE=ON \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DUSE_TVM_OP=ON \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DUSE_CUDA=OFF \

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -G Ninja /work/mxnet

Attempt to fix website build pipeline (#20634) * set nproc for build_ubuntu_cpu_openblas * set nproc for build_ubuntu_gpu Co-authored-by: Wei Chu <weichu@amazon.com>

2021-10-05 18:08:58 -07:00

								    ninja -j$(($(nproc)/2))

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

Add Intel MKL blas to Jenkins (#13607) * add mkl blas to Jenkins * add mkl install script * fix bug in mkl script * remove python2 ut and add cpu-mkl node

2018-12-12 03:57:09 +08:00

								build_ubuntu_cpu_mkl() {

							

Fix MKL static link & default to static link on unix (#17751) * Fix MKL static link & default to static link on unix Fixes https://github.com/apache/incubator-mxnet/issues/17641 * Test cmake MKL build on CI

2020-03-04 08:55:01 -08:00

    cd /work/build

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

Fix MKL static link & default to static link on unix (#17751) * Fix MKL static link & default to static link on unix Fixes https://github.com/apache/incubator-mxnet/issues/17641 * Test cmake MKL build on CI

2020-03-04 08:55:01 -08:00

								        -DCMAKE_BUILD_TYPE="RelWithDebInfo" \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

        -DENABLE_TESTCOVERAGE=OFF \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=OFF \

Fix MKL static link & default to static link on unix (#17751) * Fix MKL static link & default to static link on unix Fixes https://github.com/apache/incubator-mxnet/issues/17641 * Test cmake MKL build on CI

2020-03-04 08:55:01 -08:00

        -DUSE_CUDA=OFF \

[PERFORMANCE] [master] Layer normalization code from Marian for CPU (#19602) * Layer normalization code from Marian * Remove MKL version of LayerNorm. Experiment with OMP_NUM_THREADS=4, times in s, c5.12xlarge |batchxchanne| New code | MKL | | 1x 32 | 0.0000288| 0.0000278| | 128x 32 | 0.0000308| 0.0000311| | 2560x 32 | 0.0000712| 0.0000672| | 4096x 32 | 0.0000946| 0.0000910| | 8192x 32 | 0.0001597| 0.0001523| |16384x 32 | 0.0002905| 0.0002619| | 1x 64 | 0.0000264| 0.0000256| | 128x 64 | 0.0000339| 0.0000330| | 2560x 64 | 0.0000829| 0.0000972| | 4096x 64 | 0.0001137| 0.0001356| | 8192x 64 | 0.0002027| 0.0002435| |16384x 64 | 0.0003715| 0.0004639| | 1x 128 | 0.0000262| 0.0000263| | 128x 128 | 0.0000325| 0.0000389| | 2560x 128 | 0.0001074| 0.0001580| | 4096x 128 | 0.0001505| 0.0002336| | 8192x 128 | 0.0002861| 0.0004481| |16384x 128 | 0.0005648| 0.0008613| | 1x 256 | 0.0000273| 0.0000276| | 128x 256 | 0.0000390| 0.0000431| | 2560x 256 | 0.0001533| 0.0002811| | 4096x 256 | 0.0002258| 0.0004300| | 8192x 256 | 0.0004300| 0.0008464| |16384x 256 | 0.0010436| 0.0017613| | 1x 512 | 0.0000256| 0.0000302| | 128x 512 | 0.0000408| 0.0000551| | 2560x 512 | 0.0002444| 0.0005225| | 4096x 512 | 0.0003828| 0.0008147| | 8192x 512 | 0.0008832| 0.0017192| |16384x 512 | 0.0058463| 0.0074497| | 1x 768 | 0.0000252| 0.0000308| | 128x 768 | 0.0000450| 0.0000676| | 2560x 768 | 0.0003440| 0.0007719| | 4096x 768 | 0.0005890| 0.0013346| | 8192x 768 | 0.0014946| 0.0026145| |16384x 768 | 0.0089495| 0.0113557| | 1x 1024 | 0.0000285| 0.0000308| | 128x 1024 | 0.0000487| 0.0000786| | 2560x 1024 | 0.0004614| 0.0010190| | 4096x 1024 | 0.0008083| 0.0017376| | 8192x 1024 | 0.0059020| 0.0075588| |16384x 1024 | 0.0116553| 0.0146855| Benchmark program ```python import mxnet as mx import time def time_procedure(shape, count): data = mx.nd.random_uniform(shape=shape, low=-1.0, high = 1.0) factors = mx.nd.random_uniform(shape=(shape[-1],)) mx.nd.waitall() begin = time.time() for i in range(0, count): out = mx.nd.LayerNorm(data, factors, factors) mx.nd.waitall() return (time.time() - begin) / count count = 200 for channel in [32, 64, 128, 256, 512, 768, 1024]: for batch in [1, 128, 2560, 4096, 8192, 16384]: s = (batch, channel) timing = time_procedure(s, count) print("{:5d}x{:5d} | {:.7f}".format(s[0], s[1], timing)) ``` * Enable pragma omp simd on MSVC * Fix MSVC error C3016: 'j': index variable in OpenMP 'for' statement must have signed integral type * Try to make MSVC happy since it doesn't have ssize_t * Revert "Remove MKL version of LayerNorm." This reverts commit 740c4726c3068ac30b3809cd6280fa7e91af8c52. * Restore MKL layer normalization code, but it isn't called yet * Pull division out of the hot loop * Option to use MKL version requested by @samskalicky * Add -DUSE_MKL_LAYERNORM=ON to ubuntu MKL CPU test Co-authored-by: Kenneth Heafield <kheafiel@amazon.com>

2021-01-04 08:57:05 +00:00

        -DUSE_MKL_LAYERNORM=ON \

Fix MKL static link & default to static link on unix (#17751) * Fix MKL static link & default to static link on unix Fixes https://github.com/apache/incubator-mxnet/issues/17641 * Test cmake MKL build on CI

2020-03-04 08:55:01 -08:00

        -DUSE_BLAS=MKL \

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

Fix MKL static link & default to static link on unix (#17751) * Fix MKL static link & default to static link on unix Fixes https://github.com/apache/incubator-mxnet/issues/17641 * Test cmake MKL build on CI

2020-03-04 08:55:01 -08:00

        -GNinja /work/mxnet

Add Intel MKL blas to Jenkins (#13607) * add mkl blas to Jenkins * add mkl install script * fix bug in mkl script * remove python2 ut and add cpu-mkl node

2018-12-12 03:57:09 +08:00

Add unit test stage for mxnet cpu in debug mode (#11974)

2018-08-03 00:37:12 +02:00

								build_ubuntu_cpu_cmake_debug() {

							

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

CI: Re-enable code coverage for CPU builds (#17889)

2020-04-04 01:51:56 +00:00

        -DCMAKE_BUILD_TYPE=Debug \

Add unit test stage for mxnet cpu in debug mode (#11974)

2018-08-03 00:37:12 +02:00

        -DUSE_CUDA=OFF \

Enable tvm_op for ci (#15889) * enable tvm_op for ci * specify python3 bin * move rpath to top * move tvm op dep forward * add ldd debug info * add libtvm_runtime.so to mx_lib_cython * add ldd debug for py3 * fix libtvm_runtime path for cmake * cp libtvm_runtime.so when make rpkg * add libtvm_runtime.so to scala-pkg * add python3 to cmake in unix-gpu build * hack: add cuda to ld_path in cmake * add LD_LIBRARY_PATH into cmake tvm op * add /usr/local/cuda/compat to Dockerfile.build.ubuntu_gpu_cu101 LD_LIBRARY_PATH * remove unused codes * remove USE_TVM_OP from build_ubuntu_cpu_large_tensor

2019-09-05 14:39:38 +08:00

        -DUSE_TVM_OP=ON \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Add unit test stage for mxnet cpu in debug mode (#11974)

2018-08-03 00:37:12 +02:00

        -DUSE_OPENMP=OFF \

Add CPU test coverage and refine cmake builds (#13338)

2019-01-08 14:01:44 +01:00

        -DUSE_SIGNAL_HANDLER=ON \

Add unit test stage for mxnet cpu in debug mode (#11974)

2018-08-03 00:37:12 +02:00

        -G Ninja \

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

Add unit test stage for mxnet cpu in debug mode (#11974)

2018-08-03 00:37:12 +02:00

Add test pipeline for USE_TVM_OP=OFF on Unix (#16450) * Add no tvm op build and test pipelines * Add steps * Modify display * Fix ci * Try to fix one unit test * Fix * Address cr

2019-10-19 08:19:36 -07:00

								build_ubuntu_cpu_cmake_no_tvm_op() {

							

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

Add test pipeline for USE_TVM_OP=OFF on Unix (#16450) * Add no tvm op build and test pipelines * Add steps * Modify display * Fix ci * Try to fix one unit test * Fix * Address cr

2019-10-19 08:19:36 -07:00

        -DUSE_CUDA=OFF \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Add test pipeline for USE_TVM_OP=OFF on Unix (#16450) * Add no tvm op build and test pipelines * Add steps * Modify display * Fix ci * Try to fix one unit test * Fix * Address cr

2019-10-19 08:19:36 -07:00

        -DUSE_OPENMP=OFF \

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

Add test pipeline for USE_TVM_OP=OFF on Unix (#16450) * Add no tvm op build and test pipelines * Add steps * Modify display * Fix ci * Try to fix one unit test * Fix * Address cr

2019-10-19 08:19:36 -07:00

        -G Ninja \

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

Add test pipeline for USE_TVM_OP=OFF on Unix (#16450) * Add no tvm op build and test pipelines * Add steps * Modify display * Fix ci * Try to fix one unit test * Fix * Address cr

2019-10-19 08:19:36 -07:00

[MXNET-953] - Add ASAN sanitizer, Enable in CI (#12370) * Add ASAN sanitizer to CI * [MXNET-953] Use gcc8 via cmake options Use CMake to specify the compiler used for ASAN. This is more portable and doesn't rely on env vars. * [MXNET-953] Don't fail build on leak detection * [MXNET-953] Include deps required by scala test

2018-09-19 07:06:33 -07:00

								build_ubuntu_cpu_cmake_asan() {

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=OFF \

[MXNET-953] - Add ASAN sanitizer, Enable in CI (#12370) * Add ASAN sanitizer to CI * [MXNET-953] Use gcc8 via cmake options Use CMake to specify the compiler used for ASAN. This is more portable and doesn't rely on env vars. * [MXNET-953] Don't fail build on leak detection * [MXNET-953] Include deps required by scala test

2018-09-19 07:06:33 -07:00

        -DUSE_OPENMP=OFF \

[CI] Test gcc8 -WError build CI (#17752) * Disable printing warnings for 3rdparty/openmp target * Remove unused build_amzn_linux_cpu * Ignore -Wclass-memaccess for gcc8 where needed * Fix uninitialized variables in DeformablePSROIPoolingOp * Update dmlc-core to ignore -Wmaybe-uninitialized in optional.h * Test gcc8 -WError build on CI

2020-03-03 21:01:59 -08:00

								build_ubuntu_cpu_gcc8_werror() {

							

Update Ubuntu images used on CI to 20.04 (#19588) * Update Ubuntu images used on CI to 20.04 This helps ensure MXNet to work well on recent Linux distributions (while ensuring it continues to work well on ancient distributions based on the CentOS7 CI pipeline) * Preserve Ubuntu 18.04 images for TensorRT pipeline as NVidia failed to make TensorRT available for Ubuntu 20.04 * Temporarily disable NVML on CI [2020-12-03T18:33:10.380Z] OSError: /work/mxnet/python/mxnet/../../build/libmxnet.so: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v2

2020-12-03 19:58:14 -07:00

								    CC=gcc-8 CXX=g++-8 cmake \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

[CI] Test gcc8 -WError build CI (#17752) * Disable printing warnings for 3rdparty/openmp target * Remove unused build_amzn_linux_cpu * Ignore -Wclass-memaccess for gcc8 where needed * Fix uninitialized variables in DeformablePSROIPoolingOp * Update dmlc-core to ignore -Wmaybe-uninitialized in optional.h * Test gcc8 -WError build on CI

2020-03-03 21:01:59 -08:00

        -DUSE_CUDA=OFF \

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

								build_ubuntu_cpu_clang10_werror() {

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

       -DUSE_BLAS=Open \

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

       -DUSE_CUDA=OFF \

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

    # Workaround https://github.com/thrust/thrust/issues/1072

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

								    CXX=clang++-10 CC=clang-10 cmake \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

       -DUSE_BLAS=Open \

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

       -DUSE_CUDA=ON \

Update Ubuntu images used on CI to 20.04 (#19588) * Update Ubuntu images used on CI to 20.04 This helps ensure MXNet to work well on recent Linux distributions (while ensuring it continues to work well on ancient distributions based on the CentOS7 CI pipeline) * Preserve Ubuntu 18.04 images for TensorRT pipeline as NVidia failed to make TensorRT available for Ubuntu 20.04 * Temporarily disable NVML on CI [2020-12-03T18:33:10.380Z] OSError: /work/mxnet/python/mxnet/../../build/libmxnet.so: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v2

2020-12-03 19:58:14 -07:00

       -DUSE_NVML=OFF \

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

								       -DMXNET_CUDA_ARCH="$CI_CMAKE_CUDA_ARCH" \

							

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

								build_ubuntu_cpu_clang6() {

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    set -ex

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cd /work/build

Support static link to openblas & autodetect LP64 vs ILP64 settings (#19174) This PR forces static link to libopenblas.a on all CI pipelines to avoid the name clashing issue (except for windows where visual studio does not support static link to openblas according to this https://github.com/xianyi/OpenBLAS/wiki/How-to-use-OpenBLAS-in-Microsoft-Visual-Studio see bottom). On ubuntu and centos7 we changed from apt get/ yum openblas and lapack to building openblas(blas+lapack) from source. Also we add support LAPACKE api for non-mkl builds. The background is: In mxnet we wrap lapack functions in c_lapack_api.h so that we hide the differences in the underlying blas/lapack libraries such as mkl, openblas, atlas, and accelerate. For mkl we use/wrap the LAPACKE (https://www.netlib.org/lapack/lapacke.html) c interfaces which are the cleanest. For the rest of the libraries we wrap the old CLAPACK interfaces. Openblas is by default built with lapack support and the binary will contain a full set of LAPACKE functions. Thus this poc adds a build option with which we can use the LAPACKE apis even for openblas. This change will make ilp64 blas/lapack support easier as now we have the same wrapping logic or both mkl and openblas with USE_LAPACKE_INTERFACE = ON. Support for ilp64 mkl #19067 will automatically mean support for ilp64 openblas. Known issue: linux distros ship openblas binaries WITHOUT lapack so this option can only work if we build openblas from source. This is not a problem for our mxnet binary distributions as we always use our own openblas build there. However, For users building mxnet from source they must be advised against using this option unless they also build openblas from source

2020-12-04 11:00:18 -08:00

								    export OpenBLAS_HOME=/usr/local/openblas-clang/

							

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

								    CXX=clang++-6.0 CC=clang-6.0 cmake \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DUSE_CUDA=OFF \

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

								build_ubuntu_cpu_clang100() {

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    set -ex

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cd /work/build

Support static link to openblas & autodetect LP64 vs ILP64 settings (#19174) This PR forces static link to libopenblas.a on all CI pipelines to avoid the name clashing issue (except for windows where visual studio does not support static link to openblas according to this https://github.com/xianyi/OpenBLAS/wiki/How-to-use-OpenBLAS-in-Microsoft-Visual-Studio see bottom). On ubuntu and centos7 we changed from apt get/ yum openblas and lapack to building openblas(blas+lapack) from source. Also we add support LAPACKE api for non-mkl builds. The background is: In mxnet we wrap lapack functions in c_lapack_api.h so that we hide the differences in the underlying blas/lapack libraries such as mkl, openblas, atlas, and accelerate. For mkl we use/wrap the LAPACKE (https://www.netlib.org/lapack/lapacke.html) c interfaces which are the cleanest. For the rest of the libraries we wrap the old CLAPACK interfaces. Openblas is by default built with lapack support and the binary will contain a full set of LAPACKE functions. Thus this poc adds a build option with which we can use the LAPACKE apis even for openblas. This change will make ilp64 blas/lapack support easier as now we have the same wrapping logic or both mkl and openblas with USE_LAPACKE_INTERFACE = ON. Support for ilp64 mkl #19067 will automatically mean support for ilp64 openblas. Known issue: linux distros ship openblas binaries WITHOUT lapack so this option can only work if we build openblas from source. This is not a problem for our mxnet binary distributions as we always use our own openblas build there. However, For users building mxnet from source they must be advised against using this option unless they also build openblas from source

2020-12-04 11:00:18 -08:00

								    export OpenBLAS_HOME=/usr/local/openblas-clang/

							

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

								    CXX=clang++-10 CC=clang-10 cmake \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

       -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

       -DUSE_ONEDNN=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

       -DUSE_CUDA=OFF \

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

[MXNET-859] Add a clang-tidy stage to CI (#12282)

2018-08-27 13:18:13 +02:00

								build_ubuntu_cpu_clang_tidy() {

							

Support static link to openblas & autodetect LP64 vs ILP64 settings (#19174) This PR forces static link to libopenblas.a on all CI pipelines to avoid the name clashing issue (except for windows where visual studio does not support static link to openblas according to this https://github.com/xianyi/OpenBLAS/wiki/How-to-use-OpenBLAS-in-Microsoft-Visual-Studio see bottom). On ubuntu and centos7 we changed from apt get/ yum openblas and lapack to building openblas(blas+lapack) from source. Also we add support LAPACKE api for non-mkl builds. The background is: In mxnet we wrap lapack functions in c_lapack_api.h so that we hide the differences in the underlying blas/lapack libraries such as mkl, openblas, atlas, and accelerate. For mkl we use/wrap the LAPACKE (https://www.netlib.org/lapack/lapacke.html) c interfaces which are the cleanest. For the rest of the libraries we wrap the old CLAPACK interfaces. Openblas is by default built with lapack support and the binary will contain a full set of LAPACKE functions. Thus this poc adds a build option with which we can use the LAPACKE apis even for openblas. This change will make ilp64 blas/lapack support easier as now we have the same wrapping logic or both mkl and openblas with USE_LAPACKE_INTERFACE = ON. Support for ilp64 mkl #19067 will automatically mean support for ilp64 openblas. Known issue: linux distros ship openblas binaries WITHOUT lapack so this option can only work if we build openblas from source. This is not a problem for our mxnet binary distributions as we always use our own openblas build there. However, For users building mxnet from source they must be advised against using this option unless they also build openblas from source

2020-12-04 11:00:18 -08:00

								    export OpenBLAS_HOME=/usr/local/openblas-clang/

							

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

    # TODO(leezu) USE_OPENMP=OFF 3rdparty/dmlc-core/CMakeLists.txt:79 broken?

Update clang-tidy integration (#18815) Run clang-tidy via cmake only on the code managed by mxnet (and not 3rdparty dependencies), update to clang-tidy-10 and run clang-tidy-10 -fix to fix all the warnings that are enforced on CI. Developers can run clang-tidy by specifying the -DCMAKE_CXX_CLANG_TIDY="clang-tidy-10" to cmake, or using the python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh build_ubuntu_cpu_clang_tidy script.

2020-07-29 20:31:19 +00:00

								    CXX=clang++-10 CC=clang-10 cmake \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

       -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

       -DUSE_ONEDNN=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

       -DUSE_CUDA=OFF \

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

       -DUSE_OPENMP=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

       -DCMAKE_BUILD_TYPE=Debug \

Update clang-tidy integration (#18815) Run clang-tidy via cmake only on the code managed by mxnet (and not 3rdparty dependencies), update to clang-tidy-10 and run clang-tidy-10 -fix to fix all the warnings that are enforced on CI. Developers can run clang-tidy by specifying the -DCMAKE_CXX_CLANG_TIDY="clang-tidy-10" to cmake, or using the python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh build_ubuntu_cpu_clang_tidy script.

2020-07-29 20:31:19 +00:00

       -DCMAKE_CXX_CLANG_TIDY=clang-tidy-10 \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

       -G Ninja /work/mxnet

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

[MXNET-859] Add a clang-tidy stage to CI (#12282)

2018-08-27 13:18:13 +02:00

Change *_mkldnn* test and build scenarios names to *_onednn* (#20034)

2021-03-24 15:15:32 +01:00

								build_ubuntu_cpu_clang6_onednn() {

							

[MXNET-74]Update mkldnn to the newest & Add clang build test with mkldnn. (#9918) * add clang test with mkldnn * update mkldnn * Update mkldnn to the newest & Add clang build test with mkldnn. #9918 * update all dockerfiles in order to use the correct mkl version * fix bugs in mkldnn_base * update mklml link and modify dockerfiles * delete blank line * no changes, just retrigger. * no changes, just retriggering * put mklml in a single run file * debug mklml run file * debug mklml script * add mkldnn clang test * give each build task its own unique workspace * change lib name

2018-03-14 22:38:45 +08:00

    set -ex

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cd /work/build

Support static link to openblas & autodetect LP64 vs ILP64 settings (#19174) This PR forces static link to libopenblas.a on all CI pipelines to avoid the name clashing issue (except for windows where visual studio does not support static link to openblas according to this https://github.com/xianyi/OpenBLAS/wiki/How-to-use-OpenBLAS-in-Microsoft-Visual-Studio see bottom). On ubuntu and centos7 we changed from apt get/ yum openblas and lapack to building openblas(blas+lapack) from source. Also we add support LAPACKE api for non-mkl builds. The background is: In mxnet we wrap lapack functions in c_lapack_api.h so that we hide the differences in the underlying blas/lapack libraries such as mkl, openblas, atlas, and accelerate. For mkl we use/wrap the LAPACKE (https://www.netlib.org/lapack/lapacke.html) c interfaces which are the cleanest. For the rest of the libraries we wrap the old CLAPACK interfaces. Openblas is by default built with lapack support and the binary will contain a full set of LAPACKE functions. Thus this poc adds a build option with which we can use the LAPACKE apis even for openblas. This change will make ilp64 blas/lapack support easier as now we have the same wrapping logic or both mkl and openblas with USE_LAPACKE_INTERFACE = ON. Support for ilp64 mkl #19067 will automatically mean support for ilp64 openblas. Known issue: linux distros ship openblas binaries WITHOUT lapack so this option can only work if we build openblas from source. This is not a problem for our mxnet binary distributions as we always use our own openblas build there. However, For users building mxnet from source they must be advised against using this option unless they also build openblas from source

2020-12-04 11:00:18 -08:00

								    export OpenBLAS_HOME=/usr/local/openblas-clang/

							

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

								    CXX=clang++-6.0 CC=clang-6.0 cmake \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

       -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

       -DUSE_ONEDNN=ON \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

       -DUSE_CUDA=OFF \

[MXNET-74]Update mkldnn to the newest & Add clang build test with mkldnn. (#9918) * add clang test with mkldnn * update mkldnn * Update mkldnn to the newest & Add clang build test with mkldnn. #9918 * update all dockerfiles in order to use the correct mkl version * fix bugs in mkldnn_base * update mklml link and modify dockerfiles * delete blank line * no changes, just retrigger. * no changes, just retriggering * put mklml in a single run file * debug mklml run file * debug mklml script * add mkldnn clang test * give each build task its own unique workspace * change lib name

2018-03-14 22:38:45 +08:00

Change *_mkldnn* test and build scenarios names to *_onednn* (#20034)

2021-03-24 15:15:32 +01:00

								build_ubuntu_cpu_clang100_onednn() {

							

[MXNET-74]Update mkldnn to the newest & Add clang build test with mkldnn. (#9918) * add clang test with mkldnn * update mkldnn * Update mkldnn to the newest & Add clang build test with mkldnn. #9918 * update all dockerfiles in order to use the correct mkl version * fix bugs in mkldnn_base * update mklml link and modify dockerfiles * delete blank line * no changes, just retrigger. * no changes, just retriggering * put mklml in a single run file * debug mklml run file * debug mklml script * add mkldnn clang test * give each build task its own unique workspace * change lib name

2018-03-14 22:38:45 +08:00

    set -ex

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cd /work/build

Support static link to openblas & autodetect LP64 vs ILP64 settings (#19174) This PR forces static link to libopenblas.a on all CI pipelines to avoid the name clashing issue (except for windows where visual studio does not support static link to openblas according to this https://github.com/xianyi/OpenBLAS/wiki/How-to-use-OpenBLAS-in-Microsoft-Visual-Studio see bottom). On ubuntu and centos7 we changed from apt get/ yum openblas and lapack to building openblas(blas+lapack) from source. Also we add support LAPACKE api for non-mkl builds. The background is: In mxnet we wrap lapack functions in c_lapack_api.h so that we hide the differences in the underlying blas/lapack libraries such as mkl, openblas, atlas, and accelerate. For mkl we use/wrap the LAPACKE (https://www.netlib.org/lapack/lapacke.html) c interfaces which are the cleanest. For the rest of the libraries we wrap the old CLAPACK interfaces. Openblas is by default built with lapack support and the binary will contain a full set of LAPACKE functions. Thus this poc adds a build option with which we can use the LAPACKE apis even for openblas. This change will make ilp64 blas/lapack support easier as now we have the same wrapping logic or both mkl and openblas with USE_LAPACKE_INTERFACE = ON. Support for ilp64 mkl #19067 will automatically mean support for ilp64 openblas. Known issue: linux distros ship openblas binaries WITHOUT lapack so this option can only work if we build openblas from source. This is not a problem for our mxnet binary distributions as we always use our own openblas build there. However, For users building mxnet from source they must be advised against using this option unless they also build openblas from source

2020-12-04 11:00:18 -08:00

								    export OpenBLAS_HOME=/usr/local/openblas-clang/

							

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

								    CXX=clang++-10 CC=clang-10 cmake \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

       -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

       -DUSE_ONEDNN=ON \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

       -DUSE_CUDA=OFF \

[MXNET-74]Update mkldnn to the newest & Add clang build test with mkldnn. (#9918) * add clang test with mkldnn * update mkldnn * Update mkldnn to the newest & Add clang build test with mkldnn. #9918 * update all dockerfiles in order to use the correct mkl version * fix bugs in mkldnn_base * update mklml link and modify dockerfiles * delete blank line * no changes, just retrigger. * no changes, just retriggering * put mklml in a single run file * debug mklml run file * debug mklml script * add mkldnn clang test * give each build task its own unique workspace * change lib name

2018-03-14 22:38:45 +08:00

Change *_mkldnn* test and build scenarios names to *_onednn* (#20034)

2021-03-24 15:15:32 +01:00

								build_ubuntu_cpu_onednn() {

							

Add Intel MKL blas to Jenkins (#13607) * add mkl blas to Jenkins * add mkl install script * fix bug in mkl script * remove python2 ut and add cpu-mkl node

2018-12-12 03:57:09 +08:00

    set -ex

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cd /work/build

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

								        -DCMAKE_BUILD_TYPE="RelWithDebInfo" \

							

CI: Re-enable code coverage for CPU builds (#17889)

2020-04-04 01:51:56 +00:00

        -DENABLE_TESTCOVERAGE=ON \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=ON \

CI: Re-enable code coverage for CPU builds (#17889)

2020-04-04 01:51:56 +00:00

        -DUSE_CUDA=OFF \

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

CI: Re-enable code coverage for CPU builds (#17889)

2020-04-04 01:51:56 +00:00

        -G Ninja /work/mxnet

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    ninja

Add Intel MKL blas to Jenkins (#13607) * add mkl blas to Jenkins * add mkl install script * fix bug in mkl script * remove python2 ut and add cpu-mkl node

2018-12-12 03:57:09 +08:00

Change *_mkldnn* test and build scenarios names to *_onednn* (#20034)

2021-03-24 15:15:32 +01:00

								build_ubuntu_cpu_onednn_mkl() {

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    set -ex

Fix MKL static link & default to static link on unix (#17751) * Fix MKL static link & default to static link on unix Fixes https://github.com/apache/incubator-mxnet/issues/17641 * Test cmake MKL build on CI

2020-03-04 08:55:01 -08:00

    cd /work/build

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

Fix MKL static link & default to static link on unix (#17751) * Fix MKL static link & default to static link on unix Fixes https://github.com/apache/incubator-mxnet/issues/17641 * Test cmake MKL build on CI

2020-03-04 08:55:01 -08:00

								        -DCMAKE_BUILD_TYPE="RelWithDebInfo" \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

        -DENABLE_TESTCOVERAGE=OFF \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=ON \

Fix MKL static link & default to static link on unix (#17751) * Fix MKL static link & default to static link on unix Fixes https://github.com/apache/incubator-mxnet/issues/17641 * Test cmake MKL build on CI

2020-03-04 08:55:01 -08:00

        -DUSE_CUDA=OFF \

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

Fix MKL static link & default to static link on unix (#17751) * Fix MKL static link & default to static link on unix Fixes https://github.com/apache/incubator-mxnet/issues/17641 * Test cmake MKL build on CI

2020-03-04 08:55:01 -08:00

        -GNinja /work/mxnet

Add Intel MKL blas to Jenkins (#13607) * add mkl blas to Jenkins * add mkl install script * fix bug in mkl script * remove python2 ut and add cpu-mkl node

2018-12-12 03:57:09 +08:00

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2018-08-10 02:38:04 -07:00

								build_ubuntu_gpu_tensorrt() {

							

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    export CC=gcc-7

							

Update the onnx-tensorrt submodule - CI to TRT7 (#18574)

2020-08-03 17:15:02 -07:00

								    export ONNX_NAMESPACE=onnx

							

Fix CI in master (#21026) * Update minor versions of nvidia cuda containers to use that have the latest keys pre-installed. * Update the TensorRT pipeline to Cuda 11.2. * Update TensorRT pipeline to use Cuda 11.4 and update libnvinfer to 8.2.4. * Allow setting TRT version as argument in docker-compose.yml and update to TRT 8.2.4 for cuda 11.4. * Use python3 executable when building tensorrt (so we can update to ubuntu 20.04 base) and enable int64 build. * Remove unneeded line. * Support TRT 8+. * Update onnx-tensorrt to 22.02 release. * Add support for trt >= 8. * Fix lint * Remove debug line. * Don't upgrade libcudnn, use what is in the latest container from nvidia. * Set CUDNN_VERSION inside nvidia containers when NV_CUDNN_VERSION is set instead. * Go back to updating libcudnn8.

2022-05-16 08:25:38 -07:00

								    export PYBIN=$(which python3)

							

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2018-08-10 02:38:04 -07:00

    # Build ONNX

Fix CI in master (#21026) * Update minor versions of nvidia cuda containers to use that have the latest keys pre-installed. * Update the TensorRT pipeline to Cuda 11.2. * Update TensorRT pipeline to use Cuda 11.4 and update libnvinfer to 8.2.4. * Allow setting TRT version as argument in docker-compose.yml and update to TRT 8.2.4 for cuda 11.4. * Use python3 executable when building tensorrt (so we can update to ubuntu 20.04 base) and enable int64 build. * Remove unneeded line. * Support TRT 8+. * Update onnx-tensorrt to 22.02 release. * Add support for trt >= 8. * Fix lint * Remove debug line. * Don't upgrade libcudnn, use what is in the latest container from nvidia. * Set CUDNN_VERSION inside nvidia containers when NV_CUDNN_VERSION is set instead. * Go back to updating libcudnn8.

2022-05-16 08:25:38 -07:00

								    cmake -DPYTHON_EXECUTABLE=$PYBIN -DCMAKE_CXX_FLAGS=-I/usr/include/python${PYVER} -DBUILD_SHARED_LIBS=ON ..

							

Update the onnx-tensorrt submodule - CI to TRT7 (#18574)

2020-08-03 17:15:02 -07:00

    make -j$(nproc)

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2018-08-10 02:38:04 -07:00

								    export LIBRARY_PATH=`pwd`:`pwd`/onnx/:$LIBRARY_PATH

							

Update the onnx-tensorrt submodule - CI to TRT7 (#18574)

2020-08-03 17:15:02 -07:00

								    export CXXFLAGS=-I`pwd`

							

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2018-08-10 02:38:04 -07:00

    popd

CI: Migrate remaining Dockerfiles to docker-compose.yml and remove unused code (#18771) * Migrate remaining Dockerfiles to docker-compose.yml - Delete unused Dockerfiles - Delete unused install/*.sh scripts - Consolidate ubuntu_gpu_tensorrt and ubuntu_gpu - Remove deprecated logic in ci/build.py (no longer needed with docker-compose) - Remove ci/docker_cache.py (no longer needed with docker-compose) * Fix * Fix * Fix ubuntu_cpu_jekyll

2020-07-23 18:09:10 +00:00

								    export LD_LIBRARY_PATH=${LD_LIBRARY_PATH}:/usr/local/lib

							

Fix CI in master (#21026) * Update minor versions of nvidia cuda containers to use that have the latest keys pre-installed. * Update the TensorRT pipeline to Cuda 11.2. * Update TensorRT pipeline to use Cuda 11.4 and update libnvinfer to 8.2.4. * Allow setting TRT version as argument in docker-compose.yml and update to TRT 8.2.4 for cuda 11.4. * Use python3 executable when building tensorrt (so we can update to ubuntu 20.04 base) and enable int64 build. * Remove unneeded line. * Support TRT 8+. * Update onnx-tensorrt to 22.02 release. * Add support for trt >= 8. * Fix lint * Remove debug line. * Don't upgrade libcudnn, use what is in the latest container from nvidia. * Set CUDNN_VERSION inside nvidia containers when NV_CUDNN_VERSION is set instead. * Go back to updating libcudnn8.

2022-05-16 08:25:38 -07:00

								    export CPLUS_INCLUDE_PATH=${CPLUS_INCLUDE_PATH}:/usr/local/cuda/targets/x86_64-linux/include/

							

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2018-08-10 02:38:04 -07:00

    pushd .

Fix CI in master (#21026) * Update minor versions of nvidia cuda containers to use that have the latest keys pre-installed. * Update the TensorRT pipeline to Cuda 11.2. * Update TensorRT pipeline to use Cuda 11.4 and update libnvinfer to 8.2.4. * Allow setting TRT version as argument in docker-compose.yml and update to TRT 8.2.4 for cuda 11.4. * Use python3 executable when building tensorrt (so we can update to ubuntu 20.04 base) and enable int64 build. * Remove unneeded line. * Support TRT 8+. * Update onnx-tensorrt to 22.02 release. * Add support for trt >= 8. * Fix lint * Remove debug line. * Don't upgrade libcudnn, use what is in the latest container from nvidia. * Set CUDNN_VERSION inside nvidia containers when NV_CUDNN_VERSION is set instead. * Go back to updating libcudnn8.

2022-05-16 08:25:38 -07:00

								    cmake -DPYTHON_EXECUTABLE=$PYBIN -DONNX_NAMESPACE=$ONNX_NAMESPACE ..

							

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2018-08-10 02:38:04 -07:00

    make -j$(nproc)

Update the onnx-tensorrt submodule - CI to TRT7 (#18574)

2020-08-03 17:15:02 -07:00

    cp -L 3rdparty/onnx-tensorrt/build/libnvonnxparser.so /work/mxnet/lib/

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2018-08-10 02:38:04 -07:00

[MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs. (#13310) * [MXNET-703] Install CUDA 10 compatible cmake This works around a CUDA 10 cmake issue documented here: https://github.com/clab/dynet/issues/1457 This fix is temporary; once an updated cmake package is published to Ubuntu's package repo it may be reverted. * [MXNET-703] Update to TensorRT 5 ONNX IR 3. Fix inference bugs. * [MXNET-703] Describe onnx opsets and major version

2019-01-15 22:14:46 -08:00

    cd /work/build

Fix CI in master (#21026) * Update minor versions of nvidia cuda containers to use that have the latest keys pre-installed. * Update the TensorRT pipeline to Cuda 11.2. * Update TensorRT pipeline to use Cuda 11.4 and update libnvinfer to 8.2.4. * Allow setting TRT version as argument in docker-compose.yml and update to TRT 8.2.4 for cuda 11.4. * Use python3 executable when building tensorrt (so we can update to ubuntu 20.04 base) and enable int64 build. * Remove unneeded line. * Support TRT 8+. * Update onnx-tensorrt to 22.02 release. * Add support for trt >= 8. * Fix lint * Remove debug line. * Don't upgrade libcudnn, use what is in the latest container from nvidia. * Set CUDNN_VERSION inside nvidia containers when NV_CUDNN_VERSION is set instead. * Go back to updating libcudnn8.

2022-05-16 08:25:38 -07:00

								          -DUSE_INT64_TENSOR_SIZE=1               \

							

[MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs. (#13310) * [MXNET-703] Install CUDA 10 compatible cmake This works around a CUDA 10 cmake issue documented here: https://github.com/clab/dynet/issues/1457 This fix is temporary; once an updated cmake package is published to Ubuntu's package repo it may be reverted. * [MXNET-703] Update to TensorRT 5 ONNX IR 3. Fix inference bugs. * [MXNET-703] Describe onnx opsets and major version

2019-01-15 22:14:46 -08:00

								          -DUSE_OPENMP=0                          \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

          -DUSE_BLAS=Open                         \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

								          -DUSE_ONEDNN=0                          \

							

Update Ubuntu images used on CI to 20.04 (#19588) * Update Ubuntu images used on CI to 20.04 This helps ensure MXNet to work well on recent Linux distributions (while ensuring it continues to work well on ancient distributions based on the CentOS7 CI pipeline) * Preserve Ubuntu 18.04 images for TensorRT pipeline as NVidia failed to make TensorRT available for Ubuntu 20.04 * Temporarily disable NVML on CI [2020-12-03T18:33:10.380Z] OSError: /work/mxnet/python/mxnet/../../build/libmxnet.so: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v2

2020-12-03 19:58:14 -07:00

          -DUSE_NVML=OFF                          \

Switch to modern CMake CUDA handling (#17031) Introduce unified MXNET_CUDA_ARCH option to specify cuda architectures. Previously cuda architecture setting was partially broken and different options were applied to different parts of the build (CUDA_ARCH_NAME CUDA_ARCH_BIN CUDA_ARCH_PTX and CUDA_ARCH_LIST). Include FindCUDAToolkit from CMake 3.17, which replaces the deprecated FindCUDA functionality for finding the cuda toolkit include directories and libraries.

2019-12-30 09:37:43 +00:00

								          -DMXNET_CUDA_ARCH="$CI_CMAKE_CUDA_ARCH" \

							

[MXNET-703] Update to TensorRT 5, ONNX IR 3. Fix inference bugs. (#13310) * [MXNET-703] Install CUDA 10 compatible cmake This works around a CUDA 10 cmake issue documented here: https://github.com/clab/dynet/issues/1457 This fix is temporary; once an updated cmake package is published to Ubuntu's package repo it may be reverted. * [MXNET-703] Update to TensorRT 5 ONNX IR 3. Fix inference bugs. * [MXNET-703] Describe onnx opsets and major version

2019-01-15 22:14:46 -08:00

          -G Ninja                                \

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2018-08-10 02:38:04 -07:00

Change *_mkldnn* test and build scenarios names to *_onednn* (#20034)

2021-03-24 15:15:32 +01:00

								build_ubuntu_gpu_onednn() {

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    set -ex

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cd /work/build

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

								        -DCMAKE_BUILD_TYPE="RelWithDebInfo" \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DUSE_CUDA=ON \

Update Ubuntu images used on CI to 20.04 (#19588) * Update Ubuntu images used on CI to 20.04 This helps ensure MXNet to work well on recent Linux distributions (while ensuring it continues to work well on ancient distributions based on the CentOS7 CI pipeline) * Preserve Ubuntu 18.04 images for TensorRT pipeline as NVidia failed to make TensorRT available for Ubuntu 20.04 * Temporarily disable NVML on CI [2020-12-03T18:33:10.380Z] OSError: /work/mxnet/python/mxnet/../../build/libmxnet.so: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v2

2020-12-03 19:58:14 -07:00

        -DUSE_NVML=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

								        -DMXNET_CUDA_ARCH="$CI_CMAKE_CUDA_ARCH" \

							

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -G Ninja /work/mxnet

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

Change *_mkldnn* test and build scenarios names to *_onednn* (#20034)

2021-03-24 15:15:32 +01:00

								build_ubuntu_gpu_onednn_nocudnn() {

							

Fix build issue with USE_CUDNN=0 (#11470) * Fix build issue with CUDNN=0 * Fix nocudnn func name * Remove python2 tests * Remove CPP package test * Check assert raises when cudnn disabled for op tests on gpu * Add line * Remove whitespace * add decorator for other ops * Add and remove assert * Fix op and common * Fix merge issue * Remove C API * Fix * Fix lint * Add init git * Rename CUDNN_DISABLED env variable * Add a runtime function for nocudnn * Remove MXCudnnIsenabled * Add comment for disabled test * Add full link in comment

2018-07-12 16:40:24 -07:00

    set -ex

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    cd /work/build

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

								        -DCMAKE_BUILD_TYPE="RelWithDebInfo" \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DUSE_CUDA=ON \

Update Ubuntu images used on CI to 20.04 (#19588) * Update Ubuntu images used on CI to 20.04 This helps ensure MXNet to work well on recent Linux distributions (while ensuring it continues to work well on ancient distributions based on the CentOS7 CI pipeline) * Preserve Ubuntu 18.04 images for TensorRT pipeline as NVidia failed to make TensorRT available for Ubuntu 20.04 * Temporarily disable NVML on CI [2020-12-03T18:33:10.380Z] OSError: /work/mxnet/python/mxnet/../../build/libmxnet.so: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v2

2020-12-03 19:58:14 -07:00

        -DUSE_NVML=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

								        -DMXNET_CUDA_ARCH="$CI_CMAKE_CUDA_ARCH" \

							

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -G Ninja /work/mxnet

Fix build issue with USE_CUDNN=0 (#11470) * Fix build issue with CUDNN=0 * Fix nocudnn func name * Remove python2 tests * Remove CPP package test * Check assert raises when cudnn disabled for op tests on gpu * Add line * Remove whitespace * add decorator for other ops * Add and remove assert * Fix op and common * Fix merge issue * Remove C API * Fix * Fix lint * Add init git * Rename CUDNN_DISABLED env variable * Add a runtime function for nocudnn * Remove MXCudnnIsenabled * Add comment for disabled test * Add full link in comment

2018-07-12 16:40:24 -07:00

Update Ubuntu images used on CI to 20.04 (#19588) * Update Ubuntu images used on CI to 20.04 This helps ensure MXNet to work well on recent Linux distributions (while ensuring it continues to work well on ancient distributions based on the CentOS7 CI pipeline) * Preserve Ubuntu 18.04 images for TensorRT pipeline as NVidia failed to make TensorRT available for Ubuntu 20.04 * Temporarily disable NVML on CI [2020-12-03T18:33:10.380Z] OSError: /work/mxnet/python/mxnet/../../build/libmxnet.so: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v2

2020-12-03 19:58:14 -07:00

								build_ubuntu_gpu() {

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

    set -ex

Add back cpp-package (#20131) This updates and adds back the cpp-package removed in https://github.com/apache/incubator-mxnet/commit/97d4ba5a133f93ff6075dcde3ef842b23d498a12

2021-05-24 13:44:39 -07:00

    # Work around to link libcuda to libmxnet

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

CI: Test clang10 cpu & gpu builds with -WError (#17830) * Fix Wunused-variable * Fix Wreturn-std-move * Fix Wunused-const-variable * Fix Winconsistent-missing-override * Fix Wdelete-non-abstract-non-virtual-dtor * Fix Wrange-loop-construct * Disable Wpass-failed=transform-warning warning: loop not unrolled: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering * Fix Wimplicit-int-float-conversion 'float' changes value from 2147483647 to 2147483648 * Fix Wunused-lambda-capture * Fix Wundefined-var-template * cuda: --expt-relaxed-constexpr warning: calling a constexpr __host__ function from a __host__ __device__ function is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this. * Fix Wrange-loop-construct avoiding extra copies * Fix Wunused-private-field * Fix Wwritable-strings * Enable Clang10 -WError checking on CI * Fix -WError with mkldnn -Wliteral-conversion, -Wabsolute-value, -Wunused-private-field, -Wimplicit-int-float-conversion * Fix shuffle_op.cc * Fix use of old binutils * Print traceback on exception in OpWrapperGenerator.py * USE_CPP_PACKAGE=OFF for gpu clang10 werror build

2020-03-17 21:36:50 -07:00

								        -DCMAKE_BUILD_TYPE="RelWithDebInfo" \

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DUSE_CUDA=ON \

Update Ubuntu images used on CI to 20.04 (#19588) * Update Ubuntu images used on CI to 20.04 This helps ensure MXNet to work well on recent Linux distributions (while ensuring it continues to work well on ancient distributions based on the CentOS7 CI pipeline) * Preserve Ubuntu 18.04 images for TensorRT pipeline as NVidia failed to make TensorRT available for Ubuntu 20.04 * Temporarily disable NVML on CI [2020-12-03T18:33:10.380Z] OSError: /work/mxnet/python/mxnet/../../build/libmxnet.so: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v2

2020-12-03 19:58:14 -07:00

        -DUSE_NVML=OFF \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

								        -DMXNET_CUDA_ARCH="$CI_CMAKE_CUDA_ARCH" \

							

Add back cpp-package (#20131) This updates and adds back the cpp-package removed in https://github.com/apache/incubator-mxnet/commit/97d4ba5a133f93ff6075dcde3ef842b23d498a12

2021-05-24 13:44:39 -07:00

        -DUSE_CPP_PACKAGE=ON \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=OFF \

Add USE_DIST_KVSTORE=ON to GPU build (#17911) * Add USE_DIST_KVSTORE=ON to GPU build * Fix indent * Add check for error * Fix path error * Add license header * Remove unnecessary output * Fix path error * Fix error in test script

2020-04-06 12:49:41 -07:00

        -DUSE_DIST_KVSTORE=ON \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -DBUILD_CYTHON_MODULES=ON \

External Operators 2 (#19431) * initial commit * license fix * changed path var, formatting * add test to linux stages in ci * disable test on osx stage in ci * cleaned up example CMakeLists.txt removed -shared from GPU * moved windows check Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-220.us-west-2.compute.internal> Co-authored-by: Manu Seth <sethman@amazon.com>

2020-11-13 23:13:06 -08:00

        -DBUILD_EXTENSION_PATH=/work/mxnet/example/extensions/lib_external_ops \

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

        -G Ninja /work/mxnet

Attempt to fix website build pipeline (#20634) * set nproc for build_ubuntu_cpu_openblas * set nproc for build_ubuntu_gpu Co-authored-by: Wei Chu <weichu@amazon.com>

2021-10-05 18:08:58 -07:00

								    ninja -j$(($(nproc)/2))

							

CI: Switch to cmake builds for majority of tests (#17645) The following Makefile based builds are preserved 1) staticbuild scripts 2) Docs builds. Language binding specific build logic requires further changes 3) Jetson build. Jetpack 3.3 toolchain based on Cuda 9.0 causes 'Internal Compiler Error (codegen): "there was an error in verifying the lgenfe output!"' errors with cmake. This seems to be a known issue in Cuda 9.0 and we need to update Jetpack toolchain to work around it. 4) MKL builds. Waiting for fix of #17641 All Makefile based builds are marked with a "Makefile" postfix in the title. Improvements to CMake build - Enable -Werror for RelWithDebugInfo build in analogy to "make DEV=1" build - Add USE_LIBJPEG_TURBO to CMake build - Improve finding Python 3 executable Changes to CI setup - Install protobuf and zmq where missing - Install up-to-date CMake on Centos 7 - Don't use RelWithDebInfo on Android builds, as gcc 4.9 throws -Wdelete-non-virtual-dtor Code changes - Disable warnings introduced by GCC7 at via #pragma GCC diagnostic

2020-02-28 11:59:22 -08:00

Update Ubuntu images used on CI to 20.04 (#19588) * Update Ubuntu images used on CI to 20.04 This helps ensure MXNet to work well on recent Linux distributions (while ensuring it continues to work well on ancient distributions based on the CentOS7 CI pipeline) * Preserve Ubuntu 18.04 images for TensorRT pipeline as NVidia failed to make TensorRT available for Ubuntu 20.04 * Temporarily disable NVML on CI [2020-12-03T18:33:10.380Z] OSError: /work/mxnet/python/mxnet/../../build/libmxnet.so: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v2

2020-12-03 19:58:14 -07:00

								build_ubuntu_gpu_debug() {

							

[CI] fix debug build (#18240) * reverse control in jenkins steps, add debug gpu build test * try enabling gluon rnn cell tests on gpu * add faulthandler timeout

2020-05-06 10:45:34 -07:00

    set -ex

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

[CI] fix debug build (#18240) * reverse control in jenkins steps, add debug gpu build test * try enabling gluon rnn cell tests on gpu * add faulthandler timeout

2020-05-06 10:45:34 -07:00

        -DCMAKE_BUILD_TYPE=Debug \

Update Ubuntu images used on CI to 20.04 (#19588) * Update Ubuntu images used on CI to 20.04 This helps ensure MXNet to work well on recent Linux distributions (while ensuring it continues to work well on ancient distributions based on the CentOS7 CI pipeline) * Preserve Ubuntu 18.04 images for TensorRT pipeline as NVidia failed to make TensorRT available for Ubuntu 20.04 * Temporarily disable NVML on CI [2020-12-03T18:33:10.380Z] OSError: /work/mxnet/python/mxnet/../../build/libmxnet.so: undefined symbol: nvmlDeviceGetComputeRunningProcesses_v2

2020-12-03 19:58:14 -07:00

        -DUSE_NVML=OFF \

[CI] fix debug build (#18240) * reverse control in jenkins steps, add debug gpu build test * try enabling gluon rnn cell tests on gpu * add faulthandler timeout

2020-05-06 10:45:34 -07:00

								        -DMXNET_CUDA_ARCH="$CI_CMAKE_CUDA_ARCH" \

							

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=OFF \

[CI] fix debug build (#18240) * reverse control in jenkins steps, add debug gpu build test * try enabling gluon rnn cell tests on gpu * add faulthandler timeout

2020-05-06 10:45:34 -07:00

        -DUSE_DIST_KVSTORE=ON \

2019-04-23 14:47:10 -07:00

								build_ubuntu_cpu_large_tensor() {

							

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

2019-04-23 14:47:10 -07:00

        -DUSE_SIGNAL_HANDLER=ON                 \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_BLAS=Open                         \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=ON                         \

2019-04-23 14:47:10 -07:00

        -G Ninja                                \

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

2019-04-23 14:47:10 -07:00

use CC=gcc-7 CXX=g++-7 for all unix CI builds (#19701) * use CC=gcc-7 CXX=g++-7 for all unix CI builds * install gcc-7 and g++-7 * remove apt install cmake in favor of existing pip3 installation

2020-12-22 17:33:36 -08:00

								    CC=gcc-7 CXX=g++-7 cmake \

							

2019-04-23 14:47:10 -07:00

        -DUSE_SIGNAL_HANDLER=ON                 \

Remove USE_MKL_IF_AVAILABLE flag (#20004) USE_BLAS cmake option can be set to choose a particular BLAS. For example, USE_BLAS=mkl or USE_BLAS=open. If USE_BLAS is not specified, we search if MKL is available and use MKL if it is available. Otherwise OpenBLAS is used.

2021-03-12 20:44:24 +01:00

        -DUSE_NVML=OFF                          \

Change inner mxnet flags nomenclature for oneDNN library (#19944) This change includes: * changing MXNET_USE_MKLDNN flag name to MXNET_USE_ONEDNN * changing USE_MKLDNN flag name to USE_ONEDNN * changing 3rdparty/mkldnn folder name to 3rdparty/onednn * changing include/mkldnn folder name to include/onednn * changing MKLDNN occurences in build and documentation files to ONEDNN * adding Bartosz Kuncer to contributors list

2021-03-15 17:32:37 +01:00

        -DUSE_ONEDNN=ON                         \

2019-04-23 14:47:10 -07:00

        -DUSE_DIST_KVSTORE=ON                   \

Switch to modern CMake CUDA handling (#17031) Introduce unified MXNET_CUDA_ARCH option to specify cuda architectures. Previously cuda architecture setting was partially broken and different options were applied to different parts of the build (CUDA_ARCH_NAME CUDA_ARCH_BIN CUDA_ARCH_PTX and CUDA_ARCH_LIST). Include FindCUDAToolkit from CMake 3.17, which replaces the deprecated FindCUDA functionality for finding the cuda toolkit include directories and libraries.

2019-12-30 09:37:43 +00:00

								        -DMXNET_CUDA_ARCH="$CI_CMAKE_CUDA_ARCH" \

							

2019-04-23 14:47:10 -07:00

        -G Ninja                                \

Reduce load on CI due to excessive log flood (#17629)

2020-02-19 21:22:57 -08:00

    ninja

2019-04-23 14:47:10 -07:00

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

# Testing

Split up CI sanity test functions to enable fine-grained trigger (#18786) Developers can now trigger fine grained checks: python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_python python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_license etc

2020-07-25 02:48:30 +00:00

    set -ex

[Feature][Master] Clang-format tool to perform additional formatting and semantic checking of code. (#20433) * Clang-format hook * Added: workflow file * tools/lint/clang_format_ci.sh was added * Permision was set on +x * Jenkins clang-format runner * Update runtime_funciton.sh file * Master last commit sha * Set BASE_SHA in greetings * GITHUB_BASE_REF and GITHUB_RUN_ID: set varaibles * Runtime function, os_x_static_build stores configuration * Greetings contains env variables * Check env params * White space between breackets * Show all refs * Correct refs to master * End up if [] * Git clang format rat-excludes * LICENCE file was modifed to exclude clang-format-13 file * Greetings, update env * Remove unnecessary condition * Update LICENSE * Fix licence checker * Clang-format file update * Error message is shifted Co-authored-by: Sheng Zha <szha@users.noreply.github.com>

2021-10-10 00:02:42 +02:00

    sanity_clang

Split up CI sanity test functions to enable fine-grained trigger (#18786) Developers can now trigger fine grained checks: python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_python python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_license etc

2020-07-25 02:48:30 +00:00

    sanity_license

[master][ci][feature] Static code checker for CMake files (#20706) * CmakeLint initial commit * Skip upstream, and module dir * Rollback

2021-11-21 04:05:47 +01:00

    sanity_cmakelint

Prospector checker initial commit (#20684)

2021-10-30 16:54:39 +02:00

    sanity_tutorial

Split up CI sanity test functions to enable fine-grained trigger (#18786) Developers can now trigger fine grained checks: python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_python python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_license etc

2020-07-25 02:48:30 +00:00

    sanity_cpp

[master][ci][feature] Static code checker for CMake files (#20706) * CmakeLint initial commit * Skip upstream, and module dir * Rollback

2021-11-21 04:05:47 +01:00

								sanity_cmakelint() {

							

Prospector checker initial commit (#20684)

2021-10-30 16:54:39 +02:00

								sanity_tutorial() {

							

Split up CI sanity test functions to enable fine-grained trigger (#18786) Developers can now trigger fine grained checks: python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_python python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_license etc

2020-07-25 02:48:30 +00:00

								sanity_license() {

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    set -ex

Split up CI sanity test functions to enable fine-grained trigger (#18786) Developers can now trigger fine grained checks: python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_python python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_license etc

2020-07-25 02:48:30 +00:00

Fix naming in runtime_functions.sh (#18795)

2020-07-28 22:11:20 +00:00

								sanity_cpp() {

							

Split up CI sanity test functions to enable fine-grained trigger (#18786) Developers can now trigger fine grained checks: python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_python python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_license etc

2020-07-25 02:48:30 +00:00

    set -ex

Add back cpp-package (#20131) This updates and adds back the cpp-package removed in https://github.com/apache/incubator-mxnet/commit/97d4ba5a133f93ff6075dcde3ef842b23d498a12

2021-05-24 13:44:39 -07:00

								    3rdparty/dmlc-core/scripts/lint.py mxnet cpp include src plugin cpp-package tests --exclude_path src/operator/contrib/ctc_include include/onednn

							

Split up CI sanity test functions to enable fine-grained trigger (#18786) Developers can now trigger fine grained checks: python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_python python ci/build.py -R --platform ubuntu_cpu /work/runtime_functions.sh sanity_license etc

2020-07-25 02:48:30 +00:00

Prospector checker initial commit (#20684)

2021-10-30 16:54:39 +02:00

								sanity_python_prospector() {

							

[Feature][Master] Clang-format tool to perform additional formatting and semantic checking of code. (#20433) * Clang-format hook * Added: workflow file * tools/lint/clang_format_ci.sh was added * Permision was set on +x * Jenkins clang-format runner * Update runtime_funciton.sh file * Master last commit sha * Set BASE_SHA in greetings * GITHUB_BASE_REF and GITHUB_RUN_ID: set varaibles * Runtime function, os_x_static_build stores configuration * Greetings contains env variables * Check env params * White space between breackets * Show all refs * Correct refs to master * End up if [] * Git clang format rat-excludes * LICENCE file was modifed to exclude clang-format-13 file * Greetings, update env * Remove unnecessary condition * Update LICENSE * Fix licence checker * Clang-format file update * Error message is shifted Co-authored-by: Sheng Zha <szha@users.noreply.github.com>

2021-10-10 00:02:42 +02:00

								sanity_clang() {

							

[Feature][Master] Clang-format tool to perform additional formatting and semantic checking of code. (#20433) * Clang-format hook * Added: workflow file * tools/lint/clang_format_ci.sh was added * Permision was set on +x * Jenkins clang-format runner * Update runtime_funciton.sh file * Master last commit sha * Set BASE_SHA in greetings * GITHUB_BASE_REF and GITHUB_RUN_ID: set varaibles * Runtime function, os_x_static_build stores configuration * Greetings contains env variables * Check env params * White space between breackets * Show all refs * Correct refs to master * End up if [] * Git clang format rat-excludes * LICENCE file was modifed to exclude clang-format-13 file * Greetings, update env * Remove unnecessary condition * Update LICENSE * Fix licence checker * Clang-format file update * Error message is shifted Co-authored-by: Sheng Zha <szha@users.noreply.github.com>

2021-10-18 12:44:32 +02:00

    set -e

2021-10-10 00:02:42 +02:00

								    # .github/workgflows/greetings.yml passes BASE_SHA, GITHUB_RUN_ID, GITHUB_BASE_REF for pull requests.

							

[master][bugfix] Remove exit 0 to avoid blocking in CI pipeline (#20683) * Remove exit 0 to avoid stop CIs * Return instead of exit 0, to avoid exit code * Exit 1 is keet

2021-10-21 22:27:13 +02:00

        return

[Feature][Master] Clang-format tool to perform additional formatting and semantic checking of code. (#20433) * Clang-format hook * Added: workflow file * tools/lint/clang_format_ci.sh was added * Permision was set on +x * Jenkins clang-format runner * Update runtime_funciton.sh file * Master last commit sha * Set BASE_SHA in greetings * GITHUB_BASE_REF and GITHUB_RUN_ID: set varaibles * Runtime function, os_x_static_build stores configuration * Greetings contains env variables * Check env params * White space between breackets * Show all refs * Correct refs to master * End up if [] * Git clang format rat-excludes * LICENCE file was modifed to exclude clang-format-13 file * Greetings, update env * Remove unnecessary condition * Update LICENSE * Fix licence checker * Clang-format file update * Error message is shifted Co-authored-by: Sheng Zha <szha@users.noreply.github.com>

2021-10-10 00:02:42 +02:00

fi

[Feature][Master] Clang-format tool to perform additional formatting and semantic checking of code. (#20433) * Clang-format hook * Added: workflow file * tools/lint/clang_format_ci.sh was added * Permision was set on +x * Jenkins clang-format runner * Update runtime_funciton.sh file * Master last commit sha * Set BASE_SHA in greetings * GITHUB_BASE_REF and GITHUB_RUN_ID: set varaibles * Runtime function, os_x_static_build stores configuration * Greetings contains env variables * Check env params * White space between breackets * Show all refs * Correct refs to master * End up if [] * Git clang format rat-excludes * LICENCE file was modifed to exclude clang-format-13 file * Greetings, update env * Remove unnecessary condition * Update LICENSE * Fix licence checker * Clang-format file update * Error message is shifted Co-authored-by: Sheng Zha <szha@users.noreply.github.com>

2021-10-18 12:44:32 +02:00

2021-10-10 00:02:42 +02:00

    echo "~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~"

[Feature][Master] Clang-format tool to perform additional formatting and semantic checking of code. (#20433) * Clang-format hook * Added: workflow file * tools/lint/clang_format_ci.sh was added * Permision was set on +x * Jenkins clang-format runner * Update runtime_funciton.sh file * Master last commit sha * Set BASE_SHA in greetings * GITHUB_BASE_REF and GITHUB_RUN_ID: set varaibles * Runtime function, os_x_static_build stores configuration * Greetings contains env variables * Check env params * White space between breackets * Show all refs * Correct refs to master * End up if [] * Git clang format rat-excludes * LICENCE file was modifed to exclude clang-format-13 file * Greetings, update env * Remove unnecessary condition * Update LICENSE * Fix licence checker * Clang-format file update * Error message is shifted Co-authored-by: Sheng Zha <szha@users.noreply.github.com>

2021-10-18 12:44:32 +02:00

    echo "| Clang-format failures found! Run: "

2021-10-10 00:02:42 +02:00

    echo "| to fix this error. "

[Feature][Master] Clang-format tool to perform additional formatting and semantic checking of code. (#20433) * Clang-format hook * Added: workflow file * tools/lint/clang_format_ci.sh was added * Permision was set on +x * Jenkins clang-format runner * Update runtime_funciton.sh file * Master last commit sha * Set BASE_SHA in greetings * GITHUB_BASE_REF and GITHUB_RUN_ID: set varaibles * Runtime function, os_x_static_build stores configuration * Greetings contains env variables * Check env params * White space between breackets * Show all refs * Correct refs to master * End up if [] * Git clang format rat-excludes * LICENCE file was modifed to exclude clang-format-13 file * Greetings, update env * Remove unnecessary condition * Update LICENSE * Fix licence checker * Clang-format file update * Error message is shifted Co-authored-by: Sheng Zha <szha@users.noreply.github.com>

2021-10-18 12:44:32 +02:00

								    echo "| For more info, see: https://mxnet.apache.org/versions/master/community/clang_format_guide"

							

2021-10-10 00:02:42 +02:00

    echo "~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~"

[Feature][Master] Clang-format tool to perform additional formatting and semantic checking of code. (#20433) * Clang-format hook * Added: workflow file * tools/lint/clang_format_ci.sh was added * Permision was set on +x * Jenkins clang-format runner * Update runtime_funciton.sh file * Master last commit sha * Set BASE_SHA in greetings * GITHUB_BASE_REF and GITHUB_RUN_ID: set varaibles * Runtime function, os_x_static_build stores configuration * Greetings contains env variables * Check env params * White space between breackets * Show all refs * Correct refs to master * End up if [] * Git clang format rat-excludes * LICENCE file was modifed to exclude clang-format-13 file * Greetings, update env * Remove unnecessary condition * Update LICENSE * Fix licence checker * Clang-format file update * Error message is shifted Co-authored-by: Sheng Zha <szha@users.noreply.github.com>

2021-10-18 12:44:32 +02:00

2021-10-10 00:02:42 +02:00

								    echo "$GIT_DIFFERENCE"

							

2019-05-23 12:48:44 +02:00

# Tests libmxnet

[2.0] Bump Python to >= 3.8 (#20593) * Bump Python to >= 3.8, NumPy to >= 1.21.0 * remove pillow in requirements * update python * fix some tests

2021-10-19 22:21:04 -07:00

    source /opt/rh/rh-python38/enable

2019-05-23 12:48:44 +02:00

								    export PYTHONPATH=./python/

							

Change MXNET_MKLDNN_DEBUG define name to MXNET_ONEDNN_DEBUG (#20031)

2021-03-16 20:16:53 +01:00

								    export MXNET_ONEDNN_DEBUG=0  # Ignored if not present

							

CD Fixes (#16127) * Ignore load lib test in CD jobs * Removes cu80 and adds cu101 support to CD builds * Disable cython in CD python tests * Updates CD documentation to reflect variant changes

2019-05-23 12:48:44 +02:00

								    export MXNET_STORAGE_FALLBACK_LOG_VERBOSE=0

							

Surpress subgraph log in CI (#16607) Change-Id: Ia2ed6fdbb1d2cb5cc607a8856ca13ee338e27eac

2019-10-24 04:47:01 -05:00

								    export MXNET_SUBGRAPH_VERBOSE=0

							

2019-09-12 23:00:46 +02:00

								    export MXNET_ENABLE_CYTHON=0

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2019-05-23 12:48:44 +02:00

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								    OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -n 4 --durations=50 --verbose tests/python/unittest

							

2020-05-04 16:44:27 -07:00

								    pytest -m 'serial' --durations=50 --verbose tests/python/unittest

							

2019-05-23 12:48:44 +02:00

Port top-level-project updates from v1.x branch (#21162)

2023-01-04 04:09:23 -08:00

    # https://github.com/apache/mxnet/issues/11801

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2019-05-23 12:48:44 +02:00

    # if [[ ${mxnet_variant} = "cpu" ]] || [[ ${mxnet_variant} = "mkl" ]]; then

2020-05-16 19:04:44 -07:00

								        MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								            OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'test_operator' -n 4 --durations=50 --verbose tests/python/gpu

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								        MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

								            OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'not test_operator and not test_amp_init.py' -n 4 --durations=50 --verbose tests/python/gpu

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2020-05-04 16:44:27 -07:00

								        pytest -m 'serial' --durations=50 --verbose tests/python/gpu

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

        pytest --durations=50 --verbose tests/python/gpu/test_amp_init.py

Fix Nightly CD for GPU builds and switch CD to use cmake builds (#18205) * use cmake for cd static build, skip running kvstore tests * update dnnl headers stash location * remove unnecessary platform condition * remove 7.5 arch for cu100, cu101, cu102 Co-authored-by: Ubuntu <ubuntu@ip-172-31-3-62.us-west-2.compute.internal>

2019-05-23 12:48:44 +02:00

2020-04-30 16:02:55 -07:00

        # TODO(szha): fix and reenable the hanging issue. tracked in #18098

revert changes causing cd failures (#18533) Reverting the following changes to cd_unittest_ubuntu causing CD pipeline failures: The first change was using Naive Engine for operator tests, which causes timeout failures in CD Added here: 10b6b48 Second change was running integrationtest_ubuntu_gpu_byteps as part of cu* CD tests, added here: e28e9fe

2020-06-11 09:17:44 -07:00

        # TODO(eric-haibin-lin): fix and reenable

[submodule] Remove soon to be obsolete dnnl nomenclature from mxnet (#20606) * Tests directory * Docs * subgraph * src/operator/subgraph/dnnl/dnnl_subgraph_property.cc * src/operator/nn/mkldnn -> src/operator/nn/dnnl * src/operator/nn * src/operator/quantization * src/operator/tensor * Other files * mkl_mem -> dnnl_mem * Fix sanity * Fix miscellaneous * Apply clang * Fix license * Fix linkcheck * Remove unnecessary onednn header files * review changes * fix sanity * dnnl -> oneDNN/ONEDNN/onednn for functions/variables visible from outside * fix quantization.py * Fix contributors * Apply clang-format

2019-05-23 12:48:44 +02:00

fi

[DEV] switch nose with pytest (#18025) * switch nose with pytest * switch centos python to 3.6 * disable dist kvstore tests * skip hanging test

2020-04-22 23:53:12 -07:00

								    if [[ ${mxnet_variant} = *mkl ]]; then

							

2021-10-13 22:48:10 +02:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -n 4 --durations=50 --verbose tests/python/dnnl

							

[ONNX] Foward port new mx2onnx into master (#20355) * initial: forward port mx2onnx and remove onnx2mx * fix sanity * add onnx operator unit tests * add test file * add model test * fix license & doc * fix * marching toward 2.0 * fix typo * add more ops * more ops * more ops * more ops * fix softmax and sanity * more ops * more ops * more ops * naming * more ops * more ops * more ops and bug fix * more ops and skip unvisited tests * fix sanity * fix for onnx18 * more ops * fix * fix onnx 18 * more ops * skip model test * update read me * more ops * more ops * more ops * more ops * more ops * Update test_models.py

2019-05-23 12:48:44 +02:00

fi

2021-07-16 14:30:24 -07:00

								unittest_ubuntu_python3_cpu_onnx() {

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								unittest_ubuntu_python3_cpu() {

							

[MXNET-247] Always build profiler (#10308) * Always build profiler * Update naive_engine.cc * remove PROFILE_MESSAGE macro * Remove USE_PROFILER=1 from CI runs

2018-04-01 16:19:45 -07:00

								    export PYTHONPATH=./python/

							

Change MXNET_MKLDNN_DEBUG define name to MXNET_ONEDNN_DEBUG (#20031)

2021-03-16 20:16:53 +01:00

								    export MXNET_ONEDNN_DEBUG=0  # Ignored if not present

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								    export MXNET_STORAGE_FALLBACK_LOG_VERBOSE=0

							

Surpress subgraph log in CI (#16607) Change-Id: Ia2ed6fdbb1d2cb5cc607a8856ca13ee338e27eac

2019-10-24 04:47:01 -05:00

								    export MXNET_SUBGRAPH_VERBOSE=0

							

[MXNET-545] Fix broken cython build (#10951) * Fix broken build with cython 0.28 * Fix setup.py to be compatible with cython 0.28 * Fix broken cython ndarray module * Revised comments * Replace hard coded library path with one obtained by find_lib_path * Add documentation for MXNET_ENABLE_CYTHON and MXNET_ENFORCE_CYTHON * Add cython build to CI * Fix for cython CI * Adjust python environment for cython CI * Add make variables to set python executable * Fix typo * Fix nnvm include path * Does not use ccache for cython * Fix issues with the wildcards in the library list in Jenkinsfile * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Intentionally introduce a bug to check that the tests actually runwith cython * Remove the intentionally introduced bug * Update installation doc * Retrigger CI * Run cython CI in ubuntu environment instead of CentOS environment * Commit a missed file * Fix a bug in check_cython * Refine environments for cython CI * Restore unrelated changes * Fix a problem occurring when the cython modules for python 2 and 3 are built successively * Trigger CI * Pin the cython version in the CI * Catch up #11320 * Remove optional arguments unused after #11320 * Trigger CI * Trigger CI * Trigger CI * Remove unnecessary stype argument from the NDArrayBase constructor * Revise confusing initialization of `_ndarray_cls` * Add cython build for python3 in CI * Fix misplaced cython build in CI * Adjust CI environments for cython * Fix invalid path for cython generated .so files in cmake build * Revert invalid fix * Revise docs * Revise check_cython * Temporaily use make instead of cmake for debugging * Temporal changes for debugging * Temporal changes for debugging * Temporaily use ctypes instead of cython modules for debugging * Temporaily disable ccache for debugging * Temporaily use make (DEV = 0) instead of cmake for debugging * Temporaily disable cudnn for debugging * Restore temporal changes * Temporarily disable coverage report * Adapt to Jenkinsfile_utils * Adapt to Jenkinsfile_utils (cont.) * Restore unrelated changes * Restore temporal changes * Resolving conflict * Test with the cmake build is removed * Add MXNET_ENABLE_CYTHON=0 to tensorrt test * Fix typo * Trigger CI * Trigger CI * Adapt to Jenkinsfile refactoring * Adapt to Jenkinsfile refactoring (cont.) * Trigger CI * Trigger CI * Stash missing cython modules * Trigger CI * CMake build of cython modules without unit tests * Fix typo * Trigger CI * Fix a mistake introduced in merging process * trigger test * Update Jenkinsfile_utils.groovy * Trigger CI * Trigger CI * Trigger CI * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests

2019-05-25 07:36:30 +09:00

								    export MXNET_ENABLE_CYTHON=0

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								    OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'not test_operator' -n 4 --durations=50 --cov-report xml:tests_unittest.xml --verbose tests/python/unittest

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_ENGINE_TYPE=NaiveEngine \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'test_operator' -n 4 --durations=50 --cov-report xml:tests_unittest.xml --cov-append --verbose tests/python/unittest

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2020-05-04 16:44:27 -07:00

								    pytest -m 'serial' --durations=50 --cov-report xml:tests_unittest.xml --cov-append --verbose tests/python/unittest

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

Change *_mkldnn* test and build scenarios names to *_onednn* (#20034)

2021-03-24 15:15:32 +01:00

								unittest_ubuntu_python3_cpu_onednn() {

							

[MXNET-33] SSD example not working with mkl-dnn (#10021) * use mkl-dnn for 'valid' pooling_convention only * pooling convention full not supported by current mkl-dnn impl * disable unreachable code * add sample model test for mkldnn * fix review feedback * add jira link to comment * fix lint issue * rename python test for mkl * enable python tests for mkldnn in CI * use vgg16 with convention full * fix unittest

2018-04-24 10:48:01 -07:00

    set -ex

[MXNET-290] MKLDNN support for model quantization (#10433) * mkldnn support for quantization * fix output number in graph * update licsence * modify Jenkinsfile * modify Jenkinsfile * mkldnn has no int8 fc api, excluded_sym_names includes fc for cpu * add mkldnn uint8 pass for quantization graph * update ut * retrig ic * remove no mkldnn quantization test temp * seperate mkldnn quantization ut from gpu quantization ut * rm dev_id check for cpu * add mkl tests dictionary * resolve review comments * simplify DequantizeStorageType() logic * simplify quantize/quantized_conv storage type logic * Add mkldnn_OIhw4i16o4i type case (needed by int8) * INT8 conv/pooling: share with FP32 convolution/pooling class/function * minor indent changes * Remove unnecessary mkldnn_quantized_pooling-inl.h * Fix minor issue * Fix lint * delete duplicated data type * fix bugs and convert requantize data to NDArray * fix lint * fix requantize storgetype * fix requantize storge type * Fix coding style comments * Fix compile issue * Change to use quantized_dtype option to support uint8/int8 scenarios * fix gpu test quantization failure * Fix indent * fix quantized pooling param parser * Fix imagenet_gen_qsym.py option style * retrigger jenkins * retrigger again * trigger jenkins * Resolve further comments * share test code * remove unnecessary test code * add test_quantize_model for cpu * add comments in quantize_graph_pass.cc * jenkins * jenkins * improve coding style * improve coding style * Add naive CPU quantization test back and share quantization code between naive-CPU/MKLDNN/GPU * rename test_quantization_cpu.py to test_quantization_mkldnn.py * code style * trigger * Adjust variable naming for test quantization * add qdtype for quantized op test case to test/bypass all cases explicitly * change expressions to be consistent * revert unnecessary change

2018-06-14 12:58:33 +08:00

								    export PYTHONPATH=./python/

							

Change MXNET_MKLDNN_DEBUG define name to MXNET_ONEDNN_DEBUG (#20031)

2021-03-16 20:16:53 +01:00

								    export MXNET_ONEDNN_DEBUG=0  # Ignored if not present

							

[MXNET-33] SSD example not working with mkl-dnn (#10021) * use mkl-dnn for 'valid' pooling_convention only * pooling convention full not supported by current mkl-dnn impl * disable unreachable code * add sample model test for mkldnn * fix review feedback * add jira link to comment * fix lint issue * rename python test for mkl * enable python tests for mkldnn in CI * use vgg16 with convention full * fix unittest

2018-04-24 10:48:01 -07:00

								    export MXNET_STORAGE_FALLBACK_LOG_VERBOSE=0

							

Surpress subgraph log in CI (#16607) Change-Id: Ia2ed6fdbb1d2cb5cc607a8856ca13ee338e27eac

2019-10-24 04:47:01 -05:00

								    export MXNET_SUBGRAPH_VERBOSE=0

							

[MXNET-545] Fix broken cython build (#10951) * Fix broken build with cython 0.28 * Fix setup.py to be compatible with cython 0.28 * Fix broken cython ndarray module * Revised comments * Replace hard coded library path with one obtained by find_lib_path * Add documentation for MXNET_ENABLE_CYTHON and MXNET_ENFORCE_CYTHON * Add cython build to CI * Fix for cython CI * Adjust python environment for cython CI * Add make variables to set python executable * Fix typo * Fix nnvm include path * Does not use ccache for cython * Fix issues with the wildcards in the library list in Jenkinsfile * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Intentionally introduce a bug to check that the tests actually runwith cython * Remove the intentionally introduced bug * Update installation doc * Retrigger CI * Run cython CI in ubuntu environment instead of CentOS environment * Commit a missed file * Fix a bug in check_cython * Refine environments for cython CI * Restore unrelated changes * Fix a problem occurring when the cython modules for python 2 and 3 are built successively * Trigger CI * Pin the cython version in the CI * Catch up #11320 * Remove optional arguments unused after #11320 * Trigger CI * Trigger CI * Trigger CI * Remove unnecessary stype argument from the NDArrayBase constructor * Revise confusing initialization of `_ndarray_cls` * Add cython build for python3 in CI * Fix misplaced cython build in CI * Adjust CI environments for cython * Fix invalid path for cython generated .so files in cmake build * Revert invalid fix * Revise docs * Revise check_cython * Temporaily use make instead of cmake for debugging * Temporal changes for debugging * Temporal changes for debugging * Temporaily use ctypes instead of cython modules for debugging * Temporaily disable ccache for debugging * Temporaily use make (DEV = 0) instead of cmake for debugging * Temporaily disable cudnn for debugging * Restore temporal changes * Temporarily disable coverage report * Adapt to Jenkinsfile_utils * Adapt to Jenkinsfile_utils (cont.) * Restore unrelated changes * Restore temporal changes * Resolving conflict * Test with the cmake build is removed * Add MXNET_ENABLE_CYTHON=0 to tensorrt test * Fix typo * Trigger CI * Trigger CI * Adapt to Jenkinsfile refactoring * Adapt to Jenkinsfile refactoring (cont.) * Trigger CI * Trigger CI * Stash missing cython modules * Trigger CI * CMake build of cython modules without unit tests * Fix typo * Trigger CI * Fix a mistake introduced in merging process * trigger test * Update Jenkinsfile_utils.groovy * Trigger CI * Trigger CI * Trigger CI * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests

2019-05-25 07:36:30 +09:00

								    export MXNET_ENABLE_CYTHON=0

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								    OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'not test_operator' -n 4 --durations=50 --cov-report xml:tests_unittest.xml --verbose tests/python/unittest

							

[submodule] Remove soon to be obsolete dnnl nomenclature from mxnet (#20606) * Tests directory * Docs * subgraph * src/operator/subgraph/dnnl/dnnl_subgraph_property.cc * src/operator/nn/mkldnn -> src/operator/nn/dnnl * src/operator/nn * src/operator/quantization * src/operator/tensor * Other files * mkl_mem -> dnnl_mem * Fix sanity * Fix miscellaneous * Apply clang * Fix license * Fix linkcheck * Remove unnecessary onednn header files * review changes * fix sanity * dnnl -> oneDNN/ONEDNN/onednn for functions/variables visible from outside * fix quantization.py * Fix contributors * Apply clang-format

2021-10-13 22:48:10 +02:00

								    pytest --durations=50 --cov-report xml:tests_mkl.xml --verbose tests/python/dnnl

							

[MXNET-33] SSD example not working with mkl-dnn (#10021) * use mkl-dnn for 'valid' pooling_convention only * pooling convention full not supported by current mkl-dnn impl * disable unreachable code * add sample model test for mkldnn * fix review feedback * add jira link to comment * fix lint issue * rename python test for mkl * enable python tests for mkldnn in CI * use vgg16 with convention full * fix unittest

2018-04-24 10:48:01 -07:00

[API] Extend NumPy Array dtypes with int16, uint16, uint32, uint64 (#20478) * extend dtypes with int16, uint16, uint32, uint64 * update operator_tune.cc * add test suite * fix sanity * update test * update ci * fix * fix * fix Uint32 * extend dtypes support to tvmop

2021-09-09 18:17:58 -07:00

								unittest_array_api_standardization() {

							

[v2.0.0.beta0] License Update: **/*.md **/*.ipynb (#20628) * [v2.0.0.beta0] License Update: **/*.md **/*.ipynb * update * update config * fix array-api-test version

2021-10-04 08:49:55 -07:00

    git checkout c1dba80a196a03f880d2e0a998a272fb3867b720

[API] Extend NumPy Array dtypes with int16, uint16, uint32, uint64 (#20478) * extend dtypes with int16, uint16, uint32, uint64 * update operator_tune.cc * add test suite * fix sanity * update test * update ci * fix * fix * fix Uint32 * extend dtypes support to tvmop

2021-09-09 18:17:58 -07:00

								    export ARRAY_API_TESTS_MODULE=mxnet.numpy pytest

							

[BUGFIX] Fix mxnet.numpy.eye() handling of extreme k parameter values (#20965) * Fix mxnet.numpy.eye() handling of extreme k parameter values * Fix broken web link

2022-03-17 16:15:53 -07:00

								    export MXNET_ENABLE_CYTHON=1

							

[API] Extend NumPy Array dtypes with int16, uint16, uint32, uint64 (#20478) * extend dtypes with int16, uint16, uint32, uint64 * update operator_tune.cc * add test suite * fix sanity * update test * update ci * fix * fix * fix Uint32 * extend dtypes support to tvmop

2021-09-09 18:17:58 -07:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[API TESTS] Standardization and add more array api tests (#20725) * [API] Standardize and add more array api tests * fix lint * fix lint * fix * fix build * fix lint * update * fix * fix lint * fix * update remainder * fix lint * switch to no tvmop * fix tests * fix elemwise binary * update asarray * Revert "update asarray" This reverts commit 3a11d157d007da1ae8057f9a867bc9f0fb221ef2. * fix precision * fix precision * fix * fix floating point exception * fix floor_divide * fix dtype_from_number * fix asarray * fix asarray docstring * merge data type functions * add un-func standard tests * support multiple dtypes in gpu copy * add type_result tests * add binary tests * fix lint * update * update rtol, atol * update rtc types * fix floor,ceil,trunc * update rtc type promotion * update tests * update mod * fix lint

2021-11-20 18:58:56 -08:00

								    python3 -m pytest --reruns 3 --durations=50 --cov-report xml:tests_api.xml --verbose array_api_tests/test_creation_functions.py

							

[API] Extend NumPy Array dtypes with int16, uint16, uint32, uint64 (#20478) * extend dtypes with int16, uint16, uint32, uint64 * update operator_tune.cc * add test suite * fix sanity * update test * update ci * fix * fix * fix Uint32 * extend dtypes support to tvmop

2021-09-09 18:17:58 -07:00

								        array_api_tests/test_type_promotion.py::test_elementwise_function_two_arg_bool_type_promotion

							

[API TESTS] Standardization and add more array api tests (#20725) * [API] Standardize and add more array api tests * fix lint * fix lint * fix * fix build * fix lint * update * fix * fix lint * fix * update remainder * fix lint * switch to no tvmop * fix tests * fix elemwise binary * update asarray * Revert "update asarray" This reverts commit 3a11d157d007da1ae8057f9a867bc9f0fb221ef2. * fix precision * fix precision * fix * fix floating point exception * fix floor_divide * fix dtype_from_number * fix asarray * fix asarray docstring * merge data type functions * add un-func standard tests * support multiple dtypes in gpu copy * add type_result tests * add binary tests * fix lint * update * update rtol, atol * update rtc types * fix floor,ceil,trunc * update rtc type promotion * update tests * update mod * fix lint

2021-11-20 18:58:56 -08:00

								    python3 -m pytest --reruns 3 --durations=50 --cov-report xml:tests_api.xml --verbose \

							

[API] Extend NumPy Array dtypes with int16, uint16, uint32, uint64 (#20478) * extend dtypes with int16, uint16, uint32, uint64 * update operator_tune.cc * add test suite * fix sanity * update test * update ci * fix * fix * fix Uint32 * extend dtypes support to tvmop

2021-09-09 18:17:58 -07:00

    popd

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								unittest_ubuntu_python3_gpu() {

							

[MXNET-247] Always build profiler (#10308) * Always build profiler * Update naive_engine.cc * remove PROFILE_MESSAGE macro * Remove USE_PROFILER=1 from CI runs

2018-04-01 16:19:45 -07:00

								    export PYTHONPATH=./python/

							

Change MXNET_MKLDNN_DEBUG define name to MXNET_ONEDNN_DEBUG (#20031)

2021-03-16 20:16:53 +01:00

								    export MXNET_ONEDNN_DEBUG=0 # Ignored if not present

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								    export MXNET_STORAGE_FALLBACK_LOG_VERBOSE=0

							

Surpress subgraph log in CI (#16607) Change-Id: Ia2ed6fdbb1d2cb5cc607a8856ca13ee338e27eac

2019-10-24 04:47:01 -05:00

								    export MXNET_SUBGRAPH_VERBOSE=0

							

Updates gpu tests to use CUDNN_VERSION supplied by the environment but default to 7.0.3 if not set (#14595)

2019-04-03 02:49:36 +02:00

								    export CUDNN_VERSION=${CUDNN_VERSION:-7.0.3}

							

[MXNET-545] Fix broken cython build (#10951) * Fix broken build with cython 0.28 * Fix setup.py to be compatible with cython 0.28 * Fix broken cython ndarray module * Revised comments * Replace hard coded library path with one obtained by find_lib_path * Add documentation for MXNET_ENABLE_CYTHON and MXNET_ENFORCE_CYTHON * Add cython build to CI * Fix for cython CI * Adjust python environment for cython CI * Add make variables to set python executable * Fix typo * Fix nnvm include path * Does not use ccache for cython * Fix issues with the wildcards in the library list in Jenkinsfile * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Intentionally introduce a bug to check that the tests actually runwith cython * Remove the intentionally introduced bug * Update installation doc * Retrigger CI * Run cython CI in ubuntu environment instead of CentOS environment * Commit a missed file * Fix a bug in check_cython * Refine environments for cython CI * Restore unrelated changes * Fix a problem occurring when the cython modules for python 2 and 3 are built successively * Trigger CI * Pin the cython version in the CI * Catch up #11320 * Remove optional arguments unused after #11320 * Trigger CI * Trigger CI * Trigger CI * Remove unnecessary stype argument from the NDArrayBase constructor * Revise confusing initialization of `_ndarray_cls` * Add cython build for python3 in CI * Fix misplaced cython build in CI * Adjust CI environments for cython * Fix invalid path for cython generated .so files in cmake build * Revert invalid fix * Revise docs * Revise check_cython * Temporaily use make instead of cmake for debugging * Temporal changes for debugging * Temporal changes for debugging * Temporaily use ctypes instead of cython modules for debugging * Temporaily disable ccache for debugging * Temporaily use make (DEV = 0) instead of cmake for debugging * Temporaily disable cudnn for debugging * Restore temporal changes * Temporarily disable coverage report * Adapt to Jenkinsfile_utils * Adapt to Jenkinsfile_utils (cont.) * Restore unrelated changes * Restore temporal changes * Resolving conflict * Test with the cmake build is removed * Add MXNET_ENABLE_CYTHON=0 to tensorrt test * Fix typo * Trigger CI * Trigger CI * Adapt to Jenkinsfile refactoring * Adapt to Jenkinsfile refactoring (cont.) * Trigger CI * Trigger CI * Stash missing cython modules * Trigger CI * CMake build of cython modules without unit tests * Fix typo * Trigger CI * Fix a mistake introduced in merging process * trigger test * Update Jenkinsfile_utils.groovy * Trigger CI * Trigger CI * Trigger CI * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests

2019-05-25 07:36:30 +09:00

								    export MXNET_ENABLE_CYTHON=0

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'not test_operator and not test_amp_init.py' -n 4 --durations=50 --cov-report xml:tests_gpu.xml --verbose tests/python/gpu

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'test_operator' -n 4 --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2020-05-04 16:44:27 -07:00

								    pytest -m 'serial' --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

								    pytest --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu/test_amp_init.py

							

[MXNET-545] Fix broken cython build (#10951) * Fix broken build with cython 0.28 * Fix setup.py to be compatible with cython 0.28 * Fix broken cython ndarray module * Revised comments * Replace hard coded library path with one obtained by find_lib_path * Add documentation for MXNET_ENABLE_CYTHON and MXNET_ENFORCE_CYTHON * Add cython build to CI * Fix for cython CI * Adjust python environment for cython CI * Add make variables to set python executable * Fix typo * Fix nnvm include path * Does not use ccache for cython * Fix issues with the wildcards in the library list in Jenkinsfile * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Intentionally introduce a bug to check that the tests actually runwith cython * Remove the intentionally introduced bug * Update installation doc * Retrigger CI * Run cython CI in ubuntu environment instead of CentOS environment * Commit a missed file * Fix a bug in check_cython * Refine environments for cython CI * Restore unrelated changes * Fix a problem occurring when the cython modules for python 2 and 3 are built successively * Trigger CI * Pin the cython version in the CI * Catch up #11320 * Remove optional arguments unused after #11320 * Trigger CI * Trigger CI * Trigger CI * Remove unnecessary stype argument from the NDArrayBase constructor * Revise confusing initialization of `_ndarray_cls` * Add cython build for python3 in CI * Fix misplaced cython build in CI * Adjust CI environments for cython * Fix invalid path for cython generated .so files in cmake build * Revert invalid fix * Revise docs * Revise check_cython * Temporaily use make instead of cmake for debugging * Temporal changes for debugging * Temporal changes for debugging * Temporaily use ctypes instead of cython modules for debugging * Temporaily disable ccache for debugging * Temporaily use make (DEV = 0) instead of cmake for debugging * Temporaily disable cudnn for debugging * Restore temporal changes * Temporarily disable coverage report * Adapt to Jenkinsfile_utils * Adapt to Jenkinsfile_utils (cont.) * Restore unrelated changes * Restore temporal changes * Resolving conflict * Test with the cmake build is removed * Add MXNET_ENABLE_CYTHON=0 to tensorrt test * Fix typo * Trigger CI * Trigger CI * Adapt to Jenkinsfile refactoring * Adapt to Jenkinsfile refactoring (cont.) * Trigger CI * Trigger CI * Stash missing cython modules * Trigger CI * CMake build of cython modules without unit tests * Fix typo * Trigger CI * Fix a mistake introduced in merging process * trigger test * Update Jenkinsfile_utils.groovy * Trigger CI * Trigger CI * Trigger CI * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests

2019-05-25 07:36:30 +09:00

Change MXNET_MKLDNN_DEBUG define name to MXNET_ONEDNN_DEBUG (#20031)

2021-03-16 20:16:53 +01:00

								    export MXNET_ONEDNN_DEBUG=1 # Ignored if not present

							

[MXNET-545] Fix broken cython build (#10951) * Fix broken build with cython 0.28 * Fix setup.py to be compatible with cython 0.28 * Fix broken cython ndarray module * Revised comments * Replace hard coded library path with one obtained by find_lib_path * Add documentation for MXNET_ENABLE_CYTHON and MXNET_ENFORCE_CYTHON * Add cython build to CI * Fix for cython CI * Adjust python environment for cython CI * Add make variables to set python executable * Fix typo * Fix nnvm include path * Does not use ccache for cython * Fix issues with the wildcards in the library list in Jenkinsfile * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Intentionally introduce a bug to check that the tests actually runwith cython * Remove the intentionally introduced bug * Update installation doc * Retrigger CI * Run cython CI in ubuntu environment instead of CentOS environment * Commit a missed file * Fix a bug in check_cython * Refine environments for cython CI * Restore unrelated changes * Fix a problem occurring when the cython modules for python 2 and 3 are built successively * Trigger CI * Pin the cython version in the CI * Catch up #11320 * Remove optional arguments unused after #11320 * Trigger CI * Trigger CI * Trigger CI * Remove unnecessary stype argument from the NDArrayBase constructor * Revise confusing initialization of `_ndarray_cls` * Add cython build for python3 in CI * Fix misplaced cython build in CI * Adjust CI environments for cython * Fix invalid path for cython generated .so files in cmake build * Revert invalid fix * Revise docs * Revise check_cython * Temporaily use make instead of cmake for debugging * Temporal changes for debugging * Temporal changes for debugging * Temporaily use ctypes instead of cython modules for debugging * Temporaily disable ccache for debugging * Temporaily use make (DEV = 0) instead of cmake for debugging * Temporaily disable cudnn for debugging * Restore temporal changes * Temporarily disable coverage report * Adapt to Jenkinsfile_utils * Adapt to Jenkinsfile_utils (cont.) * Restore unrelated changes * Restore temporal changes * Resolving conflict * Test with the cmake build is removed * Add MXNET_ENABLE_CYTHON=0 to tensorrt test * Fix typo * Trigger CI * Trigger CI * Adapt to Jenkinsfile refactoring * Adapt to Jenkinsfile refactoring (cont.) * Trigger CI * Trigger CI * Stash missing cython modules * Trigger CI * CMake build of cython modules without unit tests * Fix typo * Trigger CI * Fix a mistake introduced in merging process * trigger test * Update Jenkinsfile_utils.groovy * Trigger CI * Trigger CI * Trigger CI * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests

2019-05-25 07:36:30 +09:00

								    export MXNET_STORAGE_FALLBACK_LOG_VERBOSE=0

							

Surpress subgraph log in CI (#16607) Change-Id: Ia2ed6fdbb1d2cb5cc607a8856ca13ee338e27eac

2019-10-24 04:47:01 -05:00

								    export MXNET_SUBGRAPH_VERBOSE=0

							

[MXNET-545] Fix broken cython build (#10951) * Fix broken build with cython 0.28 * Fix setup.py to be compatible with cython 0.28 * Fix broken cython ndarray module * Revised comments * Replace hard coded library path with one obtained by find_lib_path * Add documentation for MXNET_ENABLE_CYTHON and MXNET_ENFORCE_CYTHON * Add cython build to CI * Fix for cython CI * Adjust python environment for cython CI * Add make variables to set python executable * Fix typo * Fix nnvm include path * Does not use ccache for cython * Fix issues with the wildcards in the library list in Jenkinsfile * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Intentionally introduce a bug to check that the tests actually runwith cython * Remove the intentionally introduced bug * Update installation doc * Retrigger CI * Run cython CI in ubuntu environment instead of CentOS environment * Commit a missed file * Fix a bug in check_cython * Refine environments for cython CI * Restore unrelated changes * Fix a problem occurring when the cython modules for python 2 and 3 are built successively * Trigger CI * Pin the cython version in the CI * Catch up #11320 * Remove optional arguments unused after #11320 * Trigger CI * Trigger CI * Trigger CI * Remove unnecessary stype argument from the NDArrayBase constructor * Revise confusing initialization of `_ndarray_cls` * Add cython build for python3 in CI * Fix misplaced cython build in CI * Adjust CI environments for cython * Fix invalid path for cython generated .so files in cmake build * Revert invalid fix * Revise docs * Revise check_cython * Temporaily use make instead of cmake for debugging * Temporal changes for debugging * Temporal changes for debugging * Temporaily use ctypes instead of cython modules for debugging * Temporaily disable ccache for debugging * Temporaily use make (DEV = 0) instead of cmake for debugging * Temporaily disable cudnn for debugging * Restore temporal changes * Temporarily disable coverage report * Adapt to Jenkinsfile_utils * Adapt to Jenkinsfile_utils (cont.) * Restore unrelated changes * Restore temporal changes * Resolving conflict * Test with the cmake build is removed * Add MXNET_ENABLE_CYTHON=0 to tensorrt test * Fix typo * Trigger CI * Trigger CI * Adapt to Jenkinsfile refactoring * Adapt to Jenkinsfile refactoring (cont.) * Trigger CI * Trigger CI * Stash missing cython modules * Trigger CI * CMake build of cython modules without unit tests * Fix typo * Trigger CI * Fix a mistake introduced in merging process * trigger test * Update Jenkinsfile_utils.groovy * Trigger CI * Trigger CI * Trigger CI * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests

2019-05-25 07:36:30 +09:00

								    export CUDNN_VERSION=${CUDNN_VERSION:-7.0.3}

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

Python 2 cleanup (#17583) * Drop _cy2 * Drop Python 2 specific code in mxnet and tests * Replace io.open with open * Drop from __future__ imports * Fix lint

2020-02-15 01:31:53 +00:00

    check_cython

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'not test_operator and not test_amp_init.py' -n 4 --durations=50 --cov-report xml:tests_gpu.xml --verbose tests/python/gpu

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'test_operator' -n 4 --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2020-05-04 16:44:27 -07:00

								    pytest -m 'serial' --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

								    pytest --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu/test_amp_init.py

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

Fix build issue with USE_CUDNN=0 (#11470) * Fix build issue with CUDNN=0 * Fix nocudnn func name * Remove python2 tests * Remove CPP package test * Check assert raises when cudnn disabled for op tests on gpu * Add line * Remove whitespace * add decorator for other ops * Add and remove assert * Fix op and common * Fix merge issue * Remove C API * Fix * Fix lint * Add init git * Rename CUDNN_DISABLED env variable * Add a runtime function for nocudnn * Remove MXCudnnIsenabled * Add comment for disabled test * Add full link in comment

2018-07-12 16:40:24 -07:00

								unittest_ubuntu_python3_gpu_nocudnn() {

							

Surpress subgraph log in CI (#16607) Change-Id: Ia2ed6fdbb1d2cb5cc607a8856ca13ee338e27eac

2019-10-24 04:47:01 -05:00

								    export MXNET_SUBGRAPH_VERBOSE=0

							

Fix build issue with USE_CUDNN=0 (#11470) * Fix build issue with CUDNN=0 * Fix nocudnn func name * Remove python2 tests * Remove CPP package test * Check assert raises when cudnn disabled for op tests on gpu * Add line * Remove whitespace * add decorator for other ops * Add and remove assert * Fix op and common * Fix merge issue * Remove C API * Fix * Fix lint * Add init git * Rename CUDNN_DISABLED env variable * Add a runtime function for nocudnn * Remove MXCudnnIsenabled * Add comment for disabled test * Add full link in comment

2018-07-12 16:40:24 -07:00

								    export CUDNN_OFF_TEST_ONLY=true

							

[MXNET-545] Fix broken cython build (#10951) * Fix broken build with cython 0.28 * Fix setup.py to be compatible with cython 0.28 * Fix broken cython ndarray module * Revised comments * Replace hard coded library path with one obtained by find_lib_path * Add documentation for MXNET_ENABLE_CYTHON and MXNET_ENFORCE_CYTHON * Add cython build to CI * Fix for cython CI * Adjust python environment for cython CI * Add make variables to set python executable * Fix typo * Fix nnvm include path * Does not use ccache for cython * Fix issues with the wildcards in the library list in Jenkinsfile * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Fix issues with the wildcards in the library list in Jenkinsfile (continued) * Intentionally introduce a bug to check that the tests actually runwith cython * Remove the intentionally introduced bug * Update installation doc * Retrigger CI * Run cython CI in ubuntu environment instead of CentOS environment * Commit a missed file * Fix a bug in check_cython * Refine environments for cython CI * Restore unrelated changes * Fix a problem occurring when the cython modules for python 2 and 3 are built successively * Trigger CI * Pin the cython version in the CI * Catch up #11320 * Remove optional arguments unused after #11320 * Trigger CI * Trigger CI * Trigger CI * Remove unnecessary stype argument from the NDArrayBase constructor * Revise confusing initialization of `_ndarray_cls` * Add cython build for python3 in CI * Fix misplaced cython build in CI * Adjust CI environments for cython * Fix invalid path for cython generated .so files in cmake build * Revert invalid fix * Revise docs * Revise check_cython * Temporaily use make instead of cmake for debugging * Temporal changes for debugging * Temporal changes for debugging * Temporaily use ctypes instead of cython modules for debugging * Temporaily disable ccache for debugging * Temporaily use make (DEV = 0) instead of cmake for debugging * Temporaily disable cudnn for debugging * Restore temporal changes * Temporarily disable coverage report * Adapt to Jenkinsfile_utils * Adapt to Jenkinsfile_utils (cont.) * Restore unrelated changes * Restore temporal changes * Resolving conflict * Test with the cmake build is removed * Add MXNET_ENABLE_CYTHON=0 to tensorrt test * Fix typo * Trigger CI * Trigger CI * Adapt to Jenkinsfile refactoring * Adapt to Jenkinsfile refactoring (cont.) * Trigger CI * Trigger CI * Stash missing cython modules * Trigger CI * CMake build of cython modules without unit tests * Fix typo * Trigger CI * Fix a mistake introduced in merging process * trigger test * Update Jenkinsfile_utils.groovy * Trigger CI * Trigger CI * Trigger CI * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests * Trigger tests

2019-05-25 07:36:30 +09:00

								    export MXNET_ENABLE_CYTHON=0

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'not test_operator and not test_amp_init.py' -n 4 --durations=50 --cov-report xml:tests_gpu.xml --verbose tests/python/gpu

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'test_operator' -n 4 --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2020-05-04 16:44:27 -07:00

								    pytest -m 'serial' --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

								    pytest --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu/test_amp_init.py

							

Fix build issue with USE_CUDNN=0 (#11470) * Fix build issue with CUDNN=0 * Fix nocudnn func name * Remove python2 tests * Remove CPP package test * Check assert raises when cudnn disabled for op tests on gpu * Add line * Remove whitespace * add decorator for other ops * Add and remove assert * Fix op and common * Fix merge issue * Remove C API * Fix * Fix lint * Add init git * Rename CUDNN_DISABLED env variable * Add a runtime function for nocudnn * Remove MXCudnnIsenabled * Add comment for disabled test * Add full link in comment

2018-07-12 16:40:24 -07:00

Add CPU test coverage and refine cmake builds (#13338)

2019-01-08 14:01:44 +01:00

								unittest_cpp() {

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    set -ex

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    build/tests/mxnet_unit_tests

[2.0] Bump Python to >= 3.8 (#20593) * Bump Python to >= 3.8, NumPy to >= 1.21.0 * remove pillow in requirements * update python * fix some tests

2021-10-19 22:21:04 -07:00

    source /opt/rh/rh-python38/enable

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    cd /work/mxnet

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								    OMP_NUM_THREADS=$(expr $(nproc) / 4) python -m pytest -m 'not serial' -k 'not test_operator' -n 4 --durations=50 --cov-report xml:tests_unittest.xml --verbose tests/python/unittest

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_ENGINE_TYPE=NaiveEngine \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) python -m pytest -m 'not serial' -k 'test_operator' -n 4 --durations=50 --cov-report xml:tests_unittest.xml --cov-append --verbose tests/python/unittest

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2020-05-04 16:44:27 -07:00

								    python -m pytest -m 'serial' --durations=50 --cov-report xml:tests_unittest.xml --cov-append --verbose tests/python/unittest

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								    OMP_NUM_THREADS=$(expr $(nproc) / 4) python -m pytest -n 4 --durations=50 --cov-report xml:tests_train.xml --verbose tests/python/train

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

[2.0] Bump Python to >= 3.8 (#20593) * Bump Python to >= 3.8, NumPy to >= 1.21.0 * remove pillow in requirements * update python * fix some tests

2021-10-19 22:21:04 -07:00

    source /opt/rh/rh-python38/enable

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    cd /work/mxnet

Updates gpu tests to use CUDNN_VERSION supplied by the environment but default to 7.0.3 if not set (#14595)

2019-04-03 02:49:36 +02:00

								    export CUDNN_VERSION=${CUDNN_VERSION:-7.0.3}

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'not test_operator and not test_amp_init.py' -n 4 --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_GPU_MEM_POOL_TYPE=Unpooled \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -m 'not serial' -k 'test_operator' -n 4 --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2020-05-04 16:44:27 -07:00

								    pytest -m 'serial' --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu

							

Add AMP patching of npi ops in _api_internal module (#19488)

2020-11-19 15:58:32 -08:00

								    pytest --durations=50 --cov-report xml:tests_gpu.xml --cov-append --verbose tests/python/gpu/test_amp_init.py

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

Add back cpp-package (#20131) This updates and adds back the cpp-package removed in https://github.com/apache/incubator-mxnet/commit/97d4ba5a133f93ff6075dcde3ef842b23d498a12

2021-05-24 13:44:39 -07:00

								integrationtest_ubuntu_cpp_package_gpu() {

							

[API] Add new dlpack API (#20546) * Add new dlpack API * fix build * fix build * fix * fix lint * fix conflict * ctx->device * update dlpack test * fix remainder * revert * fix dlpack * Add tests for error messages * fix dlpack.py * fix dlpack * fix sanity

2021-11-29 06:22:04 -08:00

								test_python3_data_interchange_gpu() {

							

[MXNET-247] Always build profiler (#10308) * Always build profiler * Update naive_engine.cc * remove PROFILE_MESSAGE macro * Remove USE_PROFILER=1 from CI runs

2018-04-01 16:19:45 -07:00

								integrationtest_ubuntu_cpu_onnx() {

							

2018-03-14 13:42:59 -07:00

	set -ex

Update ONNX support to 1.7 (#19573)

2020-11-24 13:52:02 -08:00

									export MXNET_SUBGRAPH_VERBOSE=0

							

Disable onnx tests and pipeline until we forward-port onnx work. (#19898) Co-authored-by: Joe Evans <joeev@amazon.com>

2021-02-15 15:04:06 -08:00

									export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[DEV] switch nose with pytest (#18025) * switch nose with pytest * switch centos python to 3.6 * disable dist kvstore tests * skip hanging test

2020-04-22 23:53:12 -07:00

	python3 tests/python/unittest/onnx/backend_test.py

Disable onnx tests and pipeline until we forward-port onnx work. (#19898) Co-authored-by: Joe Evans <joeev@amazon.com>

2021-02-15 15:04:06 -08:00

									#OMP_NUM_THREADS=$(expr $(nproc) / 4) pytest -n 4 tests/python/unittest/onnx/mxnet_export_test.py

							

2018-03-14 13:42:59 -07:00

Fix dist kvstore for trainer and flaky dist kvstore test (#11633) * fix dist kvstore trainer * fix test setup * enable tests on CI * update move some test to cpu * dont use nvdia-docker * rename option * trigger test * reduce workload to avvoid time out * disable operator tuning to reduce launch overhead * update test types

2018-07-13 16:54:14 -07:00

								integrationtest_ubuntu_cpu_dist_kvstore() {

							

Fix dist kvstore for trainer and flaky dist kvstore test (#11633) * fix dist kvstore trainer * fix test setup * enable tests on CI * update move some test to cpu * dont use nvdia-docker * rename option * trigger test * reduce workload to avvoid time out * disable operator tuning to reduce launch overhead * update test types

2019-05-23 14:52:58 +02:00

    pushd .

2018-07-13 16:54:14 -07:00

								    export PYTHONPATH=./python/

							

Surpress subgraph log in CI (#16607) Change-Id: Ia2ed6fdbb1d2cb5cc607a8856ca13ee338e27eac

2019-10-24 04:47:01 -05:00

								    export MXNET_SUBGRAPH_VERBOSE=0

							

Fix dist kvstore for trainer and flaky dist kvstore test (#11633) * fix dist kvstore trainer * fix test setup * enable tests on CI * update move some test to cpu * dont use nvdia-docker * rename option * trigger test * reduce workload to avvoid time out * disable operator tuning to reduce launch overhead * update test types

2018-07-13 16:54:14 -07:00

								    export MXNET_USE_OPERATOR_TUNING=0

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

Fix dist kvstore for trainer and flaky dist kvstore test (#11633) * fix dist kvstore trainer * fix test setup * enable tests on CI * update move some test to cpu * dont use nvdia-docker * rename option * trigger test * reduce workload to avvoid time out * disable operator tuning to reduce launch overhead * update test types

2018-07-13 16:54:14 -07:00

    cd tests/nightly/

Fix Non-ASCII character in docstring (#17600) * Fix * Try to fix...

2020-02-16 11:38:29 -08:00

								    python3 ../../tools/launch.py -n 7 --launcher local python3 dist_sync_kvstore.py --type=gluon_step_cpu

							

1bit gradient compression implementation (#17952) Co-authored-by: shuo-ouyang <1414114532@qq.com>

2021-03-11 06:45:53 +08:00

								    python3 ../../tools/launch.py -n 7 --launcher local python3 dist_sync_kvstore.py --type=compressed_cpu_1bit

							

Fix Non-ASCII character in docstring (#17600) * Fix * Try to fix...

2020-02-16 11:38:29 -08:00

								    python3 ../../tools/launch.py -n 3 --launcher local python3 test_server_profiling.py

							

Fix dist kvstore for trainer and flaky dist kvstore test (#11633) * fix dist kvstore trainer * fix test setup * enable tests on CI * update move some test to cpu * dont use nvdia-docker * rename option * trigger test * reduce workload to avvoid time out * disable operator tuning to reduce launch overhead * update test types

2019-05-23 14:52:58 +02:00

    popd

2018-07-13 16:54:14 -07:00

[MXNET-120] Float16 support for distributed training (#10183) * send as char * fix bug on pull response, and rowsparse on worker side * three modes * default to mode 0 and add support for row sparse * refactor sparse * rowsparse numbytes fixes * WIP tests * update test sync * remove prints * refactoring * Revert "refactoring" This reverts commit 05ffa1bf254057ec70ca6ec1a1deb3b072c31538. * undo refactoring to keep PR simple * add wait to stored in pull default * lint fixes * undo static cast for recvblob * lint fixes * mode 1 changes * sparse bug fix dtype * mshadow default * remove unused var * remove debug statements * clearer variables, reduced multiplication, const vars * add const for more vars, comments * comment syntax, code watcher, test default val * remove unnecessary print in test * trigger ci * multi precision mode (debugging race condition) * working rsp pushes * finish multiprecision for row sparse * rename num-bytes * fix bug due to rename of numbytes, and remove debug logs * address comments * add integration test * trigger ci * integration test * integration test * fix path of script * update mshadow * disable f16c for amalgamation * fix amalgamation build * trigger ci * disable f16c for jetson

2018-04-11 10:20:56 -07:00

								integrationtest_ubuntu_gpu_dist_kvstore() {

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

Integrate Horovod training API as part of MXNet native distributed training API (#17531) * implement pushpull for horovod * add local_rank function * add tests * Remove in-place broadcast API * Add kvstore horovod example * Fix the list to singlton conversion * Add horood test to CI * Remove test horovod from unit test * Add docstring * Add horovod in test * sync with master * Fix horovod dependency in CI * Fix merge conflict with byteps * Update __init__.py * Resolve conflict * Remove openib warning message * Add log message in test * Remove tmp file * Fix lint Co-authored-by: Haibin Lin <linhaibin.eric@gmail.com>

2019-05-23 14:52:58 +02:00

    pushd .

2020-04-14 13:41:40 -07:00

    cd /work/mxnet/python

Add USE_DIST_KVSTORE=ON to GPU build (#17911) * Add USE_DIST_KVSTORE=ON to GPU build * Fix indent * Add check for error * Fix path error * Add license header * Remove unnecessary output * Fix path error * Fix error in test script

2020-04-06 12:49:41 -07:00

    ./test_distributed_training-gpu.sh

[MXNET-120] Float16 support for distributed training (#10183) * send as char * fix bug on pull response, and rowsparse on worker side * three modes * default to mode 0 and add support for row sparse * refactor sparse * rowsparse numbytes fixes * WIP tests * update test sync * remove prints * refactoring * Revert "refactoring" This reverts commit 05ffa1bf254057ec70ca6ec1a1deb3b072c31538. * undo refactoring to keep PR simple * add wait to stored in pull default * lint fixes * undo static cast for recvblob * lint fixes * mode 1 changes * sparse bug fix dtype * mshadow default * remove unused var * remove debug statements * clearer variables, reduced multiplication, const vars * add const for more vars, comments * comment syntax, code watcher, test default val * remove unnecessary print in test * trigger ci * multi precision mode (debugging race condition) * working rsp pushes * finish multiprecision for row sparse * rename num-bytes * fix bug due to rename of numbytes, and remove debug logs * address comments * add integration test * trigger ci * integration test * integration test * fix path of script * update mshadow * disable f16c for amalgamation * fix amalgamation build * trigger ci * disable f16c for jetson

2019-05-23 14:52:58 +02:00

    popd

2018-04-11 10:20:56 -07:00

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

BytePS trainer + tests (#18032) * [MXNET-#16795] Byteps-KVStore: Intergrate Byteps into mxnet as new type of kvstore backend (#17555) * Add Byteps backend for kvstore * Add a temp launcher for byteps backend * make the test fit for byteps kvstore. * final workable test * Remove trashy print and logs * correct comment * add hostfile for ci test * add ci test for byteps kvstore * add visibile devices for byteps-kvstore ci test * add licenses for tools/byteps_launcher.py * syntax error * pylint error (remove unused import like logging) * pylint error * pylint error * enable launching without hostfile (local byteps) * 1. rename byteps_kvstore.py to byteps.py; 2. shorten the launch option to ; 3. add instruction for -H and -SH options for launch; 4. add documentation for byteps kvstore in kvstore/base.py: create(name='local') * edit documentation of KVStoreBase::is_capable(capability); reture fasle for BytePS(KVStoreBase):is_capable(any). * pylint error * remove an error of arg.byteps * use --env option to set workers' environment * error in byteps-launcher.py * remove the unpurposed editing mistake in runtime_functions.sh * disable cpu support for byteps kvstore. * 1. format the document to avoid julia doc build error; 2. little change to nightly test; 3. add byteps copy right declararation in byteps_launcher.py 4. if args.byteps == True ===> if args.byteps * remove the --scheduler_ip and --scheduler_port options in launch.py * 1. maintain the origin value of broadcast and pushpull 2. optimize when out = value or [out]=value 3. add some missing documentation to avoid doc building error. * Add bytePS to CI * add dependency * +integrationtest_ubuntu_gpu_byteps * add byteps pipeline * disable a few tests * remove more tests * fix permission * remove apt-get * fix python path * improve logging * fix printns * add back CI Co-authored-by: Ubuntu <ubuntu@ip-172-31-39-16.ec2.internal> Co-authored-by: Piyush Ghai <ghai.8@osu.edu> Co-authored-by: eric-haibin-lin <linhaibin.eric@gmail.com> Co-authored-by: eric-haibin-lin <--global> Co-authored-by: Lin <haibilin@a483e7be4c92.ant.amazon.com> * fix byteps logging and declare tensor * check exceptions and return -1 * print logging in CI * Update byteps.py * Update runtime_functions.sh * add numa dependency * pin dependency * Update runtime_functions.sh * Update Dockerfile.build.ubuntu * Update runtime_functions.sh * Update runtime_functions.sh * Update runtime_functions.sh * Update runtime_functions.sh * Update Jenkins_steps.groovy * remove launcher. use bpslauncher instead. Co-authored-by: Chaokun Chang <33217209+ChaokunChang@users.noreply.github.com> Co-authored-by: Ubuntu <ubuntu@ip-172-31-39-16.ec2.internal> Co-authored-by: Piyush Ghai <ghai.8@osu.edu> Co-authored-by: Lin <haibilin@a483e7be4c92.ant.amazon.com> Co-authored-by: Ubuntu <ubuntu@ip-172-31-37-108.ec2.internal> Co-authored-by: EC2 Default User <ec2-user@ip-172-31-81-80.ec2.internal> Co-authored-by: Ubuntu <ubuntu@ip-172-31-57-164.ec2.internal>

2020-06-04 14:20:52 -07:00

								integrationtest_ubuntu_gpu_byteps() {

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

BytePS trainer + tests (#18032) * [MXNET-#16795] Byteps-KVStore: Intergrate Byteps into mxnet as new type of kvstore backend (#17555) * Add Byteps backend for kvstore * Add a temp launcher for byteps backend * make the test fit for byteps kvstore. * final workable test * Remove trashy print and logs * correct comment * add hostfile for ci test * add ci test for byteps kvstore * add visibile devices for byteps-kvstore ci test * add licenses for tools/byteps_launcher.py * syntax error * pylint error (remove unused import like logging) * pylint error * pylint error * enable launching without hostfile (local byteps) * 1. rename byteps_kvstore.py to byteps.py; 2. shorten the launch option to ; 3. add instruction for -H and -SH options for launch; 4. add documentation for byteps kvstore in kvstore/base.py: create(name='local') * edit documentation of KVStoreBase::is_capable(capability); reture fasle for BytePS(KVStoreBase):is_capable(any). * pylint error * remove an error of arg.byteps * use --env option to set workers' environment * error in byteps-launcher.py * remove the unpurposed editing mistake in runtime_functions.sh * disable cpu support for byteps kvstore. * 1. format the document to avoid julia doc build error; 2. little change to nightly test; 3. add byteps copy right declararation in byteps_launcher.py 4. if args.byteps == True ===> if args.byteps * remove the --scheduler_ip and --scheduler_port options in launch.py * 1. maintain the origin value of broadcast and pushpull 2. optimize when out = value or [out]=value 3. add some missing documentation to avoid doc building error. * Add bytePS to CI * add dependency * +integrationtest_ubuntu_gpu_byteps * add byteps pipeline * disable a few tests * remove more tests * fix permission * remove apt-get * fix python path * improve logging * fix printns * add back CI Co-authored-by: Ubuntu <ubuntu@ip-172-31-39-16.ec2.internal> Co-authored-by: Piyush Ghai <ghai.8@osu.edu> Co-authored-by: eric-haibin-lin <linhaibin.eric@gmail.com> Co-authored-by: eric-haibin-lin <--global> Co-authored-by: Lin <haibilin@a483e7be4c92.ant.amazon.com> * fix byteps logging and declare tensor * check exceptions and return -1 * print logging in CI * Update byteps.py * Update runtime_functions.sh * add numa dependency * pin dependency * Update runtime_functions.sh * Update Dockerfile.build.ubuntu * Update runtime_functions.sh * Update runtime_functions.sh * Update runtime_functions.sh * Update runtime_functions.sh * Update Jenkins_steps.groovy * remove launcher. use bpslauncher instead. Co-authored-by: Chaokun Chang <33217209+ChaokunChang@users.noreply.github.com> Co-authored-by: Ubuntu <ubuntu@ip-172-31-39-16.ec2.internal> Co-authored-by: Piyush Ghai <ghai.8@osu.edu> Co-authored-by: Lin <haibilin@a483e7be4c92.ant.amazon.com> Co-authored-by: Ubuntu <ubuntu@ip-172-31-37-108.ec2.internal> Co-authored-by: EC2 Default User <ec2-user@ip-172-31-81-80.ec2.internal> Co-authored-by: Ubuntu <ubuntu@ip-172-31-57-164.ec2.internal>

2020-06-04 14:20:52 -07:00

    cd tests/nightly/

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								test_ubuntu_cpu_python3() {

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

    VENV=mxnet_py3_venv

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								    OMP_NUM_THREADS=$(expr $(nproc) / 4) python3 -m pytest -m 'not serial' -k 'not test_operator' -n 4 --durations=50 --verbose tests/python/unittest

							

[CI] run operator tests with naive engine (#18252) * run operator tests with naive engine * fix take tests * update skip mark * fix cuda error reset * adjust tests * disable parallel testing and naive engine for mkl/mkldnn #18244

2020-05-16 19:04:44 -07:00

								    MXNET_ENGINE_TYPE=NaiveEngine \

							

Disable test coverage in MKL builds (#18443) * Disable test coverage in MKL builds * Enable test parallelization * Set OMP_NUM_THREADS * Fix * Fix unpack_and_init

2020-07-15 00:57:38 +00:00

								        OMP_NUM_THREADS=$(expr $(nproc) / 4) python3 -m pytest -m 'not serial' -k 'test_operator' -n 4 --durations=50 --verbose tests/python/unittest

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2020-05-04 16:44:27 -07:00

								    python3 -m pytest -m 'serial' --durations=50 --verbose tests/python/unittest

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

# QEMU based ARM tests

Change MXNET_MKLDNN_DEBUG define name to MXNET_ONEDNN_DEBUG (#20031)

2021-03-16 20:16:53 +01:00

								    export MXNET_ONEDNN_DEBUG=0  # Ignored if not present

							

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

								    export MXNET_STORAGE_FALLBACK_LOG_VERBOSE=0

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[CI] run pytest in parallel (#18146) * run pytest in parallel * disable memory pool * address flaky ftrl/fm test and layernorm timeout * mark tests as serial * use parametrize in numpy op tests * fix io bugs * fix gluon rnn cell test and doc * replace xfail with raises scope * fix flaky numpy, mkldnn quantize, and rnn tests * fix tempfile/dir usage

2020-05-04 16:44:27 -07:00

    python3 -m pytest -n 2 --verbose tests/python/unittest/test_engine.py

Switch to C++17 and modernize toolchain + CI (#17984) As per #17968, require C++17 compatible compiler. For cuda code, use C++14 mode introduced in Cuda 9. C++17 support for Cuda will be available in Cuda 11. Switching to C++17 requires modernizing the toolchain, which exposed a number of technical debt issues in the codebase. All blocking issues are fixed as part of this PR. See the full list below. This PR contains the following specific changes: Switch CI pipeline to use gcc7 on Ubuntu and CentOS Switch CD pipeline to CentOS 7 with https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/ This enables us to build with gcc7 C++17 compiler while keeping a relatively old glibc requirement for distribution. Simplify ARM Edge builds Switch to standard Ubuntu / Debian cross-compilation toolchain for ARMv7, ARMv8 Switch to https://toolchains.bootlin.com/ toolchain for ARMv6 (the Debian ARMv6 toolchain is for ARMv4 + ARMv5 + ARMv6, but we wish to only target ARMv6 and make use of ARMv6 features) Remove reliance on dockcross for cross compilation. Simplify Jetson build Use standard Ubuntu / Debian cross-compilation toolchain for ARMv8 Upgrade to Cuda 10 and Jetpack 4.3 Simplify build setup Simplify QEMU ARM virtualization test setup on CI Remove complex "Virtual Machine in Docker" logic and run a QEMU based Docker container instead based on arm32v7/ubuntu Fix out of bounds vector accesses in SoftmaxGradOpType MKLDNNFCBackward Fix use of non-standard rand_r function (which is not available on anymore on newer Android toolchains and shouldn't be use in any case). Fix reproducibility of RNN with Dropout Fix reproducibility of DGL Graph Sampling Operators Update tests for Android Edge build to NDK19. The previously used standalone toolchain is obsolete. Those Dockerfiles that required refactoring as part of the effort were refactored based on the following consideration Maximize the use of system dependencies provided by the distribution instead of manually installing dependencies from source or from third party vendors. This reduces the complexity of the installation process and essentially pins the dependency versions, increasing CI stability. Further, Dockerfile build speed is improved. To facilitate this, use recent distribution versions. We still ensure backwards compatibility via CentOS7 based build and test stages Minimize the number of layers in the Dockerfile. Don't have 5 different script files executed, each calling apt-get update and install, but just execute once. Speeds up the build and reduces image size. Keep each Dockerfile simple and tailored to a purpose, instead of running 20 scripts to install dependencies for every thinkable scenario, which is unmaintainable. Some more small changes: Remove outdated references to Cuda 7 and Cuda 8 in various files. Remove C++03 support in mshadow Disable broken tests NumpyBooleanAssignForwardCPU #17990 test_init.test_rsp_const_init #17988 quantized_elemwise_mul #18034 List of squashed commits * cpp standard * Remove leftover files of Cuda 7 and Cuda 8 support * thrust 1.9.8 for clang10 * compiler warnings * Disable broken test_init.test_rsp_const_init * Disable tests invoking NumpyBooleanAssignForwardCPU * Fix out of bounds access in SoftmaxGradOpType * Use CentOS 7 for staticbuilds CentOS 7 fullfills the requirements for PEP 599 manylinux-2014 and provides a C++17 toolchain. * Fix MKLDNNFCBackward * Update edge toolchain * Support platforms without rand_r * Cleanup random.h * Greatly simplify qemu setup * Remove unused functions in Jenkins_steps.groovy * Skip quantized_elemwise_mul due QuantizedElemwiseMulOpShape bug * Fix R package installation https://github.com/apache/incubator-mxnet/issues/18042 * Fix centos ccache * Fix GPU Makefile staticbuild on CentOS7 * CentOS7 NCCL * CentOS7 staticbuild fix link with libculibos

2020-04-14 10:29:29 -07:00

[MXNET-405] Add 2 new pipelines to the Official CI and run nightly tests. (#10827) * Migrate Nightlies PR1 * Run all tests * Fix RAT and Comment out KVstore for now * Comment out the chmod and test * remove chmod from rat and CW * One more check * changes * run install on cpu instead of gpu * Fix failing pip * Fix Pip * typo-fix * Run Pip on cpu instance * Add a README * Remove pip-test * add pip test back * Add BLC to jenkinsfile format * add permissions * include the git add, commit, push for broken link checker * push not working, comment * Review Comments 1: remove g++-5 and parallize make * Review Comments 2: change dockerfile name, update mx_lib, remove commented line * Review Comments 3: Separate out the dockerfiles for nightly tests * Review Comments 4: Move apache rat install to nightly install script * Review Comments 4: copy the rat folder into work * correct path * Change path Again * check tests * path * Uncomment KVStore * Fix rat and js tests * Move to docker_run and comment KVstore again * Adding docker_run for binaries Jenkinsfile * Permission and other fixes * fix RAT regex, add set -e to install funcs * Delete BLC, change chmod to 755 * JS: parallel make * Add links to github issues * change the instance type to cpu * Comment out pip installs to test build from source on cpu * typo * Add sudo for pip install * Add sudo to all pip install commands coz they cause CI to fail, linux only * Run compilation warnings on cpu instance * Change file perms and some cleanup * Fix the dockerfiles * Dockerfile updates * Merge JS dockerfile into nightly and remove export command * Pin repo versions and run cpu install from souce in cpu docker * Add yes to install in js script * remove sudo from index.md * Add sudo to test script instead * add sudo to test script

2018-06-20 19:17:45 -07:00

# Functions that run the nightly Tests:

CI: Migrate remaining Dockerfiles to docker-compose.yml and remove unused code (#18771) * Migrate remaining Dockerfiles to docker-compose.yml - Delete unused Dockerfiles - Delete unused install/*.sh scripts - Consolidate ubuntu_gpu_tensorrt and ubuntu_gpu - Remove deprecated logic in ci/build.py (no longer needed with docker-compose) - Remove ci/docker_cache.py (no longer needed with docker-compose) * Fix * Fix * Fix ubuntu_cpu_jekyll

2020-07-23 18:09:10 +00:00

								test_rat_check() {

							

[MXNET-612] Optimize RAT setup and move it into PR stage (#11492) * Move Apache RAT installation into Docker layers instead of doing it during runtime * Address comments * Move RAT from nightly to PR

2018-06-29 19:07:07 +02:00

    set -e

CI: fix test_rat_check (#19711)

2020-12-23 15:18:24 -05:00

    set -o pipefail

[MXNET-612] Optimize RAT setup and move it into PR stage (#11492) * Move Apache RAT installation into Docker layers instead of doing it during runtime * Address comments * Move RAT from nightly to PR

2018-06-29 19:07:07 +02:00

    pushd .

CI: Migrate remaining Dockerfiles to docker-compose.yml and remove unused code (#18771) * Migrate remaining Dockerfiles to docker-compose.yml - Delete unused Dockerfiles - Delete unused install/*.sh scripts - Consolidate ubuntu_gpu_tensorrt and ubuntu_gpu - Remove deprecated logic in ci/build.py (no longer needed with docker-compose) - Remove ci/docker_cache.py (no longer needed with docker-compose) * Fix * Fix * Fix ubuntu_cpu_jekyll

2018-07-19 17:18:04 +02:00

2020-07-23 18:09:10 +00:00

    cd /usr/local/src/apache-rat-0.13

[MXNET-612] Optimize RAT setup and move it into PR stage (#11492) * Move Apache RAT installation into Docker layers instead of doing it during runtime * Address comments * Move RAT from nightly to PR

2018-06-29 19:07:07 +02:00

[LICENSE] Reorganize rat-excludes file to ease license auditing (#19743) * Reorganize rat-excludes file to ease license auditing * move rat-excludes to top level

2021-01-15 17:49:18 -05:00

								    OUTPUT=$(java -jar apache-rat-0.13.jar -E /work/mxnet/rat-excludes -d /work/mxnet|tee >(cat - >&5))

							

[MXNET-612] Optimize RAT setup and move it into PR stage (#11492) * Move Apache RAT installation into Docker layers instead of doing it during runtime * Address comments * Move RAT from nightly to PR

2018-06-29 19:07:07 +02:00

								    ERROR_MESSAGE="Printing headers for text files without a valid license header"

							

[MXNET-405] Add 2 new pipelines to the Official CI and run nightly tests. (#10827) * Migrate Nightlies PR1 * Run all tests * Fix RAT and Comment out KVstore for now * Comment out the chmod and test * remove chmod from rat and CW * One more check * changes * run install on cpu instead of gpu * Fix failing pip * Fix Pip * typo-fix * Run Pip on cpu instance * Add a README * Remove pip-test * add pip test back * Add BLC to jenkinsfile format * add permissions * include the git add, commit, push for broken link checker * push not working, comment * Review Comments 1: remove g++-5 and parallize make * Review Comments 2: change dockerfile name, update mx_lib, remove commented line * Review Comments 3: Separate out the dockerfiles for nightly tests * Review Comments 4: Move apache rat install to nightly install script * Review Comments 4: copy the rat folder into work * correct path * Change path Again * check tests * path * Uncomment KVStore * Fix rat and js tests * Move to docker_run and comment KVstore again * Adding docker_run for binaries Jenkinsfile * Permission and other fixes * fix RAT regex, add set -e to install funcs * Delete BLC, change chmod to 755 * JS: parallel make * Add links to github issues * change the instance type to cpu * Comment out pip installs to test build from source on cpu * typo * Add sudo for pip install * Add sudo to all pip install commands coz they cause CI to fail, linux only * Run compilation warnings on cpu instance * Change file perms and some cleanup * Fix the dockerfiles * Dockerfile updates * Merge JS dockerfile into nightly and remove export command * Pin repo versions and run cpu install from souce in cpu docker * Add yes to install in js script * remove sudo from index.md * Add sudo to test script instead * add sudo to test script

2018-06-20 19:17:45 -07:00

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

Stop testing Python 2 on CI (#15990)

2020-02-03 09:58:01 -08:00

    tests/nightly/test_kvstore.py

[MXNET-405] Add 2 new pipelines to the Official CI and run nightly tests. (#10827) * Migrate Nightlies PR1 * Run all tests * Fix RAT and Comment out KVstore for now * Comment out the chmod and test * remove chmod from rat and CW * One more check * changes * run install on cpu instead of gpu * Fix failing pip * Fix Pip * typo-fix * Run Pip on cpu instance * Add a README * Remove pip-test * add pip test back * Add BLC to jenkinsfile format * add permissions * include the git add, commit, push for broken link checker * push not working, comment * Review Comments 1: remove g++-5 and parallize make * Review Comments 2: change dockerfile name, update mx_lib, remove commented line * Review Comments 3: Separate out the dockerfiles for nightly tests * Review Comments 4: Move apache rat install to nightly install script * Review Comments 4: copy the rat folder into work * correct path * Change path Again * check tests * path * Uncomment KVStore * Fix rat and js tests * Move to docker_run and comment KVstore again * Adding docker_run for binaries Jenkinsfile * Permission and other fixes * fix RAT regex, add set -e to install funcs * Delete BLC, change chmod to 755 * JS: parallel make * Add links to github issues * change the instance type to cpu * Comment out pip installs to test build from source on cpu * typo * Add sudo for pip install * Add sudo to all pip install commands coz they cause CI to fail, linux only * Run compilation warnings on cpu instance * Change file perms and some cleanup * Fix the dockerfiles * Dockerfile updates * Merge JS dockerfile into nightly and remove export command * Pin repo versions and run cpu install from souce in cpu docker * Add yes to install in js script * remove sudo from index.md * Add sudo to test script instead * add sudo to test script

2018-06-20 19:17:45 -07:00

2019-04-23 14:47:10 -07:00

#Test Large Tensor Size

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

removing forked and adding overall timeout for 2 hrs (#19792) Co-authored-by: Rohit Kumar Srivastava <srivastava.141@buckeyemail.osu.edu>

2021-02-03 12:00:09 -08:00

								    pytest -s --exitfirst --verbose --timeout=7200 tests/nightly/test_np_large_array.py

							

skipping randint flaky test for large vector and reordering op execution (#17388)

2020-01-21 11:02:55 -08:00

[MXNET-651] MXNet Model Backwards Compatibility Checker (#11626) * Added MNIST-MLP-Module-API models to check model save and load_checkpoint methods * Added LENET with Conv2D operator training file * Added LENET with Conv2d operator inference file * Added LanguageModelling with RNN training file * Added LamguageModelling with RNN inference file * Added hybridized LENET Gluon Model training file * Added hybridized LENET gluon model inference file * Added license headers * Refactored the model and inference files and extracted out duplicate code in a common file * Added runtime function for executing the MBCC files * Added JenkinsFile for MBCC to be run as a nightly job * Added boto3 install for s3 uploads * Added README for MBCC * Added license header * Added more common functions from lm_rnn_gluon_train and inference files into common.py to clean up code * Added scripts for training models on older versions of MXNet * Added check for preventing inference script from crashing in case no trained models are found * Fixed indentation issue * Replaced Penn Tree Bank Dataset with Sherlock Holmes Dataset * Fixed indentation issue * Removed training in models and added smaller models. Now we are simply checking a forward pass in the model with dummy data. * Updated README * Fixed indentation error * Fixed indentation error * Removed code duplication in the training file * Added comments for runtime_functions script for training files * Merged S3 Buckets for storing data and models into one * Automated the process to fetch MXNet versions from git tags * Added defensive checks for the case where the data might not be found * Fixed issue where we were performing inference on state model files * Replaced print statements with logging ones * Removed boto install statements and move them into ubuntu_python docker * Separated training and uploading of models into separate files so that training runs in Docker and upload runs outside Docker * Fixed pylint warnings * Updated comments and README * Removed the venv for training process * Fixed indentation in the MBCC Jenkins file and also separated out training and inference into two separate stages * Fixed indendation * Fixed erroneous single quote * Added --user flag to check for Jenkins error * Removed unused methods * Added force flag in the pip command to install mxnet * Removed the force-re-install flag * Changed exit 1 to exit 0 * Added quotes around the shell command * added packlibs and unpack libs for MXNet builds * Changed PythonPath from relative to absolute * Created dedicated bucket with correct permission * Fix for python path in training * Changed bucket name to CI bucket * Added set -ex to the upload shell script * Now raising an exception if no models are found in the S3 bucket * Added regex to train models script * Added check for performing inference only on models trained on same major versions * Added set -ex flags to shell scripts * Added multi-version regex checks in training * Fixed typo in regex * Now we will train models for all the minor versions for a given major version by traversing the tags * Added check for validating current_version

2018-07-31 02:50:13 -07:00

#Tests Model backwards compatibility on MXNet

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[MXNET-651] MXNet Model Backwards Compatibility Checker (#11626) * Added MNIST-MLP-Module-API models to check model save and load_checkpoint methods * Added LENET with Conv2D operator training file * Added LENET with Conv2d operator inference file * Added LanguageModelling with RNN training file * Added LamguageModelling with RNN inference file * Added hybridized LENET Gluon Model training file * Added hybridized LENET gluon model inference file * Added license headers * Refactored the model and inference files and extracted out duplicate code in a common file * Added runtime function for executing the MBCC files * Added JenkinsFile for MBCC to be run as a nightly job * Added boto3 install for s3 uploads * Added README for MBCC * Added license header * Added more common functions from lm_rnn_gluon_train and inference files into common.py to clean up code * Added scripts for training models on older versions of MXNet * Added check for preventing inference script from crashing in case no trained models are found * Fixed indentation issue * Replaced Penn Tree Bank Dataset with Sherlock Holmes Dataset * Fixed indentation issue * Removed training in models and added smaller models. Now we are simply checking a forward pass in the model with dummy data. * Updated README * Fixed indentation error * Fixed indentation error * Removed code duplication in the training file * Added comments for runtime_functions script for training files * Merged S3 Buckets for storing data and models into one * Automated the process to fetch MXNet versions from git tags * Added defensive checks for the case where the data might not be found * Fixed issue where we were performing inference on state model files * Replaced print statements with logging ones * Removed boto install statements and move them into ubuntu_python docker * Separated training and uploading of models into separate files so that training runs in Docker and upload runs outside Docker * Fixed pylint warnings * Updated comments and README * Removed the venv for training process * Fixed indentation in the MBCC Jenkins file and also separated out training and inference into two separate stages * Fixed indendation * Fixed erroneous single quote * Added --user flag to check for Jenkins error * Removed unused methods * Added force flag in the pip command to install mxnet * Removed the force-re-install flag * Changed exit 1 to exit 0 * Added quotes around the shell command * added packlibs and unpack libs for MXNet builds * Changed PythonPath from relative to absolute * Created dedicated bucket with correct permission * Fix for python path in training * Changed bucket name to CI bucket * Added set -ex to the upload shell script * Now raising an exception if no models are found in the S3 bucket * Added regex to train models script * Added check for performing inference only on models trained on same major versions * Added set -ex flags to shell scripts * Added multi-version regex checks in training * Fixed typo in regex * Now we will train models for all the minor versions for a given major version by traversing the tags * Added check for validating current_version

2018-07-31 02:50:13 -07:00

								    ./tests/nightly/model_backwards_compatibility_check/model_backward_compat_checker.sh

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[MXNET-651] MXNet Model Backwards Compatibility Checker (#11626) * Added MNIST-MLP-Module-API models to check model save and load_checkpoint methods * Added LENET with Conv2D operator training file * Added LENET with Conv2d operator inference file * Added LanguageModelling with RNN training file * Added LamguageModelling with RNN inference file * Added hybridized LENET Gluon Model training file * Added hybridized LENET gluon model inference file * Added license headers * Refactored the model and inference files and extracted out duplicate code in a common file * Added runtime function for executing the MBCC files * Added JenkinsFile for MBCC to be run as a nightly job * Added boto3 install for s3 uploads * Added README for MBCC * Added license header * Added more common functions from lm_rnn_gluon_train and inference files into common.py to clean up code * Added scripts for training models on older versions of MXNet * Added check for preventing inference script from crashing in case no trained models are found * Fixed indentation issue * Replaced Penn Tree Bank Dataset with Sherlock Holmes Dataset * Fixed indentation issue * Removed training in models and added smaller models. Now we are simply checking a forward pass in the model with dummy data. * Updated README * Fixed indentation error * Fixed indentation error * Removed code duplication in the training file * Added comments for runtime_functions script for training files * Merged S3 Buckets for storing data and models into one * Automated the process to fetch MXNet versions from git tags * Added defensive checks for the case where the data might not be found * Fixed issue where we were performing inference on state model files * Replaced print statements with logging ones * Removed boto install statements and move them into ubuntu_python docker * Separated training and uploading of models into separate files so that training runs in Docker and upload runs outside Docker * Fixed pylint warnings * Updated comments and README * Removed the venv for training process * Fixed indentation in the MBCC Jenkins file and also separated out training and inference into two separate stages * Fixed indendation * Fixed erroneous single quote * Added --user flag to check for Jenkins error * Removed unused methods * Added force flag in the pip command to install mxnet * Removed the force-re-install flag * Changed exit 1 to exit 0 * Added quotes around the shell command * added packlibs and unpack libs for MXNet builds * Changed PythonPath from relative to absolute * Created dedicated bucket with correct permission * Fix for python path in training * Changed bucket name to CI bucket * Added set -ex to the upload shell script * Now raising an exception if no models are found in the S3 bucket * Added regex to train models script * Added check for performing inference only on models trained on same major versions * Added set -ex flags to shell scripts * Added multi-version regex checks in training * Fixed typo in regex * Now we will train models for all the minor versions for a given major version by traversing the tags * Added check for validating current_version

2018-07-31 02:50:13 -07:00

								    ./tests/nightly/model_backwards_compatibility_check/train_mxnet_legacy_models.sh

							

[MXNET-1194] Reenable nightly tutorials tests for Python2 and Python3 (#13099) * Reenable nightly tests tutorials * small fix to settings * optimize a few more tutorials * Update tests * Update runtime_functions.sh * Update fine_tuning_gluon.md * Update JenkinsfileForBinaries * Update JenkinsfileForBinaries * remove coverage

2018-11-07 15:16:40 -08:00

								nightly_tutorial_test_ubuntu_python3_gpu() {

							

turn on Sphinx warnings as errors (#13544) * turn on warnings as errors * move warnings as error logic to build_all_version * fix typo in comment * add warning as error option for docs pipeline * bump ci to test again; use this chance to add notes on this feature * fix bugs in image.py docs

2018-12-18 09:02:02 -08:00

								    export BUILD_VER=tutorial

							

[MXNET-1194] Reenable nightly tutorials tests for Python2 and Python3 (#13099) * Reenable nightly tests tutorials * small fix to settings * optimize a few more tutorials * Update tests * Update runtime_functions.sh * Update fine_tuning_gluon.md * Update JenkinsfileForBinaries * Update JenkinsfileForBinaries * remove coverage

2018-11-07 15:16:40 -08:00

								    export MXNET_DOCS_BUILD_MXNET=0

							

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[MXNET-1194] Reenable nightly tutorials tests for Python2 and Python3 (#13099) * Reenable nightly tests tutorials * small fix to settings * optimize a few more tutorials * Update tests * Update runtime_functions.sh * Update fine_tuning_gluon.md * Update JenkinsfileForBinaries * Update JenkinsfileForBinaries * remove coverage

2018-11-07 15:16:40 -08:00

    make html

Surpress subgraph log in CI (#16607) Change-Id: Ia2ed6fdbb1d2cb5cc607a8856ca13ee338e27eac

2019-10-24 04:47:01 -05:00

								    export MXNET_SUBGRAPH_VERBOSE=0

							

[MXNET-1194] Reenable nightly tutorials tests for Python2 and Python3 (#13099) * Reenable nightly tests tutorials * small fix to settings * optimize a few more tutorials * Update tests * Update runtime_functions.sh * Update fine_tuning_gluon.md * Update JenkinsfileForBinaries * Update JenkinsfileForBinaries * remove coverage

2018-11-07 15:16:40 -08:00

								    export PYTHONPATH=/work/mxnet/python/

							

[DEV] switch nose with pytest (#18025) * switch nose with pytest * switch centos python to 3.6 * disable dist kvstore tests * skip hanging test

2020-04-22 23:53:12 -07:00

								    pytest --durations=50 --cov-report xml:tests_tutorials.xml --capture=no test_tutorials.py

							

[MXNET-1194] Reenable nightly tutorials tests for Python2 and Python3 (#13099) * Reenable nightly tests tutorials * small fix to settings * optimize a few more tutorials * Update tests * Update runtime_functions.sh * Update fine_tuning_gluon.md * Update JenkinsfileForBinaries * Update JenkinsfileForBinaries * remove coverage

2018-11-07 15:16:40 -08:00

[MXNET-1396][Fit-API] Update default handler logic (#14765) * move to nightly for binaries * update default handler * fix pylint * trigger ci * trigger ci

2019-04-23 20:33:03 -07:00

								nightly_estimator() {

							

[MXNet-1343][Fit API]Add CNN integration test for fit() API (#14405) * added cnn intg tests for fit api * updated cnn intg tests * added functions for nightly test * updated runtime_function * updated intg tests * updated init, datapath, refs * added validation data * update cpu test * refactor code * updated context

2019-04-03 15:12:56 -07:00

    set -ex

[DOC] Fix warnings in tutorials and turn on -W (#19624)

2020-12-08 11:13:09 -05:00

								    export DMLC_LOG_STACK_TRACE_DEPTH=100

							

[MXNet-1343][Fit API]Add CNN integration test for fit() API (#14405) * added cnn intg tests for fit api * updated cnn intg tests * added functions for nightly test * updated runtime_function * updated intg tests * updated init, datapath, refs * added validation data * update cpu test * refactor code * updated context

2019-04-03 15:12:56 -07:00

    cd /work/mxnet/tests/nightly/estimator

[DEV] switch nose with pytest (#18025) * switch nose with pytest * switch centos python to 3.6 * disable dist kvstore tests * skip hanging test

2020-04-22 23:53:12 -07:00

    pytest test_estimator_cnn.py

[MXNet-1375][Fit API]Added RNN integration test for fit() API (#14547) * Added RNN integration test for fit() API * Addressed review comments: change in JenkinFile, tmp directory, ctx with condense if/else, renamed imports * CPU test doesn't require nvidiadocker container * Modified the structure by removing the redundant code

2019-04-03 14:28:08 -07:00

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

# For testing PRs

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								deploy_docs() {

							

[MXNET-247] Always build profiler (#10308) * Always build profiler * Update naive_engine.cc * remove PROFILE_MESSAGE macro * Remove USE_PROFILER=1 from CI runs

2018-04-01 16:19:45 -07:00

Julia docs (#15454) * add julia docs generation option * add julia docs to website build * update links to point to local julia site * fix mxnet build setting bug * fix link for site * update ubuntu guide for julia * turn building mxnet on by default * readd env var since CI uses it * fold julia docs into main website jenkins routine * fix ubuntu julia setup steps * cleanup mentions of the old julia docs pipeline

2019-07-11 14:23:20 -07:00

								    export CC="ccache gcc"

							

import Julia binding - enable Jenkins CI build for Julia - add license headers to Julia source code - update links for Julia README

2018-09-29 02:48:43 +08:00

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

    build_python_docs

Julia docs (#15454) * add julia docs generation option * add julia docs to website build * update links to point to local julia site * fix mxnet build setting bug * fix link for site * update ubuntu guide for julia * turn building mxnet on by default * readd env var since CI uses it * fold julia docs into main website jenkins routine * fix ubuntu julia setup steps * cleanup mentions of the old julia docs pipeline

2019-07-11 14:23:20 -07:00

    popd

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

    set -ex

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

    build_docs_setup

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

    pushd docs/python_docs

Fix Python docs (#18924) * Fix Python docs * Fix * Fix

2020-08-18 16:57:35 +00:00

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

								    export PATH=/home/jenkins_slave/.local/bin:$PATH

							

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

    pushd python

[DOC][v2.0] Part3: Evaluate Notebooks (#20490) * evaluate notebooks * run notebooks on g4 instance * update jenkins * enable eval * use cpu instances * update skip list * update * skip info_gan.md * use gpu instance * use gpu * update jenkins * fix * fix * fix * fix * fix * add train_dataset * use net instead of model * fix notebook * fix * fix * update test * fix * update notebooks * fix doc * remove irrelevant comment

2021-08-17 09:26:14 -07:00

    cp tutorials/getting-started/crash-course/prepare_dataset.py .

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

    make clean

[DOC][v2.0] Part3: Evaluate Notebooks (#20490) * evaluate notebooks * run notebooks on g4 instance * update jenkins * enable eval * use cpu instances * update skip list * update * skip info_gan.md * use gpu instance * use gpu * update jenkins * fix * fix * fix * fix * fix * add train_dataset * use net instead of model * fix notebook * fix * fix * update test * fix * update notebooks * fix doc * remove irrelevant comment

2021-08-17 09:26:14 -07:00

								    make html EVAL=1

							

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

    GZIP=-9 tar zcvf python-artifacts.tgz -C build/_build/html .

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

    mv python/python-artifacts.tgz /work/mxnet/docs/_build/

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

    popd

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

								    python_doc_folder='html/api/python/docs'

							

Add back cpp-package (#20131) This updates and adds back the cpp-package removed in https://github.com/apache/incubator-mxnet/commit/97d4ba5a133f93ff6075dcde3ef842b23d498a12

2021-05-24 13:44:39 -07:00

								    api_folder='html/api'

							

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

    # Python has it's own landing page/site so we don't put it in /docs/api

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

								    mkdir -p $python_doc_folder && tar -xzf python-artifacts.tgz --directory $python_doc_folder

							

Add back cpp-package (#20131) This updates and adds back the cpp-package removed in https://github.com/apache/incubator-mxnet/commit/97d4ba5a133f93ff6075dcde3ef842b23d498a12

2021-05-24 13:44:39 -07:00

								    mkdir -p $api_folder/cpp/docs/api && tar -xzf c-artifacts.tgz --directory $api_folder/cpp/docs/api

							

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

[Website] Fix website publish (#20573) * fix website publish * update * remove .asf.yaml from version/master * force include .asf.yaml * include .htaccess * add .asf.yaml check in CI

2021-09-13 22:29:34 -07:00

    # check if .asf.yaml file exists

Check for version artifacts in website pipeline (#19210) * [WIP] check for ver artifacts in website pipeline * Update runtime_functions.sh * move the checks down * Update runtime_functions.sh * change directory

2020-09-23 22:08:32 -07:00

								    if [ ! -f "html/.htaccess" ]; then

							

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

								    echo "detected version is $version"

							

Check for version artifacts in website pipeline (#19210) * [WIP] check for ver artifacts in website pipeline * Update runtime_functions.sh * move the checks down * Update runtime_functions.sh * change directory

2020-09-23 22:08:32 -07:00

    # check if the artifacts for this version exist

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

								        echo "html/versions/$version/api directory exists"

							

Check for version artifacts in website pipeline (#19210) * [WIP] check for ver artifacts in website pipeline * Update runtime_functions.sh * move the checks down * Update runtime_functions.sh * change directory

2020-09-23 22:08:32 -07:00

    else

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

								        echo "html/versions/$version/api directory does not exist! Exiting 1"

							

Check for version artifacts in website pipeline (#19210) * [WIP] check for ver artifacts in website pipeline * Update runtime_functions.sh * move the checks down * Update runtime_functions.sh * change directory

2020-09-23 22:08:32 -07:00

        exit 1

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

fi

fix master website version null foler (#19237)

2020-09-27 15:48:55 -07:00

    mkdir -p html/versions/master

Fix building of master website due to removed blog page. (#21083)

2022-06-28 08:53:03 -07:00

								    for f in 404.html api assets community ecosystem features trusted_by feed.xml get_started index.html; do

							

fix master website version null foler (#19237)

2020-09-27 15:48:55 -07:00

        cp -r html/$f html/versions/master/

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

    done

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

    GZIP=-9 tar -zcvf full_website.tgz -C html .

Beta build (#16411) * remove unused jenkinsfile * add a pipeline that publishes to staging beta site * add staging site tar file generation; fix outputs

2019-10-10 16:33:34 -07:00

								build_docs_beta() {

							

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

    pushd docs/_build

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

								    python_doc_folder="html/versions/$BRANCH/api/python/docs"

							

Add back cpp-package (#20131) This updates and adds back the cpp-package removed in https://github.com/apache/incubator-mxnet/commit/97d4ba5a133f93ff6075dcde3ef842b23d498a12

2021-05-24 13:44:39 -07:00

								    cpp_doc_folder="html/versions/$BRANCH/api/cpp/docs"

							

[WEBSITE] publish master website to /versions/master (#19190) * publish master website to /versions/master * update config

2020-09-26 22:28:29 -07:00

								    mkdir -p $python_doc_folder && tar -xzf python-artifacts.tgz --directory $python_doc_folder

							

Add back cpp-package (#20131) This updates and adds back the cpp-package removed in https://github.com/apache/incubator-mxnet/commit/97d4ba5a133f93ff6075dcde3ef842b23d498a12

2021-05-24 13:44:39 -07:00

								    mkdir -p $cpp_doc_folder && tar -xzf c-artifacts.tgz --directory $cpp_doc_folder

							

Beta build (#16411) * remove unused jenkinsfile * add a pipeline that publishes to staging beta site * add staging site tar file generation; fix outputs

2019-10-10 16:33:34 -07:00

    GZIP=-9 tar -zcvf beta_website.tgz -C html .

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

    popd

[website] Automate website artifacts uploading (#19955) * website version update * back up versions * rename file * remove test * add backup version * remove space * remove test * cp between s3 buckets Co-authored-by: Wei Chu <weichu@amazon.com>

2021-03-11 15:36:05 -08:00

								push_docs() {

							

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

								create_repo() {

							

Port top-level-project updates from v1.x branch (#21162)

2023-01-04 04:09:23 -08:00

   git remote add upstream https://github.com/apache/mxnet

New Website: New Pipeline [3/3] (#15883) Merging new website pipelines and fix to content

2019-09-19 22:17:04 -07:00

   cd ..

import Julia binding - enable Jenkins CI build for Julia - add license headers to Julia source code - update links for Julia README

2018-09-29 02:48:43 +08:00

Opt in to newer GCC C++ ABI on RedHat Developer Toolset (#19182) Target version 11, which first appeared in G++7 and which is supported by default by any G++ version up until 10 (the most recent version at time of writing) due to their respective default value of -fabi-compat-version=11. Generate aliases for ABI version 7, which first appeared in G++ 4.8 and is the G++ version shipped by default on a non-EOL system (RHEL7 based systems such as Amazon Linux 1).

2019-05-23 12:48:44 +02:00

								build_static_libmxnet() {

							

Switch to GCC 8 for distribution build (#19185) * Switch to GCC 8 for distribution build * Include oneDNN gemm fix * Update to oneDNN v1.6.4 * Update mkldnn_format_tag_last

2020-10-05 11:34:20 -07:00

    source /opt/rh/devtoolset-8/enable

[2.0] Bump Python to >= 3.8 (#20593) * Bump Python to >= 3.8, NumPy to >= 1.21.0 * remove pillow in requirements * update python * fix some tests

2021-10-19 22:21:04 -07:00

    source /opt/rh/rh-python38/enable

2020-09-19 18:42:34 -07:00

    # Opt in to newer GCC C++ ABI. devtoolset defaults to ABI Version 2.

2019-05-23 12:48:44 +02:00

								    local mxnet_variant=${1:?"This function requires a python command as the first argument"}

							

Merge make/pip and make/maven configurations (#17027)

2019-12-09 20:23:47 -06:00

								    source tools/staticbuild/build.sh ${mxnet_variant}