Blame: Makefile - apache/mxnet

Making MKL-DNN default on MXNet master (#13681) * mkldnn is default makefile and explicitly turn off for buidls * add endif * retrigger * retrigger * build mkldnn as static lib * update makefile to statically build mkldnn * build static mkldnn * fix static name * fix static name * update static for mac * rename mkldnn dep in ci * remove moving mkldnn dynamic lib * retrigger * remove commented code * retrigger * remove mkldnn dnaymic for unitest * retrigger * retrigger * force static for mkldnn lib * turn of mkldnn on arm builds * remove dynamic mkldnn bind * update jenkins to use only mkldnn * remove last flag * turn mkldnn by default on mac * move mkldnn files for GPU MKLDNN build * copy lib mxnet in gpu build * only link windows * add mkldnn.mk * try force linking * retrigger * retrigger * remove mkldnn dynanmic check * use ifndef * remove test mkldnn install * fix spacing * fix index * remove cp of mkldnn since statically linked * add libmkldnn.a to list of files to pack * include mkl_ml * add mkldnn to pack * add libiomp to ci pack * move static libs * fix typo * pack mkldnn * retrigger * add linux artifacts * move libmkldnn in gpu cmake build * move libmkldnn and libiomp5 on gpu workspace * move linked files * fix typo * fix typo * add artifacts for tensorrt * move mkldnn lib in scala build * move mkldnn lib on cpu scala * create dir for binding * rename libmkldnn in scala * move mklml dep in scala builds * move mkl to another linked folder * move libmkl to another dir * add libmklml * move mkldnn * move mkldnn on centos * specify new dynamic path * retrigger * remove mkldnn dynamic lib * remove moving mkldnn artifact * add ld path * retrigger * Revert "remove moving mkldnn artifact" This reverts commit 16cca196e9e1ad92db74f4e8a01b3b052076d268. * Revert "remove mkldnn dynamic lib" This reverts commit d51043622d4ef7fcb95aff6a3e84d91ab71b48c9. * update makefile * Revert RPATH change and trigger CI * correcting use-mkldnn flags for two tests * mkldnn default on linux for starters * reverting naming rules of pack_lib * adding mkldnn=0 flags to centos non mkldnn builds * adding mkldnn=0 flags to ubuntu gpu non mkldnn builds * removing mkldnn binary operation for ubuntu gpu cmake non mkldnn build * removing mkldnn binary operation for centos non-mkldnn unittest * adding explicit USE_MKLDNN=0 flags for clang builds * adding explicit USE_MKLDNN=0 flags for cpu ubuntu builds * removing mkldnn binaries from non mkldnn builds scala gpu * adding explicit flag mkldnn=0 for tensorrt gpu build * adding explicit flag mkldnn=0 for ubuntu cmake asan * adding centos cpu mkldnn tests to CI * adding CentOS GPU MKLDNN build and unittest * not keeping mkldnn default for mac os * setting mkldnn default for x86_64 only * running docs with mkldnn=0 flag * removing CentOS CPU Scala MKLDNN test * setting mkldnn default for x86_64 only * not making mkldn default on windows * removing Centos MKLDNN tests from CI * retrigger * retrigger * retrigger

2017-08-29 17:14:01 -07:00

								ifeq ($(OS),Windows_NT)

							

2019-01-03 08:38:00 +05:30

									UNAME_P := $(shell uname -p)

							

2017-08-29 17:14:01 -07:00

								endif

							

2015-06-14 17:36:33 -07:00

								ifndef config

							

add cpp doc

2015-08-21 23:49:07 -06:00

								else

							

move dmlc-core & make" (#10231) +cmake amal examples update jenkins udpate amal.py to force include 3p update amal find_source logic remove log update license move nnvm misc nnvm updates fix perl make ; update whitelist add missing license header fix launcher.py update submodule name update amal expand file

2015-06-14 17:36:33 -07:00

	config = make/config.mk

2018-03-25 19:45:24 -07:00

									DMLC_CORE = $(TPARTYDIR)/dmlc-core

							

2015-06-14 17:36:33 -07:00

								endif

							

add doc for gluon, sym/nd contrib (#7284) * add doc for contrib * docs for gluon, sym/nd contrib

2017-08-02 16:19:41 -07:00

								CORE_INC = $(wildcard $(DMLC_CORE)/include/*/*.h)

							

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2015-06-14 17:36:33 -07:00

2016-09-01 21:17:33 -07:00

								ifndef NNVM_PATH

							

[3rdparty] introduce tvm (#11130)

2018-06-02 17:58:25 -07:00

									NNVM_PATH = $(TPARTYDIR)/tvm/nnvm

							

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2016-09-01 21:17:33 -07:00

								endif

							

Change Interface of NDArray & TBlob for DLPack Compatible (#6345) * Change Interface of NDArray & TBlob for DLPack Compatible Fix for cudnn operator Fix cpp tests * Update nnvm * Fix for MKL mem * Fix for windows macro * Bump up version number to 0.10.1 * Update NDArray Save&Load * trigger update * Add test for legacy data load * Use LegacyTShapeLoad * trigger update * Update tensor_blob.h

2017-05-29 23:37:08 -07:00

								ifndef DLPACK_PATH

							

[MXNET-16] Move mshadow/ps-lite/dlpack to 3rdparty #10132 (#10138) * move * update cmake/make * remove commment * update license * ps-lite move * update license * move mshadow for make * mshadow cmake * update license * update readme * Update Jenkinsfile * update license file with paths

2018-03-21 23:31:19 -07:00

									DLPACK_PATH = $(ROOTDIR)/3rdparty/dlpack

							

Change Interface of NDArray & TBlob for DLPack Compatible (#6345) * Change Interface of NDArray & TBlob for DLPack Compatible Fix for cudnn operator Fix cpp tests * Update nnvm * Fix for MKL mem * Fix for windows macro * Bump up version number to 0.10.1 * Update NDArray Save&Load * trigger update * Add test for legacy data load * Use LegacyTShapeLoad * trigger update * Update tensor_blob.h

2017-05-29 23:37:08 -07:00

								endif

							

cuda support for linalg-functions, restructuring of linalg interfaces (#7147) * cuda support for linalg-functions, restructuring of linalg interfaces * incorporate newest mshadow * adjustments to linalg operators

2017-08-12 21:12:44 +02:00

								ifndef AMALGAMATION_PATH

							

debugging on ndarray delete

2015-09-15 08:53:03 +08:00

								ifneq ($(USE_OPENMP), 1)

							

pass compilation, not tested

2015-09-04 01:15:31 +08:00

									export NO_OPENMP = 1

							

Making MKL-DNN default on MXNet master (#13681) * mkldnn is default makefile and explicitly turn off for buidls * add endif * retrigger * retrigger * build mkldnn as static lib * update makefile to statically build mkldnn * build static mkldnn * fix static name * fix static name * update static for mac * rename mkldnn dep in ci * remove moving mkldnn dynamic lib * retrigger * remove commented code * retrigger * remove mkldnn dnaymic for unitest * retrigger * retrigger * force static for mkldnn lib * turn of mkldnn on arm builds * remove dynamic mkldnn bind * update jenkins to use only mkldnn * remove last flag * turn mkldnn by default on mac * move mkldnn files for GPU MKLDNN build * copy lib mxnet in gpu build * only link windows * add mkldnn.mk * try force linking * retrigger * retrigger * remove mkldnn dynanmic check * use ifndef * remove test mkldnn install * fix spacing * fix index * remove cp of mkldnn since statically linked * add libmkldnn.a to list of files to pack * include mkl_ml * add mkldnn to pack * add libiomp to ci pack * move static libs * fix typo * pack mkldnn * retrigger * add linux artifacts * move libmkldnn in gpu cmake build * move libmkldnn and libiomp5 on gpu workspace * move linked files * fix typo * fix typo * add artifacts for tensorrt * move mkldnn lib in scala build * move mkldnn lib on cpu scala * create dir for binding * rename libmkldnn in scala * move mklml dep in scala builds * move mkl to another linked folder * move libmkl to another dir * add libmklml * move mkldnn * move mkldnn on centos * specify new dynamic path * retrigger * remove mkldnn dynamic lib * remove moving mkldnn artifact * add ld path * retrigger * Revert "remove moving mkldnn artifact" This reverts commit 16cca196e9e1ad92db74f4e8a01b3b052076d268. * Revert "remove mkldnn dynamic lib" This reverts commit d51043622d4ef7fcb95aff6a3e84d91ab71b48c9. * update makefile * Revert RPATH change and trigger CI * correcting use-mkldnn flags for two tests * mkldnn default on linux for starters * reverting naming rules of pack_lib * adding mkldnn=0 flags to centos non mkldnn builds * adding mkldnn=0 flags to ubuntu gpu non mkldnn builds * removing mkldnn binary operation for ubuntu gpu cmake non mkldnn build * removing mkldnn binary operation for centos non-mkldnn unittest * adding explicit USE_MKLDNN=0 flags for clang builds * adding explicit USE_MKLDNN=0 flags for cpu ubuntu builds * removing mkldnn binaries from non mkldnn builds scala gpu * adding explicit flag mkldnn=0 for tensorrt gpu build * adding explicit flag mkldnn=0 for ubuntu cmake asan * adding centos cpu mkldnn tests to CI * adding CentOS GPU MKLDNN build and unittest * not keeping mkldnn default for mac os * setting mkldnn default for x86_64 only * running docs with mkldnn=0 flag * removing CentOS CPU Scala MKLDNN test * setting mkldnn default for x86_64 only * not making mkldn default on windows * removing Centos MKLDNN tests from CI * retrigger * retrigger * retrigger

2015-06-14 17:36:33 -07:00

# use customized config file

MKL compile update to remove full mkl pack dependency for blas=mkl (#5036)

2017-02-15 19:54:43 -08:00

Fix USE_MKLDNN check in Makefile (#13775) * fix makefile * change make/config.mk * add comments * retrigger ci

2019-01-07 04:04:38 +08:00

								ifndef USE_MKLDNN

							

2019-01-03 08:38:00 +05:30

								ifneq ($(UNAME_S), Darwin)

							

Revert "Refactor operators & MKLDNN (#8302)" (#9661) This reverts commit 2cc2aa2272881326c8a50c6204aedd71e1821c3f.

2018-01-31 21:09:49 -08:00

								ifeq ($(USE_MKL2017), 1)

							

2018-02-15 14:44:34 -08:00

								$(warning "USE_MKL2017 is deprecated. We will switch to USE_MKLDNN.")

							

Implement mkldnn convolution fusion and quantization. (#12530) * Implement mkldnn convolution fusion. Implement mkldnn convolution quantization. * Fix lint * Fix performance regression caused by mkldnn fallback. * clean up include * Fix msbuild on openmp pragma. * Fix quantization test, allow to use original op names as exclude layer for quantization. * Fix unittest. * Fix unittest * fix lint * Add post quantize fusion * add test case * add head license in test case * Remove GetBoolHash() * Remove mkldnn fallback change. * Address Haibin's comments. * Add TIsMKLDNN for _sg_mkldnn_conv temporarily. * Address reminisce's comments. * Handle the case that inplace fail. * pass unit test. * Add symbol api get_backend_symbol() * Retrigger ci * update the test case * Check subgraph index. * Use index as FAvoidQuantizeInput's parameter. * Add mkldnn_hwigo support as quantizaiton needs. * Address KellenSunderland's comments. * Handle input order change after subgraph pass. * Fix ci test

2018-10-10 01:38:53 +08:00

									MKLDNNROOT = $(ROOTDIR)/3rdparty/mkldnn/build/install

							

2018-02-15 14:44:34 -08:00

									export USE_MKLML = 1

							

MKL compile update to remove full mkl pack dependency for blas=mkl (#5036)

2017-02-15 19:54:43 -08:00

								endif

							

[MXNET-16] Move mshadow/ps-lite/dlpack to 3rdparty #10132 (#10138) * move * update cmake/make * remove commment * update license * ps-lite move * update license * move mshadow for make * mshadow cmake * update license * update readme * Update Jenkinsfile * update license file with paths

2018-03-21 23:31:19 -07:00

								include $(TPARTYDIR)/mshadow/make/mshadow.mk

							

Scalable build and warning fixes (#5396) * Scalable build and warning fixes * Minor adjustments * Scalable build and warning fixes * Minor adjustments * Remove unneeded link libraries * Revert Size() back to dim() * Make OpenMP required again * Trying to make windows openmp config work * Fixed build problem

2015-06-14 17:36:33 -07:00

								include $(DMLC_CORE)/make/dmlc.mk

							

2017-03-16 10:49:47 -07:00

WARNFLAGS= -Wall -Wsign-compare

Introduce a developer mode option in Makefile (#4364) this option will enable two addtionals for now: 1. Create symbol even for optimized build 2. Enforce warning as error while I am here, clean up all warnings so that I can get a clean build in developer mode.

2015-06-14 17:36:33 -07:00

								CFLAGS = -DMSHADOW_FORCE_STREAM $(WARNFLAGS)

							

compile debug

2015-08-23 21:26:16 +08:00

2016-12-29 03:18:09 +08:00

								ifeq ($(DEV), 1)

							

add -Werror to NVCCFLAGS (#6774) * add -Werror to NVCCFLAGS * add -Werror to NVCCFLAGS * RTLD_LOCAL instead of GLOBAL * fix device/host cross-call warnings

2017-06-22 15:24:38 -07:00

	NVCCFLAGS += -Werror cross-execution-space-call

Introduce a developer mode option in Makefile (#4364) this option will enable two addtionals for now: 1. Create symbol even for optimized build 2. Enforce warning as error while I am here, clean up all warnings so that I can get a clean build in developer mode.

2016-12-29 03:18:09 +08:00

								endif

							

compile debug

2015-08-23 21:26:16 +08:00

# CFLAGS for debug

change def of context, change pinned to special device

2015-09-19 14:32:28 -07:00

								ifeq ($(DEBUG), 1)

							

Cures SEGV seen in test_symbol.py:test_zero_prop2 when DEBUG=1. (#6049)

2017-04-30 20:16:56 -07:00

	CFLAGS += -g -O0

change def of context, change pinned to special device

2015-09-19 14:32:28 -07:00

								else

							

Batch Norm rewrite without mshadow, 1D, 2D, 3D, float16, float32, float64 as well as operator gtest framework (#5936) * Batch Norm rewrite without mshadow as well as operator gtest framework * performance testing * lint fixes * use CUDNN for this test * remove superfluous omp define * Fix file names in comments * build, run, clean gtest works (although a test is failing) * CR comments * Adjust timing tests for more strenuous sample * Remove temp resource allocation * DeviceTensor3 added, forEachFast not yet converted * DeviceTensor3 version working * DeviceTensor3 working * . * Fix for use_global_stats * fixed bug with testing suite for double (Float64) * python unit tests working for batchnorm * python unit tests * Update documentation for mxnet.initializer.Mixed (#5937) * Update documentation for SVMOutput. (#5931) * Update documentation for SVMOutput. * Update doc for SVMOutput - fix formatting. * Adding install instruction for Ubuntu-CPU-Python (#5885) * edit ndarray API docs (#5806) * edit docs in broadcast_reduce_op * edit docs in broadcast_reduce_op * minor change * lint fix * fix * mx.nd.ones * mx.nd.repeat * mx.nd.reverse * add example in repeat * optimizer update * fix nanprod * fix optimizer_op api doc * fix reduce_op api doc * fix nd.ones api doc * mx.nd.repeat doc change * Update broadcast_reduce_op.h * Symbol docs fixes (#5930) * symbol docs minor formatting changes * deepcopy, infer_shape, infer_shape_partial docs modified * Few more small fixes * arithmetic functions fixes * some more modifications * changes after review * small change * grad function note added * More API Doc Edits (#5886) * edit activation doc * doc l2_normalization * edit MakeLoss doc * edit blockgrad doc * blockgrad fileline fix * edit MakeLoss doc cont. * doc change 'tensor' to 'multidimensional array' * l2normalization doc improve * makeloss doc improve, blockgrad doc improve * fix doc in activation, l2_normalization, make_loss * fix minor grammar * use .describe to avoid build failure. * Update documentation for mxnet.image.imdecode (#5957) * Update documentation for mxnet.image.imdecode * Update documentation for mxnet.image.imdecode (clarify that we need OpenCV and not the CV2 Python library) * Fix script by adding path to Dockerfile (#5958) * Clean install script * Add test for pip installations * Remove debug statements & comments * Make test runnable as script and from framework * Fix path to Dockerfiles * Putting failing cases at the end * Update doc for Custom operator. (#5875) * Update doc for Custom operator. * Update doc for Custom operator. * Fix formating in doc for Custom operator. * Fix formating in doc for Custom operator. * Minor change to ndarray.Custom documentation. * Minor edit in doc for Custom operator. * Minor change to doc for Custom operator. Data is 'NDArray-or-Symbol'. * Minor formatting change for Custom operator documentation. * For Custom operator doc, move example into ndarray_doc.py. * Minor change in Custom operator documentation * Improve the doc of pick + Update dmlc-core (#5946) * Add PickParam to fix the docstring and the initial value for axis * Update dmlc-core * Update dmlc-core * Image docs modified (#5973) * imageIter doc modified * edited imageiter * ADD missing Libri_sample.json, FIX minor bugs in speech_recognition example (#5962) * [KVStore] Add support for other data types (#5818) * Fix kvstore type * Fix lint * Parse inputs to DataDesc * Make module support dtype * Fix lint * Add default dtype in Comm * Fix lint * Revert rename * [cpp-package] Add C++ basic tutorial and build instruction (#5971) * Add C++ basic tutorial and build instruction * Remove binaries * Fix lint * Avoid sign-compare * Update documentation for mxnet.metric.np (#5977) * Getting rid of identity (#5935) * Activation ops (#5938) * [Ops] Add op: 'relu' * Add op: 'sigmoid' * Introduce 'kernel_launch_op' * Add tests and describe; move it to elemwise_unary_op * Fix GPU version * Convert caffe AbsVal to mx.symbol.abs in caffe converter (#5984) * Correction to LSTMCell docstring (#5986) * [Module] fix input_grads order (#5980) * fix input_grads order + update dmlc-core * set label to be optional * update env_var doc (#5964) * Adjusting make, Callback removed * batch norm gpu testing * Batch Norm rewrite without mshadow as well as operator gtest framework * performance testing * lint fixes * use CUDNN for this test * remove superfluous omp define * Fix file names in comments * build, run, clean gtest works (although a test is failing) * CR comments * Adjust timing tests for more strenuous sample * Remove temp resource allocation * rearrange source into cc and cu files * lint fixes * Trigger build * Use latest mshadow * temporarily revert channel position parameter field * Add more tests for batchnorm * Add more tests for batchnorm * test_operator_gpu working for all types * Compiles after AccReal * Compiles after AccReal * All tests working * All tests working * build, run, clean gtest works (although a test is failing) * vc++ requires explicit int type for omp for loop * Repair cpp-package * signed/unsigned fixed in cuda file * lint fixes in tests and cpp-package directories * more lint * use IsWriting() helper * Fall-through for unsupported MKL shapes/types * Fall-through for unsupported MKL shapes/types * cleaner mkl_off approach * Warning only whem MKL is requested * Warning only whem MKL is requested * lint * .. * python problem fixed * python problem fixed * Merge branch 'batchnorm' into batchnorm_pr # Conflicts: # src/operator/batch_norm.cc # src/operator/batch_norm.cu # tests/cpp/operator/batchnorm_test.cc * lint fix * lint fix * lint fix * lint fix * lint fix * Fix visual c++ compile problem * . * . * All unit tests pass again * lint fix * fix strange compile errors in CUDNN batchnorm header * FInish using flags instead of bools * lint * Fix timing pass count for forward pass * Fix R script install roxygen problem * code formatting, addition of doc strings is causing IDE to add spaces before the calls * removed commented * cr comments * Change back to compilable code * For CPU mode, store as invstd * move testing code around a little * lint fix * Use AccReal in some places to avoid fp16 problems * Fix minor invstd problem in cuda version * remove unused scale param * add permutation unit test, handle cudnn doesn't like 3D * . * lint * . * Remove mkl_off * lint fix and time cudnn when enabled

2017-05-15 20:27:28 -07:00

									CFLAGS += -O3 -DNDEBUG=1

							

compile debug

2015-08-23 21:26:16 +08:00

								endif

							

[3rdparty] introduce tvm (#11130)

2018-06-02 17:58:25 -07:00

								CFLAGS += -I$(TPARTYDIR)/mshadow/ -I$(TPARTYDIR)/dmlc-core/include -fPIC -I$(NNVM_PATH)/include -I$(DLPACK_PATH)/include -I$(TPARTYDIR)/tvm/include -Iinclude $(MSHADOW_CFLAGS)

							

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2015-06-14 17:36:33 -07:00

								LDFLAGS = -pthread $(MSHADOW_LDFLAGS) $(DMLC_LDFLAGS)

							

2018-08-10 02:38:04 -07:00

Enable C++ coverage (#12642) * Enable C++ coverage for make * Enable C++ test coverage for CMake * Add environment variable GCOV_PREFIX * Stash GCNO files * Enable test coverage for Scala * Fix -P not found error * Remove debug comments * Add linking with coverage * Trigger separate build * Add ignored files

2018-09-24 16:30:38 +02:00

								ifeq ($(ENABLE_TESTCOVERAGE), 1)

							

[MXNET-703] TensorRT runtime integration (#11325) * [MXNET-703] TensorRT runtime integration Co-authored-by: Clement Fuji-Tsang <caenorst@hotmail.com> Co-authored-by: Kellen Sunderland <kellen.sunderland@gmail.com> * correctly assign self._optimized_symbol in executor * declare GetTrtCompatibleSubsets and ReplaceSubgraph only if MXNET_USE_TENSORRT * add comments in ReplaceSubgraph * Addressing Haibin's code review points * Check that shared_buffer is not empty when USE_TENSORRT is set * Added check that TensorRT binding is for inference only * Removed redundant decl. * WIP Refactored TRT integration and tests * Add more build guards, remove unused code * Remove ccache report * Remove redundant const in declaration * Clean Cmake TRT files * Remove TensorRT env var usage We don't want to use environment variables with TensorRT yet, the logic being that we want to try and have as much fwd compatiblity as possible when working on an experimental feature. Were we to add env vars they would have to be gaurenteed to work in the future until a major version change. Moving the functionality to a contrib call reduces this risk. * Use contrib optimize_graph instaed of bind * Clean up cycle detector * Convert lenet test to contrib optimize * Protect interface with trt build flag * Fix whitespace issues * Add another build guard to c_api * Move get_optimized_symbol to contrib area * Ignore gz files in test folder * Make trt optimization implicit * Remove unused declaration * Replace build guards with runtime errors * Change default value of TensorRT to off This is change applies to both TensorRT and non-TensorRT builds. * Warn user when TRT not active at runtime * Move TensorRTBind declaration, add descriptive errors * Test TensorRT graph execution, fix bugs * Fix lint and whitespace issues * Fix typo * Removed default value for set_use_tensorrt * Improved documentation and fixed spacing issues * Move static exec funcs to util files * Update comments to match util style * Apply const to loop element * Fix a few namespace issues * Make static funcs inline to avoid compiler warning * Remove unused inference code from lenet5_train * Add explicit trt contrib bind, update tests to use it * Rename trt bind call * Remove documentation that is not needed for trt * Reorder arguments, allow position calling

2018-08-10 02:38:04 -07:00

proper debug flag (-g -G) for nvcc

2015-10-24 15:53:35 -07:00

								ifeq ($(DEBUG), 1)

							

add -Werror to NVCCFLAGS (#6774) * add -Werror to NVCCFLAGS * add -Werror to NVCCFLAGS * RTLD_LOCAL instead of GLOBAL * fix device/host cross-call warnings

2017-06-22 15:24:38 -07:00

									NVCCFLAGS += -std=c++11 -Xcompiler -D_FORCE_INLINES -g -G -O0 -ccbin $(CXX) $(MSHADOW_NVCCFLAGS)

							

proper debug flag (-g -G) for nvcc

2015-10-24 15:53:35 -07:00

								else

							

add -Werror to NVCCFLAGS (#6774) * add -Werror to NVCCFLAGS * add -Werror to NVCCFLAGS * RTLD_LOCAL instead of GLOBAL * fix device/host cross-call warnings

2017-06-22 15:24:38 -07:00

									NVCCFLAGS += -std=c++11 -Xcompiler -D_FORCE_INLINES -O3 -ccbin $(CXX) $(MSHADOW_NVCCFLAGS)

							

proper debug flag (-g -G) for nvcc

2015-10-24 15:53:35 -07:00

								endif

							

Caffe without the patch, cpp-package fixed also with Caffe plugin (#5573) * PEP8 indentation fix * Remove executable flag * Remove executable flags * Fixing cpp-package including problems with caffe converter * Fix warnings * Remove need for caffe patch for caffe plugin * Ignore cmake paths, remove rebuildable op.h for cpp-package * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * Fix source file * Better example.mk * Turn off caffe by default * lint fixes * Trying to figure out how to fix submodules * nohub problem on travis so force retry * Edited test to just run make instead of obsolete make example * Fix cpp-package op.h bootstrapping with CMake * Build tweaking cpp-package * Lint fixes * op.h generator * FIx cpp-package for latest merge from master * Fix lint * static link * link whole static lib * Trigger another build attempt * Trigger another build attempt * Add caffe plugin support (disabled until dependencies can be added to build machine) * Add cufft library * win32 cufft library * don't link unit tests as static * Add include for rebuildable protobuf header caffe.pb.h

2015-06-14 17:36:33 -07:00

add option for segfault logger (#7836) * add option for segfault logger * rename

2017-09-10 22:22:19 -07:00

# CFLAGS for segfault logger

2017-03-30 20:13:36 -07:00

# Caffe Plugin

Caffe without the patch, cpp-package fixed also with Caffe plugin (#5573) * PEP8 indentation fix * Remove executable flag * Remove executable flags * Fixing cpp-package including problems with caffe converter * Fix warnings * Remove need for caffe patch for caffe plugin * Ignore cmake paths, remove rebuildable op.h for cpp-package * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * Fix source file * Better example.mk * Turn off caffe by default * lint fixes * Trying to figure out how to fix submodules * nohub problem on travis so force retry * Edited test to just run make instead of obsolete make example * Fix cpp-package op.h bootstrapping with CMake * Build tweaking cpp-package * Lint fixes * op.h generator * FIx cpp-package for latest merge from master * Fix lint * static link * link whole static lib * Trigger another build attempt * Trigger another build attempt * Add caffe plugin support (disabled until dependencies can be added to build machine) * Add cufft library * win32 cufft library * don't link unit tests as static * Add include for rebuildable protobuf header caffe.pb.h

2017-08-29 17:14:01 -07:00

									CFLAGS += -DMXNET_USE_CAFFE=1

							

2017-03-30 20:13:36 -07:00

								endif

							

add travis

2015-07-03 17:06:23 -06:00

								ifndef LINT_LANG

							

compatibility with opencv4 (#14313) * compatibility with opencv4 * update makefile * update Makefile * fix Makefile * fix lib path * update make.mk * add -lopencv_highgui * retrigger CI * update makefile * remove libs-L * Fix OpenCV linking order * try to fix the bug of linking static libraries

2019-03-07 10:10:02 +08:00

									LINT_LANG = "all"

							

add travis

2015-07-03 17:06:23 -06:00

								endif

							

Adjust MKLDNN compile flags order to resolve some corner case (#9936)

2018-03-16 13:55:27 +08:00

								ifeq ($(USE_MKLDNN), 1)

							

2018-12-06 17:44:06 +08:00

									CFLAGS += -I$(MKLDNNROOT)/include

							

Revert "Feature/mkldnn static (#13628)" (#13638) This reverts commit 5bcf2bd6e8b48fa27bfcfdafd06401ec2d28978b.

2018-12-13 16:01:30 -08:00

									LDFLAGS += -L$(MKLDNNROOT)/lib -lmkldnn -Wl,-rpath,'$${ORIGIN}'

							

Adjust MKLDNN compile flags order to resolve some corner case (#9936)

2018-03-16 13:55:27 +08:00

								endif

							

compatibility with opencv4 (#14313) * compatibility with opencv4 * update makefile * update Makefile * fix Makefile * fix lib path * update make.mk * add -lopencv_highgui * retrigger CI * update makefile * remove libs-L * Fix OpenCV linking order * try to fix the bug of linking static libraries

2015-06-14 17:36:33 -07:00

# setup opencv

fix opencv local path

2015-10-02 11:52:44 +08:00

								ifeq ($(USE_OPENCV), 1)

							

2019-03-07 10:10:02 +08:00

									CFLAGS += -DMXNET_USE_OPENCV=1

							

fix Makefile (#14424)

2019-03-14 16:00:36 +08:00

									ifneq ($(filter-out NONE, $(USE_OPENCV_INC_PATH)),)

							

compatibility with opencv4 (#14313) * compatibility with opencv4 * update makefile * update Makefile * fix Makefile * fix lib path * update make.mk * add -lopencv_highgui * retrigger CI * update makefile * remove libs-L * Fix OpenCV linking order * try to fix the bug of linking static libraries

2019-03-07 10:10:02 +08:00

										CFLAGS += -I$(USE_OPENCV_INC_PATH)/include

							

fix Makefile (#14424)

2019-03-14 16:00:36 +08:00

										ifeq ($(filter-out NONE, $(USE_OPENCV_LIB_PATH)),)

							

compatibility with opencv4 (#14313) * compatibility with opencv4 * update makefile * update Makefile * fix Makefile * fix lib path * update make.mk * add -lopencv_highgui * retrigger CI * update makefile * remove libs-L * Fix OpenCV linking order * try to fix the bug of linking static libraries

2019-03-07 10:10:02 +08:00

								$(error Please add the path of OpenCV shared library path into `USE_OPENCV_LIB_PATH`, when `USE_OPENCV_INC_PATH` is not NONE)

							

add showme example and im2rec, modify io,=.md

2015-09-21 01:55:32 +08:00

	BIN += bin/im2rec

compatibility with opencv4 (#14313) * compatibility with opencv4 * update makefile * update Makefile * fix Makefile * fix lib path * update make.mk * add -lopencv_highgui * retrigger CI * update makefile * remove libs-L * Fix OpenCV linking order * try to fix the bug of linking static libraries

2015-06-14 17:36:33 -07:00

								else

							

2019-03-07 10:10:02 +08:00

									CFLAGS += -DMXNET_USE_OPENCV=0

							

pass compilation, not tested

2015-09-04 01:15:31 +08:00

								endif

							

rename narray->ndarray

2015-09-13 10:41:34 -07:00

								ifeq ($(USE_OPENMP), 1)

							

simplify mac mkldnn build (#12724) * remove guard that prevent omp flag in mac * udpate doc for mac make build * update docs * update readme * set opencv to 1 in instructions * remove disable opencv line * update mac docs * fix indent

2018-10-16 09:30:39 -07:00

	CFLAGS += -fopenmp

add NNPACK support for high convolution inference perf (#3666) * add NNPACK support for high convolution inference perf * set USE_NNPACK to 0 * Fix header declaration * Fix input_size init value 1. data's shape is BxCxHxW, input_size is {width,height} 2. improve algorithm selection policy * Fix lint error

2015-06-14 17:36:33 -07:00

								endif

							

2016-11-10 12:19:46 +08:00

								ifeq ($(USE_NNPACK), 1)

							

Kernel operator tuning (#8686) * Refreshed branch bc_tune * local-build openmp as static * trigger * Somehow broadcast found its way back in, removed again * Trigger rebuild

2017-11-21 06:49:51 -08:00

								ifeq ($(USE_OPERATOR_TUNING), 1)

							

enable use of lapack by default (#6704) * enable use of lapack by default * corrected search paths for lapack lib

2017-06-18 14:23:29 -07:00

# verify existence of separate lapack library when using blas/openblas/atlas

Fix gperftools/jemalloc and lapack warning bug. (#11110) * Fix gperftools/jemalloc and lapack warning bug. * mend

2018-06-18 06:23:55 +08:00

								#   -  for rhel7.2, try installing the package `lapack-static` via yum will dismiss this warning.

							

enable use of lapack by default (#6704) * enable use of lapack by default * corrected search paths for lapack lib

2017-06-18 14:23:29 -07:00

								# silently switching lapack off instead of letting the build fail because of backward compatibility

							

Add BLAS3 and LAPACK routines (#6538) * Added linear algebra operators * more comments about style of wrapper interface * more appropriate fatal exit when lapack does not exist * more comments on row/col-major ordering * added config switch for lapack usage * switched lapack usage off by default

2017-06-13 14:13:20 -07:00

								ifeq ($(USE_LAPACK), 1)

							

2018-02-15 14:44:34 -08:00

								ifeq ($(USE_BLAS),$(filter $(USE_BLAS),blas openblas atlas mkl))

							

Compile with LAPACK shared library (#11483) * Compile with LAPACK shared library * Add missing bracket * Reorder LAPACK paths

2018-07-03 01:56:54 +08:00

								ifeq (,$(wildcard $(USE_LAPACK_PATH)/liblapack.a))

							

enable use of lapack by default (#6704) * enable use of lapack by default * corrected search paths for lapack lib

2017-06-18 14:23:29 -07:00

								ifeq (,$(wildcard /lib/liblapack.a))

							

Compile with LAPACK shared library (#11483) * Compile with LAPACK shared library * Add missing bracket * Reorder LAPACK paths

2018-07-03 01:56:54 +08:00

								ifeq (,$(wildcard /lib/liblapack.so))

							

enable use of lapack by default (#6704) * enable use of lapack by default * corrected search paths for lapack lib

2017-06-18 14:23:29 -07:00

								ifeq (,$(wildcard /usr/lib/liblapack.a))

							

Compile with LAPACK shared library (#11483) * Compile with LAPACK shared library * Add missing bracket * Reorder LAPACK paths

2018-07-03 01:56:54 +08:00

								ifeq (,$(wildcard /usr/lib/liblapack.so))

							

Locate liblapack.a in /usr/lib64 (#8073)

2017-12-09 01:16:03 +00:00

								ifeq (,$(wildcard /usr/lib64/liblapack.a))

							

Compile with LAPACK shared library (#11483) * Compile with LAPACK shared library * Add missing bracket * Reorder LAPACK paths

2018-07-03 01:56:54 +08:00

								ifeq (,$(wildcard /usr/lib64/liblapack.so))

							

enable use of lapack by default (#6704) * enable use of lapack by default * corrected search paths for lapack lib

2017-06-18 14:23:29 -07:00

									USE_LAPACK = 0

							

CI docker revamp; Add Jetson, Raspberry and CentOS 7 build [MXNET-42][MXNET-43] (#9995) * Start dockerfile revamp * Fix lint * Fix lint * Fix lint * Add python unit test * Add new dependency * Add user creation * Fix file permission * Determine USER_ID automatically * REmove ENV command in dockerfile * Remove python nose timer, improve useradd * ENable nvidia docker * Add remaining unittests * Add CentOS 7 unittests * Add integration tests * Add TVM and LLVM to dockerfile * Add ARMv7 and ARMv8 to build * Fix amalgamation build * Improvements and android_arm64 fixes, missing removing pthread * Jetson fix (install unzip) * Build jetson with make until issue with libomp.so is resolved #10011 * Fix Amalgamation builds * Fix Jetson build by switching to cmake * Assign CentOS gpu test to gpu instance * Fix R builds * Assign jobs to right docker containers * Fix missing file permissions inside docker * Create homedir on centos * Enable lapack * Fix Lapack on Cent OS 7 * Disable MXNET_MKLDNN_DEBUG * Delete Dockerfiles * Last general refinements before finish * Remove docker_multiarch folder, superseeded by the new ci scripts * Address review comments * Fix Caffe Integrationstest * Fix deploy stage * Address review comments * Address review comments * Enable script to run on Mac

2018-03-09 23:33:10 +01:00

								        $(warning "USE_LAPACK disabled because libraries were not found")

							

enable use of lapack by default (#6704) * enable use of lapack by default * corrected search paths for lapack lib

2017-06-18 14:23:29 -07:00

								endif

							

Add BLAS3 and LAPACK routines (#6538) * Added linear algebra operators * more comments about style of wrapper interface * more appropriate fatal exit when lapack does not exist * more comments on row/col-major ordering * added config switch for lapack usage * switched lapack usage off by default

2017-06-13 14:13:20 -07:00

								endif

							

Locate liblapack.a in /usr/lib64 (#8073)

2017-12-09 01:16:03 +00:00

								endif

							

Compile with LAPACK shared library (#11483) * Compile with LAPACK shared library * Add missing bracket * Reorder LAPACK paths

2018-07-03 01:56:54 +08:00

								endif

							

enable use of lapack by default (#6704) * enable use of lapack by default * corrected search paths for lapack lib

2017-06-18 14:23:29 -07:00

2018-02-15 14:44:34 -08:00

									ifeq ($(USE_BLAS),$(filter $(USE_BLAS),blas openblas atlas mkl))

							

enable use of lapack by default (#6704) * enable use of lapack by default * corrected search paths for lapack lib

2017-06-18 14:23:29 -07:00

		LDFLAGS += -llapack

Add BLAS3 and LAPACK routines (#6538) * Added linear algebra operators * more comments about style of wrapper interface * more appropriate fatal exit when lapack does not exist * more comments on row/col-major ordering * added config switch for lapack usage * switched lapack usage off by default

2017-06-13 14:13:20 -07:00

								endif

							

2015-06-14 17:36:33 -07:00

								ifeq ($(USE_CUDNN), 1)

							

cudnn conv

2015-09-09 20:16:18 -06:00

									CFLAGS += -DMSHADOW_USE_CUDNN=1

							

Runtime feature detection (#13549) * Prototype for runtime feature detection * Includes from diamond to quotes * Add CPU feature and BLAS flavour flags * Add BLAS flavour and CPU SSE and AVX flags * MXNET_USE_LAPACK * Fix C++ linting errors * Expose runtime feature detection in the public C API and in the Python API * Refactor Storage -> FeatureSet * Refine documentation * Add failure case * Fix pylint * Address CR comments

2015-06-14 17:36:33 -07:00

	LDFLAGS += -lcudnn

2019-01-21 16:45:07 +01:00

								ifeq ($(use_blas), open)

							

[MXNET-372] Add build flag for USE_F16C in CMake and clarify flag in make (#10760) * add comments to makefile config * add f16c check in mxnet * update cmake * clarify * small updates * add message * add message * update msvc message * update mshadow * typo * only print message for MSVC if USE_F16C * improve build logic * update mshadow * remove def from amalgamation makefile * trigger CI

2018-05-01 21:18:23 -07:00

# whether to use F16C instruction set extension for fast fp16 compute on CPU

GPerftools update, also include include/mxnet/*.h as sources for CLion (#8232) * GPROF update, also include include/mxnet/*.h as sources for CLionwq * Added FindGperftools.cmake * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * USE_GPERFTOOLS off by default for now * Add Apache license to FindGperftools.cmake * Update CMakeLists.txt Try to use GPerftools or JEmalloc by default * Update CMakeLists.txt Off by default for now * internal labeling * gperftools and jemalloc * gperftools and jemalloc on by default * Fixing the Caught error (#8199) * Temporarily disable some unit tests to fix the build (#8253) * Temporarily disable the following unit tests that have been causing build failures: test_rms: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8230 is fixed. test_autograd_save_memory: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8211 is fixed. * OMP num threads 0->1 * remove check * Update documentation links to point to mxnet.incubator.apache.org Update documentation links to point to mxnet.incubator.apache.org * add export to gluon (#8212) * add export * fix * add test * fix nnvm * fix * ReleaseFeedback: License Files (#8247) * Updating license Headers * License changes * Sequential aug (#8243) * add sequentialAug * add type for castaug * modify docs * Basic CPU Kernel OMP selection based upon whether GPU has been used (#7854) * Basic CPU Kernel OMP selection based upon whether GPU has been used * lint * Disabling the test_CSVIter for now (#7829) * Disabling the test_CSVIter for now This test causing random failure while running on windows. Disabling it for now till we fix it. An git hub issue has been created to track it. * Update test_io.py * Update test_io.py * Use OMP thread count as test in Kernel, set count for Kernel loop * lint * removed * Remove assert * Adjust DefaultOMPThreadsPerWorker * remove -1 from omp_cores * Trigger build * It is not clear why pylint claims that this is re-imported. It is not. This is not changed from master branch. Trying a different format. * lint * lint * Change getter/setter naming style * allow env override * check environment directly, since OMP_NUM_THREADS mnay have odd formatting (i.e. 3, 2"). * CR comments * Squashed commit of the following: commit ec704f1bf7709e1cd8a73ad2d4fa18dc62922012 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:29:25 2017 -0700 Fix formatting commit 0218c49f37dbe787767936a22279764b0f219800 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:21:48 2017 -0700 Splitting unary ops commit 9abbba14715088d41076397980cd5d3d49df68df Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 11:38:04 2017 -0700 split unary * Update mxnet_predict0.cc * Update mxnet_predict0.cc * fix oversight with bracket * Binary scatter working on CPU and GPU * return unchanged * This test case is BS. I can't even tell what's wrong on the CI build because so many errors coming from this test. * inconsequential cleanup * Update test_kvstore.py * Update CMakeLists.txt * Update CMakeLists.txt trigger build * force fail * remove forced error * test clean every make * Test * Copy Jenkinsfile from upstream/master to fix the build. * logic was reversed * Update threaded_engine.h Trigger build * Trigger rebuild * Trigger build * Trigger build * Multiplatform docker based builds (#7792) * Add dockerized multi-architecture build files * Add android arm64 build * Operators for sum(csr, axis=0) and sum(csr, axis=1) (#8174) * Add Infer storage for sparse slice operator * Remove unused files * Indentation fix and add gpu test for fallback * Change sum builtin to py_sum * Add sum_axis(csr,axis=0)=dense and sum(csr,axis=1)=dense operator * Documentation changes for sparse * Add fallback unittest for keepdims and exclude * PR review based changes : * Fix CHECK_NE * Change in_stype to int * Using const int instead of int * Initialize mid with the start * Generalizing * OMP num threads 0->1 * remove check

2017-10-13 19:24:38 -07:00

# gperftools malloc library (tcmalloc)

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

FIND_LIBFILEEXT=so

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

								FIND_LIBFILE=$(wildcard /lib/libtcmalloc.$(FIND_LIBFILEEXT))

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

								FIND_LIBFILE=$(wildcard /usr/lib/libtcmalloc.$(FIND_LIBFILEEXT))

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

								FIND_LIBFILE=$(wildcard /usr/local/lib/libtcmalloc.$(FIND_LIBFILEEXT))

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

								FIND_LIBFILE=$(wildcard /usr/lib64/libtcmalloc.$(FIND_LIBFILEEXT))

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

GPerftools update, also include include/mxnet/*.h as sources for CLion (#8232) * GPROF update, also include include/mxnet/*.h as sources for CLionwq * Added FindGperftools.cmake * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * USE_GPERFTOOLS off by default for now * Add Apache license to FindGperftools.cmake * Update CMakeLists.txt Try to use GPerftools or JEmalloc by default * Update CMakeLists.txt Off by default for now * internal labeling * gperftools and jemalloc * gperftools and jemalloc on by default * Fixing the Caught error (#8199) * Temporarily disable some unit tests to fix the build (#8253) * Temporarily disable the following unit tests that have been causing build failures: test_rms: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8230 is fixed. test_autograd_save_memory: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8211 is fixed. * OMP num threads 0->1 * remove check * Update documentation links to point to mxnet.incubator.apache.org Update documentation links to point to mxnet.incubator.apache.org * add export to gluon (#8212) * add export * fix * add test * fix nnvm * fix * ReleaseFeedback: License Files (#8247) * Updating license Headers * License changes * Sequential aug (#8243) * add sequentialAug * add type for castaug * modify docs * Basic CPU Kernel OMP selection based upon whether GPU has been used (#7854) * Basic CPU Kernel OMP selection based upon whether GPU has been used * lint * Disabling the test_CSVIter for now (#7829) * Disabling the test_CSVIter for now This test causing random failure while running on windows. Disabling it for now till we fix it. An git hub issue has been created to track it. * Update test_io.py * Update test_io.py * Use OMP thread count as test in Kernel, set count for Kernel loop * lint * removed * Remove assert * Adjust DefaultOMPThreadsPerWorker * remove -1 from omp_cores * Trigger build * It is not clear why pylint claims that this is re-imported. It is not. This is not changed from master branch. Trying a different format. * lint * lint * Change getter/setter naming style * allow env override * check environment directly, since OMP_NUM_THREADS mnay have odd formatting (i.e. 3, 2"). * CR comments * Squashed commit of the following: commit ec704f1bf7709e1cd8a73ad2d4fa18dc62922012 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:29:25 2017 -0700 Fix formatting commit 0218c49f37dbe787767936a22279764b0f219800 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:21:48 2017 -0700 Splitting unary ops commit 9abbba14715088d41076397980cd5d3d49df68df Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 11:38:04 2017 -0700 split unary * Update mxnet_predict0.cc * Update mxnet_predict0.cc * fix oversight with bracket * Binary scatter working on CPU and GPU * return unchanged * This test case is BS. I can't even tell what's wrong on the CI build because so many errors coming from this test. * inconsequential cleanup * Update test_kvstore.py * Update CMakeLists.txt * Update CMakeLists.txt trigger build * force fail * remove forced error * test clean every make * Test * Copy Jenkinsfile from upstream/master to fix the build. * logic was reversed * Update threaded_engine.h Trigger build * Trigger rebuild * Trigger build * Trigger build * Multiplatform docker based builds (#7792) * Add dockerized multi-architecture build files * Add android arm64 build * Operators for sum(csr, axis=0) and sum(csr, axis=1) (#8174) * Add Infer storage for sparse slice operator * Remove unused files * Indentation fix and add gpu test for fallback * Change sum builtin to py_sum * Add sum_axis(csr,axis=0)=dense and sum(csr,axis=1)=dense operator * Documentation changes for sparse * Add fallback unittest for keepdims and exclude * PR review based changes : * Fix CHECK_NE * Change in_stype to int * Using const int instead of int * Initialize mid with the start * Generalizing * OMP num threads 0->1 * remove check

2017-10-13 19:24:38 -07:00

								endif

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								else

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

FIND_LIBFILEEXT=so

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

								FIND_LIBFILE=$(wildcard /lib/libjemalloc.$(FIND_LIBFILEEXT))

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

								FIND_LIBFILE=$(wildcard /usr/lib/libjemalloc.$(FIND_LIBFILEEXT))

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

								FIND_LIBFILE=$(wildcard /usr/local/lib/libjemalloc.$(FIND_LIBFILEEXT))

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

								FIND_LIBFILE=$(wildcard /usr/lib/x86_64-linux-gnu/libjemalloc.$(FIND_LIBFILEEXT))

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

Fix static / dynamic linking of gperftools and jemalloc (#12714)

2018-10-02 20:07:42 +00:00

								FIND_LIBFILE=$(wildcard /usr/lib64/libjemalloc.$(FIND_LIBFILEEXT))

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifeq (,$(FIND_LIBFILE))

							

GPerftools update, also include include/mxnet/*.h as sources for CLion (#8232) * GPROF update, also include include/mxnet/*.h as sources for CLionwq * Added FindGperftools.cmake * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * USE_GPERFTOOLS off by default for now * Add Apache license to FindGperftools.cmake * Update CMakeLists.txt Try to use GPerftools or JEmalloc by default * Update CMakeLists.txt Off by default for now * internal labeling * gperftools and jemalloc * gperftools and jemalloc on by default * Fixing the Caught error (#8199) * Temporarily disable some unit tests to fix the build (#8253) * Temporarily disable the following unit tests that have been causing build failures: test_rms: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8230 is fixed. test_autograd_save_memory: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8211 is fixed. * OMP num threads 0->1 * remove check * Update documentation links to point to mxnet.incubator.apache.org Update documentation links to point to mxnet.incubator.apache.org * add export to gluon (#8212) * add export * fix * add test * fix nnvm * fix * ReleaseFeedback: License Files (#8247) * Updating license Headers * License changes * Sequential aug (#8243) * add sequentialAug * add type for castaug * modify docs * Basic CPU Kernel OMP selection based upon whether GPU has been used (#7854) * Basic CPU Kernel OMP selection based upon whether GPU has been used * lint * Disabling the test_CSVIter for now (#7829) * Disabling the test_CSVIter for now This test causing random failure while running on windows. Disabling it for now till we fix it. An git hub issue has been created to track it. * Update test_io.py * Update test_io.py * Use OMP thread count as test in Kernel, set count for Kernel loop * lint * removed * Remove assert * Adjust DefaultOMPThreadsPerWorker * remove -1 from omp_cores * Trigger build * It is not clear why pylint claims that this is re-imported. It is not. This is not changed from master branch. Trying a different format. * lint * lint * Change getter/setter naming style * allow env override * check environment directly, since OMP_NUM_THREADS mnay have odd formatting (i.e. 3, 2"). * CR comments * Squashed commit of the following: commit ec704f1bf7709e1cd8a73ad2d4fa18dc62922012 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:29:25 2017 -0700 Fix formatting commit 0218c49f37dbe787767936a22279764b0f219800 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:21:48 2017 -0700 Splitting unary ops commit 9abbba14715088d41076397980cd5d3d49df68df Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 11:38:04 2017 -0700 split unary * Update mxnet_predict0.cc * Update mxnet_predict0.cc * fix oversight with bracket * Binary scatter working on CPU and GPU * return unchanged * This test case is BS. I can't even tell what's wrong on the CI build because so many errors coming from this test. * inconsequential cleanup * Update test_kvstore.py * Update CMakeLists.txt * Update CMakeLists.txt trigger build * force fail * remove forced error * test clean every make * Test * Copy Jenkinsfile from upstream/master to fix the build. * logic was reversed * Update threaded_engine.h Trigger build * Trigger rebuild * Trigger build * Trigger build * Multiplatform docker based builds (#7792) * Add dockerized multi-architecture build files * Add android arm64 build * Operators for sum(csr, axis=0) and sum(csr, axis=1) (#8174) * Add Infer storage for sparse slice operator * Remove unused files * Indentation fix and add gpu test for fallback * Change sum builtin to py_sum * Add sum_axis(csr,axis=0)=dense and sum(csr,axis=1)=dense operator * Documentation changes for sparse * Add fallback unittest for keepdims and exclude * PR review based changes : * Fix CHECK_NE * Change in_stype to int * Using const int instead of int * Initialize mid with the start * Generalizing * OMP num threads 0->1 * remove check

2017-10-13 19:24:38 -07:00

								endif

							

[CI] Add MKLML build and test (#5419) * update * update * update * update * update * update * update * udpate * update * update * update * new docker * update * update * update * update * fix * update * update * update * update * update * update * update * update * update * update * add gpu test * update * udpate * fix * update * tiny * add back test_forward * use new pip * update

2017-03-16 15:01:51 -07:00

GPerftools update, also include include/mxnet/*.h as sources for CLion (#8232) * GPROF update, also include include/mxnet/*.h as sources for CLionwq * Added FindGperftools.cmake * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * USE_GPERFTOOLS off by default for now * Add Apache license to FindGperftools.cmake * Update CMakeLists.txt Try to use GPerftools or JEmalloc by default * Update CMakeLists.txt Off by default for now * internal labeling * gperftools and jemalloc * gperftools and jemalloc on by default * Fixing the Caught error (#8199) * Temporarily disable some unit tests to fix the build (#8253) * Temporarily disable the following unit tests that have been causing build failures: test_rms: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8230 is fixed. test_autograd_save_memory: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8211 is fixed. * OMP num threads 0->1 * remove check * Update documentation links to point to mxnet.incubator.apache.org Update documentation links to point to mxnet.incubator.apache.org * add export to gluon (#8212) * add export * fix * add test * fix nnvm * fix * ReleaseFeedback: License Files (#8247) * Updating license Headers * License changes * Sequential aug (#8243) * add sequentialAug * add type for castaug * modify docs * Basic CPU Kernel OMP selection based upon whether GPU has been used (#7854) * Basic CPU Kernel OMP selection based upon whether GPU has been used * lint * Disabling the test_CSVIter for now (#7829) * Disabling the test_CSVIter for now This test causing random failure while running on windows. Disabling it for now till we fix it. An git hub issue has been created to track it. * Update test_io.py * Update test_io.py * Use OMP thread count as test in Kernel, set count for Kernel loop * lint * removed * Remove assert * Adjust DefaultOMPThreadsPerWorker * remove -1 from omp_cores * Trigger build * It is not clear why pylint claims that this is re-imported. It is not. This is not changed from master branch. Trying a different format. * lint * lint * Change getter/setter naming style * allow env override * check environment directly, since OMP_NUM_THREADS mnay have odd formatting (i.e. 3, 2"). * CR comments * Squashed commit of the following: commit ec704f1bf7709e1cd8a73ad2d4fa18dc62922012 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:29:25 2017 -0700 Fix formatting commit 0218c49f37dbe787767936a22279764b0f219800 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:21:48 2017 -0700 Splitting unary ops commit 9abbba14715088d41076397980cd5d3d49df68df Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 11:38:04 2017 -0700 split unary * Update mxnet_predict0.cc * Update mxnet_predict0.cc * fix oversight with bracket * Binary scatter working on CPU and GPU * return unchanged * This test case is BS. I can't even tell what's wrong on the CI build because so many errors coming from this test. * inconsequential cleanup * Update test_kvstore.py * Update CMakeLists.txt * Update CMakeLists.txt trigger build * force fail * remove forced error * test clean every make * Test * Copy Jenkinsfile from upstream/master to fix the build. * logic was reversed * Update threaded_engine.h Trigger build * Trigger rebuild * Trigger build * Trigger build * Multiplatform docker based builds (#7792) * Add dockerized multi-architecture build files * Add android arm64 build * Operators for sum(csr, axis=0) and sum(csr, axis=1) (#8174) * Add Infer storage for sparse slice operator * Remove unused files * Indentation fix and add gpu test for fallback * Change sum builtin to py_sum * Add sum_axis(csr,axis=0)=dense and sum(csr,axis=1)=dense operator * Documentation changes for sparse * Add fallback unittest for keepdims and exclude * PR review based changes : * Fix CHECK_NE * Change in_stype to int * Using const int instead of int * Initialize mid with the start * Generalizing * OMP num threads 0->1 * remove check

2017-10-13 19:24:38 -07:00

								# If not using tcmalloc or jemalloc, print a warning (user should consider installing)

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								ifneq ($(USE_JEMALLOC), 1)

							

GPerftools update, also include include/mxnet/*.h as sources for CLion (#8232) * GPROF update, also include include/mxnet/*.h as sources for CLionwq * Added FindGperftools.cmake * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * USE_GPERFTOOLS off by default for now * Add Apache license to FindGperftools.cmake * Update CMakeLists.txt Try to use GPerftools or JEmalloc by default * Update CMakeLists.txt Off by default for now * internal labeling * gperftools and jemalloc * gperftools and jemalloc on by default * Fixing the Caught error (#8199) * Temporarily disable some unit tests to fix the build (#8253) * Temporarily disable the following unit tests that have been causing build failures: test_rms: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8230 is fixed. test_autograd_save_memory: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8211 is fixed. * OMP num threads 0->1 * remove check * Update documentation links to point to mxnet.incubator.apache.org Update documentation links to point to mxnet.incubator.apache.org * add export to gluon (#8212) * add export * fix * add test * fix nnvm * fix * ReleaseFeedback: License Files (#8247) * Updating license Headers * License changes * Sequential aug (#8243) * add sequentialAug * add type for castaug * modify docs * Basic CPU Kernel OMP selection based upon whether GPU has been used (#7854) * Basic CPU Kernel OMP selection based upon whether GPU has been used * lint * Disabling the test_CSVIter for now (#7829) * Disabling the test_CSVIter for now This test causing random failure while running on windows. Disabling it for now till we fix it. An git hub issue has been created to track it. * Update test_io.py * Update test_io.py * Use OMP thread count as test in Kernel, set count for Kernel loop * lint * removed * Remove assert * Adjust DefaultOMPThreadsPerWorker * remove -1 from omp_cores * Trigger build * It is not clear why pylint claims that this is re-imported. It is not. This is not changed from master branch. Trying a different format. * lint * lint * Change getter/setter naming style * allow env override * check environment directly, since OMP_NUM_THREADS mnay have odd formatting (i.e. 3, 2"). * CR comments * Squashed commit of the following: commit ec704f1bf7709e1cd8a73ad2d4fa18dc62922012 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:29:25 2017 -0700 Fix formatting commit 0218c49f37dbe787767936a22279764b0f219800 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:21:48 2017 -0700 Splitting unary ops commit 9abbba14715088d41076397980cd5d3d49df68df Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 11:38:04 2017 -0700 split unary * Update mxnet_predict0.cc * Update mxnet_predict0.cc * fix oversight with bracket * Binary scatter working on CPU and GPU * return unchanged * This test case is BS. I can't even tell what's wrong on the CI build because so many errors coming from this test. * inconsequential cleanup * Update test_kvstore.py * Update CMakeLists.txt * Update CMakeLists.txt trigger build * force fail * remove forced error * test clean every make * Test * Copy Jenkinsfile from upstream/master to fix the build. * logic was reversed * Update threaded_engine.h Trigger build * Trigger rebuild * Trigger build * Trigger build * Multiplatform docker based builds (#7792) * Add dockerized multi-architecture build files * Add android arm64 build * Operators for sum(csr, axis=0) and sum(csr, axis=1) (#8174) * Add Infer storage for sparse slice operator * Remove unused files * Indentation fix and add gpu test for fallback * Change sum builtin to py_sum * Add sum_axis(csr,axis=0)=dense and sum(csr,axis=1)=dense operator * Documentation changes for sparse * Add fallback unittest for keepdims and exclude * PR review based changes : * Fix CHECK_NE * Change in_stype to int * Using const int instead of int * Initialize mid with the start * Generalizing * OMP num threads 0->1 * remove check

2017-10-13 19:24:38 -07:00

								$(warning WARNING: Significant performance increases can be achieved by installing and \

							

allow custom path and static linking for custom mallocs in make (#12645) * allow custom path and static linking for custom mallocs * add config entry

2018-09-23 22:11:20 -07:00

								endif

							

GPerftools update, also include include/mxnet/*.h as sources for CLion (#8232) * GPROF update, also include include/mxnet/*.h as sources for CLionwq * Added FindGperftools.cmake * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * Add option USE_GPERFTOOLS * USE_GPERFTOOLS off by default for now * Add Apache license to FindGperftools.cmake * Update CMakeLists.txt Try to use GPerftools or JEmalloc by default * Update CMakeLists.txt Off by default for now * internal labeling * gperftools and jemalloc * gperftools and jemalloc on by default * Fixing the Caught error (#8199) * Temporarily disable some unit tests to fix the build (#8253) * Temporarily disable the following unit tests that have been causing build failures: test_rms: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8230 is fixed. test_autograd_save_memory: This can be re-enabled once https://github.com/apache/incubator-mxnet/issues/8211 is fixed. * OMP num threads 0->1 * remove check * Update documentation links to point to mxnet.incubator.apache.org Update documentation links to point to mxnet.incubator.apache.org * add export to gluon (#8212) * add export * fix * add test * fix nnvm * fix * ReleaseFeedback: License Files (#8247) * Updating license Headers * License changes * Sequential aug (#8243) * add sequentialAug * add type for castaug * modify docs * Basic CPU Kernel OMP selection based upon whether GPU has been used (#7854) * Basic CPU Kernel OMP selection based upon whether GPU has been used * lint * Disabling the test_CSVIter for now (#7829) * Disabling the test_CSVIter for now This test causing random failure while running on windows. Disabling it for now till we fix it. An git hub issue has been created to track it. * Update test_io.py * Update test_io.py * Use OMP thread count as test in Kernel, set count for Kernel loop * lint * removed * Remove assert * Adjust DefaultOMPThreadsPerWorker * remove -1 from omp_cores * Trigger build * It is not clear why pylint claims that this is re-imported. It is not. This is not changed from master branch. Trying a different format. * lint * lint * Change getter/setter naming style * allow env override * check environment directly, since OMP_NUM_THREADS mnay have odd formatting (i.e. 3, 2"). * CR comments * Squashed commit of the following: commit ec704f1bf7709e1cd8a73ad2d4fa18dc62922012 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:29:25 2017 -0700 Fix formatting commit 0218c49f37dbe787767936a22279764b0f219800 Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 12:21:48 2017 -0700 Splitting unary ops commit 9abbba14715088d41076397980cd5d3d49df68df Author: Olivier <coolivie@amazon.com> Date: Mon Sep 25 11:38:04 2017 -0700 split unary * Update mxnet_predict0.cc * Update mxnet_predict0.cc * fix oversight with bracket * Binary scatter working on CPU and GPU * return unchanged * This test case is BS. I can't even tell what's wrong on the CI build because so many errors coming from this test. * inconsequential cleanup * Update test_kvstore.py * Update CMakeLists.txt * Update CMakeLists.txt trigger build * force fail * remove forced error * test clean every make * Test * Copy Jenkinsfile from upstream/master to fix the build. * logic was reversed * Update threaded_engine.h Trigger build * Trigger rebuild * Trigger build * Trigger build * Multiplatform docker based builds (#7792) * Add dockerized multi-architecture build files * Add android arm64 build * Operators for sum(csr, axis=0) and sum(csr, axis=1) (#8174) * Add Infer storage for sparse slice operator * Remove unused files * Indentation fix and add gpu test for fallback * Change sum builtin to py_sum * Add sum_axis(csr,axis=0)=dense and sum(csr,axis=1)=dense operator * Documentation changes for sparse * Add fallback unittest for keepdims and exclude * PR review based changes : * Fix CHECK_NE * Change in_stype to int * Using const int instead of int * Initialize mid with the start * Generalizing * OMP num threads 0->1 * remove check

2017-10-13 19:24:38 -07:00

								endif

							

[CI] Add MKLML build and test (#5419) * update * update * update * update * update * update * update * udpate * update * update * update * new docker * update * update * update * update * fix * update * update * update * update * update * update * update * update * update * update * add gpu test * update * udpate * fix * update * tiny * add back test_forward * use new pip * update

2017-03-16 15:01:51 -07:00

[engine-refactor] stream manager and travis script

2015-09-08 23:29:03 +08:00

								ifeq ($(USE_THREADED_ENGINE), 1)

							

Changed make to support more gpu archs, multiple toolkits, reduce lib size. (#6588) * Updated make to support more gpu archs, tolerate multiple toolkit versions, reduce lib size. * Moved CUDA_ARCH setting to Makefile, removed from all make/*.mk files.

2015-06-14 17:36:33 -07:00

								ifneq ($(ADD_CFLAGS), NONE)

							

[MXNET-303] Update Jetpack to version 3.2 (#10482)

2018-04-10 13:22:54 +00:00

								ifeq ($(NVCC), NONE)

							

Use default nvcc when configured nvcc not present. (#10566)

2018-04-25 13:09:58 -07:00

	# If NVCC has not been manually defined, use the CUDA_PATH bin dir.

[MXNET-303] Update Jetpack to version 3.2 (#10482)

2018-04-10 13:22:54 +00:00

									ifneq ($(USE_CUDA_PATH), NONE)

							

update build doc

2015-09-22 14:32:47 -04:00

								endif

							

Use default nvcc when configured nvcc not present. (#10566)

2018-04-25 13:09:58 -07:00

# Guard against displaying nvcc info messages to users not using CUDA.

2017-06-07 10:12:32 -07:00

# Sets 'CUDA_ARCH', which determines the GPU architectures supported

Fixed Makefile so a null CUDA_ARCH is treated like an unset one. (#7515)

2017-08-17 21:14:18 -07:00

# to remove archs they don't wish to support to speed compilation, or they can

Changed make to support more gpu archs, multiple toolkits, reduce lib size. (#6588) * Updated make to support more gpu archs, tolerate multiple toolkit versions, reduce lib size. * Moved CUDA_ARCH setting to Makefile, removed from all make/*.mk files.

2017-06-07 10:12:32 -07:00

Fixed Makefile so a null CUDA_ARCH is treated like an unset one. (#7515)

2017-08-17 21:14:18 -07:00

								ifeq ($(CUDA_ARCH),)

							

add Volta and Turing arch (#13069) add 75 in Makefile

2018-11-21 10:48:59 +08:00

									KNOWN_CUDA_ARCHS := 30 35 50 52 60 61 70 75

							

Changed make to support more gpu archs, multiple toolkits, reduce lib size. (#6588) * Updated make to support more gpu archs, tolerate multiple toolkit versions, reduce lib size. * Moved CUDA_ARCH setting to Makefile, removed from all make/*.mk files.

2017-06-07 10:12:32 -07:00

	# Run nvcc on a zero-length file to check architecture-level support.

Changed make to support more gpu archs, multiple toolkits, reduce lib size. (#6588) * Updated make to support more gpu archs, tolerate multiple toolkit versions, reduce lib size. * Moved CUDA_ARCH setting to Makefile, removed from all make/*.mk files.

2017-08-29 17:14:01 -07:00

												$(shell $(NVCC) -arch=sm_$(arch) -E --x cu /dev/null >/dev/null 2>&1 && \

							

2017-06-07 10:12:32 -07:00

	# Convert a trailing "code=sm_NN" to "code=[sm_NN,compute_NN]" to also

Changed make to support more gpu archs, multiple toolkits, reduce lib size. (#6588) * Updated make to support more gpu archs, tolerate multiple toolkit versions, reduce lib size. * Moved CUDA_ARCH setting to Makefile, removed from all make/*.mk files.

2017-08-29 17:14:01 -07:00

														 echo $(COMPRESS))

							

2017-06-07 10:12:32 -07:00

								endif

							

2017-12-06 19:31:19 +01:00

								$(info Running CUDA_ARCH: $(CUDA_ARCH))

							

Changed make to support more gpu archs, multiple toolkits, reduce lib size. (#6588) * Updated make to support more gpu archs, tolerate multiple toolkit versions, reduce lib size. * Moved CUDA_ARCH setting to Makefile, removed from all make/*.mk files.

2017-06-07 10:12:32 -07:00

								endif

							

[kvstore] add ps-lite

2015-10-02 22:22:22 -04:00

# ps-lite

[MXNET-16] Move mshadow/ps-lite/dlpack to 3rdparty #10132 (#10138) * move * update cmake/make * remove commment * update license * ps-lite move * update license * move mshadow for make * mshadow cmake * update license * update readme * Update Jenkinsfile * update license file with paths

2018-03-21 23:31:19 -07:00

								PS_PATH=$(ROOTDIR)/3rdparty/ps-lite

							

[kvstore] add ps-lite

2015-10-02 22:22:22 -04:00

								DEPS_PATH=$(shell pwd)/deps

							

[kvstore] update ps-lite

2015-10-20 22:06:23 +00:00

									CFLAGS += -DMXNET_USE_DIST_KVSTORE -I$(PS_PATH)/include -I$(DEPS_PATH)/include

							

[kvstore] add ps-lite

2015-10-02 22:22:22 -04:00

									LIB_DEP += $(PS_PATH)/build/libps.a

							

[ps-lite] link libprotobuf.a and libzmq.a

2015-11-06 21:54:16 -05:00

									LDFLAGS += $(PS_LDFLAGS_A)

							

[kvstore] add ps-lite

2015-10-02 22:22:22 -04:00

								endif

							

[doc] new sphnix plugin (#6105) * update doc * rm * update * update ndarray * update mds * update * update * update * update * update * update * update image.md and others * update

2017-05-07 22:19:25 -07:00

								.PHONY: clean all extra-packages test lint docs clean_all rcpplint rcppexport roxygen\

							

[OP] Add alias support (#3261)

2016-09-08 19:10:58 -07:00

									cython2 cython3 cython cyclean

							

Caffe without the patch, cpp-package fixed also with Caffe plugin (#5573) * PEP8 indentation fix * Remove executable flag * Remove executable flags * Fixing cpp-package including problems with caffe converter * Fix warnings * Remove need for caffe patch for caffe plugin * Ignore cmake paths, remove rebuildable op.h for cpp-package * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * Fix source file * Better example.mk * Turn off caffe by default * lint fixes * Trying to figure out how to fix submodules * nohub problem on travis so force retry * Edited test to just run make instead of obsolete make example * Fix cpp-package op.h bootstrapping with CMake * Build tweaking cpp-package * Lint fixes * op.h generator * FIx cpp-package for latest merge from master * Fix lint * static link * link whole static lib * Trigger another build attempt * Trigger another build attempt * Add caffe plugin support (disabled until dependencies can be added to build machine) * Add cufft library * win32 cufft library * don't link unit tests as static * Add include for rebuildable protobuf header caffe.pb.h

2015-06-14 17:36:33 -07:00

2017-03-30 20:13:36 -07:00

								all: lib/libmxnet.a lib/libmxnet.so $(BIN) extra-packages

							

Move legacy operators under the nn directory (#8858) * Move legacy operators to nn/ * Move CuDNN operators to nn/ * Update CuDNN algoreg. * fix amalgamation. * fix a path of a header file. * Fix the path for mkl header files. * Fix compilation errors in mkl. * Fix for the coding style. * Fix the path of the header files. * Fix headers in the cu files. * Compile cu files. * Update activation-inl.h * Update activation-inl.h

2015-06-14 17:36:33 -07:00

2017-11-30 10:56:21 -08:00

								SRC = $(wildcard src/*/*/*/*.cc src/*/*/*.cc src/*/*.cc src/*.cc)

							

Move legacy operators under the nn directory (#8858) * Move legacy operators to nn/ * Move CuDNN operators to nn/ * Update CuDNN algoreg. * fix amalgamation. * fix a path of a header file. * Fix the path for mkl header files. * Fix compilation errors in mkl. * Fix for the coding style. * Fix the path of the header files. * Fix headers in the cu files. * Compile cu files. * Update activation-inl.h * Update activation-inl.h

2016-01-07 22:52:33 -08:00

								OBJ = $(patsubst %.cc, build/%.o, $(SRC))

							

2017-11-30 10:56:21 -08:00

								CUSRC = $(wildcard src/*/*/*/*.cu src/*/*/*.cu src/*/*.cu src/*.cu)

							

2016-01-07 22:52:33 -08:00

								CUOBJ = $(patsubst %.cu, build/%_gpu.o, $(CUSRC))

							

2015-06-14 17:36:33 -07:00

2016-01-12 21:22:09 -08:00

# extra operators

extra operator makefile fix

2015-12-23 15:07:58 -08:00

								ifneq ($(EXTRA_OPERATORS),)

							

add ssd as example (#3674)

2016-11-07 15:10:50 -06:00

									EXTRA_SRC = $(wildcard $(patsubst %, %/*.cc, $(EXTRA_OPERATORS)) $(patsubst %, %/*/*.cc, $(EXTRA_OPERATORS)))

							

fix caffe stream (#2984) * fix caffe stream * allow multiple extra operator * fix python 2/3 codec conflict

2016-08-10 18:33:41 -07:00

									EXTRA_OBJ = $(patsubst %.cc, %.o, $(EXTRA_SRC))

							

add ssd as example (#3674)

2016-11-07 15:10:50 -06:00

									EXTRA_CUSRC = $(wildcard $(patsubst %, %/*.cu, $(EXTRA_OPERATORS)) $(patsubst %, %/*/*.cu, $(EXTRA_OPERATORS)))

							

fix caffe stream (#2984) * fix caffe stream * allow multiple extra operator * fix python 2/3 codec conflict

2016-08-10 18:33:41 -07:00

									EXTRA_CUOBJ = $(patsubst %.cu, %_gpu.o, $(EXTRA_CUSRC))

							

make ccsgd the default and add number of batch to predict

2015-12-16 23:38:36 -08:00

								else

							

2016-01-07 22:52:33 -08:00

# plugin

2016-01-12 21:22:09 -08:00

PLUGIN_OBJ =

Re-organize Scala maven build (#13626) * Re-organize scala maven build 1. Automatically detect which platform to build for scala. 2. Remove platform dependend submodules 3. Fix cyclic module dependencies 4. Fix scalatype style check 5. Now mvn can be executed in submodule 6. Maven build can be executed from any directory not only in root project 7. Checkin javah header file, and use verify task to detect native API changes 8. Improve incremental build performance 9. Remove unittest and integration-test profile, use proper task instead 10. Delete generated scala file during maven clean. * Redo maven deploy related tasks. 1. Removed maven release plugin. 2. Make maven build friendly to CI, allow cli override version. 3. Moved gpg signing to deploy stage. 4. Created a separeated deploy module. 5. Updated Makefile to new maven build change. 6. Remove unused nexus-staging-plugin 7. Added nightly and staging profile for CI. * Support mkldnn for Scala.

2016-01-07 22:52:33 -08:00

2019-01-08 17:25:53 -08:00

								ifneq ($(UNAME_S), Windows)

							

split cpu & gpu version

2016-03-11 23:48:53 +08:00

									ifeq ($(UNAME_S), Darwin)

							

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2016-09-01 21:17:33 -07:00

		WHOLE_ARCH= -all_load

Fix gcc5.3 + nvcc 8.0 (#2004)

2016-05-01 17:00:47 -07:00

	else

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2016-09-01 21:17:33 -07:00

		WHOLE_ARCH= --whole-archive

split cpu & gpu version

2016-03-11 23:48:53 +08:00

	endif

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2016-01-12 21:22:09 -08:00

# all dep

2016-09-01 21:17:33 -07:00

								LIB_DEP += $(DMLC_CORE)/libdmlc.a $(NNVM_PATH)/lib/libnnvm.a

							

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2016-01-07 22:52:33 -08:00

								ALL_DEP = $(OBJ) $(EXTRA_OBJ) $(PLUGIN_OBJ) $(LIB_DEP)

							

2016-09-01 21:17:33 -07:00

add elementwise sym

2015-09-05 19:23:50 -06:00

								ifeq ($(USE_CUDA), 1)

							

Change CUB submodule to track Nvidia CUB project. (#13322) * Change CUB submodule to track Nvidia CUB project. Directly change submodule URL will impact every developer. "git submodule update" won't work, developer has to use "git submodule sync" first. * retrigger CI

2019-03-31 16:38:15 -07:00

									CFLAGS += -I$(ROOTDIR)/3rdparty/nvidia_cub

							

Introduce the ENABLE_CUDA_RTC build option (#9428) When OFF the ENABLE_CUDA_RTC option skips building the CUDA runtime compilation feature (CudaModule) that requires access to the CUDA driver API. The MXNet shared library build with this option OFF has no direct dependency on the CUDA driver library (e.g. libcuda.so) and can be used on both GPU and CPU hosts. CPU-only hosts require no stub CUDA driver library.

2016-01-07 22:52:33 -08:00

									ALL_DEP += $(CUOBJ) $(EXTRA_CUOBJ) $(PLUGIN_CUOBJ)

							

2018-01-15 15:21:23 -05:00

	LDFLAGS += -lcufft

TVM bridge support to JIT NDArray Function by TVM (#9880) * TVM bridge support. Support wrap TVM compiled function as a NDArray function. * Testcases and CI to include TVM as dependency * address review comments * Add more comments, change to constexpr * change to log warn * update comment on the type code

2018-02-27 08:41:28 -08:00

	# Make sure to add stubs as fallback in order to be able to build

2017-12-06 19:31:19 +01:00

	# without full CUDA install (especially if run without nvidia-docker)

NCCL integration (#8294) * NCCL integration * Skipping NCCL test (since it requires NCCL library to be present and enabled in build) * Add Apache header to test_nccl.py * Fixes from review * Trigger CI * Removing API change for Pull * Fixes * Fix * Fix * Fix * Fix * Fix * Indentation fixes and importing unittest in test_nccl.py * sorted_key_attrs -> key_attrs * More fixes from review * Fix * Fix lint * Support for aggregation in NCCL * Fix typo * Fix missing logic * Move from CommNCCL to KVStoreNCCL * Fix * Moved nccl update to separate function * Add message about not supporting gradient compression * Fix lint * Trigger CI

2017-11-21 06:52:16 -08:00

									ifeq ($(USE_NCCL), 1)

							

split cpu & gpu version

2016-03-11 23:48:53 +08:00

								else

							

NCCL integration (#8294) * NCCL integration * Skipping NCCL test (since it requires NCCL library to be present and enabled in build) * Add Apache header to test_nccl.py * Fixes from review * Trigger CI * Removing API change for Pull * Fixes * Fix * Fix * Fix * Fix * Fix * Indentation fixes and importing unittest in test_nccl.py * sorted_key_attrs -> key_attrs * More fixes from review * Fix * Fix lint * Support for aggregation in NCCL * Fix typo * Fix missing logic * Move from CommNCCL to KVStoreNCCL * Fix * Moved nccl update to separate function * Add message about not supporting gradient compression * Fix lint * Trigger CI

2017-11-21 06:52:16 -08:00

									CFLAGS += -DMXNET_USE_NCCL=0

							

add elementwise sym

2015-09-05 19:23:50 -06:00

								endif

							

[WIP] New faster version of the RecordIO iterator (#7152) * Improved ImageRecordIter performance Added option for using libjpeg-turbo directly to decode images ImageRecordIter can now use .idx files generated by im2rec.py Added rec2idx.py utility to generate .idx files from .rec files When using IndexedRecordIO (.rec and .idx together) shuffle option performs global shuffling * Add ASF license header * Update dmlc-core to fix a bug on Windows * USE_TURBO_JPEG -> USE_LIBJPEG_TURBO * trigger test

2017-10-14 13:09:21 -07:00

								ifeq ($(USE_LIBJPEG_TURBO), 1)

							

[MXNET-862] Basic maven jenkins pipeline (#13450) * Jenkins Publish Nightly Maven Progress * Seperate Build, Test, and Deploy Stages with parallel

2019-01-08 11:31:35 -08:00

								ifeq ($(CI), 1)

							

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2016-09-01 21:17:33 -07:00

# For quick compile test, used smaller subset

Define build target for mkldnn lib build to fix 'make clean USE_MKMLDNN=1' issue (#11090) * Define build target for mkldnn lib build to fix 'make clean USE_MKMLDNN=1' issue * fix create install dir and other minor issues * Fix GPU MKLDNN and cpp-package build failure * Fix issue to only link with full MKL when BLAS is mkl * simplify logic by removing MKLDNN_ROOT support and some renaming * retrigger Jenkins * retrigger Jenkins * retrigger Jenkins * retrigger Jenkins * retrigger Jenkins

2018-06-28 01:32:53 +08:00

								build/src/%.o: src/%.cc | mkldnn

							

2016-01-07 22:52:33 -08:00

	@mkdir -p $(@D)

Enable warning as error (#4451) * Engineering system enhencement Enable warning as error Enable Profiler code into build Speed up build a bit * Fix warning in scala package

2017-01-01 07:37:40 +08:00

									$(CXX) -std=c++11 -c $(CFLAGS) -MMD -c $< -o $@

							

Define build target for mkldnn lib build to fix 'make clean USE_MKMLDNN=1' issue (#11090) * Define build target for mkldnn lib build to fix 'make clean USE_MKMLDNN=1' issue * fix create install dir and other minor issues * Fix GPU MKLDNN and cpp-package build failure * Fix issue to only link with full MKL when BLAS is mkl * simplify logic by removing MKLDNN_ROOT support and some renaming * retrigger Jenkins * retrigger Jenkins * retrigger Jenkins * retrigger Jenkins * retrigger Jenkins

2016-01-07 22:52:33 -08:00

2018-06-28 01:32:53 +08:00

								build/src/%_gpu.o: src/%.cu | mkldnn

							

Improve CCache handling (#13456) * Remove gitignore entries * Modify Makefile * Modify user permissions * Add new ccache wrapper function * Change PATH rewrite to a different one to resolve CUDA issues * Add ccache to gpu cmake * Enable ccache for every build * Set permissions for arm dockerfiles * Disable ccache for ASAN * Remove g++-8 ccache redirect * Update Android Dockerfiles for user permissions * Fix ASAN compiler typo * Remove sanity for speed * Move build dir creation in android armv8 * Revert "Remove sanity for speed" This reverts commit e8386a774dafe96337930b9cac36cb24fc36585e. * Add ccache for NVCC in Makefile

2016-01-07 22:52:33 -08:00

	@mkdir -p $(@D)

2018-12-14 20:54:51 +01:00

									$(NVCC) $(NVCCFLAGS) $(CUDA_ARCH) -Xcompiler "$(CFLAGS)" --generate-dependencies -MT build/src/$*_gpu.o $< >build/src/$*_gpu.d

							

Improve build system to avoid calling pkg-config many times (#4111) * Improve build system to avoid calling pkg-config many times * support specify cuda arch

2016-12-07 15:49:40 +08:00

									$(NVCC) -c -o $@ $(NVCCFLAGS) $(CUDA_ARCH) -Xcompiler "$(CFLAGS)" $<

							

2016-01-07 22:52:33 -08:00

2016-01-12 21:22:09 -08:00

								# A nvcc bug cause it to generate "generic/xxx.h" dependencies from torch headers.

							

2016-01-07 22:52:33 -08:00

								build/plugin/%_gpu.o: plugin/%.cu

							

simplify makefile, automatically generate dependency

2015-09-07 18:12:23 -04:00

	@mkdir -p $(@D)

[OP] 3d conv + 3d pool (#2585) * cudnn 3d conv & 3d pooling

2016-07-08 14:07:45 -07:00

									$(CXX) -std=c++11 $(CFLAGS) -MM -MT build/plugin/$*_gpu.o $< >build/plugin/$*_gpu.d

							

Improve build system to avoid calling pkg-config many times (#4111) * Improve build system to avoid calling pkg-config many times * support specify cuda arch

2016-12-07 15:49:40 +08:00

									$(NVCC) -c -o $@ $(NVCCFLAGS) $(CUDA_ARCH) -Xcompiler "$(CFLAGS)" $<

							

Improve CCache handling (#13456) * Remove gitignore entries * Modify Makefile * Modify user permissions * Add new ccache wrapper function * Change PATH rewrite to a different one to resolve CUDA issues * Add ccache to gpu cmake * Enable ccache for every build * Set permissions for arm dockerfiles * Disable ccache for ASAN * Remove g++-8 ccache redirect * Update Android Dockerfiles for user permissions * Fix ASAN compiler typo * Remove sanity for speed * Move build dir creation in android armv8 * Revert "Remove sanity for speed" This reverts commit e8386a774dafe96337930b9cac36cb24fc36585e. * Add ccache for NVCC in Makefile

2015-06-14 17:36:33 -07:00

added mkldnn dependency for plugin compile target (#14274)

2019-03-01 16:26:21 -08:00

								build/plugin/%.o: plugin/%.cc | mkldnn

							

make ccsgd the default and add number of batch to predict

2015-12-16 23:38:36 -08:00

	@mkdir -p $(@D)

Enable warning as error (#4451) * Engineering system enhencement Enable warning as error Enable Profiler code into build Speed up build a bit * Fix warning in scala package

2017-01-01 07:37:40 +08:00

									$(CXX) -std=c++11 -c $(CFLAGS) -MMD -c $< -o $@

							

make ccsgd the default and add number of batch to predict

2015-12-16 23:38:36 -08:00

fix caffe stream (#2984) * fix caffe stream * allow multiple extra operator * fix python 2/3 codec conflict

2016-08-10 18:33:41 -07:00

								%_gpu.o: %.cu

							

make ccsgd the default and add number of batch to predict

2015-12-16 23:38:36 -08:00

	@mkdir -p $(@D)

2018-12-14 20:54:51 +01:00

									$(NVCC) $(NVCCFLAGS) $(CUDA_ARCH) -Xcompiler "$(CFLAGS) -Isrc/operator" --generate-dependencies -MT $*_gpu.o $< >$*_gpu.d

							

Improve build system to avoid calling pkg-config many times (#4111) * Improve build system to avoid calling pkg-config many times * support specify cuda arch

2016-12-07 15:49:40 +08:00

									$(NVCC) -c -o $@ $(NVCCFLAGS) $(CUDA_ARCH) -Xcompiler "$(CFLAGS) -Isrc/operator" $<

							

make ccsgd the default and add number of batch to predict

2015-12-16 23:38:36 -08:00

add doc for gluon, sym/nd contrib (#7284) * add doc for contrib * docs for gluon, sym/nd contrib

2017-08-02 16:19:41 -07:00

								%.o: %.cc $(CORE_INC)

							

fix caffe stream (#2984) * fix caffe stream * allow multiple extra operator * fix python 2/3 codec conflict

2016-08-10 18:33:41 -07:00

	@mkdir -p $(@D)

Enable warning as error (#4451) * Engineering system enhencement Enable warning as error Enable Profiler code into build Speed up build a bit * Fix warning in scala package

2017-01-01 07:37:40 +08:00

									$(CXX) -std=c++11 -c $(CFLAGS) -MMD -Isrc/operator -c $< -o $@

							

fix caffe stream (#2984) * fix caffe stream * allow multiple extra operator * fix python 2/3 codec conflict

2016-08-10 18:33:41 -07:00

Set install path for libmxnet.so dynamic lib on Mac OS (#13629)

2018-12-12 20:55:24 -08:00

# Set install path for libmxnet.so on Mac OS

[OP] Fix load of scalar binary op

2016-01-08 08:09:11 -08:00

# NOTE: to statically link libmxnet.a we need the option

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2016-09-01 21:17:33 -07:00

								lib/libmxnet.a: $(ALLX_DEP)

							

[make] delete lib and bin, try to fix travis python

2015-10-08 23:37:04 -04:00

	@mkdir -p $(@D)

simplify makefile, automatically generate dependency

2015-09-07 18:12:23 -04:00

									ar crv $@ $(filter %.o, $?)

							

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2015-06-14 17:36:33 -07:00

2016-09-01 21:17:33 -07:00

								lib/libmxnet.so: $(ALLX_DEP)

							

2017-08-29 17:14:01 -07:00

	@mkdir -p $(@D)

Compile MKLDNN in Mac. (#10115) * Compile MKLDNN in Mac. * A more general way of finding #cores. * find the right dyn library of mkldnn.

2018-03-14 19:20:47 -07:00

								ifeq ($(USE_MKLDNN), 1)

							

Copy only one libmkldnn file (#10245) * Only copy libmkldnn.so * Update Jenkins. * use libmkldnn.so.0 * copy the right libmklml file.

2018-03-27 13:46:36 -07:00

									install_name_tool -change '@rpath/libmklml.dylib' '@loader_path/libmklml.dylib' $@

							

Compile MKLDNN in Mac. (#10115) * Compile MKLDNN in Mac. * A more general way of finding #cores. * find the right dyn library of mkldnn.

2018-03-14 19:20:47 -07:00

								endif

							

[MXNET-472] ccache for docker builds (#11151) * [MXNET-472] Add ccache support to docker builds * Added ccache stages to all containers * Refactored ccache installs in docker images * Reformatted build.py * Added ccache install to android docker builds * Improved setting ccache directory and max cache size * Added ccache to cmake based docker builds * Removed unnessesary yum install from centos7 ccache build * Added default compilers for ccache for docker builds * Added README comment about ccache mapping into docker builds * Reverted not working ccache configurations * Added comments about ccache installation * Move install scripts * Update ubuntu_r.sh

2015-06-14 17:36:33 -07:00

Update makefile to depend on sub directories

2016-07-31 15:57:20 -07:00

								$(PS_PATH)/build/libps.a: PSLITE

							

2018-06-08 03:12:30 +02:00

									$(MAKE) CXX="$(CXX)" DEPS_PATH="$(DEPS_PATH)" -C $(PS_PATH) ps

							

[kvstore] add ps-lite

2015-10-02 22:22:22 -04:00

Update makefile to depend on sub directories

2016-07-31 15:57:20 -07:00

								$(DMLC_CORE)/libdmlc.a: DMLCCORE

							

change recursive make commands per gnumake manual recommendation (#4983)

2017-02-11 23:12:15 -08:00

									+ cd $(DMLC_CORE); $(MAKE) libdmlc.a USE_SSE=$(USE_SSE) config=$(ROOTDIR)/$(config); cd $(ROOTDIR)

							

api

2015-06-27 14:57:11 -06:00

Add NNVM .cc .h deps to Makfile (#6416) * Fix for NNVM build * Add deps to nnvm/include

2017-05-24 12:09:30 -07:00

								NNVM_INC = $(wildcard $(NNVM_PATH)/include/*/*.h)

							

change recursive make commands per gnumake manual recommendation (#4983)

2017-02-11 23:12:15 -08:00

									+ cd $(NNVM_PATH); $(MAKE) lib/libnnvm.a DMLC_CORE_PATH=$(DMLC_CORE); cd $(ROOTDIR)

							

NNVM Refactor (#3194) * Init nnvm change * temp checkin * Move TShape to NNVM * Redirect Symbolic API to NNVM * Add Op Prop Adapter * Finish migrate in shape infer * Pass all symbolic test * temp commit * enable aux data * [EXEC] Basic version of exec for forward only * [EXEC] Enable most optimizations, still wait grad and context * fix legacy op with latest one * Update NNVM NodeRef * Adapt to newer interface * ALl registry of backop is complete * temp commit * Hack finish backward pass * [EXEC] One day pass * [EXEC] Pass all operator unittest * [EXEC] enable model parallel * Fully pass all legacy tests * Remove legacy symbolic code * update news * Make travis compile * Fix python3 * Update viz module to new json format

2016-09-01 21:17:33 -07:00

add showme example and im2rec, modify io,=.md

2015-09-21 01:55:32 +08:00

[make] delete lib and bin, try to fix travis python

2015-10-08 23:37:04 -04:00

	@mkdir -p $(@D)

[OP] 3d conv + 3d pool (#2585) * cudnn 3d conv & 3d pooling

2016-07-08 14:07:45 -07:00

									$(CXX) $(CFLAGS) -std=c++11  -o $@ $(filter %.cpp %.o %.c %.a %.cc, $^) $(LDFLAGS)

							

add showme example and im2rec, modify io,=.md

2015-09-21 01:55:32 +08:00

Caffe without the patch, cpp-package fixed also with Caffe plugin (#5573) * PEP8 indentation fix * Remove executable flag * Remove executable flags * Fixing cpp-package including problems with caffe converter * Fix warnings * Remove need for caffe patch for caffe plugin * Ignore cmake paths, remove rebuildable op.h for cpp-package * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * Fix source file * Better example.mk * Turn off caffe by default * lint fixes * Trying to figure out how to fix submodules * nohub problem on travis so force retry * Edited test to just run make instead of obsolete make example * Fix cpp-package op.h bootstrapping with CMake * Build tweaking cpp-package * Lint fixes * op.h generator * FIx cpp-package for latest merge from master * Fix lint * static link * link whole static lib * Trigger another build attempt * Trigger another build attempt * Add caffe plugin support (disabled until dependencies can be added to build machine) * Add cufft library * win32 cufft library * don't link unit tests as static * Add include for rebuildable protobuf header caffe.pb.h

2017-03-30 20:13:36 -07:00

# CPP Package

Define build target for mkldnn lib build to fix 'make clean USE_MKMLDNN=1' issue (#11090) * Define build target for mkldnn lib build to fix 'make clean USE_MKMLDNN=1' issue * fix create install dir and other minor issues * Fix GPU MKLDNN and cpp-package build failure * Fix issue to only link with full MKL when BLAS is mkl * simplify logic by removing MKLDNN_ROOT support and some renaming * retrigger Jenkins * retrigger Jenkins * retrigger Jenkins * retrigger Jenkins * retrigger Jenkins

2018-06-28 01:32:53 +08:00

								include mkldnn.mk

							

Change engine callback to old style; support multiple streams in Naive. Make PushSync inline; Use registry and env variable to select engine move tests into fine grained folders checkin gtest fix test

2015-09-12 16:49:36 -07:00

								include tests/cpp/unittest.mk

							

Caffe without the patch, cpp-package fixed also with Caffe plugin (#5573) * PEP8 indentation fix * Remove executable flag * Remove executable flags * Fixing cpp-package including problems with caffe converter * Fix warnings * Remove need for caffe patch for caffe plugin * Ignore cmake paths, remove rebuildable op.h for cpp-package * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * cpp-package examples fixed. Makefile and CMake of op.h and examples added. * Fix source file * Better example.mk * Turn off caffe by default * lint fixes * Trying to figure out how to fix submodules * nohub problem on travis so force retry * Edited test to just run make instead of obsolete make example * Fix cpp-package op.h bootstrapping with CMake * Build tweaking cpp-package * Lint fixes * op.h generator * FIx cpp-package for latest merge from master * Fix lint * static link * link whole static lib * Trigger another build attempt * Trigger another build attempt * Add caffe plugin support (disabled until dependencies can be added to build machine) * Add cufft library * win32 cufft library * don't link unit tests as static * Add include for rebuildable protobuf header caffe.pb.h

2015-06-14 17:36:33 -07:00

2017-03-30 20:13:36 -07:00

								extra-packages: $(EXTRA_PACKAGES)

							

[cpptest] refactor

2015-09-23 20:26:29 +00:00

								test: $(TEST)