root/MNN - MNN - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
xiaying	acb3bb6c62	[Sync] Sync Internal Gitlab 2.2.0	2022-10-30 08:44:24 +08:00
xiaying	db53f951e6	[Sync] Sync Internal 2.1.2	2022-09-30 10:02:52 +08:00
zhaode.wzd	4753255227	[MNN:Sync] Sync Internal 2.1.1 contain below changes. [Pymnn:Bugfix] Fix usage and small bug in pymnn. [Docs:Update] Update docs/cpp markdown [Docs:Update] Add docs check. [MNN:Update] Update VecHalf.hpp [MNN:Bugfix] Fix compile errors caused by "#define MNN_THREAD_LOCK_CPU" [Geometry:Bugfix] Fix bug for resize of broadcastto: https://github.com/alibaba/MNN/issues/2040 [Docs:Update] Update inference api usage. [Pymnn:Bugfix] Close hiai load to fix resource leak. [MNN:Update] Down gradle version for demo compile	2022-09-09 17:24:37 +08:00
xiaying	68708c5d66	Sync Internal 2.0.4	2022-08-12 10:30:48 +08:00
xiaying	8330da263a	[Sync] Sync internal 2.0.3	2022-07-22 09:59:30 +08:00
xiaying	eb51926f84	[MNN:Sync] Sync internal Gitlab to 2.0.2	2022-07-19 13:52:07 +08:00
xiaying	d3ffdf4229	[MNN:Sync] Sync internal gitlab	2022-06-24 18:30:05 +08:00
xiaying	aeaac3fde3	[MNN:Sync] Sync internal gitlab	2022-06-10 10:39:50 +08:00
Q-engineering	49a6d13399	Raspberry Pi 32-bit fix	2022-05-30 15:15:15 +02:00
Yulv-git	77cc100153	Fix some typos.	2022-05-27 23:48:09 +08:00
xiaying	0c718e552b	[Sync] Sync internal Gitlab	2022-02-18 11:30:27 +08:00
xiaying	1b626d72c3	[MNN:Sync] Sync internal gitlab	2022-01-04 10:50:40 +08:00
xiaying	69dba73dc7	[MNN:Sync] Sync internal gitlab Main Feature: 1. Add OpenCV API and Numpy API Support 2. Protobuf move into MNN 3. Add more op for torchscript convert 4. Add recompute to speed up geometry compute 5. Add ModuleBasic Test	2021-11-30 10:10:53 +08:00
xiaying	03c7b5347b	[MNN:Sync] Sync internal Gitlab	2021-09-18 15:52:30 +08:00
xiaying	d8fc15d84b	[MNN:Sync] Sync internal github Commits: 8148ae75c 弗人 bugfix 14cb8ec7f 弗人 [Converter:Bugfix] bugfix for onnx depthwise convtranspose 476fbcd90 雁行 [MNN:Feature] Open AVX cast and bugfix for contentCFG. 5e26b9fd3 雁行 [Test:Feature] Add android test. 37e147b25 雁行 [MNN:Bugfix] Bugfix for floordiv. 144c185f5 tianbu.xsw hangxing fix hiai b4fd429d6 tianbu.xsw updateCacheFile bugfix -- update cache size d4ba572a8 雁行 [MNN:Bugfix] Support int8 in AVX2 and some Bugfix. 43061f07e xiaying [MNN:Bugfix] Fix bug for module mode run part of model 398cc5ab6 tianhang.yth refactor demo 736380600 xiaying [Express:Bugfix] Fix memory leak for copy branch b8dab0a27 tianhang.yth MNNFloat2Int8 sizeQuad=0 crash fix 94b95bfed ghz [BugFix]1.Better method for fast pack valid check 6a921f85e xiaying [Converter:Bugfix] Fix bug for Fuseconsttosubgraph 5f77ae889 tianhang.yth numThread bugfix a807ef879 tianhang.yth add createSession(configs, runtimeinfo) API, add pymnn demo, pymnn logcat bugfix ad05409d3 xiaying [MNN:Bugfix] Fix bug for StaticModule's sizecompute overflow, add error print for module mode 9d81b8299 xiaying [MNN:Bugfix] Fix bug for Unique op for output size = 1 03b15e9af xiaying [Test:Feature] Add MatMulBConst Test, Fix bug for single Convert c944a76ee tianhang.yth add auto backend and getSessionInfo @tianbu 91fa7267b ghz [BugFix]1.fix the error in eP check bf0041f77 ghz [BugFix]1.Fix the logic error in eP check. 2.Fix the sp align error 693871672 雁行 [CPU:Bugfix] rm adrp instruction for clang compiler bug. 1b8f6b3d8 ghz 1.Fix the wronly use of r13 in arm32 version. 2.Fix the missing callee register save and restore process. feb7ecc4c 弗人 modify log of python offline quant 040c04811 ghz [BufFix]1.replace platform-related regs. 2.fix the same problem in arm32 version 609f37db8 弗人 add log for python quant, python convert 5511dd30a ghz [BugFix]1.Add testcases in SparseConv to check all functional code branch. 2. Fix the bug in "MNNPackC4ForMatMul_A.S" in arm64, which is caused by the missing check of eReal parameter. a93ff9280 tianhang.yth add tf.Unique op support 9729ff773 allen.lk [Bugfix] Fix one arm32 instruction syntax that clang works but gcc DOES NOT work. use index instruction instead. 297c1ad14 雁行 [Expr:Bugfix] bugfix for tensor content used by shape compute. ef8c369e3 弗人 catch exception 07c2dd670 弗人 add dependence to setup, base64 encode url, add time log 177e590c1 弗人 [Python:Feature] add aliyun log for python quant tool 40a7928cf allen.lk [Debug:Sparse] 1.Add group parameter in torchscript converter. 2. Stop split running to avoid memory corruption when check failed in TransformGroupConvolution 3. fix Op split issue in TransformGroupConvolution 3bdea84a1 allen.lk [Debug:Sparse] Fix and warning one kind of segmentfault cause by memory corruption when resize ConvolutionWinograd. Avoid to use some registers as arm restriction. c3c6fbdbd allen.lk [Debug:Sparse] Fix and warning one kind of segmentfault cause by memory corruption when resize ConvolutionWinograd. Avoid to use some registers as arm restriction. bc590eee4 雁行 [Converter:Bugfix] bugfix for onnx instancenormalization convert. d8918593f tianhang.yth add auto backend and getSessionInfo @tianbu 83a198ed7 杭行 update d0dd3e09b 杭行 update 99540202e xiaying [Converter:Optimize] Opt the tensor convert insert 333d8db82 allen.lk [Debug:Sparse] Fix All platform-register r9 / x18 issue on arm32 and arm64. db5994672 杭行 merge 6293de7b8 tianbu.xsw fix pymnn updateCacheFile 5c2e11cb1 tianbu.xsw do updateCache in createSession 6e7641ff4 tianbu.xsw do not limit cacheFile for a model 5287a65e4 tianbu.xsw bugfix 52ba53a91 tianbu.xsw revert pymnn api 60284d830 tianbu.xsw bugfix 6d8077490 tianbu.xsw rename updateCacheFile api params 3cb172710 tianhang.yth updateCacheFile API size default value is 0 c5b69aabf tianbu.xsw updateCacheFile python api fix 5d5da7aa5 tianbu.xsw reflector code 5707877a4 雁行 [MNN:Speed] Speedup for softmax in x86 and arm. 2a211825c tianbu.xsw reflector code for updateCacheFile 76db3a835 tianbu.xsw [Cache Feature]: Add updateCacheFile API for increment cache b06b0fd43 allen.lk [Debug:Sparse] Fix and warning one kind of segmentfault cause by memory corruption when resize ConvolutionWinograd. Avoid to use some registers as arm restriction. e68bfa495 雁行 [Converter:Feature] Add UUID when model convert. a9cb935dc xiaying [MNN:Speed] Support c4nhwc for more fastblit 019f40353 xiaying [Converter:Refractor] Reduce memory used by MNNConvert(bert from 5G -> 1G) d2a6d3d05 xiaying [MNN:Bugfix] Fix bug for identity output not find 604d0801b xiaying [Converter:Bugfix] Fix bug for FuseGeLu 4bada2367 xiaying [MNN:Refractor] SegmentMean rewrite as segment 82070e708 xiaying [MNN:Bugfix] Fix bug for GeometryBinary e8ea4266e xiaying Fix bug for ShapeTensorConvert compute for dim = 1 error 1f1cf1991 xiaying [Tools:Bugfix] Fix system compability for fastTestOnnx 6f422efe2 xiaying [Tools:Bugfix] Remove color for checkDir for easy to dump 968f7ec88 xiaying [MNN:Speed] Support turn broadcast binary to loop 3e7aaf46f xiaying [MNN:Refractor] Set Convolution1x1Strassen support variable input/output ptr 1f65ab163 xiaying [MNN:Bugfix] Fix bug for mini mnn can't convert model d65953d47 xiaying [MNN:Bugfix] Fix bug for armv7a - android-14 + ARM82 8b68be45c xiaying [MNN:Feature] Add segment 8a8f264f5 xiaying [Vulkan:Bugfix] Remove unuseful print 025bb0fda xiaying [Converter:Bugfix] Fix bug for oneof don't support 43900251e tianbu.xsw enable setCacheFile python API ebfb05c74 tianbu.xsw [Metal Feature] support metallib obtain from walle transfer task 9665c0a79 弗人 add check for path in json file c66fef224 xiaying [Converter:Bugfix] Fix bug for oneof don't support 42f192852 xiaying [MNN:Bugfix] Fix bug for not set output / saveTensor into origin Schedule's outputs 1b95354ff 雁行 [Feature]: Support shape compute for SetDiff1D, and null input for Prod. 83966d043 xiaying [Test:Feature] Add test for static module 42d1be933 xiaying [Converter:Bugfix] Fix bug for mnn convert and static model add more outputs for origin model 9067531c3 xiaying [Converter:Refractor] formatLicence 99558bed9 xiaying [Converter:Bugfix] Count the op for unuseful and controlflow 4f6da0fa7 allen.lk [Feature:GRUMultiOutput] fix multi output dimension type c6b219bce xiaying [Converter:Feature] Turn torch converter to object dd4e68a37 xiaying [Converter:Feature] Support dump supported ops 80b6a60a3 xiaying [Converter:Info] If has output name, print output name instead of computed 015278fc3 xiaying [MNN:Refractor] Revert IfModule's debug info 23ac967c4 xiaying Don't transform for multi-input convolution/deconvolution b02b0d4de xiaying Fix bug for multi-input for conv1d 254d8b1d4 xiaying Fix bug for Conv1dSqueezeMove for multi input convolution 1d d47d0b9ca xiaying Fix bug for CPURaster's fuse nc4hw4 357c5bd33 xiaying Fix ConvBiasAdd for conv's inputs op > 1 55b1f0c9c xiaying [Converter:Bugfix] Don't transform for multi-input convolution/deconvolution 1902a30f5 xiaying [Converter:Bugfix] Fix bug for Conv1dSqueezeMove for multi input convolution 1d c23fe617b xiaying [MNN:Bugfix] Fix bug for multi-input for conv1d 8ff018426 xiaying [MNN:Bugfix] Fix bug for CPURaster's fuse nc4hw4 d4e8cd602 xiaying [Converter:Bugfix] Fix ConvBiasAdd for conv's inputs op > 1 846266b42 tianbu.xsw return when program and tune both nullptr fd67c76a9 xiaying [Converter:Bugfix] DepthwiseConvWeightMerge only valid for tflite e77a242c4 xiaying [Converter:Feature] Support tflite's half pixel be054c377 tianbu.xsw [OpenCL Bugfix] do not rewrite cache when binary program is produced 51e65aa35 xiaying [Converter:Feature] Support tflite for fp16 and multi-input convolution 1ccdfdeb5 tianbu.xsw redefine svm macro name 31234d372 tianbu.xsw [OpenCL SVM] add macro for only use wrapper d739e35da xiaying [MNN:Bugfix] Fix compile bug for grid op 24ab13c79 Joker feat(arm82): add GridSample op support in arm82 backend, AVX(by xiaying) 7b142978e xiaying [AVX512:Speed] Optimize for e <= 8 5f6febe7b tianbu.xsw code refactor 998d91b57 xiaying [Express:Speed] Merge submodule for speed 22c89146f tianhang.yth fix alpha div by zero bug and arm server compile bug 8f829a170 tianbu.xsw [OpenCL Pad] unify conv/deconv pad computing 4a28f603e xiaying [Express:Speed] Shared Const for All Submodule c74cf28f3 xiaying [MNN:Refractor] Seperate Const init and schedule 2a1eebb7a xiaying [Tools:Bugfix] Fix bug for modelTest.py count size 72f04008c xiaying [MNN:Refractor] Delete unuseful const op 1e735d03c xiaying [Converter:Bugfix] Fix bug for static module gen 4dfadbc6e xiaying [MNN:Refractor] Rewrite const init mode 1fcf0417a xiaying [MNN:Bugfix] Fix bug for deconvolutin multi-input for multi-batch 41d429cfd xiaying [Train:Bugfix] Revert convert NCHW for mnistTrain f947a5f01 xiaying [Test:Feature] Add testTrain dad59b6f6 tianbu.xsw move realize code from Backend.hpp to Tensor.cpp cf4473ad1 xiaying [Train:Bugfix] Support pad for GeometryPoolGrad 91ab13734 xiaying [MNN:Bugfix] Fix compile bug for avx512 742e80f47 xiaying [MNN:Refractor] Opt the logic for checknan judge 12543b841 xiaying [ARM82:Bugfix] Fix compile bug for ios 3a2b0a49f xiaying [ARM82:Speed] Opt Pack / Unpack for armv8 c0f1995cd xiaying [ARM82:Speed] Opt MNNPackC8FP16 and MNNUnpackC8FP16 by asm e0fc77dcf xiaying [MNN:Speed] Fix bug for DeconvolutionWithStride for C4HW4, open it 584bec578 xiaying [MNN:Bugfix] Fix bug for format set error for onnx d5bd4148d xiaying [MNN:Bugfix] Fix bug for format set error for onnx b00265841 xiaying [MNN:Bugfix] Fix bug for SparseConvolutionTiledExecutor bb09188ac xiaying [Test:Bugfix] Fix bug for run into sparse auto 426d1babd xiaying [MNN:Refractor] Small bugfix for Group convolution and pack 7d0ea1c46 tianbu.xsw [testModel Feature] support testModel.out input resize 4169c54ce xiaying [MNN:Bugfix] Fix bug for checkNAN for origin 412a82222 xiaying [Test:Bugfix] Fix bug for CheckNAN's error of matmul 319b1d425 xiaying [MNN:Bugfix] Fix bug for multi-batch for ConvInt8 050b728a6 xiaying [Test:Bugfix] Use NCHW for ConvInt8Test 7db3423a1 xiaying [OpenCL:Bugfix] Fix bug for opencl::image,opencl::buffer for C4HW4 adcec6a7f xiaying [Vulkan:Bugfix] Fix bug for invalid tensor size limit d2a7cf4e9 xiaying [Vulkan:Bugfix] Fix bug for onCopyBuffer of nc4hw4 557bebdd3 xiaying [MNN:Bugfix] Fix bug for BF16-ARM32 bbe186649 tianbu.xsw [Update AUTO mode]: fix MNN_FORWARD_AUTO choose priority 6deb23439 xiaying [MNN:Bugfix] Fix bug for GeometryBinary don't care about NC4HW4 same size b137590e4 xiaying [MNN:Bugfix] Fix bug for GeometryBinary don't care about NC4HW4 same size 7003558ea xiaying [Converter:Bugfix] Fix bug for onnx pad for serveral case b5f8cae5a xiaying [Converter:Bugfix] Fix bug for onnx pad for serveral case 29b09e125 xiaying [MNN:Bugfix] Fix bug for arm64-bf16 42ce00770 xiaying [MNN:Bugfix] Fix bug for ARM64 - float a2d89fc18 雁行 [Converter:Feature] Support Binary Unary for Torch. 7f1c0deb1 xiaying [MNN:Bugfix] Fix bug for Raster for Int8 8335a6f18 tianbu.xsw [OpenCL Shared Memory] modify data_format method b359e031b xiaying [ARM82:Bugfix] Fix bug for arm82 and speed up pack / unpack c8 24bf3fc88 雁行 [Convert:Feature] Support LayerNormFuse without gamma beta. 3e629624b xiaying [MNN:Bugfix] Fix bug for float - armv7a 2b7908ec7 tianbu.xsw modify workItemSize 3cee0d413 xiaying [MNN:Bugfix] test wrong clear 9cbbfb998 xiaying [MNN:Bugfix] fix compile bug for c++ < 14 2d7a44484 xiaying [MNN:Bugfix] fix compile bug for c++ < 14 eb7d0cb53 xiaying [Test:Bugfix] Don't test for NC4HW4 directly 7b40ca8d1 xiaying [MNN:Bugfix] Fix bug for ConvolutionGroup 2694d8a91 xiaying [MNN:Bugfix] Fix bug for CPUGridSample f89af60f6 xiaying [MNN:Bugfix] Fix compile bug for arm a151abcdd xiaying [MNN:Bugfix] Fix bug for convert for int8 / int16 b254dbe61 雁行 [MNN:Bugfix] Bugfix for Conv onClone. d08150631 xiaying [MNN:Bugfix] Fix bug for fast rcnn e5568a0df xiaying [MNN:Bugfix] Fix bug for CPURaster treat NC4HW4 fast blit 128318933 雁行 [Raster:Bugfix] bugfix for Raster merge onResize. 03caacbea xiaying [MNN:Bugfix] fix bug for CPUDeconvolution and Convolution1x1Strassen for iw != ow e1e3c245c xiaying [MNN:Bugfix] Fix bug for ConvolutionWinograd 2524cbc6d xiaying [MNN:Bugfix] Fix bug for CPUSoftmax 44ec79b8f xiaying [MNN:Bugfix] Fix bug for CPUConvolutionDepthwise / Scale / DeconvolutionDW 21ae956ce xiaying [MNN:Bugfix] Fix bug for Multi-Batch-TiledExecutor 09a5069c7 xiaying [MNN:Speed] Add offset for src and dst 6776c6784 xiaying [MNN:Bugfix] Fix bug for trainable model cc83ae30b xiaying [MNN:Bugfix] Fix bug for trainable model	2021-07-29 11:47:13 +08:00
hush-alibaba	58545d6ca1	Synchronize internal github for version 1.2.0 (#1518 )	2021-06-11 17:17:13 +08:00
tianhang.yth	d85952d826	sync from internal repo	2021-04-28 18:02:10 +08:00
xiaying	5947b90a03	[PATCH 30/36] [MNN:Refractor] Move NN to train folder	2021-04-16 14:29:38 +08:00
xiaying	d91fc63976	[MNN:Sync] Sync internal Gitlab	2021-04-08 15:34:23 +08:00
Joker	21127cb907	improvement(HiAI): update cmake of HiAI backend to support it when MNN_SEP_BUILD=true	2021-03-11 16:36:22 +08:00
jxt1234	4baf6b1ecf	Merge pull request #1370 from WillTao-RD/master avoid build warning if MNN_SEP_BUILD is OFF	2021-02-23 11:08:35 +08:00
xiaying	5e127496fc	Sync Internal Github	2021-02-07 10:47:03 +08:00
taowei	3d1cf0c3c4	avoid build warning if MNN_SEP_BUILD is OFF	2021-02-04 10:24:34 +08:00
xiaying	aad7b7aed1	[MNN:Sync] Sync internal Gitlab	2021-01-08 14:36:59 +08:00
xiaying	2d1b129121	[MNN:Sync] Sync internal git	2021-01-06 16:29:37 +08:00
xiaying	644eadbdb0	[PATCH 338/350] [MNN:Bugfix] Remove GenVCS for serveral system can't execute	2021-01-06 15:57:22 +08:00
xiaying	0fe2b0dfee	[PATCH 278/350] [MNN:Speed] Support OneDNN for MNN Convolution	2021-01-06 15:57:17 +08:00
xiaying	acbdeaa60b	[PATCH 195/350] [CV:Bugfix] Fix bug for CMakeLists	2021-01-06 15:57:10 +08:00
xiaying	0061c6a454	[PATCH 172/350] [CV:Bugfix] Fix compile optimizer error in sse machine for cv	2021-01-06 15:57:08 +08:00
Hui Shu	ab711d484c	Synchronize internal master to Github	2020-12-15 14:12:35 +08:00
riddick	7cc4ddc585	feat(backend): add M1 chip support - add cpu backend support for M1 chip - add arm82 backend support for M1 chip	2020-11-24 22:48:48 +08:00
Hui Shu	d6795ad031	Github release 1.1.0	2020-11-05 16:49:17 +08:00
xiaying	0a95efdf66	[MNN:Bugfix] Fix cv compile bug in windows	2020-07-04 09:29:49 +08:00
xiaying	255db932eb	[MNN:Sync] Sync Internal Github	2020-07-04 01:21:30 +08:00
tianbu.xsw	d4a1814cd2	add MNN_OPENCL_LWS_TUNE macro	2020-07-04 01:06:26 +08:00
root	57ab91b9e7	Fix compile bug of use sse for cv in linux	2020-07-04 01:06:20 +08:00
xiaying	5fc7acd37e	support transpose, fix bug for not align	2020-07-04 01:06:18 +08:00
Evgeny Proydakov	5f52b9bfc8	Fixed openmp android build.	2020-05-22 22:54:27 +03:00
Evgeny Proydakov	5f673ddcae	Fixed Travis CI linux vulkan build. As I see libMNN uses dl library. I updated target_link_libraries. ./ciscripts/Linux/CL_ThreadPool_Vulkan.sh /usr/bin/ld: libMNN.so: undefined reference to `dlopen' /usr/bin/ld: libMNN.so: undefined reference to `dlclose' /usr/bin/ld: libMNN.so: undefined reference to `dlsym' After the change, everything is collected without problems. [100%] Linking CXX executable ../../runTrainDemo.out [100%] Built target runTrainDemo.out	2020-05-19 23:31:26 +03:00
玄裳	0df31a8667	MNN 1.0.0 release sync. - Added Python Express API implemented with pbind11 - Added demos for Python Express API - Performance improvements for ARM64, ARMv8.2, x86. - README update.	2020-05-07 18:22:11 +08:00
和彬	00b8d31fbf	remove unnecessary iostream, add comment in cmake	2020-04-29 10:00:52 +08:00
hebin	eff988f719	windows support evaluation and train tools	2020-04-29 10:00:50 +08:00
xiaying	74cdc8963e	Remove unuseful change	2020-04-15 15:34:57 +08:00
xiaying	d42340d6a0	Fix bug for sep_build judge before shared libs	2020-04-15 15:34:57 +08:00
xiaying	6256c3a2bf	For static library set sep_build as false	2020-04-15 15:34:57 +08:00
xiaying	be57ec2863	Fix bug for compile static library for android	2020-04-15 15:34:57 +08:00
xiaying	a76be60722	[MNN:Sync] Fix compile bug for windows, fix bug for device not support fma	2020-04-14 22:52:24 +08:00
xiaying	5e5902240c	[MNN:Sync] Add MNN-Plugin, Fix serveral bug	2020-04-14 21:43:02 +08:00
xiaying	d01b041bb0	[PATCH 12/17] [MNN:Refractor] Remove schema build from cmakelists	2020-04-13 13:08:05 +08:00
xiaying	6d6a51e52e	[PATCH 10/17] [MNN:Feature] Support not build tools	2020-04-13 13:08:05 +08:00
Just Test	fb8075fddf	[PATCH 16/28] undo some change	2020-03-31 11:12:39 +08:00
Just Test	ce7dd512b1	[PATCH 15/28] flatbuffer fix, unicode model filepath support, TensorUtils.hpp macro conflict fix, cmake cache option force update	2020-03-31 11:12:39 +08:00
xiaying	48c92a41e7	[MNN:Sync] Sync internal git for remain patch	2020-03-22 20:33:03 +08:00
誉阳	9bf8aebe93	[PATCH 159/160] fix build static library on Mac bug	2020-03-22 19:02:15 +08:00
誉阳	4fde4e5ac8	[PATCH 144/160] fix compile error	2020-03-22 19:02:13 +08:00
誉阳	e69626b08f	[PATCH 142/160] fix build static library bug	2020-03-22 19:02:13 +08:00
xiaying	46c11b3128	[PATCH 060/160] [Vulkan:Feature] Support use system lib for vulkan	2020-03-22 19:02:05 +08:00
xiaying	32aea32cd1	[PATCH 23/24] [MNN:Bugfix] Fix cmake setting bug for linux	2020-03-06 11:04:54 +08:00
海境	90e06944db	Update	2020-02-26 09:57:17 +08:00
海境	ed8b9f2a23	Sync Internal CMake changes	2020-01-17 12:06:45 +08:00
海境	4d6c19f121	Clean	2020-01-16 17:49:15 +08:00
Zhang	91b5ade49a	Sync. Fix OpenGL related building issues. Build the whole suite on Android CI (#580 ) * Sync code with latest internal version * Update CMake * Fix logging issues * Fix OpenGL Building * Bump CMakeLists version. Update Podspec * Update MetalLib Lookup logic * Fix Windows Build	2020-01-16 16:55:46 +08:00
Zhang	c95892de87	Fix OpenCL linking logic when MNN_BUILD_SHARED_LIBS=OFF	2020-01-08 11:34:27 +08:00
Zhang	2e3d5a318b	Fix Windows CI / Enhance iOS Build Script (#568 ) * Add back MNN_BUILD_HARD option * Add simulator to buildiOS.sh	2020-01-07 18:47:22 +08:00
Zhang	f59c5335ba	Fix win ci (#566 ) * Bump version of Podspec and fixing header includes * Fix Windows CI Configuration	2020-01-06 15:46:23 +08:00
海境	37f34c9d90	Fix Android Demo	2020-01-06 13:56:45 +08:00
海境	1803a4e025	Add per-platform CI status flag	2020-01-06 10:52:22 +08:00
海境	f4e28567d3	Bump version. Add issue template	2020-01-03 14:54:26 +08:00
海境	5af265939b	Wrap SSE/AVX Flags in platform detection code	2020-01-03 11:07:05 +08:00
Zhang	859c3e241d	Fix Unix Environment detection and OpenCL Linking (#554 ) * Close #551 * Fix Linux Detection	2020-01-02 12:01:24 +08:00
Zhang	002ac367e4	Update	2019-12-27 22:16:57 +08:00
liqing	e93e8dcbe8	0.2.1.5 # integration - add travis CI - fix building parameters for python # converter - add half storage option for MNN converter - fix op name lost in converter - fix converter bug for print input output, identity remove output # ops - add quantized Convolution & Deconvolution support on OpenCL - add more expression supports - add DetectionPostProcess Op for TensorFlow Lite (ssd is supported directly now) - add supports for LSTM & ELU for ONNX - add support for Convolution that weights is not constant for ONNX - fix Unary Op compile error on Linux - fix Metal backend buffer reuse after resize - fix Metal raw memory access after model releasing - fix redundant transpose in Winograd generater	2019-11-15 14:22:45 +08:00
Naville	879f993978	Provide placeholder values for both Android NATIVE_* variable	2019-11-13 16:03:26 +08:00
海境	133fddc017	Close #436 #449	2019-11-13 15:47:08 +08:00
Zhang	a005e9c46d	Add Travis config for Android/iOS/macOS/Linux (#453 )	2019-11-13 11:44:43 +08:00
liqing	d6b00d04f4	- build: - unify schema building in core and converter; - add more build script for android; - add linux build script for python; - ops impl: - add floor mod support in binary; - use eltwise impl in add/max/sub/mul binary for optimization; - remove fake double support in cast; - fix 5d support for concat; - add adjX and adjY support for batch matmul; - optimize conv2d back prop filter; - add pad mode support for conv3d; - fix bug in conv2d & conv depthwise with very small feature map; - optimize binary without broacast; - add data types support for gather; - add gather ND support; - use uint8 data type in gather v2; - add transpose support for matmul; - add matrix band part; - add dim != 4 support for padding, reshape & tensor convert; - add pad type support for pool3d; - make ops based on TensorFlow Lite quantization optional; - add all & any support for reduction; - use type in parameter as output type in reduction; - add int support for unary; - add variable weight support for conv2d; - fix conv2d depthwise weights initialization; - fix type support for transpose; - fix grad outputs count for reduce grad and reshape grad; - fix priorbox & detection output; - fix metal softmax error; - python: - add runSessionWithCallBackInfo interface; - add max nodes limit (1400) for visualization tool; - fix save error in python3; - align default dim; - convert: - add extra design for optimization; - add more post converting optimizers; - add caffe v1 weights blob support; - add cast, unary, conv transpose support for onnx model; - optimize batchnorm, conv with variable weights, prelu, reshape, slice, upsample for onnx model; - add cos/sin/atan/tan support for unary for tensorflow model; - add any/all support for reduction for tensorflow model; - add elu, conv3d, pool3d support for tensorflow model; - optimize argmax, batchnorm, concat, batch to space, conv with variable weights, prelu, slice for tensorflow model; - others: - fix size computer lock; - fix thread pool deadlock; - add express & parameters in express; - rewrite blitter chooser without static map; - add tests for expr;	2019-10-29 13:37:26 +08:00
liqing	73ad3413cc	- dynamic computation graph (beta) - add supports (/express) - add tests - add benchmarks with it (/benchmark/exprModels) - Python - MNN engine and tools were submitted to pip - available on Windows/macOS/Linux - Engine/Converter - add supports for each op benchmarking - refactor optimizer by separating steps - CPU - add supports for Conv3D, Pool3D, ELU, ReverseSequence - fix ArgMax, Permute, Scale, BinaryOp, Slice, SliceTf - OpenCL - add half transform in CPU - add broadcast supports for binary - optimize Conv2D, Reshape, Eltwise, Gemm, etc. - OpenGL - add sub, real div supports for binary - add supports for unary - optimize Conv2D, Reshape - Vulkan - add max supports for eltwise - Metal - fix metallib missing problem - Train/Quantization - use express to refactor training codes	2019-09-26 21:02:07 +08:00
liqing	487a0fbd0a	beta 0.2.0.9 - fix quantization tool compiling on Windows - fix converter compiling on Windows - fix eltwise optimization on Windows - separate sse & avx for Windows - add LeakyReLU support for TensorFlow - fix reshape, const for TensorFlow - fix dimension format error for ONNX ops - optimize winograd, ReLU for OpenCL - add fp16 availability & dimensions size check-up for OpenCL - optimize GEMM for arm32 - fix ExpandDims shape calculation when inputs size == 1	2019-09-01 19:25:26 +08:00
liqing	f085106da9	release 0.2.0.6 - fix bugs in quantization - add evaluating tool for quantization - add ADMM support in quantization - fix lock in thread pool - fix fusing for deconv - fix reshape converting from ONNX to MNN - turn off blob size checking by default	2019-08-07 16:44:09 +08:00
liqing	7bb0df92dc	beta 0.2.0.5 - CPU - add support for DepthToSpace & SpaceToDepth ops - OpenGL - add Android demo - add half / float runtime option - add support for ROIPooling, Squeeze - fix bugs in conv im2col - OpenCL - fix Concat, Eltwise, Reshape bugs - Tools - add KL threshold method in quantization tool - support optimization for graph with multiple rnn	2019-07-25 13:36:35 +08:00
如幻	732ba68b19	beta 0.2.0.4 - bug fix for quantization tool - bug fix/performance update for thread pool - bug fix for converters - tutorial/doc update - more op support	2019-07-19 17:36:12 +08:00
liqing	a367406308	beta 0.2.0.3 - add quantization tool & cpu impl & demo/exec - add thread pool - add tests - fix onnx converter tensor name mismatch - optimize cpu performance with SSE for windows	2019-07-11 13:56:52 +08:00
liqing	6a4213f7dc	beta 0.2.0.0 - replace FreeImage with stb_image - warn unicode error in Windows compiling - separate clang/gcc build script for android - add default values in fbs - optimize CPU conv / conv depthwise / deconv / deconv depthwise / lstm / sigmoid - add sub support in eltwise - add reciprocal / log1p / log in unary - add zero like / select / set diff 1d - add batch support for permute - add training codes - fix metal error in dynamic separate storage type handling	2019-06-17 20:10:35 +08:00
liqing	ff405a3078	beta 0.1.1.6 - add support for windows - fix bugs in converting dropout - fix bugs in post treat	2019-06-10 21:08:55 +08:00
liqing	28a6f1a614	beta 0.1.1.2 - fix register typo - add input count in conv model - add 5x5 winograd convolution for OpenCL - edit condition to use half in OpenCL - upgrade build.gradle	2019-05-14 19:54:21 +08:00
daquexian	4c91524112	CMAKE_SOURCE_DIR -> CMAKE_CURRENT_SOURCE_DIR	2019-05-10 13:49:20 +08:00
liqing	9bfb58a3ec	fix backend/op/sizer register	2019-05-09 19:39:33 +08:00
liqing	f2d6e8fe2d	beta 0.1.1.1 - use code generate for op/backend/sizer register - add pose demo - fix docs & script - improve cpu softmax performance 80% - improve converter ops fuse	2019-05-08 15:44:57 +08:00
daquexian	3a4a58ccf7	Use bash instead of /bin/sh	2019-05-08 11:53:55 +08:00
liqing	2a2fa36c47	add -lazy option for generate.sh, and use it in CMakeList.	2019-05-07 16:55:36 +08:00
liqing	6566533d2d	generate schema automatically before build MNN.	2019-05-07 16:12:43 +08:00
liqing	07e28c80d3	beta 0.1.1 - update resources and docs - unite tensor's width/height/channel/batch getter - optimize several ops - fix compile warnings and errors on Ubantu - some other bug fixes	2019-05-05 20:27:57 +08:00
liqing	5551108af8	beta 0.1.0	2019-04-19 20:50:09 +08:00

1 2 3

143 Commits