Commit Graph

143 Commits

Author SHA1 Message Date
xiaying acb3bb6c62 [Sync] Sync Internal Gitlab 2.2.0 2022-10-30 08:44:24 +08:00
xiaying db53f951e6 [Sync] Sync Internal 2.1.2 2022-09-30 10:02:52 +08:00
zhaode.wzd 4753255227 [MNN:Sync] Sync Internal 2.1.1 contain below changes.
[Pymnn:Bugfix] Fix usage and small bug in pymnn.
    [Docs:Update] Update docs/cpp markdown
    [Docs:Update] Add docs check.
    [MNN:Update] Update VecHalf.hpp
    [MNN:Bugfix] Fix compile errors caused by "#define MNN_THREAD_LOCK_CPU"
    [Geometry:Bugfix] Fix bug for resize of broadcastto: https://github.com/alibaba/MNN/issues/2040
    [Docs:Update] Update inference api usage.
    [Pymnn:Bugfix] Close hiai load to fix resource leak.
    [MNN:Update] Down gradle version for demo compile
2022-09-09 17:24:37 +08:00
xiaying 68708c5d66 Sync Internal 2.0.4 2022-08-12 10:30:48 +08:00
xiaying 8330da263a [Sync] Sync internal 2.0.3 2022-07-22 09:59:30 +08:00
xiaying eb51926f84 [MNN:Sync] Sync internal Gitlab to 2.0.2 2022-07-19 13:52:07 +08:00
xiaying d3ffdf4229 [MNN:Sync] Sync internal gitlab 2022-06-24 18:30:05 +08:00
xiaying aeaac3fde3 [MNN:Sync] Sync internal gitlab 2022-06-10 10:39:50 +08:00
Q-engineering 49a6d13399
Raspberry Pi 32-bit fix 2022-05-30 15:15:15 +02:00
Yulv-git 77cc100153 Fix some typos. 2022-05-27 23:48:09 +08:00
xiaying 0c718e552b [Sync] Sync internal Gitlab 2022-02-18 11:30:27 +08:00
xiaying 1b626d72c3 [MNN:Sync] Sync internal gitlab 2022-01-04 10:50:40 +08:00
xiaying 69dba73dc7 [MNN:Sync] Sync internal gitlab
Main Feature:
1. Add OpenCV API and Numpy API Support
2. Protobuf move into MNN
3. Add more op for torchscript convert
4. Add recompute to speed up geometry compute
5. Add ModuleBasic Test
2021-11-30 10:10:53 +08:00
xiaying 03c7b5347b [MNN:Sync] Sync internal Gitlab 2021-09-18 15:52:30 +08:00
xiaying d8fc15d84b [MNN:Sync] Sync internal github
Commits:
        8148ae75c  弗人  bugfix
        14cb8ec7f  弗人  [Converter:Bugfix] bugfix for onnx depthwise convtranspose
        476fbcd90  雁行  [MNN:Feature] Open AVX cast and bugfix for contentCFG.
        5e26b9fd3  雁行  [Test:Feature] Add android test.
        37e147b25  雁行  [MNN:Bugfix] Bugfix for floordiv.
        144c185f5  tianbu.xsw  hangxing fix hiai
        b4fd429d6  tianbu.xsw  updateCacheFile bugfix -- update cache size
        d4ba572a8  雁行  [MNN:Bugfix] Support int8 in AVX2 and some Bugfix.
        43061f07e  xiaying  [MNN:Bugfix] Fix bug for module mode run part of model
        398cc5ab6  tianhang.yth  refactor demo
        736380600  xiaying  [Express:Bugfix] Fix memory leak for copy branch
        b8dab0a27  tianhang.yth  MNNFloat2Int8 sizeQuad=0 crash fix
        94b95bfed  ghz  [BugFix]1.Better method for fast pack valid check
        6a921f85e  xiaying  [Converter:Bugfix] Fix bug for Fuseconsttosubgraph
        5f77ae889  tianhang.yth  numThread bugfix
        a807ef879  tianhang.yth  add createSession(configs, runtimeinfo) API, add pymnn demo, pymnn logcat bugfix
        ad05409d3  xiaying  [MNN:Bugfix] Fix bug for StaticModule's sizecompute overflow, add error print for module mode
        9d81b8299  xiaying  [MNN:Bugfix] Fix bug for Unique op for output size = 1
        03b15e9af  xiaying  [Test:Feature] Add MatMulBConst Test, Fix bug for single Convert
        c944a76ee  tianhang.yth  add auto backend and getSessionInfo @tianbu
        91fa7267b  ghz  [BugFix]1.fix the error in eP check
        bf0041f77  ghz  [BugFix]1.Fix the logic error in eP check. 2.Fix the sp align error
        693871672  雁行  [CPU:Bugfix] rm adrp instruction for clang compiler bug.
        1b8f6b3d8  ghz  1.Fix the wronly use of r13 in arm32 version. 2.Fix the missing callee register save and restore process.
        feb7ecc4c  弗人  modify log of python offline quant
        040c04811  ghz  [BufFix]1.replace platform-related regs. 2.fix the same problem in arm32 version
        609f37db8  弗人  add log for python quant, python convert
        5511dd30a  ghz  [BugFix]1.Add testcases in SparseConv to check all functional code branch. 2. Fix the bug in "MNNPackC4ForMatMul_A.S" in arm64, which is caused by the missing check of eReal parameter.
        a93ff9280  tianhang.yth  add tf.Unique op support
        9729ff773  allen.lk  [Bugfix] Fix one arm32 instruction syntax that clang works but gcc DOES NOT work. use index instruction instead.
        297c1ad14  雁行  [Expr:Bugfix] bugfix for tensor content used by shape compute.
        ef8c369e3  弗人  catch exception
        07c2dd670  弗人  add dependence to setup, base64 encode url, add time log
        177e590c1  弗人  [Python:Feature] add aliyun log for python quant tool
        40a7928cf  allen.lk  [Debug:Sparse] 1.Add group parameter in torchscript converter. 2. Stop split running to avoid memory corruption when check failed in TransformGroupConvolution 3. fix Op split issue in TransformGroupConvolution
        3bdea84a1  allen.lk  [Debug:Sparse] Fix and warning one kind of segmentfault cause by memory corruption when resize ConvolutionWinograd.  Avoid to use some registers as arm restriction.
        c3c6fbdbd  allen.lk  [Debug:Sparse] Fix and warning one kind of segmentfault cause by memory corruption when resize ConvolutionWinograd.  Avoid to use some registers as arm restriction.
        bc590eee4  雁行  [Converter:Bugfix] bugfix for onnx instancenormalization convert.
        d8918593f  tianhang.yth  add auto backend and getSessionInfo @tianbu
        83a198ed7  杭行  update
        d0dd3e09b  杭行  update
        99540202e  xiaying  [Converter:Optimize] Opt the tensor convert insert
        333d8db82  allen.lk  [Debug:Sparse] Fix All platform-register r9 / x18 issue on arm32 and arm64.
        db5994672  杭行  merge
        6293de7b8  tianbu.xsw  fix pymnn updateCacheFile
        5c2e11cb1  tianbu.xsw  do updateCache in createSession
        6e7641ff4  tianbu.xsw  do not limit cacheFile for a model
        5287a65e4  tianbu.xsw  bugfix
        52ba53a91  tianbu.xsw  revert pymnn api
        60284d830  tianbu.xsw  bugfix
        6d8077490  tianbu.xsw  rename updateCacheFile api params
        3cb172710  tianhang.yth  updateCacheFile API size default value is 0
        c5b69aabf  tianbu.xsw  updateCacheFile python api fix
        5d5da7aa5  tianbu.xsw  reflector code
        5707877a4  雁行  [MNN:Speed] Speedup for softmax in x86 and arm.
        2a211825c  tianbu.xsw  reflector code for updateCacheFile
        76db3a835  tianbu.xsw  [Cache Feature]: Add updateCacheFile API for increment cache
        b06b0fd43  allen.lk  [Debug:Sparse] Fix and warning one kind of segmentfault cause by memory corruption when resize ConvolutionWinograd.  Avoid to use some registers as arm restriction.
        e68bfa495  雁行  [Converter:Feature] Add UUID when model convert.
        a9cb935dc  xiaying  [MNN:Speed] Support c4nhwc for more fastblit
        019f40353  xiaying  [Converter:Refractor] Reduce memory used by MNNConvert(bert from 5G ->         1G)
        d2a6d3d05  xiaying  [MNN:Bugfix] Fix bug for identity output not find
        604d0801b  xiaying  [Converter:Bugfix] Fix bug for FuseGeLu
        4bada2367  xiaying  [MNN:Refractor] SegmentMean rewrite as segment
        82070e708  xiaying  [MNN:Bugfix] Fix bug for GeometryBinary
        e8ea4266e  xiaying  Fix bug for ShapeTensorConvert compute for dim = 1 error
        1f1cf1991  xiaying  [Tools:Bugfix] Fix system compability for fastTestOnnx
        6f422efe2  xiaying  [Tools:Bugfix] Remove color for checkDir for easy to dump
        968f7ec88  xiaying  [MNN:Speed] Support turn broadcast binary to loop
        3e7aaf46f  xiaying  [MNN:Refractor] Set Convolution1x1Strassen support variable input/output ptr
        1f65ab163  xiaying  [MNN:Bugfix] Fix bug for mini mnn can't convert model
        d65953d47  xiaying  [MNN:Bugfix] Fix bug for armv7a - android-14 + ARM82
        8b68be45c  xiaying  [MNN:Feature] Add segment
        8a8f264f5  xiaying  [Vulkan:Bugfix] Remove unuseful print
        025bb0fda  xiaying  [Converter:Bugfix] Fix bug for oneof don't support
        43900251e  tianbu.xsw  enable setCacheFile python API
        ebfb05c74  tianbu.xsw  [Metal Feature] support metallib obtain from walle transfer task
        9665c0a79  弗人  add check for path in json file
        c66fef224  xiaying  [Converter:Bugfix] Fix bug for oneof don't support
        42f192852  xiaying  [MNN:Bugfix] Fix bug for not set output / saveTensor into origin Schedule's outputs
        1b95354ff  雁行  [Feature]: Support shape compute for SetDiff1D, and null input for Prod.
        83966d043  xiaying  [Test:Feature] Add test for static module
        42d1be933  xiaying  [Converter:Bugfix] Fix bug for mnn convert and static model add more outputs for origin model
        9067531c3  xiaying  [Converter:Refractor] formatLicence
        99558bed9  xiaying  [Converter:Bugfix] Count the op for unuseful and controlflow
        4f6da0fa7  allen.lk  [Feature:GRUMultiOutput] fix multi output dimension type
        c6b219bce  xiaying  [Converter:Feature] Turn torch converter to object
        dd4e68a37  xiaying  [Converter:Feature] Support dump supported ops
        80b6a60a3  xiaying  [Converter:Info] If has output name, print output name instead of computed
        015278fc3  xiaying  [MNN:Refractor] Revert IfModule's debug info
        23ac967c4  xiaying  Don't transform for multi-input convolution/deconvolution
        b02b0d4de  xiaying  Fix bug for multi-input for conv1d
        254d8b1d4  xiaying  Fix bug for Conv1dSqueezeMove for multi input convolution 1d
        d47d0b9ca  xiaying  Fix bug for CPURaster's fuse nc4hw4
        357c5bd33  xiaying  Fix ConvBiasAdd for conv's inputs op > 1
        55b1f0c9c  xiaying  [Converter:Bugfix] Don't transform for multi-input convolution/deconvolution
        1902a30f5  xiaying  [Converter:Bugfix] Fix bug for Conv1dSqueezeMove for multi input convolution 1d
        c23fe617b  xiaying  [MNN:Bugfix] Fix bug for multi-input for conv1d
        8ff018426  xiaying  [MNN:Bugfix] Fix bug for CPURaster's fuse nc4hw4
        d4e8cd602  xiaying  [Converter:Bugfix] Fix ConvBiasAdd for conv's inputs op > 1
        846266b42  tianbu.xsw  return when program and tune both nullptr
        fd67c76a9  xiaying  [Converter:Bugfix] DepthwiseConvWeightMerge only valid for tflite
        e77a242c4  xiaying  [Converter:Feature] Support tflite's half pixel
        be054c377  tianbu.xsw  [OpenCL Bugfix] do not rewrite cache when binary program is produced
        51e65aa35  xiaying  [Converter:Feature] Support tflite for fp16 and multi-input convolution
        1ccdfdeb5  tianbu.xsw  redefine svm macro name
        31234d372  tianbu.xsw  [OpenCL SVM] add macro for only use wrapper
        d739e35da  xiaying  [MNN:Bugfix] Fix compile bug for grid op
        24ab13c79  Joker  feat(arm82): add GridSample op support in arm82 backend, AVX(by xiaying)
        7b142978e  xiaying  [AVX512:Speed] Optimize for e <= 8
        5f6febe7b  tianbu.xsw  code refactor
        998d91b57  xiaying  [Express:Speed] Merge submodule for speed
        22c89146f  tianhang.yth  fix alpha div by zero bug and arm server compile bug
        8f829a170  tianbu.xsw  [OpenCL Pad] unify conv/deconv pad computing
        4a28f603e  xiaying  [Express:Speed] Shared Const for All Submodule
        c74cf28f3  xiaying  [MNN:Refractor] Seperate Const init and schedule
        2a1eebb7a  xiaying  [Tools:Bugfix] Fix bug for modelTest.py count size
        72f04008c  xiaying  [MNN:Refractor] Delete unuseful const op
        1e735d03c  xiaying  [Converter:Bugfix] Fix bug for static module gen
        4dfadbc6e  xiaying  [MNN:Refractor] Rewrite const init mode
        1fcf0417a  xiaying  [MNN:Bugfix] Fix bug for deconvolutin multi-input for multi-batch
        41d429cfd  xiaying  [Train:Bugfix] Revert convert NCHW for mnistTrain
        f947a5f01  xiaying  [Test:Feature] Add testTrain
        dad59b6f6  tianbu.xsw  move realize code from Backend.hpp to Tensor.cpp
        cf4473ad1  xiaying  [Train:Bugfix] Support pad for GeometryPoolGrad
        91ab13734  xiaying  [MNN:Bugfix] Fix compile bug for avx512
        742e80f47  xiaying  [MNN:Refractor] Opt the logic for checknan judge
        12543b841  xiaying  [ARM82:Bugfix] Fix compile bug for ios
        3a2b0a49f  xiaying  [ARM82:Speed] Opt Pack / Unpack for armv8
        c0f1995cd  xiaying  [ARM82:Speed] Opt MNNPackC8FP16 and MNNUnpackC8FP16 by asm
        e0fc77dcf  xiaying  [MNN:Speed] Fix bug for DeconvolutionWithStride for C4HW4, open it
        584bec578  xiaying  [MNN:Bugfix] Fix bug for format set error for onnx
        d5bd4148d  xiaying  [MNN:Bugfix] Fix bug for format set error for onnx
        b00265841  xiaying  [MNN:Bugfix] Fix bug for SparseConvolutionTiledExecutor
        bb09188ac  xiaying  [Test:Bugfix] Fix bug for run into sparse auto
        426d1babd  xiaying  [MNN:Refractor] Small bugfix for Group convolution and pack
        7d0ea1c46  tianbu.xsw  [testModel Feature] support testModel.out input resize
        4169c54ce  xiaying  [MNN:Bugfix] Fix bug for checkNAN for origin
        412a82222  xiaying  [Test:Bugfix] Fix bug for CheckNAN's error of matmul
        319b1d425  xiaying  [MNN:Bugfix] Fix bug for multi-batch for ConvInt8
        050b728a6  xiaying  [Test:Bugfix] Use NCHW for ConvInt8Test
        7db3423a1  xiaying  [OpenCL:Bugfix] Fix bug for opencl::image,opencl::buffer for C4HW4
        adcec6a7f  xiaying  [Vulkan:Bugfix] Fix bug for invalid tensor size limit
        d2a7cf4e9  xiaying  [Vulkan:Bugfix] Fix bug for onCopyBuffer of nc4hw4
        557bebdd3  xiaying  [MNN:Bugfix] Fix bug for BF16-ARM32
        bbe186649  tianbu.xsw  [Update AUTO mode]: fix MNN_FORWARD_AUTO choose priority
        6deb23439  xiaying  [MNN:Bugfix] Fix bug for GeometryBinary don't care about NC4HW4 same size
        b137590e4  xiaying  [MNN:Bugfix] Fix bug for GeometryBinary don't care about NC4HW4 same size
        7003558ea  xiaying  [Converter:Bugfix] Fix bug for onnx pad for serveral case
        b5f8cae5a  xiaying  [Converter:Bugfix] Fix bug for onnx pad for serveral case
        29b09e125  xiaying  [MNN:Bugfix] Fix bug for arm64-bf16
        42ce00770  xiaying  [MNN:Bugfix] Fix bug for ARM64 - float
        a2d89fc18  雁行  [Converter:Feature] Support Binary Unary for Torch.
        7f1c0deb1  xiaying  [MNN:Bugfix] Fix bug for Raster for Int8
        8335a6f18  tianbu.xsw  [OpenCL Shared Memory] modify data_format method
        b359e031b  xiaying  [ARM82:Bugfix] Fix bug for arm82 and speed up pack / unpack c8
        24bf3fc88  雁行  [Convert:Feature] Support LayerNormFuse without gamma beta.
        3e629624b  xiaying  [MNN:Bugfix] Fix bug for float - armv7a
        2b7908ec7  tianbu.xsw  modify workItemSize
        3cee0d413  xiaying  [MNN:Bugfix] test wrong clear
        9cbbfb998  xiaying  [MNN:Bugfix] fix compile bug for c++ < 14
        2d7a44484  xiaying  [MNN:Bugfix] fix compile bug for c++ < 14
        eb7d0cb53  xiaying  [Test:Bugfix] Don't test for NC4HW4 directly
        7b40ca8d1  xiaying  [MNN:Bugfix] Fix bug for ConvolutionGroup
        2694d8a91  xiaying  [MNN:Bugfix] Fix bug for CPUGridSample
        f89af60f6  xiaying  [MNN:Bugfix] Fix compile bug for arm
        a151abcdd  xiaying  [MNN:Bugfix] Fix bug for convert for int8 / int16
        b254dbe61  雁行  [MNN:Bugfix] Bugfix for Conv onClone.
        d08150631  xiaying  [MNN:Bugfix] Fix bug for fast rcnn
        e5568a0df  xiaying  [MNN:Bugfix] Fix bug for CPURaster treat NC4HW4 fast blit
        128318933  雁行  [Raster:Bugfix] bugfix for Raster merge onResize.
        03caacbea  xiaying  [MNN:Bugfix] fix bug for CPUDeconvolution and Convolution1x1Strassen for iw != ow
        e1e3c245c  xiaying  [MNN:Bugfix] Fix bug for ConvolutionWinograd
        2524cbc6d  xiaying  [MNN:Bugfix] Fix bug for CPUSoftmax
        44ec79b8f  xiaying  [MNN:Bugfix] Fix bug for CPUConvolutionDepthwise / Scale / DeconvolutionDW
        21ae956ce  xiaying  [MNN:Bugfix] Fix bug for Multi-Batch-TiledExecutor
        09a5069c7  xiaying  [MNN:Speed] Add offset for src and dst
        6776c6784  xiaying  [MNN:Bugfix] Fix bug for trainable model
        cc83ae30b  xiaying  [MNN:Bugfix] Fix bug for trainable model
2021-07-29 11:47:13 +08:00
hush-alibaba 58545d6ca1
Synchronize internal github for version 1.2.0 (#1518) 2021-06-11 17:17:13 +08:00
tianhang.yth d85952d826 sync from internal repo 2021-04-28 18:02:10 +08:00
xiaying 5947b90a03 [PATCH 30/36] [MNN:Refractor] Move NN to train folder 2021-04-16 14:29:38 +08:00
xiaying d91fc63976 [MNN:Sync] Sync internal Gitlab 2021-04-08 15:34:23 +08:00
Joker 21127cb907 improvement(HiAI): update cmake of HiAI backend to support it when MNN_SEP_BUILD=true 2021-03-11 16:36:22 +08:00
jxt1234 4baf6b1ecf
Merge pull request #1370 from WillTao-RD/master
avoid build warning if MNN_SEP_BUILD is OFF
2021-02-23 11:08:35 +08:00
xiaying 5e127496fc Sync Internal Github 2021-02-07 10:47:03 +08:00
taowei 3d1cf0c3c4 avoid build warning if MNN_SEP_BUILD is OFF 2021-02-04 10:24:34 +08:00
xiaying aad7b7aed1 [MNN:Sync] Sync internal Gitlab 2021-01-08 14:36:59 +08:00
xiaying 2d1b129121 [MNN:Sync] Sync internal git 2021-01-06 16:29:37 +08:00
xiaying 644eadbdb0 [PATCH 338/350] [MNN:Bugfix] Remove GenVCS for serveral system can't execute 2021-01-06 15:57:22 +08:00
xiaying 0fe2b0dfee [PATCH 278/350] [MNN:Speed] Support OneDNN for MNN Convolution 2021-01-06 15:57:17 +08:00
xiaying acbdeaa60b [PATCH 195/350] [CV:Bugfix] Fix bug for CMakeLists 2021-01-06 15:57:10 +08:00
xiaying 0061c6a454 [PATCH 172/350] [CV:Bugfix] Fix compile optimizer error in sse machine for cv 2021-01-06 15:57:08 +08:00
Hui Shu ab711d484c Synchronize internal master to Github 2020-12-15 14:12:35 +08:00
riddick 7cc4ddc585 feat(backend): add M1 chip support
- add cpu backend support for M1 chip
- add arm82 backend support for M1 chip
2020-11-24 22:48:48 +08:00
Hui Shu d6795ad031 Github release 1.1.0 2020-11-05 16:49:17 +08:00
xiaying 0a95efdf66 [MNN:Bugfix] Fix cv compile bug in windows 2020-07-04 09:29:49 +08:00
xiaying 255db932eb [MNN:Sync] Sync Internal Github 2020-07-04 01:21:30 +08:00
tianbu.xsw d4a1814cd2 add MNN_OPENCL_LWS_TUNE macro 2020-07-04 01:06:26 +08:00
root 57ab91b9e7 Fix compile bug of use sse for cv in linux 2020-07-04 01:06:20 +08:00
xiaying 5fc7acd37e support transpose, fix bug for not align 2020-07-04 01:06:18 +08:00
Evgeny Proydakov 5f52b9bfc8 Fixed openmp android build. 2020-05-22 22:54:27 +03:00
Evgeny Proydakov 5f673ddcae Fixed Travis CI linux vulkan build.
As I see libMNN uses dl library. I updated target_link_libraries.

./ciscripts/Linux/CL_ThreadPool_Vulkan.sh

/usr/bin/ld: libMNN.so: undefined reference to `dlopen'
/usr/bin/ld: libMNN.so: undefined reference to `dlclose'
/usr/bin/ld: libMNN.so: undefined reference to `dlsym'

After the change, everything is collected without problems.

[100%] Linking CXX executable ../../runTrainDemo.out
[100%] Built target runTrainDemo.out
2020-05-19 23:31:26 +03:00
玄裳 0df31a8667 MNN 1.0.0 release sync.
- Added Python Express API implemented with pbind11
- Added demos for Python Express API
- Performance improvements for ARM64, ARMv8.2, x86.
- README update.
2020-05-07 18:22:11 +08:00
和彬 00b8d31fbf remove unnecessary iostream, add comment in cmake 2020-04-29 10:00:52 +08:00
hebin eff988f719 windows support evaluation and train tools 2020-04-29 10:00:50 +08:00
xiaying 74cdc8963e Remove unuseful change 2020-04-15 15:34:57 +08:00
xiaying d42340d6a0 Fix bug for sep_build judge before shared libs 2020-04-15 15:34:57 +08:00
xiaying 6256c3a2bf For static library set sep_build as false 2020-04-15 15:34:57 +08:00
xiaying be57ec2863 Fix bug for compile static library for android 2020-04-15 15:34:57 +08:00
xiaying a76be60722 [MNN:Sync] Fix compile bug for windows, fix bug for device not support
fma
2020-04-14 22:52:24 +08:00
xiaying 5e5902240c [MNN:Sync] Add MNN-Plugin, Fix serveral bug 2020-04-14 21:43:02 +08:00
xiaying d01b041bb0 [PATCH 12/17] [MNN:Refractor] Remove schema build from cmakelists 2020-04-13 13:08:05 +08:00
xiaying 6d6a51e52e [PATCH 10/17] [MNN:Feature] Support not build tools 2020-04-13 13:08:05 +08:00
Just Test fb8075fddf [PATCH 16/28] undo some change 2020-03-31 11:12:39 +08:00
Just Test ce7dd512b1 [PATCH 15/28] flatbuffer fix, unicode model filepath support, TensorUtils.hpp macro conflict fix, cmake cache option force update 2020-03-31 11:12:39 +08:00
xiaying 48c92a41e7 [MNN:Sync] Sync internal git for remain patch 2020-03-22 20:33:03 +08:00
誉阳 9bf8aebe93 [PATCH 159/160] fix build static library on Mac bug 2020-03-22 19:02:15 +08:00
誉阳 4fde4e5ac8 [PATCH 144/160] fix compile error 2020-03-22 19:02:13 +08:00
誉阳 e69626b08f [PATCH 142/160] fix build static library bug 2020-03-22 19:02:13 +08:00
xiaying 46c11b3128 [PATCH 060/160] [Vulkan:Feature] Support use system lib for vulkan 2020-03-22 19:02:05 +08:00
xiaying 32aea32cd1 [PATCH 23/24] [MNN:Bugfix] Fix cmake setting bug for linux 2020-03-06 11:04:54 +08:00
海境 90e06944db
Update 2020-02-26 09:57:17 +08:00
海境 ed8b9f2a23
Sync Internal CMake changes 2020-01-17 12:06:45 +08:00
海境 4d6c19f121
Clean 2020-01-16 17:49:15 +08:00
Zhang 91b5ade49a
Sync. Fix OpenGL related building issues. Build the whole suite on Android CI (#580)
* Sync code with latest internal version

* Update CMake

* Fix logging issues

* Fix OpenGL Building

* Bump CMakeLists version. Update Podspec

* Update MetalLib Lookup logic

* Fix Windows Build
2020-01-16 16:55:46 +08:00
Zhang c95892de87
Fix OpenCL linking logic when MNN_BUILD_SHARED_LIBS=OFF 2020-01-08 11:34:27 +08:00
Zhang 2e3d5a318b
Fix Windows CI / Enhance iOS Build Script (#568)
* Add back MNN_BUILD_HARD option

* Add simulator to buildiOS.sh
2020-01-07 18:47:22 +08:00
Zhang f59c5335ba
Fix win ci (#566)
* Bump version of Podspec and fixing header includes

* Fix Windows CI Configuration
2020-01-06 15:46:23 +08:00
海境 37f34c9d90
Fix Android Demo 2020-01-06 13:56:45 +08:00
海境 1803a4e025
Add per-platform CI status flag 2020-01-06 10:52:22 +08:00
海境 f4e28567d3
Bump version. Add issue template 2020-01-03 14:54:26 +08:00
海境 5af265939b
Wrap SSE/AVX Flags in platform detection code 2020-01-03 11:07:05 +08:00
Zhang 859c3e241d
Fix Unix Environment detection and OpenCL Linking (#554)
* Close #551

* Fix Linux Detection
2020-01-02 12:01:24 +08:00
Zhang 002ac367e4
Update 2019-12-27 22:16:57 +08:00
liqing e93e8dcbe8 0.2.1.5
# integration
- add travis CI
- fix building parameters for python

# converter
- add half storage option for MNN converter
- fix op name lost in converter
- fix converter bug for print input output, identity remove output

# ops
- add quantized Convolution & Deconvolution support on OpenCL
- add more expression supports
- add DetectionPostProcess Op for TensorFlow Lite (ssd is supported directly now)
- add supports for LSTM & ELU for ONNX
- add support for Convolution that weights is not constant for ONNX
- fix Unary Op compile error on Linux
- fix Metal backend buffer reuse after resize
- fix Metal raw memory access after model releasing
- fix redundant transpose in Winograd generater
2019-11-15 14:22:45 +08:00
Naville 879f993978 Provide placeholder values for both Android NATIVE_* variable 2019-11-13 16:03:26 +08:00
海境 133fddc017 Close #436 #449 2019-11-13 15:47:08 +08:00
Zhang a005e9c46d
Add Travis config for Android/iOS/macOS/Linux (#453) 2019-11-13 11:44:43 +08:00
liqing d6b00d04f4 - build:
- unify schema building in core and converter;
	- add more build script for android;
	- add linux build script for python;

- ops impl:
	- add floor mod support in binary;
	- use eltwise impl in add/max/sub/mul binary for optimization;
	- remove fake double support in cast;
	- fix 5d support for concat;
	- add adjX and adjY support for batch matmul;
	- optimize conv2d back prop filter;
	- add pad mode support for conv3d;
	- fix bug in conv2d & conv depthwise with very small feature map;
	- optimize binary without broacast;
	- add data types support for gather;
	- add gather ND support;
	- use uint8 data type in gather v2;
	- add transpose support for matmul;
	- add matrix band part;
	- add dim != 4 support for padding, reshape & tensor convert;
	- add pad type support for pool3d;
	- make ops based on TensorFlow Lite quantization optional;
	- add all & any support for reduction;
	- use type in parameter as output type in reduction;
	- add int support for unary;
	- add variable weight support for conv2d;
	- fix conv2d depthwise weights initialization;
	- fix type support for transpose;
	- fix grad outputs count for  reduce grad and reshape grad;
	- fix priorbox & detection output;
	- fix metal softmax error;

- python:
	- add runSessionWithCallBackInfo interface;
	- add max nodes limit (1400) for visualization tool;
	- fix save error in python3;
	- align default dim;

- convert:
	- add extra design for optimization;
	- add more post converting optimizers;
	- add caffe v1 weights blob support;
	- add cast, unary, conv transpose support for onnx model;
	- optimize batchnorm, conv with variable weights, prelu, reshape, slice, upsample for onnx model;
	- add cos/sin/atan/tan support for unary for tensorflow model;
	- add any/all support for reduction for tensorflow model;
	- add elu, conv3d, pool3d support for tensorflow model;
	- optimize argmax, batchnorm, concat, batch to space, conv with variable weights, prelu, slice for tensorflow model;

- others:
	- fix size computer lock;
	- fix thread pool deadlock;
	- add express & parameters in express;
	- rewrite blitter chooser without static map;
	- add tests for expr;
2019-10-29 13:37:26 +08:00
liqing 73ad3413cc - dynamic computation graph (beta)
- add supports (/express)
	- add tests
	- add benchmarks with it (/benchmark/exprModels)
- Python
	- MNN engine and tools were submitted to pip
	- available on Windows/macOS/Linux
- Engine/Converter
	- add supports for each op benchmarking
	- refactor optimizer by separating steps
- CPU
	- add supports for Conv3D, Pool3D, ELU, ReverseSequence
	- fix ArgMax, Permute, Scale, BinaryOp, Slice, SliceTf
- OpenCL
	- add half transform in CPU
	- add broadcast supports for binary
	- optimize Conv2D, Reshape, Eltwise, Gemm, etc.
- OpenGL
	- add sub, real div supports for binary
	- add supports for unary
	- optimize Conv2D, Reshape
- Vulkan
	- add max supports for eltwise
- Metal
	- fix metallib missing problem
- Train/Quantization
	- use express to refactor training codes
2019-09-26 21:02:07 +08:00
liqing 487a0fbd0a beta 0.2.0.9
- fix quantization tool compiling on Windows
- fix converter compiling on Windows
- fix eltwise optimization on Windows
- separate sse & avx for Windows
- add LeakyReLU support for TensorFlow
- fix reshape, const for TensorFlow
- fix dimension format error for ONNX ops
- optimize winograd, ReLU for OpenCL
- add fp16 availability & dimensions size check-up for OpenCL
- optimize GEMM for arm32
- fix ExpandDims shape calculation when inputs size == 1
2019-09-01 19:25:26 +08:00
liqing f085106da9 release 0.2.0.6
- fix bugs in quantization
- add evaluating tool for quantization
- add ADMM support in quantization
- fix lock in thread pool
- fix fusing for deconv
- fix reshape converting from ONNX to MNN
- turn off blob size checking by default
2019-08-07 16:44:09 +08:00
liqing 7bb0df92dc beta 0.2.0.5
- CPU
	- add support for DepthToSpace & SpaceToDepth ops
- OpenGL
	- add Android demo
	- add half / float runtime option
	- add support for ROIPooling, Squeeze
	- fix bugs in conv im2col
- OpenCL
	- fix Concat, Eltwise, Reshape bugs
- Tools
	- add KL threshold method in quantization tool
	- support optimization for graph with multiple rnn
2019-07-25 13:36:35 +08:00
如幻 732ba68b19 beta 0.2.0.4
- bug fix for quantization tool
    - bug fix/performance update for thread pool
    - bug fix for converters
    - tutorial/doc update
    - more op support
2019-07-19 17:36:12 +08:00
liqing a367406308 beta 0.2.0.3
- add quantization tool & cpu impl & demo/exec
- add thread pool
- add tests
- fix onnx converter tensor name mismatch
- optimize cpu performance with SSE for windows
2019-07-11 13:56:52 +08:00
liqing 6a4213f7dc beta 0.2.0.0
- replace FreeImage with stb_image
- warn unicode error in Windows compiling
- separate clang/gcc build script for android
- add default values in fbs
- optimize CPU conv / conv depthwise / deconv / deconv depthwise / lstm / sigmoid
- add sub support in eltwise
- add reciprocal / log1p / log in unary
- add zero like / select / set diff 1d
- add batch support for permute
- add training codes
- fix metal error in dynamic separate storage type handling
2019-06-17 20:10:35 +08:00
liqing ff405a3078 beta 0.1.1.6
- add support for windows
- fix bugs in converting dropout
- fix bugs in post treat
2019-06-10 21:08:55 +08:00
liqing 28a6f1a614 beta 0.1.1.2
- fix register typo
- add input count in conv model
- add 5x5 winograd convolution for OpenCL
- edit condition to use half in OpenCL
- upgrade build.gradle
2019-05-14 19:54:21 +08:00
daquexian 4c91524112 CMAKE_SOURCE_DIR -> CMAKE_CURRENT_SOURCE_DIR 2019-05-10 13:49:20 +08:00
liqing 9bfb58a3ec fix backend/op/sizer register 2019-05-09 19:39:33 +08:00
liqing f2d6e8fe2d beta 0.1.1.1
- use code generate for op/backend/sizer register
- add pose demo
- fix docs & script
- improve cpu softmax performance 80%
- improve converter ops fuse
2019-05-08 15:44:57 +08:00
daquexian 3a4a58ccf7 Use bash instead of /bin/sh 2019-05-08 11:53:55 +08:00
liqing 2a2fa36c47 add -lazy option for generate.sh, and use it in CMakeList. 2019-05-07 16:55:36 +08:00
liqing 6566533d2d generate schema automatically before build MNN. 2019-05-07 16:12:43 +08:00
liqing 07e28c80d3 beta 0.1.1
- update resources and docs
  - unite tensor's width/height/channel/batch getter
  - optimize several ops
  - fix compile warnings and errors on Ubantu
  - some other bug fixes
2019-05-05 20:27:57 +08:00
liqing 5551108af8 beta 0.1.0 2019-04-19 20:50:09 +08:00