Commit Graph

28 Commits

Author SHA1 Message Date
hebin c0cb82d9ab [PATCH 22/28] [MNN:Speed] 8x8 Gemm and cache prefetch optimize 2020-03-31 11:12:40 +08:00
xiaying f1cc216846 [PATCH 13/28] [MNN:Speed] Fix bug for winograd cost compute 2020-03-31 11:12:39 +08:00
xiaying 48c92a41e7 [MNN:Sync] Sync internal git for remain patch 2020-03-22 20:33:03 +08:00
xiaying 010913aaaa [PATCH 070/160] [MNN:Refract] Seperate the load of quan and half to ConvolutionCommon 2020-03-22 19:02:06 +08:00
海境 90e06944db
Update 2020-02-26 09:57:17 +08:00
Zhang 91b5ade49a
Sync. Fix OpenGL related building issues. Build the whole suite on Android CI (#580)
* Sync code with latest internal version

* Update CMake

* Fix logging issues

* Fix OpenGL Building

* Bump CMakeLists version. Update Podspec

* Update MetalLib Lookup logic

* Fix Windows Build
2020-01-16 16:55:46 +08:00
Zhang 002ac367e4
Update 2019-12-27 22:16:57 +08:00
liqing e93e8dcbe8 0.2.1.5
# integration
- add travis CI
- fix building parameters for python

# converter
- add half storage option for MNN converter
- fix op name lost in converter
- fix converter bug for print input output, identity remove output

# ops
- add quantized Convolution & Deconvolution support on OpenCL
- add more expression supports
- add DetectionPostProcess Op for TensorFlow Lite (ssd is supported directly now)
- add supports for LSTM & ELU for ONNX
- add support for Convolution that weights is not constant for ONNX
- fix Unary Op compile error on Linux
- fix Metal backend buffer reuse after resize
- fix Metal raw memory access after model releasing
- fix redundant transpose in Winograd generater
2019-11-15 14:22:45 +08:00
liqing d6b00d04f4 - build:
- unify schema building in core and converter;
	- add more build script for android;
	- add linux build script for python;

- ops impl:
	- add floor mod support in binary;
	- use eltwise impl in add/max/sub/mul binary for optimization;
	- remove fake double support in cast;
	- fix 5d support for concat;
	- add adjX and adjY support for batch matmul;
	- optimize conv2d back prop filter;
	- add pad mode support for conv3d;
	- fix bug in conv2d & conv depthwise with very small feature map;
	- optimize binary without broacast;
	- add data types support for gather;
	- add gather ND support;
	- use uint8 data type in gather v2;
	- add transpose support for matmul;
	- add matrix band part;
	- add dim != 4 support for padding, reshape & tensor convert;
	- add pad type support for pool3d;
	- make ops based on TensorFlow Lite quantization optional;
	- add all & any support for reduction;
	- use type in parameter as output type in reduction;
	- add int support for unary;
	- add variable weight support for conv2d;
	- fix conv2d depthwise weights initialization;
	- fix type support for transpose;
	- fix grad outputs count for  reduce grad and reshape grad;
	- fix priorbox & detection output;
	- fix metal softmax error;

- python:
	- add runSessionWithCallBackInfo interface;
	- add max nodes limit (1400) for visualization tool;
	- fix save error in python3;
	- align default dim;

- convert:
	- add extra design for optimization;
	- add more post converting optimizers;
	- add caffe v1 weights blob support;
	- add cast, unary, conv transpose support for onnx model;
	- optimize batchnorm, conv with variable weights, prelu, reshape, slice, upsample for onnx model;
	- add cos/sin/atan/tan support for unary for tensorflow model;
	- add any/all support for reduction for tensorflow model;
	- add elu, conv3d, pool3d support for tensorflow model;
	- optimize argmax, batchnorm, concat, batch to space, conv with variable weights, prelu, slice for tensorflow model;

- others:
	- fix size computer lock;
	- fix thread pool deadlock;
	- add express & parameters in express;
	- rewrite blitter chooser without static map;
	- add tests for expr;
2019-10-29 13:37:26 +08:00
liqing 73ad3413cc - dynamic computation graph (beta)
- add supports (/express)
	- add tests
	- add benchmarks with it (/benchmark/exprModels)
- Python
	- MNN engine and tools were submitted to pip
	- available on Windows/macOS/Linux
- Engine/Converter
	- add supports for each op benchmarking
	- refactor optimizer by separating steps
- CPU
	- add supports for Conv3D, Pool3D, ELU, ReverseSequence
	- fix ArgMax, Permute, Scale, BinaryOp, Slice, SliceTf
- OpenCL
	- add half transform in CPU
	- add broadcast supports for binary
	- optimize Conv2D, Reshape, Eltwise, Gemm, etc.
- OpenGL
	- add sub, real div supports for binary
	- add supports for unary
	- optimize Conv2D, Reshape
- Vulkan
	- add max supports for eltwise
- Metal
	- fix metallib missing problem
- Train/Quantization
	- use express to refactor training codes
2019-09-26 21:02:07 +08:00
liqing b995b25db9 beta 0.2.0.8
- add NaN check-up
- add quantification support for ScaleAdd Op
- add binary to eltwise optimization
- add console logs for quantization tool
- better document for quantization tool
- replace redundant dimension flags with dimension format
- optimize performance of TensorFlow Lite Quantized Convolution
- fix axis support for ONNX softmax
- fix get performance compile error on Windows
2019-08-22 20:13:46 +08:00
liqing 1005c138e1 beta 0.2.0.7
- move docs to http://www.yuque.com/mnn
- fix bugs for CPU ops TopKV2 and quantized convolution
- add enqueue map buffer error handle for OpenCL
- add nullptr protection for extra tensor desc
- add failure protection for memory acquirement
- fix slice shape calculation
- refactor binary shape calculation
2019-08-15 17:30:39 +08:00
liqing 88396124ff add nullptr protection for quantized conv 2019-08-08 14:41:38 +08:00
如幻 732ba68b19 beta 0.2.0.4
- bug fix for quantization tool
    - bug fix/performance update for thread pool
    - bug fix for converters
    - tutorial/doc update
    - more op support
2019-07-19 17:36:12 +08:00
jxt1234 94cf471eb8
Merge pull request #155 from yisongsong/dev/yisongsong/test_quantize
Fix: bug in int8 convolution when dilateX is not equal to dilateY
2019-07-15 14:29:29 +08:00
yisongsong d3d12cd475 Fix: bug in int8 convolution when dilateX is not equal to dilateY 2019-07-11 14:09:06 +08:00
liqing a367406308 beta 0.2.0.3
- add quantization tool & cpu impl & demo/exec
- add thread pool
- add tests
- fix onnx converter tensor name mismatch
- optimize cpu performance with SSE for windows
2019-07-11 13:56:52 +08:00
liqing db155b4d1d beta 0.2.0.2
- CPU
  - add padding support
  - fix bug in permute when channel % 4 != 0
  - fix bug in exp with extreme value
- OpenCL
  - add protecting logics
- OpenGL
  - add protecting logics
  - support NCHW format in Squeeze and Reshape
- Converter
  - add ShuffleChannel support for Caffe
  - add Clip/Transpose/Unary/Pad supports for ONNX
2019-07-02 18:01:08 +08:00
liqing ad759ebfae beta 0.2.0.1
- support both armv7/arm64 in podspec (pod version >= 1.5.0 required)
- refactor neg axis support
- fix memory overlap in de-conv
- fix CONVOLUTION_TILED_NUMBER spell error
- fix few warnings
- add binary / interp / permute / relu / reshape / softmax support and optimize conv for OpenGL backend
- add clean in nmake build script
2019-06-24 11:32:41 +08:00
liqing 6a4213f7dc beta 0.2.0.0
- replace FreeImage with stb_image
- warn unicode error in Windows compiling
- separate clang/gcc build script for android
- add default values in fbs
- optimize CPU conv / conv depthwise / deconv / deconv depthwise / lstm / sigmoid
- add sub support in eltwise
- add reciprocal / log1p / log in unary
- add zero like / select / set diff 1d
- add batch support for permute
- add training codes
- fix metal error in dynamic separate storage type handling
2019-06-17 20:10:35 +08:00
liqing ff405a3078 beta 0.1.1.6
- add support for windows
- fix bugs in converting dropout
- fix bugs in post treat
2019-06-10 21:08:55 +08:00
liqing 0b085d6ffd beta 0.1.1.5
- cpu & gpu
	- add ceil mode in pool
	- fix softmax with neg axis
- cpu
	- add unsqueeze op
	- optimize lstm
- gpu
	- add 5x5 winograd in metal
	- add batch support for winograd in opencl
- onnx
	- add concat / gather / shape / squeeze / unsqueeze
	- fix data type support in constant
2019-06-05 10:45:59 +08:00
liqing 934d9bae07 beta 0.1.1.4
- refactor OpenGL, test relative codes
- optimize LRN
- fix doc link error
2019-05-24 11:26:54 +08:00
liqing 2fc8a201de beta 0.1.1.3
- fix benchmark script for older version adb
- add FAQ.md
- add environment requirement in Install.md
- add coeff in Eltwise Op
- fix bugs in strassen 1x1 data preparation
- add download failure process in get_model.sh
2019-05-17 14:59:57 +08:00
liqing 3cec8aa4d9 fix CPU pool & Metal scale op bugs 2019-05-07 16:16:34 +08:00
liqing 0bbdb6ac56 remove usages of GNU unused attribute 2019-05-07 16:15:16 +08:00
liqing 07e28c80d3 beta 0.1.1
- update resources and docs
  - unite tensor's width/height/channel/batch getter
  - optimize several ops
  - fix compile warnings and errors on Ubantu
  - some other bug fixes
2019-05-05 20:27:57 +08:00
liqing 5551108af8 beta 0.1.0 2019-04-19 20:50:09 +08:00