Commit Graph

14 Commits

Author SHA1 Message Date
tianbu.xsw cf97427595 set queue_count to zero for every forward 2020-07-04 01:06:26 +08:00
tianbu.xsw 2f93946c79 disable CL_QUEUE_PROFILING_ENABLE onExecute 2020-07-04 01:06:25 +08:00
tianbu.xsw eb5bef714b Performance OPT OPENCL 2020-07-04 01:06:24 +08:00
xiaying 46c11b3128 [PATCH 060/160] [Vulkan:Feature] Support use system lib for vulkan 2020-03-22 19:02:05 +08:00
xiaying d4aa6687eb [PATCH 047/160] [OpenCL:Feature] Add a few unary op 2020-03-22 19:02:04 +08:00
xiaying 499bdc67a0 [PATCH 040/160] [Test:Feature] Support test for opencl backend 2020-03-22 19:02:03 +08:00
xiaying 7831c4516b [PATCH 039/160] [OpenCL:Bugfix] Fix bug for op->name empty crash 2020-03-22 19:02:03 +08:00
Zhang 91b5ade49a
Sync. Fix OpenGL related building issues. Build the whole suite on Android CI (#580)
* Sync code with latest internal version

* Update CMake

* Fix logging issues

* Fix OpenGL Building

* Bump CMakeLists version. Update Podspec

* Update MetalLib Lookup logic

* Fix Windows Build
2020-01-16 16:55:46 +08:00
Zhang 002ac367e4
Update 2019-12-27 22:16:57 +08:00
liqing e93e8dcbe8 0.2.1.5
# integration
- add travis CI
- fix building parameters for python

# converter
- add half storage option for MNN converter
- fix op name lost in converter
- fix converter bug for print input output, identity remove output

# ops
- add quantized Convolution & Deconvolution support on OpenCL
- add more expression supports
- add DetectionPostProcess Op for TensorFlow Lite (ssd is supported directly now)
- add supports for LSTM & ELU for ONNX
- add support for Convolution that weights is not constant for ONNX
- fix Unary Op compile error on Linux
- fix Metal backend buffer reuse after resize
- fix Metal raw memory access after model releasing
- fix redundant transpose in Winograd generater
2019-11-15 14:22:45 +08:00
liqing 73ad3413cc - dynamic computation graph (beta)
- add supports (/express)
	- add tests
	- add benchmarks with it (/benchmark/exprModels)
- Python
	- MNN engine and tools were submitted to pip
	- available on Windows/macOS/Linux
- Engine/Converter
	- add supports for each op benchmarking
	- refactor optimizer by separating steps
- CPU
	- add supports for Conv3D, Pool3D, ELU, ReverseSequence
	- fix ArgMax, Permute, Scale, BinaryOp, Slice, SliceTf
- OpenCL
	- add half transform in CPU
	- add broadcast supports for binary
	- optimize Conv2D, Reshape, Eltwise, Gemm, etc.
- OpenGL
	- add sub, real div supports for binary
	- add supports for unary
	- optimize Conv2D, Reshape
- Vulkan
	- add max supports for eltwise
- Metal
	- fix metallib missing problem
- Train/Quantization
	- use express to refactor training codes
2019-09-26 21:02:07 +08:00
liqing 1005c138e1 beta 0.2.0.7
- move docs to http://www.yuque.com/mnn
- fix bugs for CPU ops TopKV2 and quantized convolution
- add enqueue map buffer error handle for OpenCL
- add nullptr protection for extra tensor desc
- add failure protection for memory acquirement
- fix slice shape calculation
- refactor binary shape calculation
2019-08-15 17:30:39 +08:00
liqing db155b4d1d beta 0.2.0.2
- CPU
  - add padding support
  - fix bug in permute when channel % 4 != 0
  - fix bug in exp with extreme value
- OpenCL
  - add protecting logics
- OpenGL
  - add protecting logics
  - support NCHW format in Squeeze and Reshape
- Converter
  - add ShuffleChannel support for Caffe
  - add Clip/Transpose/Unary/Pad supports for ONNX
2019-07-02 18:01:08 +08:00
liqing 5551108af8 beta 0.1.0 2019-04-19 20:50:09 +08:00