Commit Graph

31 Commits

Author SHA1 Message Date
zhaode.wzd 55a59a7ebc [MNN:Sync] Sync Internal reranker, gpt-oss. 2025-08-08 12:24:23 +08:00
xiaying db0f559f9d MNN:Sync: Sync Internal 3.2.2 2025-07-23 14:33:57 +08:00
zhaode.wzd bd36a3f749 [MNN:Sync] Sync internal:
1. SmolVLM, FastVLM support.
    2. QNN backend init.
    3. Qwen3 MoE support.
    4. Speculative decodeing init.
    5. Some bugfix.
2025-05-23 15:24:18 +08:00
xiaying 766815282f MNN:Sync: Sync Internal 3.0.4 2025-01-22 16:28:36 +08:00
xiaying 65ec0ea406 MNN:Sync: Fix bug for llama2/llama3 attention fuse, refract llm usage 2024-06-15 15:39:59 +08:00
xiaying 5895243607 [MNN:Sync] Sync Internal 2.8.4 2024-04-19 11:58:21 +08:00
zhaode.wzd 2972fe71dc [MNN:Sync] Sync Internal 2.8.3 2024-03-13 14:55:54 +08:00
zhaode.wzd 387775be2a [MNN:Sync] Sync Internal 2.8.0 2023-12-04 11:19:10 +08:00
xiaying 3ff49cbf4a [MNN:Sync] Sync Internal 2.7.2 2023-10-18 10:31:02 +08:00
xiaying ea4f13d3cf [MNN:Sync] Sync Internal 2.7.0 2023-09-04 10:42:11 +08:00
zhaode.wzd 67eceb8abb [MNN:Sync] Sync Internal code, support low_memory for conv. 2023-06-27 10:33:16 +08:00
xiaying 69dba73dc7 [MNN:Sync] Sync internal gitlab
Main Feature:
1. Add OpenCV API and Numpy API Support
2. Protobuf move into MNN
3. Add more op for torchscript convert
4. Add recompute to speed up geometry compute
5. Add ModuleBasic Test
2021-11-30 10:10:53 +08:00
hush-alibaba 58545d6ca1
Synchronize internal github for version 1.2.0 (#1518) 2021-06-11 17:17:13 +08:00
xiaying d91fc63976 [MNN:Sync] Sync internal Gitlab 2021-04-08 15:34:23 +08:00
xiaying aedc8f6a68 [PATCH 341/350] Add avx512 patch 2021-01-06 15:57:22 +08:00
Hui Shu d6795ad031 Github release 1.1.0 2020-11-05 16:49:17 +08:00
xiaying 255db932eb [MNN:Sync] Sync Internal Github 2020-07-04 01:21:30 +08:00
xiaying a750fe0956 Rename _AVX_MNNGemm16x6 as _AVX_MNNPackedMatMul 2020-07-04 01:06:18 +08:00
xiaying d13f1bc0b6 Optimize Strassen Merge C Function for x86 2020-07-04 01:06:18 +08:00
xiaying ae91cab1b8 Support Strassen for new matmul 2020-07-04 01:06:18 +08:00
和彬 1821f9bd46 gemm common and onr optimize 2020-04-15 15:34:57 +08:00
和彬 067c3e35ca [PATCH 24/28] rename MNNGemmFloatUnit -> MNNGemmFloatUnit_4 2020-03-31 11:12:40 +08:00
hebin c0cb82d9ab [PATCH 22/28] [MNN:Speed] 8x8 Gemm and cache prefetch optimize 2020-03-31 11:12:40 +08:00
海境 90e06944db
Update 2020-02-26 09:57:17 +08:00
Zhang 002ac367e4
Update 2019-12-27 22:16:57 +08:00
liqing d6b00d04f4 - build:
- unify schema building in core and converter;
	- add more build script for android;
	- add linux build script for python;

- ops impl:
	- add floor mod support in binary;
	- use eltwise impl in add/max/sub/mul binary for optimization;
	- remove fake double support in cast;
	- fix 5d support for concat;
	- add adjX and adjY support for batch matmul;
	- optimize conv2d back prop filter;
	- add pad mode support for conv3d;
	- fix bug in conv2d & conv depthwise with very small feature map;
	- optimize binary without broacast;
	- add data types support for gather;
	- add gather ND support;
	- use uint8 data type in gather v2;
	- add transpose support for matmul;
	- add matrix band part;
	- add dim != 4 support for padding, reshape & tensor convert;
	- add pad type support for pool3d;
	- make ops based on TensorFlow Lite quantization optional;
	- add all & any support for reduction;
	- use type in parameter as output type in reduction;
	- add int support for unary;
	- add variable weight support for conv2d;
	- fix conv2d depthwise weights initialization;
	- fix type support for transpose;
	- fix grad outputs count for  reduce grad and reshape grad;
	- fix priorbox & detection output;
	- fix metal softmax error;

- python:
	- add runSessionWithCallBackInfo interface;
	- add max nodes limit (1400) for visualization tool;
	- fix save error in python3;
	- align default dim;

- convert:
	- add extra design for optimization;
	- add more post converting optimizers;
	- add caffe v1 weights blob support;
	- add cast, unary, conv transpose support for onnx model;
	- optimize batchnorm, conv with variable weights, prelu, reshape, slice, upsample for onnx model;
	- add cos/sin/atan/tan support for unary for tensorflow model;
	- add any/all support for reduction for tensorflow model;
	- add elu, conv3d, pool3d support for tensorflow model;
	- optimize argmax, batchnorm, concat, batch to space, conv with variable weights, prelu, slice for tensorflow model;

- others:
	- fix size computer lock;
	- fix thread pool deadlock;
	- add express & parameters in express;
	- rewrite blitter chooser without static map;
	- add tests for expr;
2019-10-29 13:37:26 +08:00
liqing 73ad3413cc - dynamic computation graph (beta)
- add supports (/express)
	- add tests
	- add benchmarks with it (/benchmark/exprModels)
- Python
	- MNN engine and tools were submitted to pip
	- available on Windows/macOS/Linux
- Engine/Converter
	- add supports for each op benchmarking
	- refactor optimizer by separating steps
- CPU
	- add supports for Conv3D, Pool3D, ELU, ReverseSequence
	- fix ArgMax, Permute, Scale, BinaryOp, Slice, SliceTf
- OpenCL
	- add half transform in CPU
	- add broadcast supports for binary
	- optimize Conv2D, Reshape, Eltwise, Gemm, etc.
- OpenGL
	- add sub, real div supports for binary
	- add supports for unary
	- optimize Conv2D, Reshape
- Vulkan
	- add max supports for eltwise
- Metal
	- fix metallib missing problem
- Train/Quantization
	- use express to refactor training codes
2019-09-26 21:02:07 +08:00
liqing db155b4d1d beta 0.2.0.2
- CPU
  - add padding support
  - fix bug in permute when channel % 4 != 0
  - fix bug in exp with extreme value
- OpenCL
  - add protecting logics
- OpenGL
  - add protecting logics
  - support NCHW format in Squeeze and Reshape
- Converter
  - add ShuffleChannel support for Caffe
  - add Clip/Transpose/Unary/Pad supports for ONNX
2019-07-02 18:01:08 +08:00
liqing ad759ebfae beta 0.2.0.1
- support both armv7/arm64 in podspec (pod version >= 1.5.0 required)
- refactor neg axis support
- fix memory overlap in de-conv
- fix CONVOLUTION_TILED_NUMBER spell error
- fix few warnings
- add binary / interp / permute / relu / reshape / softmax support and optimize conv for OpenGL backend
- add clean in nmake build script
2019-06-24 11:32:41 +08:00
liqing 6a4213f7dc beta 0.2.0.0
- replace FreeImage with stb_image
- warn unicode error in Windows compiling
- separate clang/gcc build script for android
- add default values in fbs
- optimize CPU conv / conv depthwise / deconv / deconv depthwise / lstm / sigmoid
- add sub support in eltwise
- add reciprocal / log1p / log in unary
- add zero like / select / set diff 1d
- add batch support for permute
- add training codes
- fix metal error in dynamic separate storage type handling
2019-06-17 20:10:35 +08:00
liqing 5551108af8 beta 0.1.0 2019-04-19 20:50:09 +08:00