root/MNN - MNN - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
zhaode.wzd	55a59a7ebc	[MNN:Sync] Sync Internal reranker, gpt-oss.	2025-08-08 12:24:23 +08:00
xiaying	db0f559f9d	MNN:Sync: Sync Internal 3.2.2	2025-07-23 14:33:57 +08:00
zhaode.wzd	bd36a3f749	[MNN:Sync] Sync internal: 1. SmolVLM, FastVLM support. 2. QNN backend init. 3. Qwen3 MoE support. 4. Speculative decodeing init. 5. Some bugfix.	2025-05-23 15:24:18 +08:00
xiaying	766815282f	MNN:Sync: Sync Internal 3.0.4	2025-01-22 16:28:36 +08:00
xiaying	65ec0ea406	MNN:Sync: Fix bug for llama2/llama3 attention fuse, refract llm usage	2024-06-15 15:39:59 +08:00
xiaying	5895243607	[MNN:Sync] Sync Internal 2.8.4	2024-04-19 11:58:21 +08:00
zhaode.wzd	2972fe71dc	[MNN:Sync] Sync Internal 2.8.3	2024-03-13 14:55:54 +08:00
zhaode.wzd	387775be2a	[MNN:Sync] Sync Internal 2.8.0	2023-12-04 11:19:10 +08:00
xiaying	3ff49cbf4a	[MNN:Sync] Sync Internal 2.7.2	2023-10-18 10:31:02 +08:00
xiaying	ea4f13d3cf	[MNN:Sync] Sync Internal 2.7.0	2023-09-04 10:42:11 +08:00
zhaode.wzd	67eceb8abb	[MNN:Sync] Sync Internal code, support low_memory for conv.	2023-06-27 10:33:16 +08:00
xiaying	69dba73dc7	[MNN:Sync] Sync internal gitlab Main Feature: 1. Add OpenCV API and Numpy API Support 2. Protobuf move into MNN 3. Add more op for torchscript convert 4. Add recompute to speed up geometry compute 5. Add ModuleBasic Test	2021-11-30 10:10:53 +08:00
hush-alibaba	58545d6ca1	Synchronize internal github for version 1.2.0 (#1518 )	2021-06-11 17:17:13 +08:00
xiaying	d91fc63976	[MNN:Sync] Sync internal Gitlab	2021-04-08 15:34:23 +08:00
xiaying	aedc8f6a68	[PATCH 341/350] Add avx512 patch	2021-01-06 15:57:22 +08:00
Hui Shu	d6795ad031	Github release 1.1.0	2020-11-05 16:49:17 +08:00
xiaying	255db932eb	[MNN:Sync] Sync Internal Github	2020-07-04 01:21:30 +08:00
xiaying	a750fe0956	Rename _AVX_MNNGemm16x6 as _AVX_MNNPackedMatMul	2020-07-04 01:06:18 +08:00
xiaying	d13f1bc0b6	Optimize Strassen Merge C Function for x86	2020-07-04 01:06:18 +08:00
xiaying	ae91cab1b8	Support Strassen for new matmul	2020-07-04 01:06:18 +08:00
和彬	1821f9bd46	gemm common and onr optimize	2020-04-15 15:34:57 +08:00
和彬	067c3e35ca	[PATCH 24/28] rename MNNGemmFloatUnit -> MNNGemmFloatUnit_4	2020-03-31 11:12:40 +08:00
hebin	c0cb82d9ab	[PATCH 22/28] [MNN:Speed] 8x8 Gemm and cache prefetch optimize	2020-03-31 11:12:40 +08:00
海境	90e06944db	Update	2020-02-26 09:57:17 +08:00
Zhang	002ac367e4	Update	2019-12-27 22:16:57 +08:00
liqing	d6b00d04f4	- build: - unify schema building in core and converter; - add more build script for android; - add linux build script for python; - ops impl: - add floor mod support in binary; - use eltwise impl in add/max/sub/mul binary for optimization; - remove fake double support in cast; - fix 5d support for concat; - add adjX and adjY support for batch matmul; - optimize conv2d back prop filter; - add pad mode support for conv3d; - fix bug in conv2d & conv depthwise with very small feature map; - optimize binary without broacast; - add data types support for gather; - add gather ND support; - use uint8 data type in gather v2; - add transpose support for matmul; - add matrix band part; - add dim != 4 support for padding, reshape & tensor convert; - add pad type support for pool3d; - make ops based on TensorFlow Lite quantization optional; - add all & any support for reduction; - use type in parameter as output type in reduction; - add int support for unary; - add variable weight support for conv2d; - fix conv2d depthwise weights initialization; - fix type support for transpose; - fix grad outputs count for reduce grad and reshape grad; - fix priorbox & detection output; - fix metal softmax error; - python: - add runSessionWithCallBackInfo interface; - add max nodes limit (1400) for visualization tool; - fix save error in python3; - align default dim; - convert: - add extra design for optimization; - add more post converting optimizers; - add caffe v1 weights blob support; - add cast, unary, conv transpose support for onnx model; - optimize batchnorm, conv with variable weights, prelu, reshape, slice, upsample for onnx model; - add cos/sin/atan/tan support for unary for tensorflow model; - add any/all support for reduction for tensorflow model; - add elu, conv3d, pool3d support for tensorflow model; - optimize argmax, batchnorm, concat, batch to space, conv with variable weights, prelu, slice for tensorflow model; - others: - fix size computer lock; - fix thread pool deadlock; - add express & parameters in express; - rewrite blitter chooser without static map; - add tests for expr;	2019-10-29 13:37:26 +08:00
liqing	73ad3413cc	- dynamic computation graph (beta) - add supports (/express) - add tests - add benchmarks with it (/benchmark/exprModels) - Python - MNN engine and tools were submitted to pip - available on Windows/macOS/Linux - Engine/Converter - add supports for each op benchmarking - refactor optimizer by separating steps - CPU - add supports for Conv3D, Pool3D, ELU, ReverseSequence - fix ArgMax, Permute, Scale, BinaryOp, Slice, SliceTf - OpenCL - add half transform in CPU - add broadcast supports for binary - optimize Conv2D, Reshape, Eltwise, Gemm, etc. - OpenGL - add sub, real div supports for binary - add supports for unary - optimize Conv2D, Reshape - Vulkan - add max supports for eltwise - Metal - fix metallib missing problem - Train/Quantization - use express to refactor training codes	2019-09-26 21:02:07 +08:00
liqing	db155b4d1d	beta 0.2.0.2 - CPU - add padding support - fix bug in permute when channel % 4 != 0 - fix bug in exp with extreme value - OpenCL - add protecting logics - OpenGL - add protecting logics - support NCHW format in Squeeze and Reshape - Converter - add ShuffleChannel support for Caffe - add Clip/Transpose/Unary/Pad supports for ONNX	2019-07-02 18:01:08 +08:00
liqing	ad759ebfae	beta 0.2.0.1 - support both armv7/arm64 in podspec (pod version >= 1.5.0 required) - refactor neg axis support - fix memory overlap in de-conv - fix CONVOLUTION_TILED_NUMBER spell error - fix few warnings - add binary / interp / permute / relu / reshape / softmax support and optimize conv for OpenGL backend - add clean in nmake build script	2019-06-24 11:32:41 +08:00
liqing	6a4213f7dc	beta 0.2.0.0 - replace FreeImage with stb_image - warn unicode error in Windows compiling - separate clang/gcc build script for android - add default values in fbs - optimize CPU conv / conv depthwise / deconv / deconv depthwise / lstm / sigmoid - add sub support in eltwise - add reciprocal / log1p / log in unary - add zero like / select / set diff 1d - add batch support for permute - add training codes - fix metal error in dynamic separate storage type handling	2019-06-17 20:10:35 +08:00
liqing	5551108af8	beta 0.1.0	2019-04-19 20:50:09 +08:00

31 Commits