History

游薪渝(揽清) 9e95fa8cb7 [update] app icon		2025-08-19 14:24:21 +08:00
..
MNNLLMiOS	[update] app icon	2025-08-19 14:24:21 +08:00
MNNLLMiOS.xcodeproj	[update] ExyteChat version	2025-08-14 17:10:47 +08:00
MNNLLMiOSTests	…
MNNLLMiOSUITests	…
assets	[feat] readme with version 0.4	2025-07-22 11:07:04 +08:00
.gitignore	…
README-ZH.md	[update] readme with local model config	2025-08-19 10:44:46 +08:00
README.md	[update] readme with local model config	2025-08-19 10:44:46 +08:00

README.md

MNNLLM iOS Application

查看中文文档

Introduction

This project is an iOS application based on the MNN engine, supporting local large-model multimodal conversations.

It operates fully offline with high privacy. Once the models are downloaded to the device, all conversations occur locally without any network uploads or processing.

Features

Local Models
- Display locally downloaded models
- Support custom pinning
Model Market
- Get list of models supported by MNN
- Model management: download and delete models
  - Support switching between Hugging Face, ModelScope, and Modeler download sources
- Model search: support keyword search and tag search
Benchmark Testing
- Support automated benchmark testing, outputting Prefill speed, Decode Speed, and Memory Usage information
Multimodal Chat: Supports full Markdown format output
- Text-to-text
- Audio-to-text
- Image-to-text: images can be captured or selected from gallery
Model Configuration
- Support configuring mmap
- Support configuring sampling strategy
- Support configuring diffusion settings
Chat History
- Support model conversation history list, restore historical conversation scenarios

Video Introduction

Click here to download the original resolution introduction video

Application Preview:


Text To Text	Image To Text	Audio To Text	Model Filter

Local Model	Model Market	Benchmark	History

Additionally, the app supports edge-side usage of DeepSeek with Think mode:

How to Build and Use

Clone the repository:

git clone https://github.com/alibaba/MNN.git

Build the MNN.framework:

sh package_scripts/ios/buildiOS.sh "
-DMNN_ARM82=ON
-DMNN_LOW_MEMORY=ON
-DMNN_SUPPORT_TRANSFORMER_FUSE=ON
-DMNN_BUILD_LLM=ON
-DMNN_CPU_WEIGHT_DEQUANT_GEMM=ON
-DMNN_METAL=ON
-DMNN_BUILD_DIFFUSION=ON
-DMNN_OPENCL=OFF
-DMNN_SEP_BUILD=OFF
-DLLM_SUPPORT_AUDIO=ON
-DMNN_BUILD_AUDIO=ON
-DLLM_SUPPORT_VISION=ON
-DMNN_BUILD_OPENCV=ON
-DMNN_IMGCODECS=ON
"

Copy the framework to the iOS project:
```
mv MNN-iOS-CPU-GPU/Static/MNN.framework apps/iOS/MNNLLMChat
```
Ensure the Link Binary With Libraries section includes the MNN.framework:

If it's missing, add it manually:
Update iOS signing and build the project:
```
cd apps/iOS/MNNLLMChat
open MNNLLMiOS.xcodeproj
```
In Xcode, go to Signing & Capabilities > Team and input your Apple ID and Bundle Identifier:

Wait for the Swift Package to finish downloading before building.

Notes

Due to memory limitations on iPhones, it is recommended to use models with 7B parameters or fewer to avoid memory-related crashes.

Here is the professional technical translation of the provided text:

Local Debugging

For local debugging, simply drag the model files to the LocalModel folder and run the project:

First, download the MNN-related models from Hugging Face or Modelscope:
After downloading, drag all the files from the model folder into the project's LocalModel folder:
Ensure that the above files are included in the Copy Bundle Resources section:
Configure the Model:

Go to ModelListViewModel.swift for configuration, such as whether to support thinking mode:

// MARK: Config the Local Model here
let modelName = "Qwen3-0.6B-MNN-Inside" // Model name
let localModel = ModelInfo(
    modelName: modelName,
    tags: [
        // MARK: if you know that model support think, uncomment the line
        // NSLocalizedString("tag.deepThinking", comment: "Deep thinking tag for local model"), // Whether to support think
            NSLocalizedString("tag.localModel", comment: "Local model inside the app")],
    categories: ["Local Models"],
    vendor: "Local",
    sources: ["local": "bundle_root/\(modelName)"],
    isDownloaded: true
)
localModels.append(localModel)
ModelStorageManager.shared.markModelAsDownloaded(modelName)

Run the project, navigate to the chat page, and perform model interactions and debugging.

The app will automatically detect and load models from the LocalModel folder without requiring additional configuration.

Release Notes

Version 0.4

Added three major project modules: Local Models, Model Market, and Benchmark Testing
Added benchmark testing to test different model performance
Added settings page, accessible from the history sidebar
Added Ali CDN for getting model lists
Added model market filtering functionality