1 模型常驻显存

curl http://localhost:11434/api/generate -d '{"model": "gemma3:27b", "keep_alive": -1}' 

示例输出:

{"model":"gemma3:27b","created_at":"2025-11-26T00:50:25.894745547Z","response":"","done":true,"done_reason":"load"}%  

2 从魔搭加载GGUF文件

ollama run modelscope.cn/Qwen/Qwen2.5-3B-Instruct-GGUF
ollama run modelscope.cn/Qwen/Qwen2.5-3B-Instruct-GGUF:Q3_K_M
ollama run modelscope.cn/Qwen/Qwen2.5-3B-Instruct-GGUF:qwen2.5-3b-instruct-q3_k_m.gguf

3 删除本地模型

标签: none

评论已关闭