[新教程] ollama常用操作
1 模型常驻显存
curl http://localhost:11434/api/generate -d '{"model": "gemma3:27b", "keep_alive": -1}' 示例输出:
{"model":"gemma3:27b","created_at":"2025-11-26T00:50:25.894745547Z","response":"","done":true,"done_reason":"load"}% 2 从魔搭加载GGUF文件
ollama run modelscope.cn/Qwen/Qwen2.5-3B-Instruct-GGUF
ollama run modelscope.cn/Qwen/Qwen2.5-3B-Instruct-GGUF:Q3_K_M
ollama run modelscope.cn/Qwen/Qwen2.5-3B-Instruct-GGUF:qwen2.5-3b-instruct-q3_k_m.gguf