CPU上安装ollama和部署开源Llama2-chinese模型

即使没有GPU显卡，在CPU上借用ollama服务工具，也可以一键体验对话式语言模型。

ishanshui

2943人浏览 · 2024-03-01 00:51:20

ishanshui · 2024-03-01 00:51:20 发布

即使没有GPU显卡，在CPU上借用ollama服务工具，也可以一键体验对话式语言模型。

ollama：一个开源的大型语言模型服务，提供了类似OpenAI的API接口和聊天界面，可以非常方便地部署最新版本的GPT模型并通过接口使用。支持热加载模型文件，无需重新启动即可切换不同的模型。
Llama2-chinese：Llama2预训练模型包含7B、13B和70B三个版本。Llama2-Chat模型基于预训练模型进行了监督微调，具备更强的对话能力。Llama2-chinese是基于中文指令数据集对Llama2-Chat模型进行的微调，使得Llama2模型有着更强的中文对话能力。

环境介绍

硬件：AMD Ryzen 3

软件：WSL Debian （使用微软应用商店安装Debian）

安装ollama

官网：https://ollama.com/

简介：（摘自官网）

Get up and running with large language models, locally.
Run Llama 2, Code Llama, and other models. Customize and create your own.

下面以Linux环境为例，说明ollama的下载和安装命令：

curl -fsSL https://ollama.com/install.sh | sh

以上命令下载后，会自动安装，等待安装完成。

部署Llama2-chinese模型

安装ollama后，输入命令ollama，会提示命令格式和说明。

~$ ollama
Usage:
  ollama [flags]
  ollama [command]

Available Commands:
  serve       Start ollama
  create      Create a model from a Modelfile
  show        Show information for a model
  run         Run a model
  pull        Pull a model from a registry
  push        Push a model to a registry
  list        List models
  cp          Copy a model
  rm          Remove a model
  help        Help about any command

Flags:
  -h, --help      help for ollama
  -v, --version   Show version information

Use "ollama [command] --help" for more information about a command.

下载模型前，必须先开启ollama服务，命令如下：

ollama serve &

开启ollama服务后，可下载并运行模型Llama2-chinese，命令如下：

ollama pull llama2-chinese
ollama run llama2-chinese

运行模型后，会进入对话流程，输入问题即可开启聊天问答：

~$ ollama run llama2-chinese
[GIN] 2024/03/01 - 00:47:32 | 200 |      30.418µs |       127.0.0.1 | HEAD     "/"
[GIN] 2024/03/01 - 00:47:32 | 200 |     366.398µs |       127.0.0.1 | POST     "/api/show"
[GIN] 2024/03/01 - 00:47:32 | 200 |     406.273µs |       127.0.0.1 | POST     "/api/show"
[GIN] 2024/03/01 - 00:47:32 | 200 |     446.069µs |       127.0.0.1 | POST     "/api/chat"
>>> 你好

名字：你好
[GIN] 2024/03/01 - 00:47:41 | 200 |   1.65430645s |       127.0.0.1 | POST     "/api/chat"


>>> 你是谁

名字：你不知道吗？我是一个语言模型，用于回答问题和提供信息。
[GIN] 2024/03/01 - 00:48:03 | 200 | 11.945286677s |       127.0.0.1 | POST     "/api/chat"


>>> Send a message (/? for help)

输入“ ctrl + d ” 可以结束聊天对话。

AtomGit 开源协作平台测评赛

瓜分20万奖金获得内推名额丰厚实物奖励易参与易上手

更多推荐

ADS1292R 使用过程心电图高精度ADC模块

文章目录1 Fundamentals ofPrecision ADC Noise Analysis 精密模数转换器噪声分析基础1 Fundamentals ofPrecision ADC Noise Analysis 精密模数转换器噪声分析基础https://www.ti.com.cn/cn/lit/wp/slyy192/slyy192.pdf?ts=1600659610730&ref_u

开放原子开发者工作坊

实现一个家庭安防与环境监测系统（一）

开放原子开发者工作坊

【cf】Codeforces Round #774 (Div. 2) 前4题

题目A. Square Counting 简单数学题目大意题解代码B. Quality vs Quantity 排序题目大意题解代码C. Factorials and Powers of Two 状态压缩dp+位运算题目大意题解代码D. Weight the Tree 树形dp+dfs题目大意题解代码E. Power Board 看起来像是数论？许多年没打cf了，偶尔打了一盘，恢复紫名了。A. S