Upload folder using huggingface_hub
Browse files- .gitattributes +21 -0
- README.md +131 -3
- Yee-R1-mini-BF16.gguf +3 -0
- Yee-R1-mini-F16.gguf +3 -0
- Yee-R1-mini-IQ3_M-imat.gguf +3 -0
- Yee-R1-mini-IQ3_S-imat.gguf +3 -0
- Yee-R1-mini-IQ3_XS-imat.gguf +3 -0
- Yee-R1-mini-IQ3_XXS-imat.gguf +3 -0
- Yee-R1-mini-IQ4_NL-imat.gguf +3 -0
- Yee-R1-mini-IQ4_XS-imat.gguf +3 -0
- Yee-R1-mini-Q2_K-imat.gguf +3 -0
- Yee-R1-mini-Q2_K_S-imat.gguf +3 -0
- Yee-R1-mini-Q3_K_L-imat.gguf +3 -0
- Yee-R1-mini-Q3_K_M-imat.gguf +3 -0
- Yee-R1-mini-Q3_K_S-imat.gguf +3 -0
- Yee-R1-mini-Q4_K_M-imat.gguf +3 -0
- Yee-R1-mini-Q4_K_S-imat.gguf +3 -0
- Yee-R1-mini-Q5_K_M-imat.gguf +3 -0
- Yee-R1-mini-Q5_K_S-imat.gguf +3 -0
- Yee-R1-mini-Q6_K-imat.gguf +3 -0
- Yee-R1-mini-Q8_0-imat.gguf +3 -0
- imatrix.dat +3 -0
- logo.png +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,24 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
Yee-R1-mini-BF16.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
Yee-R1-mini-F16.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
Yee-R1-mini-IQ3_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
Yee-R1-mini-IQ3_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
Yee-R1-mini-IQ3_XS-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
Yee-R1-mini-IQ3_XXS-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
42 |
+
Yee-R1-mini-IQ4_NL-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
43 |
+
Yee-R1-mini-IQ4_XS-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
44 |
+
Yee-R1-mini-Q2_K-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
45 |
+
Yee-R1-mini-Q2_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
46 |
+
Yee-R1-mini-Q3_K_L-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
47 |
+
Yee-R1-mini-Q3_K_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
48 |
+
Yee-R1-mini-Q3_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
49 |
+
Yee-R1-mini-Q4_K_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
50 |
+
Yee-R1-mini-Q4_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
51 |
+
Yee-R1-mini-Q5_K_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
52 |
+
Yee-R1-mini-Q5_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
53 |
+
Yee-R1-mini-Q6_K-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
54 |
+
Yee-R1-mini-Q8_0-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
55 |
+
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
56 |
+
logo.png filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -1,3 +1,131 @@
|
|
1 |
-
---
|
2 |
-
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
library_name: transformers
|
3 |
+
license: apache-2.0
|
4 |
+
license_link: https://huggingface.co/Qwen/Qwen3-1.7B/blob/main/LICENSE
|
5 |
+
pipeline_tag: text-generation
|
6 |
+
base_model:
|
7 |
+
- Qwen/Qwen3-1.7B-Base
|
8 |
+
---
|
9 |
+
|
10 |
+
# 小熠(Yee)AI 数据安全专家
|
11 |
+
|
12 |
+

|
13 |
+
|
14 |
+
> 由 [广州熠数信息技术有限公司](https://shining-data.com) 开发,基于大语言模型技术构建的数据安全智能助手。
|
15 |
+
> 该仓库为 Yee-R1-mini 的 GGUF 模型文件
|
16 |
+
|
17 |
+
小熠(Yee)是一款专注于 **数据安全领域** 的 AI 专家系统,依托于先进的 **Qwen3-1.7B** 大语言模型架构,并融合了数据分类分级、安全审计、防护检测等专业能力。它为工业、政务、运营商等行业提供轻量化、智能化的数据安全解决方案,帮助用户实现“合规、可视、可控、可防”的数据安全目标。
|
18 |
+
|
19 |
+
小熠以 **AI 数据安全专家大模型** 为核心技术基座,构建了全栈式数据安全审计与全链路防泄露体系,在“云”、“管”、“端”三大场景中落地应用,助力企业从容应对数字经济时代的安全挑战。
|
20 |
+
|
21 |
+
---
|
22 |
+
|
23 |
+
## 🔍 核心特点
|
24 |
+
|
25 |
+
- **基于 Qwen3-1.7B 构建**
|
26 |
+
- 使用阿里巴巴通义千问最新一代大语言模型 Qwen3,具备强大的推理、逻辑判断与指令执行能力。
|
27 |
+
- 支持在 **Thinking Mode** 和 **Non-Thinking Mode** 之间灵活切换,适应不同应用场景。
|
28 |
+
|
29 |
+
- **双模推理机制**
|
30 |
+
- 在复杂逻辑任务(如代码分析、数学计算、策略制定)中启用 Thinking Mode。
|
31 |
+
- 在日常对话、快速响应场景中使用 Non-Thinking Mode,提升效率。
|
32 |
+
|
33 |
+
- **Agent 化能力**
|
34 |
+
- 集成 Qwen-Agent 框架,支持调用外部工具(如数据库接口、日志分析器、API 接口等),实现自动化任务执行。
|
35 |
+
|
36 |
+
- **高兼容性**
|
37 |
+
- 支持主流部署方式:本地运行、Docker 容器、Kubernetes 集群、SaaS API 接口等。
|
38 |
+
- 兼容 HuggingFace Transformers、vLLM、SGLang、Ollama 等推理框架。
|
39 |
+
|
40 |
+
---
|
41 |
+
|
42 |
+
## 📊 性能测试
|
43 |
+
|
44 |
+
以下是小熠在 [CS-Eval](https://cs-eval.com/#/app/leaderBoard) 中多个安全领域的综合得分测试结果,基于模拟真实业务场景的评估体系生成:
|
45 |
+
|
46 |
+
| 综合得分 | 系统安全及软件安全基础 | 访问控制与身份管理 | 加密技术与密钥管理 | 基础设施安全 | AI与网络安全 | 漏洞管理与渗透测试 | 威胁检测与预防 | 数据安全和隐私保护 | 供应链安全 | 安全架构设计 | 业务连续性与应急响应恢复 | 中文任务 | 英文任务 |
|
47 |
+
|----------|------------------------|--------------------|--------------------|--------------|--------------|--------------------|----------------|--------------------|------------|--------------|--------------------------|----------|----------|
|
48 |
+
| 77.48 | 78.00 | 79.31 | 71.90 | 78.37 | 84.65 | 75.24 | 78.41 | 73.02 | 86.71 | 80.49 | 71.33 | 77.58 | 76.03 |
|
49 |
+
|
50 |
+
---
|
51 |
+
|
52 |
+
## 📦 快速开始
|
53 |
+
|
54 |
+
```python
|
55 |
+
from transformers import AutoTokenizer, AutoModelForCausalLM
|
56 |
+
|
57 |
+
# 加载 tokenizer 和模型
|
58 |
+
tokenizer = AutoTokenizer.from_pretrained("sds-ai/Yee-R1-mini")
|
59 |
+
model = AutoModelForCausalLM.from_pretrained(
|
60 |
+
"sds-ai/Yee-R1-mini",
|
61 |
+
torch_dtype="auto",
|
62 |
+
device_map="auto"
|
63 |
+
)
|
64 |
+
|
65 |
+
# 输入提示
|
66 |
+
prompt = "请帮我检查这份数据是否包含敏感字段?"
|
67 |
+
|
68 |
+
# 应用聊天模板并切换模式
|
69 |
+
messages = [{"role": "user", "content": prompt}]
|
70 |
+
text = tokenizer.apply_chat_template(
|
71 |
+
messages,
|
72 |
+
tokenize=False,
|
73 |
+
add_generation_prompt=True,
|
74 |
+
enable_thinking=True # 切换至思考模式
|
75 |
+
)
|
76 |
+
|
77 |
+
# 编码输入
|
78 |
+
inputs = tokenizer([text], return_tensors="pt").to(model.device)
|
79 |
+
|
80 |
+
# 生成响应
|
81 |
+
response_ids = model.generate(**inputs, max_new_tokens=32768)
|
82 |
+
response = tokenizer.decode(response_ids[0][len(inputs.input_ids[0]):], skip_special_tokens=True)
|
83 |
+
|
84 |
+
print("小熠:\n", response)
|
85 |
+
```
|
86 |
+
|
87 |
+
---
|
88 |
+
|
89 |
+
## 🛠️ 部署方式
|
90 |
+
|
91 |
+
你可以通过以下任意一种方式部署小熠:
|
92 |
+
|
93 |
+
### 使用 SGLang 启动服务
|
94 |
+
```bash
|
95 |
+
python -m sglang.launch_server --model-path sds-ai/Yee-R1-mini --reasoning-parser qwen3
|
96 |
+
```
|
97 |
+
|
98 |
+
### 使用 vLLM 启动服务
|
99 |
+
```bash
|
100 |
+
vllm serve sds-ai/Yee-R1-mini --enable-reasoning --reasoning-parser deepseek_r1
|
101 |
+
```
|
102 |
+
|
103 |
+
### 使用 Ollama / LMStudio / llama.cpp / KTransformers
|
104 |
+
Qwen3 已被主流本地化 LLM 工具广泛支持,详情请参考官方文档。
|
105 |
+
|
106 |
+
---
|
107 |
+
|
108 |
+
## 📚 最佳实践建议
|
109 |
+
|
110 |
+
为获得最佳性能,请遵循以下推荐设置:
|
111 |
+
|
112 |
+
| 场景 | 温度 | TopP | TopK | MinP | Presence Penalty |
|
113 |
+
|------|------|------|------|------|------------------|
|
114 |
+
| 思考模式 (`enable_thinking=True`) | 0.6 | 0.95 | 20 | 0 | 1.5 (减少重复输出) |
|
115 |
+
| 非思考模式 (`enable_thinking=False`) | 0.7 | 0.8 | 20 | 0 | 不推荐使用 |
|
116 |
+
|
117 |
+
- 输出长度建议设为 **32,768 tokens**,复杂任务可提升至 **38,912 tokens**。
|
118 |
+
- 在多轮对话中,历史��录应仅保留最终输出部分,避免引入思维内容影响上下文理解。
|
119 |
+
|
120 |
+
|
121 |
+
---
|
122 |
+
|
123 |
+
## 📞 联系我们
|
124 |
+
|
125 |
+
了解更多关于小熠的信息,请访问 [熠数信息官网](https://shining-data.com)
|
126 |
+
|
127 |
+
---
|
128 |
+
|
129 |
+
## 🌟 致谢
|
130 |
+
|
131 |
+
感谢阿里通义实验室开源 Qwen3 模型,为小熠提供了坚实的语言理解和生成能力基础。
|
Yee-R1-mini-BF16.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d6acc969e1fb3705fc7b86219d5ab2e1e4b83a674367c86e1d2a95d7a94f1b79
|
3 |
+
size 3447349312
|
Yee-R1-mini-F16.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3bb29e0aada5d37a32f902f5efb9aa3c1946a26d30443f2d3d922e999b0eb723
|
3 |
+
size 3447349312
|
Yee-R1-mini-IQ3_M-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:78e95229833048629a188805834198898174f3be14e8313f54e49ffef812544c
|
3 |
+
size 895662400
|
Yee-R1-mini-IQ3_S-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f0b81e61890b681f90465306f67df2114c5ea3e2fc0e119170c625369747dd2f
|
3 |
+
size 867252544
|
Yee-R1-mini-IQ3_XS-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:db4f8fe2eca00239b2f1217c9922f9a7e96322b4468c015d74409fea7b8ab669
|
3 |
+
size 834222400
|
Yee-R1-mini-IQ3_XXS-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fc7808eb31d1f0489a6fcddc421db7c9f7a051e4386152b3853e7a68fef0cfd9
|
3 |
+
size 754360640
|
Yee-R1-mini-IQ4_NL-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:319fe7d63be53e57e22025f0d83983d62d5740b0a00ed190687be55bd2a6f87a
|
3 |
+
size 1054423360
|
Yee-R1-mini-IQ4_XS-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0743b76ddef583e82e4e55ca86de2c3558008c00bf0aaa9af0bb7242d7c98640
|
3 |
+
size 1010383168
|
Yee-R1-mini-Q2_K-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0e7f77275a0948bfc0879c29f43f80e250fc77c5c5580bd44ad960db5fe41953
|
3 |
+
size 777795904
|
Yee-R1-mini-Q2_K_S-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:597496cdee311809a964b0da7660f9e8004949078fbda0459311ad0f35d3b0d6
|
3 |
+
size 732969280
|
Yee-R1-mini-Q3_K_L-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e96692f592481498c2426f2230dddee990031e0611b2ca137fe39227a66bc332
|
3 |
+
size 1003501888
|
Yee-R1-mini-Q3_K_M-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:dd71d036f9b727a3b2631df2dcb697b747454825e47f008912ae2ce48f7b181a
|
3 |
+
size 939538752
|
Yee-R1-mini-Q3_K_S-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0a73681582a56abc7ca30689a70f07ef74904170e1757d8d9edafaedf967c3a2
|
3 |
+
size 867252544
|
Yee-R1-mini-Q4_K_M-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5e5aea0bd9d19f17d0c715091ae30d5fc59ec9c8ebcf4b9f122d8c500e1f5959
|
3 |
+
size 1107409216
|
Yee-R1-mini-Q4_K_S-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8c44717dbc85e5fd44319f3094f792eabc52ba4f234231c5758605f77ff08380
|
3 |
+
size 1060190528
|
Yee-R1-mini-Q5_K_M-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:209bfdd51337b690da65bf965b10032e14606047c3ca4cb4a71d17932e312678
|
3 |
+
size 1257879872
|
Yee-R1-mini-Q5_K_S-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c5352570f2c728084e73e5f7c00af761bd3e2659777c675fb30395375d4f2abe
|
3 |
+
size 1230584128
|
Yee-R1-mini-Q6_K-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bc7b66d86f4701658b4834b7db34976b1d4efc24460f7b1b2c2d730aa8df2399
|
3 |
+
size 1417754944
|
Yee-R1-mini-Q8_0-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e94037669972cde9b4794c3497641d379a293494e71588eefa7279be21e40313
|
3 |
+
size 1834426688
|
imatrix.dat
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:40b90656b56f42098911321a09c8520de634be8d93b0cac8d200117bcb844875
|
3 |
+
size 2070887
|
logo.png
ADDED
![]() |
Git LFS Details
|