Shining-Data commited on
Commit
a40fc81
·
verified ·
1 Parent(s): 189ed70

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ logo.png filter=lfs diff=lfs merge=lfs -text
37
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
Modelfile ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ollama modelfile auto-generated by llamafactory
2
+
3
+ FROM .
4
+
5
+ TEMPLATE """<bos>{{ if .System }}{{ .System }}
6
+
7
+ {{ end }}{{ range .Messages }}{{ if eq .Role "user" }}<start_of_turn>user
8
+ {{ .Content }}<end_of_turn>
9
+ <start_of_turn>model
10
+ {{ else if eq .Role "assistant" }}{{ .Content }}<end_of_turn>
11
+ {{ end }}{{ end }}"""
12
+
13
+ PARAMETER stop "<end_of_turn>"
14
+ PARAMETER num_ctx 4096
README.md CHANGED
@@ -1,3 +1,125 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ license_link: https://huggingface.co/google/gemma-3-270m-it/blob/main/LICENSE
5
+ pipeline_tag: text-generation
6
+ base_model:
7
+ - google/gemma-3-270m-it
8
+ ---
9
+
10
+ # 小熠(Yee)AI 数据安全专家
11
+
12
+ ![Logo](logo.png)
13
+
14
+ > 由 [广州熠数信息技术有限公司](https://shining-data.com) 开发,基于大语言模型技术构建的数据安全智能助手。
15
+ ---
16
+
17
+ ## 🔍 核心特点
18
+
19
+ - **基于 Gemma-3-270M-IT 微调**
20
+ - 使用 Google 最新一代轻量级大语言模型 Gemma-3,具备优秀的推理能力和指令跟随能力。
21
+ - 模型参数精简,推理速度快,资源消耗低,适合资源受限环境部署。
22
+
23
+ - **高效推理机制**
24
+ - 针对数据安全场景优化,在保证准确性的同时提供快速响应。
25
+ - 支持流式输出,提升用户体验。
26
+
27
+ - **高兼容性**
28
+ - 支持主流部署方式:本地运行、Docker 容器、Kubernetes 集群、SaaS API 接口等。
29
+ - 兼容 HuggingFace Transformers、vLLM、Ollama 等推理框架。
30
+
31
+ ---
32
+
33
+ ## 📊 性能特点
34
+
35
+ Yee-270m 基于 Gemma-3-270M-IT 微调,在保持轻量级特性的同时,针对数据安全领域任务进行了专门优化:
36
+
37
+ - 模型参数量:2.7亿,部署资源需求低
38
+ - 支持上下文长度:8K tokens
39
+ - 推理速度快,响应延迟低
40
+ - 针对中文数据安全场景优化
41
+
42
+ ---
43
+
44
+ ## 📦 快速开始
45
+
46
+ ```python
47
+ from transformers import AutoTokenizer, AutoModelForCausalLM
48
+
49
+ # 加载 tokenizer 和模型
50
+ tokenizer = AutoTokenizer.from_pretrained("sds-ai/Yee-270m")
51
+ model = AutoModelForCausalLM.from_pretrained(
52
+ "sds-ai/Yee-270m",
53
+ torch_dtype="auto",
54
+ device_map="auto"
55
+ )
56
+
57
+ # 输入提示
58
+ prompt = "请帮我检查这份数据是否包含敏感字段?"
59
+
60
+ # 应用聊天模板
61
+ messages = [{"role": "user", "content": prompt}]
62
+ text = tokenizer.apply_chat_template(
63
+ messages,
64
+ tokenize=False,
65
+ add_generation_prompt=True
66
+ )
67
+
68
+ # 编码输入
69
+ inputs = tokenizer([text], return_tensors="pt").to(model.device)
70
+
71
+ # 生成响应
72
+ response_ids = model.generate(**inputs, max_new_tokens=1024)
73
+ response = tokenizer.decode(response_ids[0][len(inputs.input_ids[0]):], skip_special_tokens=True)
74
+
75
+ print("小熠:\n", response)
76
+ ```
77
+
78
+ ---
79
+
80
+ ## 🛠️ 部署方式
81
+
82
+ 你可以通过以下任意一种方式部署小熠:
83
+
84
+ ### 使用 Transformers 直接加载
85
+ ```python
86
+ from transformers import pipeline
87
+
88
+ pipe = pipeline("text-generation", model="sds-ai/Yee-270m")
89
+ response = pipe("数据安全最佳实践有哪些?")
90
+ ```
91
+
92
+ ### 使用 vLLM 启动服务
93
+ ```bash
94
+ vllm serve sds-ai/Yee-270m
95
+ ```
96
+
97
+ ### 使用 Ollama / LMStudio
98
+ Gemma-3 已被主流本地化 LLM 工具广泛支持,详情请参考官方文档。
99
+
100
+ ---
101
+
102
+ ## 📚 最佳实践建议
103
+
104
+ 为获得最佳性能,请遵循以下推荐设置:
105
+
106
+ | 参数 | 推荐值 | 说明 |
107
+ |------|--------|------|
108
+ | 温度 | 0.6 | 平衡创造性和一致性 |
109
+ | TopP | 0.9 | 核采样参数 |
110
+ | Max Length | 2048 | 最大生成长度 |
111
+
112
+ - 输出长度建议设为 **1024 tokens**,复杂任务可适当增加。
113
+ - 在多轮对话中,建议保留完整的对话历史以获得最佳效果。
114
+
115
+ ---
116
+
117
+ ## 📞 联系我们
118
+
119
+ 了解更多关于小熠的信息,请访问 [熠数信息官网](https://shining-data.com)
120
+
121
+ ---
122
+
123
+ ## 🌟 致谢
124
+
125
+ 感谢 Google 开源 Gemma-3 模型,为小熠提供了高效的语言理解和生成能力基础。
chat_template.jinja ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {{ bos_token }}
2
+ {%- if messages[0]['role'] == 'system' -%}
3
+ {%- if messages[0]['content'] is string -%}
4
+ {%- set first_user_prefix = messages[0]['content'] + '
5
+
6
+ ' -%}
7
+ {%- else -%}
8
+ {%- set first_user_prefix = messages[0]['content'][0]['text'] + '
9
+
10
+ ' -%}
11
+ {%- endif -%}
12
+ {%- set loop_messages = messages[1:] -%}
13
+ {%- else -%}
14
+ {%- set first_user_prefix = "" -%}
15
+ {%- set loop_messages = messages -%}
16
+ {%- endif -%}
17
+ {%- for message in loop_messages -%}
18
+ {%- if (message['role'] == 'user') != (loop.index0 % 2 == 0) -%}
19
+ {{ raise_exception("Conversation roles must alternate user/assistant/user/assistant/...") }}
20
+ {%- endif -%}
21
+ {%- if (message['role'] == 'assistant') -%}
22
+ {%- set role = "model" -%}
23
+ {%- else -%}
24
+ {%- set role = message['role'] -%}
25
+ {%- endif -%}
26
+ {{ '<start_of_turn>' + role + '
27
+ ' + (first_user_prefix if loop.first else "") }}
28
+ {%- if message['content'] is string -%}
29
+ {{ message['content'] | trim }}
30
+ {%- elif message['content'] is iterable -%}
31
+ {%- for item in message['content'] -%}
32
+ {%- if item['type'] == 'image' -%}
33
+ {{ '<start_of_image>' }}
34
+ {%- elif item['type'] == 'text' -%}
35
+ {{ item['text'] | trim }}
36
+ {%- endif -%}
37
+ {%- endfor -%}
38
+ {%- else -%}
39
+ {{ raise_exception("Invalid content type") }}
40
+ {%- endif -%}
41
+ {{ '<end_of_turn>
42
+ ' }}
43
+ {%- endfor -%}
44
+ {%- if add_generation_prompt -%}
45
+ {{'<start_of_turn>model
46
+ '}}
47
+ {%- endif -%}
config.json ADDED
@@ -0,0 +1,54 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_sliding_window_pattern": 6,
3
+ "architectures": [
4
+ "Gemma3ForCausalLM"
5
+ ],
6
+ "attention_bias": false,
7
+ "attention_dropout": 0.0,
8
+ "attn_logit_softcapping": null,
9
+ "bos_token_id": 2,
10
+ "eos_token_id": 1,
11
+ "final_logit_softcapping": null,
12
+ "head_dim": 256,
13
+ "hidden_activation": "gelu_pytorch_tanh",
14
+ "hidden_size": 640,
15
+ "initializer_range": 0.02,
16
+ "intermediate_size": 2048,
17
+ "layer_types": [
18
+ "sliding_attention",
19
+ "sliding_attention",
20
+ "sliding_attention",
21
+ "sliding_attention",
22
+ "sliding_attention",
23
+ "full_attention",
24
+ "sliding_attention",
25
+ "sliding_attention",
26
+ "sliding_attention",
27
+ "sliding_attention",
28
+ "sliding_attention",
29
+ "full_attention",
30
+ "sliding_attention",
31
+ "sliding_attention",
32
+ "sliding_attention",
33
+ "sliding_attention",
34
+ "sliding_attention",
35
+ "full_attention"
36
+ ],
37
+ "max_position_embeddings": 32768,
38
+ "model_type": "gemma3_text",
39
+ "num_attention_heads": 4,
40
+ "num_hidden_layers": 18,
41
+ "num_key_value_heads": 1,
42
+ "pad_token_id": 0,
43
+ "query_pre_attn_scalar": 256,
44
+ "rms_norm_eps": 1e-06,
45
+ "rope_local_base_freq": 10000.0,
46
+ "rope_scaling": null,
47
+ "rope_theta": 1000000.0,
48
+ "sliding_window": 512,
49
+ "torch_dtype": "bfloat16",
50
+ "transformers_version": "4.55.0.dev0",
51
+ "use_bidirectional_attention": false,
52
+ "use_cache": true,
53
+ "vocab_size": 262144
54
+ }
generation_config.json ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cache_implementation": "hybrid",
3
+ "do_sample": true,
4
+ "eos_token_id": [
5
+ 1,
6
+ 106
7
+ ],
8
+ "top_k": 64,
9
+ "top_p": 0.95,
10
+ "transformers_version": "4.55.0.dev0"
11
+ }
logo.png ADDED

Git LFS Details

  • SHA256: 3b5b212bbaf3c677396a93637e44e26c9adc810ac6bc99192e280c0ae283a0fe
  • Pointer size: 132 Bytes
  • Size of remote file: 3.4 MB
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:421f80b5110955a40ea570f3f391484c191d3c9478d03f717da1834a531b2c68
3
+ size 536223056
special_tokens_map.json ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "boi_token": "<start_of_image>",
3
+ "bos_token": {
4
+ "content": "<bos>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false
9
+ },
10
+ "eoi_token": "<end_of_image>",
11
+ "eos_token": {
12
+ "content": "<end_of_turn>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false
17
+ },
18
+ "image_token": "<image_soft_token>",
19
+ "pad_token": {
20
+ "content": "<pad>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false
25
+ },
26
+ "unk_token": {
27
+ "content": "<unk>",
28
+ "lstrip": false,
29
+ "normalized": false,
30
+ "rstrip": false,
31
+ "single_word": false
32
+ }
33
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4667f2089529e8e7657cfb6d1c19910ae71ff5f28aa7ab2ff2763330affad795
3
+ size 33384568
tokenizer_config.json ADDED
The diff for this file is too large to render. See raw diff